RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17567
         (362 letters)



>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 54.7 bits (132), Expect = 3e-09
 Identities = 34/93 (36%), Positives = 55/93 (59%), Gaps = 4/93 (4%)

Query: 10  EKKKRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKK--KKKKKKKKKKKKKKKEEQ 66
           E+ + ++++ K+E E EE +Q+ +EK+  +EEK  KK+ K  KKK+KKKKKKK KK  ++
Sbjct: 50  ERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109

Query: 67  SVKEIEESLKERKEKTLTEDSQCDTMIPVFKKG 99
             KE  +S +E  ++   E  +     PV    
Sbjct: 110 EEKEGSKSSEESSDEEE-EGEEDKQEEPVEIME 141



 Score = 35.8 bits (83), Expect = 0.010
 Identities = 22/80 (27%), Positives = 45/80 (56%), Gaps = 12/80 (15%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK----------KKKKKKKKKKKKKKEEQS 67
           R +E E+ E   E+ +KE   E++  ++K+++          KK+ K++KKK+KKK+++ 
Sbjct: 45  RRREYERLELMDEKWKKE--TEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKK 102

Query: 68  VKEIEESLKERKEKTLTEDS 87
            K+  +  ++   K+  E S
Sbjct: 103 AKKGNKKEEKEGSKSSEESS 122



 Score = 33.5 bits (77), Expect = 0.070
 Identities = 23/74 (31%), Positives = 44/74 (59%), Gaps = 3/74 (4%)

Query: 11  KKKRIEKRNKEEEKEEN--EQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           ++KR EK+ K+EEK      + +K+K+  ++ +K KK  KK++K+  K  ++   +EE+ 
Sbjct: 69  QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128

Query: 68  VKEIEESLKERKEK 81
            ++ +E   E  EK
Sbjct: 129 EEDKQEEPVEIMEK 142



 Score = 27.7 bits (62), Expect = 5.1
 Identities = 13/60 (21%), Positives = 37/60 (61%), Gaps = 4/60 (6%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R +R    +KKK+ +K+ K+ +K   ++E++  +  EE  +++++ ++ K+++  +  +K
Sbjct: 87  RAKR----QKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 57.6 bits (139), Expect = 4e-09
 Identities = 22/79 (27%), Positives = 46/79 (58%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           K   G  E+K++ + + ++++K+E  +EE +    +EE  +K+  K+K+K+K+KK ++ +
Sbjct: 99  KNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158

Query: 63  KEEQSVKEIEESLKERKEK 81
             E+  K      K R +K
Sbjct: 159 DREEEKKRERVRAKSRPKK 177



 Score = 52.6 bits (126), Expect = 1e-07
 Identities = 23/81 (28%), Positives = 47/81 (58%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
             K      +  K  +  + +EE++E EQ ++EK+  +E+  ++ K +K K++ K+K+  
Sbjct: 84  GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143

Query: 61  KKKEEQSVKEIEESLKERKEK 81
           K+KE++  K++EE     +EK
Sbjct: 144 KEKEKEKEKKVEEPRDREEEK 164



 Score = 52.2 bits (125), Expect = 2e-07
 Identities = 26/82 (31%), Positives = 47/82 (57%), Gaps = 1/82 (1%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
              E  K  EK  ++ ++E+ +++EK KE+ ++ K K++ K+K+  K+K+K+K+KK EE 
Sbjct: 98  PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157

Query: 67  SVKEIEESLKERKEKTLTEDSQ 88
             +E EE  +ER          
Sbjct: 158 RDRE-EEKKRERVRAKSRPKKP 178



 Score = 50.3 bits (120), Expect = 1e-06
 Identities = 20/87 (22%), Positives = 49/87 (56%), Gaps = 3/87 (3%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK-- 60
            +     +++ + EK+ K+E+ +E  ++ K KE+ +E++  K+K+K+K+KK ++ + +  
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162

Query: 61  -KKKEEQSVKEIEESLKERKEKTLTED 86
            KK+E    K   +   ++K     ++
Sbjct: 163 EKKRERVRAKSRPKKPPKKKPPNKKKE 189



 Score = 49.5 bits (118), Expect = 2e-06
 Identities = 21/73 (28%), Positives = 40/73 (54%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           +++  K   K    + +  KE   E   +++K+K++ K++KKKKK+K KEE   ++ +E 
Sbjct: 77  VKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEE 136

Query: 75  LKERKEKTLTEDS 87
            KE++     E  
Sbjct: 137 AKEKRPPKEKEKE 149



 Score = 45.6 bits (108), Expect = 2e-05
 Identities = 22/72 (30%), Positives = 43/72 (59%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K++  EKR  +E+++E E++ +E  D EEEK +++ + K + KK  KKK   K+++  +E
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193

Query: 71  IEESLKERKEKT 82
            ++    R+   
Sbjct: 194 EKQRQAAREAVK 205



 Score = 44.9 bits (106), Expect = 4e-05
 Identities = 16/73 (21%), Positives = 39/73 (53%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            ++K++  +  ++ E+E+  +  + K   ++   KK   KKK+  +++K+++  +E    
Sbjct: 147 EKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKG 206

Query: 69  KEIEESLKERKEK 81
           K  E  + E +EK
Sbjct: 207 KPEEPDVNEEREK 219



 Score = 44.1 bits (104), Expect = 7e-05
 Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 2/76 (2%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           RK +  A EK+   EK  ++E+K E  +    +E+ + E+ + K + KK  KKK   KKK
Sbjct: 131 RKPKEEAKEKRPPKEKEKEKEKKVEEPR--DREEEKKRERVRAKSRPKKPPKKKPPNKKK 188

Query: 62  KKEEQSVKEIEESLKE 77
           +  E+  +        
Sbjct: 189 EPPEEEKQRQAAREAV 204



 Score = 44.1 bits (104), Expect = 8e-05
 Identities = 24/75 (32%), Positives = 45/75 (60%), Gaps = 3/75 (4%)

Query: 13  KRIEKRNKEEE--KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           KR+EK   +    K +  +E K +   EEEK +K++ K++KKKKK+K K++ K+ +  +E
Sbjct: 78  KRVEKGGSKGPAAKTKPAKEPKNESGKEEEK-EKEQVKEEKKKKKEKPKEEPKDRKPKEE 136

Query: 71  IEESLKERKEKTLTE 85
            +E    ++++   E
Sbjct: 137 AKEKRPPKEKEKEKE 151



 Score = 34.1 bits (78), Expect = 0.12
 Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 8/90 (8%)

Query: 3   KRRGGAGEK---KKRIEKRNKEEEKEENEQ--EEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           K+     ++   KKR   R K   K+  ++    K+KE  EEE   K+++  ++  K K 
Sbjct: 152 KKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEE---KQRQAAREAVKGKP 208

Query: 58  KKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           ++    EE+  +E +   +E     + ED 
Sbjct: 209 EEPDVNEEREKEEDDGKDRETTTSPMEEDE 238



 Score = 32.9 bits (75), Expect = 0.23
 Identities = 17/86 (19%), Positives = 40/86 (46%), Gaps = 2/86 (2%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R R     +K  + +  NK++E    E+E++ +   E  K K ++    ++++K++   K
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKKEP--PEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGK 225

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDS 87
            +E  +    E+  ++  E +    S
Sbjct: 226 DRETTTSPMEEDESRQSSEISRRSSS 251



 Score = 30.6 bits (69), Expect = 1.3
 Identities = 15/65 (23%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           +  E +   +E  ++ ++   +    K K  K+ K +  K+++K+KE+  VKE ++  KE
Sbjct: 65  KCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQ--VKEEKKKKKE 122

Query: 78  RKEKT 82
           + ++ 
Sbjct: 123 KPKEE 127


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 55.1 bits (133), Expect = 5e-09
 Identities = 28/74 (37%), Positives = 42/74 (56%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           G        +   ++  EKE   + EKE E  EEEK +KKKKK+ KK+KK+KK KK+K  
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMV 178

Query: 66  QSVKEIEESLKERK 79
           +     ++  K++K
Sbjct: 179 EPKGSKKKKKKKKK 192



 Score = 53.6 bits (129), Expect = 2e-08
 Identities = 28/77 (36%), Positives = 43/77 (55%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           G         E  ++ E  E+    + EKE   EE+ KK+KKKKK+ KK+KK+KK KKE+
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEK 176

Query: 66  QSVKEIEESLKERKEKT 82
               +  +  K++K+K 
Sbjct: 177 MVEPKGSKKKKKKKKKK 193



 Score = 52.8 bits (127), Expect = 3e-08
 Identities = 30/81 (37%), Positives = 46/81 (56%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           MR    G G       +   E E  E E   K +++ E E+ +KK+KKKKK+ KK+KK+K
Sbjct: 111 MRFFPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170

Query: 61  KKKEEQSVKEIEESLKERKEK 81
           K K+E+ V+      K++K+K
Sbjct: 171 KDKKEKMVEPKGSKKKKKKKK 191



 Score = 45.9 bits (109), Expect = 7e-06
 Identities = 25/61 (40%), Positives = 40/61 (65%), Gaps = 6/61 (9%)

Query: 5   RGGAGEKKKRIEKRNK--EEEKEENEQEEKEKEDVEEEKNKKKK----KKKKKKKKKKKK 58
                E   ++EK  +  EEEK+E +++++ K++ +E+K+KK+K    K  KKKKKKKKK
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192

Query: 59  K 59
           K
Sbjct: 193 K 193



 Score = 44.3 bits (105), Expect = 2e-05
 Identities = 21/52 (40%), Positives = 36/52 (69%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
           +  +    E++++ EK+ K+E K+E ++++ +KE + E K  KKKKKKKKKK
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 13/62 (20%), Positives = 34/62 (54%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQC 89
             +   + E  + +   K +K+ + ++++KK+KK+++ VK+ ++  K++KEK +      
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184

Query: 90  DT 91
             
Sbjct: 185 KK 186


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 53.9 bits (130), Expect = 1e-08
 Identities = 22/73 (30%), Positives = 42/73 (57%), Gaps = 2/73 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E  + IEK  KE E+++  + + +K+  +++K+K K KK  KK  K +KK +K+ E  ++
Sbjct: 66  ELAEEIEKVKKEYEEKQ--KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123

Query: 70  EIEESLKERKEKT 82
           ++ +S  E     
Sbjct: 124 DLTKSYSETLSTL 136



 Score = 47.0 bits (112), Expect = 2e-06
 Identities = 24/64 (37%), Positives = 40/64 (62%), Gaps = 3/64 (4%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK---KKKKKEEQSVKEIEESLKE 77
            E K++ ++  +E E V++E  +K+K K KKKK KKKK   K KK +++  K  ++  KE
Sbjct: 58  TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117

Query: 78  RKEK 81
            ++K
Sbjct: 118 AEDK 121



 Score = 42.0 bits (99), Expect = 1e-04
 Identities = 20/57 (35%), Positives = 37/57 (64%)

Query: 25  EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            E  + +K+K+++ EE  K KK+ ++K+K K KKKK KK++   K+ ++  K+ K +
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111



 Score = 41.2 bits (97), Expect = 2e-04
 Identities = 19/69 (27%), Positives = 37/69 (53%), Gaps = 1/69 (1%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           KK  E++ K + K++  +++K+K+ D +++K   K +KK +K+ + K +   K       
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134

Query: 71  IEESLKERK 79
               LK RK
Sbjct: 135 TLSELKPRK 143



 Score = 33.1 bits (76), Expect = 0.13
 Identities = 16/55 (29%), Positives = 34/55 (61%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
           E  E +K+  E  +  +K KK+ ++K+K K KKKK +++  K+ ++   ++ +K+
Sbjct: 56  EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKS 110


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 52.9 bits (127), Expect = 5e-08
 Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 1/73 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            ++ + E    EE+ +   + E   E    +  +K K +KK KK K K K K K +  VK
Sbjct: 53  TEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVK 112

Query: 70  EIEESLKERKEKT 82
              +  K+   KT
Sbjct: 113 PQPKP-KKPPSKT 124



 Score = 49.4 bits (118), Expect = 7e-07
 Identities = 20/71 (28%), Positives = 29/71 (40%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E + + E  EE  +   E E   E    K K+K K +KK KK K K K +   K   +  
Sbjct: 55  EPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQ 114

Query: 76  KERKEKTLTED 86
            + K+      
Sbjct: 115 PKPKKPPSKTA 125



 Score = 39.4 bits (92), Expect = 0.001
 Identities = 20/81 (24%), Positives = 27/81 (33%), Gaps = 5/81 (6%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK-----KKKKKKKKKKKKKKE 64
           E  +   K   E E        K KE  + EK  KK K K     K K K K + K KK 
Sbjct: 61  EPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKP 120

Query: 65  EQSVKEIEESLKERKEKTLTE 85
                    +   +  +  + 
Sbjct: 121 PSKTAAKAPAAPNQPARPPSA 141



 Score = 32.8 bits (75), Expect = 0.22
 Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 5/60 (8%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKK-----KKKKKKEEQSVKEIEESLKERKEKT 82
           E   +E +   E   ++ K   + +   +      K+K K E++  K   +   + K K 
Sbjct: 50  EAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP 109


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 53.8 bits (130), Expect = 6e-08
 Identities = 26/77 (33%), Positives = 52/77 (67%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            KK  +K  K  EK E ++EE++KE  ++    KKK+++++++K+KK+++K++EE+  +E
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464

Query: 71  IEESLKERKEKTLTEDS 87
            +E  +E+K+K  T   
Sbjct: 465 EKEEEEEKKKKQATLFD 481



 Score = 53.4 bits (129), Expect = 8e-08
 Identities = 25/79 (31%), Positives = 48/79 (60%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+ + +    K  +K +   E+ EK+  EE+K KKKK    KKK+++++++K+K+E+  +
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456

Query: 70  EIEESLKERKEKTLTEDSQ 88
           E EE  +E KE+   +  +
Sbjct: 457 EEEEEAEEEKEEEEEKKKK 475



 Score = 49.1 bits (118), Expect = 2e-06
 Identities = 19/79 (24%), Positives = 56/79 (70%), Gaps = 4/79 (5%)

Query: 5   RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
              A +K K+I ++ +++ +EE ++++K+    ++++ +++++K+KK+++K++++++ +E
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464

Query: 65  EQSVKEIEESLKERKEKTL 83
           E    + EE  K++K+ TL
Sbjct: 465 E----KEEEEEKKKKQATL 479



 Score = 47.2 bits (113), Expect = 8e-06
 Identities = 24/55 (43%), Positives = 40/55 (72%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           +KR     EKKK+     K+EE+EE E+E+KE+E  EEE+  +++K+++++KKKK
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 46.1 bits (110), Expect = 2e-05
 Identities = 20/67 (29%), Positives = 41/67 (61%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           EEE E     +K  + +++   K +KK++++KK+KKKK    K+++  +E E+  KE ++
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455

Query: 81  KTLTEDS 87
           +   E++
Sbjct: 456 EEEEEEA 462



 Score = 44.1 bits (105), Expect = 7e-05
 Identities = 15/57 (26%), Positives = 43/57 (75%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
            +++    +K+K+ +    ++++EE E+E+++KE+ +EE+ ++ +++K+++++KKKK
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 42.6 bits (101), Expect = 2e-04
 Identities = 16/69 (23%), Positives = 33/69 (47%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQC 89
            E+E E +   K   KK KK  +K +KK++++KKE++      +  +E +E+   +  + 
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 90  DTMIPVFKK 98
                   +
Sbjct: 455 KEEEEEEAE 463



 Score = 42.2 bits (100), Expect = 3e-04
 Identities = 22/54 (40%), Positives = 38/54 (70%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
            R+      +KK    K+ +EEE+EE E++E+EKE+ EEE  ++K+++++KKKK
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 40.3 bits (95), Expect = 0.001
 Identities = 20/68 (29%), Positives = 38/68 (55%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
            EE+ E     K+     ++  +K +KK++++KK+KKKK    +++  +E EE  K+ +E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 81  KTLTEDSQ 88
           K   E+  
Sbjct: 455 KEEEEEEA 462



 Score = 38.0 bits (89), Expect = 0.007
 Identities = 16/68 (23%), Positives = 36/68 (52%)

Query: 33  EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTM 92
            +E++E     KK  KK KK  +K +KK+++E++  K+   + K+++E+   E  + +  
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 93  IPVFKKGV 100
               ++  
Sbjct: 455 KEEEEEEA 462


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 51.6 bits (124), Expect = 1e-07
 Identities = 25/64 (39%), Positives = 38/64 (59%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           +E E +++EE EKE   EE+ KKKK+ K K  K+  KKKKKK+  + K  + +    K+K
Sbjct: 64  DEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKK 123

Query: 82  TLTE 85
           +   
Sbjct: 124 SERI 127



 Score = 45.4 bits (108), Expect = 1e-05
 Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK-KEEQSVKEIEESLKERK 79
           E E ++ E+ EKE +  E  K KK+ K K  K+  KKKKKK     +S K      K++ 
Sbjct: 65  EPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKS 124

Query: 80  EKTLTEDS 87
           E+     +
Sbjct: 125 ERISWAPT 132



 Score = 30.8 bits (70), Expect = 0.98
 Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 16/75 (21%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----------------QSVK 69
           +N++   E+    E + KK + K +K+K+KKK+K+  +EE                +  +
Sbjct: 148 QNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYE 207

Query: 70  EIEESLKERKEKTLT 84
           E EE  K+ K + L 
Sbjct: 208 EQEEEKKKAKIQALK 222



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 17/88 (19%), Positives = 38/88 (43%), Gaps = 11/88 (12%)

Query: 5   RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           +    E++ + +KR K +  +E  +++K+K+    +  K    + KKK ++        +
Sbjct: 76  KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLD 135

Query: 65  E-----------QSVKEIEESLKERKEK 81
                       Q+ +   E LKER+ +
Sbjct: 136 SPRRKSSRSSTVQNKEATHERLKEREIR 163


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 50.4 bits (121), Expect = 4e-07
 Identities = 19/69 (27%), Positives = 41/69 (59%), Gaps = 4/69 (5%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----QSVKEIEESLKER 78
           ++EE+E++E+EK  V  +   KK  K K ++K+K K++K+++     +     +E  ++ 
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97

Query: 79  KEKTLTEDS 87
           + + L E+S
Sbjct: 98  RLRKLQEES 106



 Score = 49.7 bits (119), Expect = 5e-07
 Identities = 21/65 (32%), Positives = 37/65 (56%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
          ++ K+  ++EE E+++ E+ K   K K KK  K K ++K+K K E+  K + E  ++  E
Sbjct: 31 DDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPE 90

Query: 81 KTLTE 85
            L E
Sbjct: 91 DELAE 95



 Score = 48.9 bits (117), Expect = 1e-06
 Identities = 19/65 (29%), Positives = 39/65 (60%)

Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          E+++++ ++  ++E+ EE++ +K K   K K KK  K K +++E++ +E EE      E+
Sbjct: 27 EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86

Query: 82 TLTED 86
             ED
Sbjct: 87 DTPED 91



 Score = 48.5 bits (116), Expect = 1e-06
 Identities = 21/78 (26%), Positives = 41/78 (52%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            K + K   ++E E+++ ++   E+ +EEK ++K K   K K KK  K K +E++  K 
Sbjct: 15 PAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74

Query: 71 IEESLKERKEKTLTEDSQ 88
           +E    R+ +  T + +
Sbjct: 75 EKEEKGLRELEEDTPEDE 92



 Score = 34.6 bits (80), Expect = 0.048
 Identities = 10/34 (29%), Positives = 13/34 (38%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           EK K +   +  KKKK K K K           +
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226



 Score = 34.2 bits (79), Expect = 0.062
 Identities = 10/32 (31%), Positives = 11/32 (34%)

Query: 32  KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            EK   E+     KKKK K K K         
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 33.9 bits (78), Expect = 0.10
 Identities = 12/39 (30%), Positives = 16/39 (41%), Gaps = 4/39 (10%)

Query: 27  NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           NE+ + EK      K  KKKK K K K          ++
Sbjct: 192 NEKLKAEKAA----KGGKKKKGKAKAKLNVGGANDDDDD 226



 Score = 32.3 bits (74), Expect = 0.28
 Identities = 10/41 (24%), Positives = 24/41 (58%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK 50
             K +IE++ K + ++E +   + +ED  E++  +K + +K
Sbjct: 61  ALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101



 Score = 31.2 bits (71), Expect = 0.64
 Identities = 13/44 (29%), Positives = 28/44 (63%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
            KK ++ + +E+EK + E+EEK   ++EE+  + +  +K + +K
Sbjct: 58  AKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101



 Score = 29.2 bits (66), Expect = 2.8
 Identities = 13/58 (22%), Positives = 33/58 (56%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           + K   + + K+  K + E++EK K + EE+  ++ ++   + +  +K + +K +E+S
Sbjct: 49  KAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQEES 106


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 51.5 bits (123), Expect = 4e-07
 Identities = 24/65 (36%), Positives = 33/65 (50%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E K   E   K E +++ + EE E+E  EEE    KK KK KK K KK    K +  S  
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGD 360

Query: 70  EIEES 74
           + ++S
Sbjct: 361 DSDDS 365



 Score = 46.5 bits (110), Expect = 1e-05
 Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 5/92 (5%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK----KKK 58
           ++ GG   K  +I+    ++E + +E ++   ED +EEK+KKKKKK  K KKK    KK 
Sbjct: 203 EKGGGGRGKDLKIKDLEGDDEDDGDESDKGG-EDGDEEKSKKKKKKLAKNKKKLDDDKKG 261

Query: 59  KKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
           K+   ++    + ++   E +E+    DS   
Sbjct: 262 KRGGDDDADEYDSDDGDDEGREEDYISDSSAS 293



 Score = 41.1 bits (96), Expect = 7e-04
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 2/56 (3%)

Query: 10  EKKKRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           + ++R +K + E   K E EQ+E  +E  EEEKN+++    KK KK KK K KK  
Sbjct: 296 DPEEREDKLSPEIPAKPEIEQDEDSEES-EEEKNEEEGGLSKKGKKLKKLKGKKNG 350



 Score = 34.9 bits (80), Expect = 0.068
 Identities = 11/50 (22%), Positives = 27/50 (54%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
           + KK   K+N  ++ + +  ++ +  D++ E +      KK+K+ KK++ 
Sbjct: 340 KLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEEP 389



 Score = 32.2 bits (73), Expect = 0.45
 Identities = 14/102 (13%), Positives = 42/102 (41%), Gaps = 12/102 (11%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEE----------NEQEEKEK--EDVEEEKNKKKKKKK 49
           + +RGG  +  +       +E +EE          N+ EE+E         K + ++ + 
Sbjct: 260 KGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDED 319

Query: 50  KKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
            ++ +++K +++    +  K++++   ++      +    D 
Sbjct: 320 SEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDD 361


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 51.3 bits (122), Expect = 6e-07
 Identities = 17/81 (20%), Positives = 44/81 (54%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E KK+ E+  K+ +  + + EE +K     +   +    + +  ++K +  +KK+E++
Sbjct: 1317 ADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376

Query: 68   VKEIEESLKERKEKTLTEDSQ 88
             K+ + + K+ +EK   ++++
Sbjct: 1377 KKKADAAKKKAEEKKKADEAK 1397



 Score = 48.6 bits (115), Expect = 4e-06
 Identities = 26/90 (28%), Positives = 47/90 (52%), Gaps = 3/90 (3%)

Query: 2    RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK---KKKK 58
                  A E+K    ++ KEE K++ +  +K+ E+ ++    KKK ++ KKK    KK  
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414

Query: 59   KKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
              KKK +++ K+ EE  K  + K   E+++
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444



 Score = 48.2 bits (114), Expect = 5e-06
 Identities = 29/83 (34%), Positives = 49/83 (59%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E KK+ E+  K EE E+   E  +KE  E +K ++ KKK+ ++KKK ++ KK +EE  
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729

Query: 68   VKEIEESLKERKEKTLTEDSQCD 90
            +K  E   +  ++K   E+++ D
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAKKD 1752



 Score = 47.8 bits (113), Expect = 7e-06
 Identities = 26/84 (30%), Positives = 49/84 (58%), Gaps = 3/84 (3%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKE 64
            A E KK+ E+  K +E ++  +E K+ E+ +   EE  K  + KKK ++ KK  + KKK 
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA 1492

Query: 65   EQSVKEIEESLKERKEKTLTEDSQ 88
            E++ K+ +E+ K  + K   ++++
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAK 1516



 Score = 47.8 bits (113), Expect = 7e-06
 Identities = 24/81 (29%), Positives = 40/81 (49%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E  K   +   +E +   E+ E  ++  EE K K    KKK ++KKK  + KKK E+ 
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403

Query: 68   VKEIEESLKERKEKTLTEDSQ 88
             K+ +E  K    K   ++++
Sbjct: 1404 KKKADELKKAAAAKKKADEAK 1424



 Score = 47.8 bits (113), Expect = 7e-06
 Identities = 19/81 (23%), Positives = 39/81 (48%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E KK+ E+  K +E ++  +E K+K D  ++K ++ KK  +  K + +    + E   
Sbjct: 1304 ADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363

Query: 68   VKEIEESLKERKEKTLTEDSQ 88
             K      K+ + K   + ++
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAK 1384



 Score = 47.8 bits (113), Expect = 8e-06
 Identities = 18/79 (22%), Positives = 34/79 (43%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E KK  E + K EE ++     K+K +  ++  +  K + +    + +  ++K E   
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370

Query: 68   VKEIEESLKERKEKTLTED 86
             K+ E   K    K   E+
Sbjct: 1371 KKKEEAKKKADAAKKKAEE 1389



 Score = 47.4 bits (112), Expect = 1e-05
 Identities = 27/79 (34%), Positives = 46/79 (58%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E+KK++E+  K+E +E+ + EE +K + E +    ++ KK ++ KKK ++ KK EE  
Sbjct: 1628 AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE 1687

Query: 68   VKEIEESLKERKEKTLTED 86
             K  E   KE +E    E+
Sbjct: 1688 KKAAEALKKEAEEAKKAEE 1706



 Score = 47.1 bits (111), Expect = 1e-05
 Identities = 28/78 (35%), Positives = 44/78 (56%), Gaps = 2/78 (2%)

Query: 4    RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            ++  A EKKK  E +  EEE +    EE +K   EE+K K ++ KK ++ +KK  +  KK
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKA--EEDKKKAEEAKKAEEDEKKAAEALKK 1696

Query: 64   EEQSVKEIEESLKERKEK 81
            E +  K+ EE  K+  E+
Sbjct: 1697 EAEEAKKAEELKKKEAEE 1714



 Score = 47.1 bits (111), Expect = 1e-05
 Identities = 24/81 (29%), Positives = 48/81 (59%), Gaps = 2/81 (2%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            +KKK  E +     K++ ++ +K+ E+ +  +E  KK ++ KK  + KKK ++ KK E++
Sbjct: 1403 DKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462

Query: 68   VKEIEESLKERKEKTLTEDSQ 88
             K+ EE+ K  + K   E+++
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAK 1483



 Score = 46.7 bits (110), Expect = 2e-05
 Identities = 24/72 (33%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 12   KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
            K   E +  EE+K++ E+ +K +ED   EK   +  KK+ ++ KK ++ KKKE +  K+ 
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEED---EKKAAEALKKEAEEAKKAEELKKKEAEEKKKA 1718

Query: 72   EESLKERKEKTL 83
            EE  K  +E  +
Sbjct: 1719 EELKKAEEENKI 1730



 Score = 46.7 bits (110), Expect = 2e-05
 Identities = 25/77 (32%), Positives = 45/77 (58%), Gaps = 6/77 (7%)

Query: 10   EKKKRIEKRNKEEEK------EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            E+KK+ E+  K EE+      EE ++ E++K+  EE K  ++ +KK  +  KK+ ++ KK
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703

Query: 64   EEQSVKEIEESLKERKE 80
             E+  K+  E  K+ +E
Sbjct: 1704 AEELKKKEAEEKKKAEE 1720



 Score = 46.3 bits (109), Expect = 2e-05
 Identities = 21/81 (25%), Positives = 40/81 (49%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E KK+ +   K+ E+ +   E  + E        +  ++K +  +KKK++ KKK + +
Sbjct: 1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383

Query: 68   VKEIEESLKERKEKTLTEDSQ 88
             K+ EE  K  + K   E+ +
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDK 1404



 Score = 46.3 bits (109), Expect = 2e-05
 Identities = 27/79 (34%), Positives = 42/79 (53%)

Query: 3    KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            K++  A +KK    K+  E  K E E    E E  EE+    +KKK++ KKK    KKK 
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387

Query: 63   KEEQSVKEIEESLKERKEK 81
            +E++   E ++  +E K+K
Sbjct: 1388 EEKKKADEAKKKAEEDKKK 1406



 Score = 46.3 bits (109), Expect = 2e-05
 Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 4/78 (5%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDV----EEEKNKKKKKKKKKKKKKKKKKKKKK 63
            A E KK  E + K EE ++ E++EK+  +      EE  K ++ KKK+ ++KKK ++ KK
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723

Query: 64   EEQSVKEIEESLKERKEK 81
             E+  K   E  K+  E+
Sbjct: 1724 AEEENKIKAEEAKKEAEE 1741



 Score = 46.3 bits (109), Expect = 2e-05
 Identities = 25/74 (33%), Positives = 48/74 (64%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E +K+  +  K+E +E  + EE +K++ EE+K  ++ KK +++ K K ++ KK+ E+ 
Sbjct: 1683 AEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742

Query: 68   VKEIEESLKERKEK 81
             K+ EE+ K+ +EK
Sbjct: 1743 KKKAEEAKKDEEEK 1756



 Score = 45.9 bits (108), Expect = 3e-05
 Identities = 27/83 (32%), Positives = 52/83 (62%), Gaps = 2/83 (2%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            A  KKK  E + K EEK++ ++ +K+ E+ +  +E  KK ++ KK ++ KKK ++ KK +
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473

Query: 66   QSVKEIEESLKERKEKTLTEDSQ 88
            ++ K+ EE+ K  + K   E+++
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAK 1496



 Score = 45.9 bits (108), Expect = 3e-05
 Identities = 25/91 (27%), Positives = 51/91 (56%), Gaps = 2/91 (2%)

Query: 2    RKRRGGAGEKKKRIEKRNKEEEKEENEQEEK--EKEDVEEEKNKKKKKKKKKKKKKKKKK 59
            R++     E+ ++ ++  K EEK++ ++ +K  EK+  +E K K ++ KK  + KKK ++
Sbjct: 1267 RRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1326

Query: 60   KKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
             KKK + + K+ EE+ K  +      ++  D
Sbjct: 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357



 Score = 45.9 bits (108), Expect = 3e-05
 Identities = 19/78 (24%), Positives = 39/78 (50%)

Query: 11   KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            KKK    + K EE ++  +  K + +   ++ +  ++K +  +KKK++ KKK +    K 
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387

Query: 71   IEESLKERKEKTLTEDSQ 88
             E+   +  +K   ED +
Sbjct: 1388 EEKKKADEAKKKAEEDKK 1405



 Score = 45.9 bits (108), Expect = 3e-05
 Identities = 22/78 (28%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK----KKKKKKKKKK 63
            A EKKK  E + K EE ++ ++ +K+ E+ +++ +  KKK ++ KK     K + +    
Sbjct: 1298 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357

Query: 64   EEQSVKEIEESLKERKEK 81
            E ++ +E  E+ +++KE+
Sbjct: 1358 EAEAAEEKAEAAEKKKEE 1375



 Score = 45.9 bits (108), Expect = 3e-05
 Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEK---EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
            A E KK  E + K EE ++ E+ +K   E +  +E K K ++ KK  + KKK ++ KKK 
Sbjct: 1440 AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1499

Query: 65   EQSVKEIEESLKERKEKTLTEDSQCD 90
            +++ K  E   K  + K   E  + D
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKAD 1525



 Score = 45.5 bits (107), Expect = 3e-05
 Identities = 26/75 (34%), Positives = 47/75 (62%), Gaps = 2/75 (2%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--KKKKKKKKKEE 65
            A EKKK  E + K EE ++ ++ +K+ E+ ++ +  KKK ++ KK    KKK ++ KK +
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKAD 1486

Query: 66   QSVKEIEESLKERKE 80
            ++ K+ EE+ K+  E
Sbjct: 1487 EAKKKAEEAKKKADE 1501



 Score = 45.5 bits (107), Expect = 4e-05
 Identities = 24/80 (30%), Positives = 50/80 (62%), Gaps = 2/80 (2%)

Query: 11   KKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            KKK  EK+  +E K++ E+++K+ ++++      KK  + KKK ++KKK  + KK+ +  
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEA 1443

Query: 69   KEIEESLKERKEKTLTEDSQ 88
            K+ +E+ K+ +E    E+++
Sbjct: 1444 KKADEAKKKAEEAKKAEEAK 1463



 Score = 45.5 bits (107), Expect = 4e-05
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEK--EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            A EKKK  E +  EE+K+ +E ++K  E +  +E K K ++ KKK    KKK ++ KK  
Sbjct: 1286 AEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAA 1345

Query: 66   QSVKEIEESLKERKEKT 82
            ++ K   E+  +  E  
Sbjct: 1346 EAAKAEAEAAADEAEAA 1362



 Score = 45.1 bits (106), Expect = 4e-05
 Identities = 28/79 (35%), Positives = 51/79 (64%), Gaps = 1/79 (1%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            EKK + E+  K EE  + + EE +K + E++K ++ KKK+ ++KKK ++ KK +EE  +K
Sbjct: 1604 EKKMKAEEAKKAEE-AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIK 1662

Query: 70   EIEESLKERKEKTLTEDSQ 88
              EE+ K  ++K   E+++
Sbjct: 1663 AAEEAKKAEEDKKKAEEAK 1681



 Score = 44.7 bits (105), Expect = 6e-05
 Identities = 25/90 (27%), Positives = 42/90 (46%), Gaps = 7/90 (7%)

Query: 8    AGEKKKRIEKRNKEEEK---EENEQEEKEKEDVEEEKNK----KKKKKKKKKKKKKKKKK 60
            A E KK  EK+  +E K   EE ++ ++ K+  EE K K    KKK ++ KK  +  K +
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE 1351

Query: 61   KKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
             +      +  EE  +  ++K      + D
Sbjct: 1352 AEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381



 Score = 44.7 bits (105), Expect = 6e-05
 Identities = 28/95 (29%), Positives = 55/95 (57%), Gaps = 4/95 (4%)

Query: 3    KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDV--EEEKNKKKKKKKKKKKKKKKKKK 60
            +++     KK+  E +  EE K++  +E+K+ E++   EE+NK K ++ KK+ ++ KKK 
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746

Query: 61   K--KKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
            +  KK+E+  K+I    KE ++K      + + +I
Sbjct: 1747 EEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVI 1781



 Score = 43.6 bits (102), Expect = 2e-04
 Identities = 26/82 (31%), Positives = 49/82 (59%), Gaps = 3/82 (3%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK---KKKEEQ 66
             KKK    + K EEK++ ++ +K+ E+ +++ ++ KK    KKK  + KKK   KKK ++
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435

Query: 67   SVKEIEESLKERKEKTLTEDSQ 88
            + K+ EE+ K  + K   E+++
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAK 1457



 Score = 43.6 bits (102), Expect = 2e-04
 Identities = 26/81 (32%), Positives = 45/81 (55%), Gaps = 3/81 (3%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKE 64
            A E KK+ E+  K EE ++  +E K+ ++ +   EE  K  + KKK ++ KKK  + KK 
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505

Query: 65   EQSVKEIEESLKERKEKTLTE 85
             ++ K+ +E+ K  + K   E
Sbjct: 1506 AEAKKKADEAKKAEEAKKADE 1526



 Score = 43.2 bits (101), Expect = 2e-04
 Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 4/87 (4%)

Query: 8    AGEKKKRIEKRNKEEEK----EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            A EKKK  E + K EE     +E ++    K+  +E K K ++KKK  + KKK ++ KK 
Sbjct: 1387 AEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKA 1446

Query: 64   EEQSVKEIEESLKERKEKTLTEDSQCD 90
            +E   K  E    E  +K   E  + D
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKAD 1473



 Score = 43.2 bits (101), Expect = 2e-04
 Identities = 32/90 (35%), Positives = 55/90 (61%), Gaps = 10/90 (11%)

Query: 4    RRGGAGEKKKRIEKRNKEEE--------KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
            ++  A EKKK  E +  EEE        K+E E+++K+ E+ ++++ +KKK    KK+++
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767

Query: 56   KKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
            KK ++ +KE+++V  IEE L E  EK   E
Sbjct: 1768 KKAEEIRKEKEAV--IEEELDEEDEKRRME 1795



 Score = 43.2 bits (101), Expect = 2e-04
 Identities = 25/80 (31%), Positives = 45/80 (56%), Gaps = 4/80 (5%)

Query: 11   KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----Q 66
            KKK  E +   E K++ ++ +K +E  + ++ KK ++ KK  + KK ++KKK +E    +
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAE 1555

Query: 67   SVKEIEESLKERKEKTLTED 86
             +K+ EE  K  + K   ED
Sbjct: 1556 ELKKAEEKKKAEEAKKAEED 1575



 Score = 42.4 bits (99), Expect = 3e-04
 Identities = 26/85 (30%), Positives = 50/85 (58%), Gaps = 5/85 (5%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEK-----EKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            A E KK  E + K EE ++ E+E+K     +K++ EE+K  ++ KK +++ K K  ++ K
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668

Query: 63   KEEQSVKEIEESLKERKEKTLTEDS 87
            K E+  K+ EE+ K  +++    ++
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAAEA 1693



 Score = 42.4 bits (99), Expect = 3e-04
 Identities = 26/73 (35%), Positives = 45/73 (61%), Gaps = 1/73 (1%)

Query: 11   KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            ++ RIE+  K  E+E+  + E+ K+  EE K K ++ KK +++KKK ++ KKKE +  K+
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKA-EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648

Query: 71   IEESLKERKEKTL 83
             EE  K  +E  +
Sbjct: 1649 AEELKKAEEENKI 1661



 Score = 42.4 bits (99), Expect = 4e-04
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 8    AGEKKKRIEKRNKEEEK---EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
            A E KK  E + K EE    +E +++ +E +  +E K K ++ KKK  + KK  + KKK 
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512

Query: 65   EQSVKEIEESLKERKEKTLTEDSQCD 90
            ++  K+ EE+ K  + K   E  + D
Sbjct: 1513 DE-AKKAEEAKKADEAKKAEEAKKAD 1537



 Score = 41.7 bits (97), Expect = 6e-04
 Identities = 26/79 (32%), Positives = 48/79 (60%), Gaps = 5/79 (6%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKE 64
            A E KK+ E++ K +E ++  +E K+ ++ +   EE  K ++ KKK ++ KK  + KKK 
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKA 1479

Query: 65   EQSVK--EIEESLKERKEK 81
            E++ K  E ++  +E K+K
Sbjct: 1480 EEAKKADEAKKKAEEAKKK 1498



 Score = 41.7 bits (97), Expect = 6e-04
 Identities = 25/76 (32%), Positives = 47/76 (61%), Gaps = 2/76 (2%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E KK+ ++  K EE ++ ++ +K +E  + ++ KK ++KKK  + KK ++ KK EE  
Sbjct: 1505 AAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE-- 1562

Query: 68   VKEIEESLKERKEKTL 83
             K+ EE+ K  ++K +
Sbjct: 1563 KKKAEEAKKAEEDKNM 1578



 Score = 41.7 bits (97), Expect = 6e-04
 Identities = 29/100 (29%), Positives = 49/100 (49%), Gaps = 19/100 (19%)

Query: 8    AGEKKKRIEKRNKEEEKEENE---------------QEEKEKEDVEEEKNKKKKKKKKKK 52
            A E KK  E    E E   +E               +E K+K D  ++K ++KKK  + K
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397

Query: 53   KKKKKKKKK----KKEEQSVKEIEESLKERKEKTLTEDSQ 88
            KK ++ KKK    KK   + K+ +E+ K+ +EK   ++++
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437



 Score = 41.3 bits (96), Expect = 8e-04
 Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 4/80 (5%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKED----VEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            A E KK+ E+  K +E ++  +E K+K D      E K K  + KK ++ KK  + KK +
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531

Query: 64   EEQSVKEIEESLKERKEKTL 83
            E +   E +++ +++K   L
Sbjct: 1532 EAKKADEAKKAEEKKKADEL 1551



 Score = 41.3 bits (96), Expect = 9e-04
 Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 11/83 (13%)

Query: 10   EKKKRIEKRNKEEEKEENEQ----EEKEKEDVE-----EEKNKKKKKKKKKKKKKKKKKK 60
             K K  E +  EEEK++ EQ    E +EK+  E     EE+NK K  ++ KK ++ KKK 
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677

Query: 61   K--KKEEQSVKEIEESLKERKEK 81
            +  KK E+  K+  E+LK+  E+
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEE 1700



 Score = 40.9 bits (95), Expect = 0.001
 Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQ 66
            A E KK  E + K EE ++ ++ +K+ E+ +++ ++ KK  + KKK  + KK ++ K+  
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKAD 1525

Query: 67   SVKEIEESLKERKEKTLTEDSQCD 90
              K+ EE+ K  + K   E  + D
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKAD 1549



 Score = 40.1 bits (93), Expect = 0.002
 Identities = 23/78 (29%), Positives = 41/78 (52%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E KK  E + K EE ++   E K+  + +++ ++ KK ++ KK  + KK ++ K+   
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538

Query: 68   VKEIEESLKERKEKTLTE 85
             K+ EE  K  + K   E
Sbjct: 1539 AKKAEEKKKADELKKAEE 1556



 Score = 39.7 bits (92), Expect = 0.002
 Identities = 21/76 (27%), Positives = 47/76 (61%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            E K + E+  KE E+++ + EE +K++ E++K    KK+++KK ++ +K+K+   E+ + 
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786

Query: 70   EIEESLKERKEKTLTE 85
            E +E  +   +K + +
Sbjct: 1787 EEDEKRRMEVDKKIKD 1802



 Score = 39.7 bits (92), Expect = 0.002
 Identities = 24/82 (29%), Positives = 46/82 (56%), Gaps = 4/82 (4%)

Query: 11   KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            KKK  E + K +E ++  + +K+ ++ ++ +  KK  + KK ++ KK  + KK E+  K 
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA 1548

Query: 71   IE----ESLKERKEKTLTEDSQ 88
             E    E LK+ +EK   E+++
Sbjct: 1549 DELKKAEELKKAEEKKKAEEAK 1570



 Score = 39.0 bits (90), Expect = 0.004
 Identities = 22/80 (27%), Positives = 43/80 (53%)

Query: 1    MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            +RK          R +   K EE  + ++ +K +E  + ++ KK ++KKK  + KKK ++
Sbjct: 1254 IRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEE 1313

Query: 61   KKKEEQSVKEIEESLKERKE 80
             KK +++ K+ EE+ K+   
Sbjct: 1314 AKKADEAKKKAEEAKKKADA 1333



 Score = 38.6 bits (89), Expect = 0.005
 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A EKKK  E +  EE+K    ++ +E +  EE + ++  K  +++KK K ++ KK EE  
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAK 1619

Query: 68   VKEIEESLKERKEKTLTE 85
            +K  EE  K  +EK   E
Sbjct: 1620 IKA-EELKKAEEEKKKVE 1636



 Score = 37.8 bits (87), Expect = 0.009
 Identities = 22/78 (28%), Positives = 45/78 (57%), Gaps = 2/78 (2%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            E+ K+ ++  K EE ++ ++ +K +E  + ++ KK ++ KK ++KKK ++ KK EE    
Sbjct: 1519 EEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM 1578

Query: 70   EIE--ESLKERKEKTLTE 85
             +   E  K+ +E  + E
Sbjct: 1579 ALRKAEEAKKAEEARIEE 1596



 Score = 36.7 bits (84), Expect = 0.025
 Identities = 26/89 (29%), Positives = 49/89 (55%), Gaps = 8/89 (8%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E KK  E +  EE K+ +E ++ E++   +E  K ++ KK ++KKK ++ KK +E+++
Sbjct: 1518 AEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKN 1577

Query: 68   V--------KEIEESLKERKEKTLTEDSQ 88
            +        K+ EE+  E   K   E+ +
Sbjct: 1578 MALRKAEEAKKAEEARIEEVMKLYEEEKK 1606



 Score = 35.5 bits (81), Expect = 0.045
 Identities = 23/85 (27%), Positives = 46/85 (54%), Gaps = 6/85 (7%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVE------EEKNKKKKKKKKKKKKKKKKKKKKK 63
            ++ K+ E+  K EEK++ E+ +K +ED        EE  K ++ + ++  K  +++KK K
Sbjct: 1549 DELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608

Query: 64   EEQSVKEIEESLKERKEKTLTEDSQ 88
             E++ K  E  +K  + K   E+ +
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKK 1633



 Score = 35.5 bits (81), Expect = 0.048
 Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 2/72 (2%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
             K+        EE   + E+ +K +    EE  K ++ KKK +  +K ++ +K E+   +
Sbjct: 1083 AKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAED--AR 1140

Query: 70   EIEESLKERKEK 81
            + EE+ K    K
Sbjct: 1141 KAEEARKAEDAK 1152



 Score = 35.1 bits (80), Expect = 0.065
 Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 8/89 (8%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK--------KKK 59
            A E KK  E +  EE+K+ +E ++ E+    EEK K ++ KK ++ K          KK 
Sbjct: 1530 AEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKA 1589

Query: 60   KKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
            ++ + E+ +K  EE  K + E+    +  
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618



 Score = 34.0 bits (77), Expect = 0.16
 Identities = 21/72 (29%), Positives = 37/72 (51%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            E +     R +   K E  ++  E +  EE+K   + KK ++KKK  + KKK +E +   
Sbjct: 1259 EARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKAD 1318

Query: 70   EIEESLKERKEK 81
            E ++  +E K+K
Sbjct: 1319 EAKKKAEEAKKK 1330



 Score = 33.2 bits (75), Expect = 0.25
 Identities = 17/79 (21%), Positives = 49/79 (62%), Gaps = 1/79 (1%)

Query: 2    RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
             K +    +K+   +K+  EE K++ E+E+K+   +++E+ KK ++ +K+K+   +++  
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKD-EEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786

Query: 62   KKEEQSVKEIEESLKERKE 80
            +++E+   E+++ +K+  +
Sbjct: 1787 EEDEKRRMEVDKKIKDIFD 1805



 Score = 32.8 bits (74), Expect = 0.30
 Identities = 31/97 (31%), Positives = 49/97 (50%), Gaps = 16/97 (16%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE------------EEKNK----KKKKKKKK 51
            A E KK  E+  K EE+  NE+  K +E               EE  K    KK ++KKK
Sbjct: 1232 AEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKK 1291

Query: 52   KKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
              + KK ++KKK +++ K+ EE+ K  + K   E+++
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328



 Score = 32.8 bits (74), Expect = 0.35
 Identities = 22/72 (30%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 8    AGEKKKRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
            A E +K  + R  E   K E E++ +E    E+ K  +  KK ++ KK  ++ KK +EE+
Sbjct: 1190 AEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEER 1249

Query: 67   SVKEIEESLKER 78
            + +EI +  + R
Sbjct: 1250 NNEEIRKFEEAR 1261



 Score = 32.4 bits (73), Expect = 0.45
 Identities = 22/86 (25%), Positives = 44/86 (51%), Gaps = 2/86 (2%)

Query: 5    RGGAGEKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            +  A E KK  E+RN EE +  EE       +     +  + +K  + KK ++KKK  + 
Sbjct: 1236 KKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEA 1295

Query: 63   KEEQSVKEIEESLKERKEKTLTEDSQ 88
            K+ +  K+ +E+ K+ +E    ++++
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAK 1321



 Score = 30.9 bits (69), Expect = 1.4
 Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 1/84 (1%)

Query: 2    RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
             K   G  E  K   K    + KE+N  +E  +E   + +  KK +  K ++ +K ++ K
Sbjct: 1062 AKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAK 1121

Query: 62   KKEEQSVKEIEESLKERKEKTLTE 85
            KK E   ++ EE+ K    +   E
Sbjct: 1122 KKAED-ARKAEEARKAEDARKAEE 1144



 Score = 29.7 bits (66), Expect = 3.4
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            E+ ++ E+  K E+  + E   K +E+ + E+ +K +  KK +  KK ++ KK  E++ K
Sbjct: 1185 EEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKK 1244

Query: 70   EIEESLKERKEK 81
              EE   E   K
Sbjct: 1245 AEEERNNEEIRK 1256



 Score = 29.3 bits (65), Expect = 3.6
 Identities = 22/73 (30%), Positives = 36/73 (49%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            A E +K  E R  E+ ++     + E+E   EE  K +  KK +  KK ++ KK  EE  
Sbjct: 1184 AEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAK 1243

Query: 68   VKEIEESLKERKE 80
              E E + +E ++
Sbjct: 1244 KAEEERNNEEIRK 1256



 Score = 29.0 bits (64), Expect = 6.0
 Identities = 22/98 (22%), Positives = 45/98 (45%), Gaps = 4/98 (4%)

Query: 10   EKKKRIEKRNKEEEKEENEQ----EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            +KKK  E +  EEEK++       EEK+ E++ +EK    +++  ++ +K++ +  KK +
Sbjct: 1742 DKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIK 1801

Query: 66   QSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
                     ++  KE  L  +   +      K+    K
Sbjct: 1802 DIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSK 1839


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 47.3 bits (113), Expect = 1e-06
 Identities = 19/59 (32%), Positives = 41/59 (69%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           +K +    KE+E+EE E+ E E+ D EE+ ++  +K+  K K++K+++ ++K+++ +KE
Sbjct: 91  RKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKE 149



 Score = 45.7 bits (109), Expect = 4e-06
 Identities = 21/59 (35%), Positives = 37/59 (62%), Gaps = 3/59 (5%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           G  +K+K  E+   EEE E  E +E+E+ D   EK   K K++K+++ ++K+K+  KE+
Sbjct: 95  GLDKKEKEEEE---EEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQ 150



 Score = 45.4 bits (108), Expect = 6e-06
 Identities = 17/55 (30%), Positives = 36/55 (65%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           +     +K  +EEE+EE E EE ++E+  +E  +K+  K K++K+++ ++K+K+ 
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146



 Score = 39.6 bits (93), Expect = 6e-04
 Identities = 15/56 (26%), Positives = 39/56 (69%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           +K +++   +K+E++EE E+E + +E  EEE+  +  +K+  K K++K+++ ++++
Sbjct: 88  KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143



 Score = 38.4 bits (90), Expect = 0.001
 Identities = 19/56 (33%), Positives = 39/56 (69%), Gaps = 2/56 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK--KKKKKKKKKKKKK 63
           +KK++ E+  +E E EE ++EE+  E +E+E  K K++K++  ++K+K+  K++ K
Sbjct: 97  DKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 28.4 bits (64), Expect = 3.8
 Identities = 11/50 (22%), Positives = 31/50 (62%)

Query: 32  KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           K ++ V +     KK+K+++++++ + ++  +EEQ  + +E+ L + K +
Sbjct: 85  KWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKRE 134



 Score = 27.3 bits (61), Expect = 8.2
 Identities = 11/44 (25%), Positives = 26/44 (59%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
           +K +K     KK+K+++++++ + EE   +E  + L E++   L
Sbjct: 88  KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKL 131


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 49.8 bits (119), Expect = 1e-06
 Identities = 27/89 (30%), Positives = 41/89 (46%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E KK+++  N   + +  E    +  DV E    KK KKKKKKKKK++K   + E +   
Sbjct: 242 EVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEA 301

Query: 70  EIEESLKERKEKTLTEDSQCDTMIPVFKK 98
           E   S      K + E++      P  +K
Sbjct: 302 EGLGSSDSGSRKDVEEENARLEDSPKKRK 330



 Score = 35.5 bits (82), Expect = 0.041
 Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 15/90 (16%)

Query: 2   RKRRGGAGEKKKRIEKRNK-EEEKEENEQEEK--------EKEDVEE------EKNKKKK 46
           +KR+  A  K+KR E R K  + +E+ E+  K        E +D ++      +K+KK++
Sbjct: 43  KKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQ 102

Query: 47  KKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           KKK+ ++KK     +K++E++ +   E L 
Sbjct: 103 KKKEAERKKALLLDEKEKERAAEYTSEDLA 132



 Score = 35.1 bits (81), Expect = 0.059
 Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 24/103 (23%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------------- 54
            K+ R     +  +K + E E K K +   EK  K ++K+++  K               
Sbjct: 30  TKESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDD 89

Query: 55  ---------KKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
                    KK++KKK+ E +    ++E  KER  +  +ED  
Sbjct: 90  DTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSEDLA 132



 Score = 34.7 bits (80), Expect = 0.077
 Identities = 15/69 (21%), Positives = 32/69 (46%), Gaps = 3/69 (4%)

Query: 9  GEKKKRIEKRNKEEEKEENEQEEKE--KEDVEEEKNKKKKKKKK-KKKKKKKKKKKKKEE 65
          G K   +        KE  +    E  K+  EE + K+K+++ + K  K ++K+++  + 
Sbjct: 16 GLKPLPVPGAKPGSTKESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKL 75

Query: 66 QSVKEIEES 74
            +K + E 
Sbjct: 76 GGIKTLGED 84



 Score = 32.0 bits (73), Expect = 0.51
 Identities = 23/109 (21%), Positives = 49/109 (44%), Gaps = 33/109 (30%)

Query: 11  KKKRIEKRNKEEEKEENEQE---------------EKEKEDVEEEKNKKKKKKK------ 49
           KKK+ +K+ + ++ +E+E E               + E+E+   E + KK+K++      
Sbjct: 279 KKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDF 338

Query: 50  ------------KKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
                       K+++  +KK+KK + E   ++I E   E +E+    +
Sbjct: 339 VEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSEDEEEAAENN 387



 Score = 30.1 bits (68), Expect = 2.3
 Identities = 17/82 (20%), Positives = 37/82 (45%), Gaps = 12/82 (14%)

Query: 15 IEKRNK----------EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
          IE+ NK               +    ++ ++    E  KK++++ + K+K+++ ++K  +
Sbjct: 5  IEETNKLRAKLGLKPLPVPGAKPGSTKESRDAAAYENWKKRQEEAEAKRKREELREKIAK 64

Query: 65 EQSVKEIEESLKERKEKTLTED 86
           +  +E    L     KTL ED
Sbjct: 65 AREKRERNSKLG--GIKTLGED 84



 Score = 29.3 bits (66), Expect = 3.4
 Identities = 23/100 (23%), Positives = 37/100 (37%), Gaps = 29/100 (29%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKK----------------------------- 51
           +++K +N          ++E  KK+++ KKK                             
Sbjct: 216 KKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKF 275

Query: 52  KKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
           KK KKKKKKKKK  + + E E   +     +    S+ D 
Sbjct: 276 KKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDV 315



 Score = 28.6 bits (64), Expect = 5.8
 Identities = 13/84 (15%), Positives = 37/84 (44%), Gaps = 3/84 (3%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            +         +   +KR +E+E ++  +++ + +    ++ +  +KK+KK + +   ++
Sbjct: 312 RKDVEEENARLEDSPKKRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQ 371

Query: 61  KKKEEQSVKEIEESLKERKEKTLT 84
             +E     E EE   E  +  + 
Sbjct: 372 IAEERS---EDEEEAAENNDNGIV 392


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 49.6 bits (119), Expect = 1e-06
 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 5/66 (7%)

Query: 12  KKRI----EKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           K R+      + K   EK+  +  +K      E+K K+K+K K KK+ +  K   K+ + 
Sbjct: 369 KARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKP 428

Query: 67  SVKEIE 72
           S    E
Sbjct: 429 SGTSEE 434



 Score = 44.5 bits (106), Expect = 5e-05
 Identities = 19/76 (25%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 14  RIEKRNKEEEKEE--NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           +IE+  +E  K    +E   K K   E++  K  KK   K+ +KK+K+K+K + +     
Sbjct: 359 KIERYIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418

Query: 72  EESLKERKEKTLTEDS 87
            +++ +R++ + T + 
Sbjct: 419 TKNIGKRRKPSGTSEE 434



 Score = 38.8 bits (91), Expect = 0.004
 Identities = 16/54 (29%), Positives = 28/54 (51%), Gaps = 3/54 (5%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
            +K+  K +K+   +  E++EKEKE     K KK+ +  K   K++K     +E
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKP---KVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 38.4 bits (90), Expect = 0.005
 Identities = 16/64 (25%), Positives = 24/64 (37%), Gaps = 15/64 (23%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           K+ G   +K        KE+EK               EK K KK+ +  K   K++K   
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEK---------------EKPKVKKRHRDTKNIGKRRKPSG 430

Query: 63  KEEQ 66
             E+
Sbjct: 431 TSEE 434


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 48.7 bits (116), Expect = 2e-06
 Identities = 24/92 (26%), Positives = 44/92 (47%), Gaps = 9/92 (9%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEE------KEKEDVEEEKNKKKKKKKKKKKKKK 56
           ++R  A +  K+ E+  K+ E+++ + EE       E +   E + +KK K++ KK+ ++
Sbjct: 95  EQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEE 154

Query: 57  KKKKKKKEEQSVKEIEESLK---ERKEKTLTE 85
           + K K   E   K  E   K   E K K   +
Sbjct: 155 EAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAK 186



 Score = 46.0 bits (109), Expect = 1e-05
 Identities = 15/70 (21%), Positives = 36/70 (51%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
            K+ ++R K+ E++  E E++   +   +K  +++   +K  K+ ++  K+ EE+  +  
Sbjct: 63  AKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAE 122

Query: 72  EESLKERKEK 81
           E   K+  E 
Sbjct: 123 EAKAKQAAEA 132



 Score = 44.4 bits (105), Expect = 5e-05
 Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 1/80 (1%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           A E K +     K + + E E++ KE+ +   EE+ K K   + KKK  + KKK + E +
Sbjct: 121 AEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAK 180

Query: 67  SVKEIEESLKERKEKTLTED 86
           +  E +   K  + K   E 
Sbjct: 181 AKAEAKAKAKAEEAKAKAEA 200



 Score = 41.4 bits (97), Expect = 5e-04
 Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 6/77 (7%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKK-----KKKKKE 64
           KK   +    E+++   +  +KE ++    EK  K+ ++  K+ ++K+K     K K+  
Sbjct: 71  KKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAA 130

Query: 65  EQSVKEIEESLKERKEK 81
           E   K   E+ K+ KE+
Sbjct: 131 EAKAKAEAEAEKKAKEE 147



 Score = 40.6 bits (95), Expect = 7e-04
 Identities = 23/85 (27%), Positives = 35/85 (41%), Gaps = 6/85 (7%)

Query: 2   RKRRGGAGEKKKRIEKRNK--EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
            K +  A  +KK  E+  K  EEE +     E +K+   E K K + + K K + K K K
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKK-AAEAKKKAEAEAKAKAEAKAKAK 190

Query: 60  ---KKKKEEQSVKEIEESLKERKEK 81
               K K E +  +       + E 
Sbjct: 191 AEEAKAKAEAAKAKAAAEAAAKAEA 215



 Score = 40.2 bits (94), Expect = 0.001
 Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 11/92 (11%)

Query: 1   MRKRRGGAGEKKKRIEKRNK-EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK- 58
             K+R     ++K +E+R   E+  ++ EQ  K+ E+ +++  + K K+  + K K +  
Sbjct: 80  AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAE 139

Query: 59  ---------KKKKKEEQSVKEIEESLKERKEK 81
                    KK+ +EE   K   E+ K+  E 
Sbjct: 140 AEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEA 171



 Score = 39.8 bits (93), Expect = 0.001
 Identities = 20/78 (25%), Positives = 41/78 (52%), Gaps = 7/78 (8%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           +KK   K+ +E +K+  +Q E+ ++    E+ ++K+ +++   +K  K   + E+ + K+
Sbjct: 58  QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAK---QAEQAA-KQ 113

Query: 71  IEESLK---ERKEKTLTE 85
            EE  K   E K K   E
Sbjct: 114 AEEKQKQAEEAKAKQAAE 131



 Score = 39.8 bits (93), Expect = 0.001
 Identities = 17/80 (21%), Positives = 30/80 (37%), Gaps = 2/80 (2%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK--KKKEE 65
             +K+   E + K   + + +  E +K+   E K K + K K K ++ K K +  K K  
Sbjct: 147 EAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAA 206

Query: 66  QSVKEIEESLKERKEKTLTE 85
                  E+          E
Sbjct: 207 AEAAAKAEAEAAAAAAAEAE 226



 Score = 39.0 bits (91), Expect = 0.003
 Identities = 20/71 (28%), Positives = 30/71 (42%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            K +  A  KKK  E + K E + + + E K K   EE K K +  K K   +   K + 
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEA 215

Query: 62  KKEEQSVKEIE 72
           +    +  E E
Sbjct: 216 EAAAAAAAEAE 226



 Score = 31.3 bits (71), Expect = 0.81
 Identities = 13/70 (18%), Positives = 38/70 (54%), Gaps = 8/70 (11%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK--------KKKKKEEQSVKEI 71
             ++    +Q++K     E+E+ KK +++ ++ +K++          +++   E++ K+ 
Sbjct: 48  VAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQA 107

Query: 72  EESLKERKEK 81
           E++ K+ +EK
Sbjct: 108 EQAAKQAEEK 117


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 48.4 bits (116), Expect = 2e-06
 Identities = 20/72 (27%), Positives = 46/72 (63%), Gaps = 6/72 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E  ++++K  +EEE++      K  E+  +E+ ++KK++KKK++++ K  K   EEQ  +
Sbjct: 256 EVLRKVDKTREEEEEKIL----KAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQ--R 309

Query: 70  EIEESLKERKEK 81
           ++EE  ++++ +
Sbjct: 310 KLEEKERKKQAR 321



 Score = 42.2 bits (100), Expect = 2e-04
 Identities = 17/50 (34%), Positives = 34/50 (68%), Gaps = 3/50 (6%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKK---KKKKKKKKKKKKKKKKK 63
            +  EEE++E  QE+KE++  EE + K  K   ++++K ++K++KK+ +K
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 38.4 bits (90), Expect = 0.004
 Identities = 11/47 (23%), Positives = 32/47 (68%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
            +++R E+  +++E+++ E+ E +   +  E+ +K ++K++KK+ +K
Sbjct: 276 AEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 29.1 bits (66), Expect = 3.5
 Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 8/52 (15%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
           +R+  A EKK+  +K        E  + +  K   EE++  ++K++KK+ +K
Sbjct: 279 ERQEEAQEKKEEKKK--------EEREAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 48.5 bits (116), Expect = 2e-06
 Identities = 18/67 (26%), Positives = 36/67 (53%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
             E  +  EE+ ++E   +  E   E  NK  + K+K K ++ K+K++K+ E+  KE ++
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301

Query: 74  SLKERKE 80
             K+  +
Sbjct: 302 LKKKLAQ 308



 Score = 42.0 bits (99), Expect = 4e-04
 Identities = 19/82 (23%), Positives = 39/82 (47%), Gaps = 15/82 (18%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEE----KNKKKKKKKKKKKKKKK---------KKKKK 62
              + EEE ++    E  + + E      + K+K K ++ K+K++K         K+ KK
Sbjct: 245 SDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304

Query: 63  KEEQ--SVKEIEESLKERKEKT 82
           K  Q   +KEI + + ++++  
Sbjct: 305 KLAQLARLKEIAKEVAQKEKAR 326



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 17/61 (27%), Positives = 32/61 (52%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
            EE +++ +EE + E   E    + +   K  + K+K K ++ +E+  KE+E   KE K+
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301

Query: 81  K 81
            
Sbjct: 302 L 302



 Score = 38.5 bits (90), Expect = 0.004
 Identities = 14/70 (20%), Positives = 39/70 (55%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           ++K + ++  ++  KE   + ++EK+  ++     + K+  K+  +K+K + +K+EQ  +
Sbjct: 276 KRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKE 335

Query: 70  EIEESLKERK 79
             E+   +R+
Sbjct: 336 RGEKKKLKRR 345



 Score = 36.6 bits (85), Expect = 0.015
 Identities = 16/61 (26%), Positives = 39/61 (63%), Gaps = 8/61 (13%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKE--------KEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           +K+R E   + +E+++ +++  +        KE  ++EK + +KK+++K++ +KKK K++
Sbjct: 286 EKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRR 345

Query: 63  K 63
           K
Sbjct: 346 K 346



 Score = 32.0 bits (73), Expect = 0.41
 Identities = 23/90 (25%), Positives = 46/90 (51%), Gaps = 13/90 (14%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEEN-------EQEEKEKEDVEEEKNKKKKKKKKKKKK 54
           +++R    E++ + EK+ K++  +         E  +KEK     +K ++K++ +KKK K
Sbjct: 285 KEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKA-RARKKEQRKERGEKKKLK 343

Query: 55  KKKKKKKKKEEQSVK-----EIEESLKERK 79
           ++K  K K  E  ++     E+  SL+  K
Sbjct: 344 RRKLGKHKYPEPPLEVVLPDELSGSLRTLK 373



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 4/74 (5%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK----KKKKKKEEQSVKEI 71
           E  NK    +   + ++ KE   +E  ++ K++K+ KKK  +    K+  K+  Q  K  
Sbjct: 267 EPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKAR 326

Query: 72  EESLKERKEKTLTE 85
               ++RKE+   +
Sbjct: 327 ARKKEQRKERGEKK 340



 Score = 29.7 bits (67), Expect = 2.3
 Identities = 15/54 (27%), Positives = 30/54 (55%)

Query: 35  EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           E+ E+E   +KK+++ ++ ++KK +K   E   + E+ E L E  +    E+S 
Sbjct: 201 EEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESD 254



 Score = 28.1 bits (63), Expect = 6.8
 Identities = 12/50 (24%), Positives = 23/50 (46%)

Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
          +E++  E        E+ +++    +    +   KKK++KK KK  K  E
Sbjct: 21 LEEQRDERITGGVLAEKPDEDLFVVDTTGDEATIKKKRRKKLKKPLKSDE 70


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 48.2 bits (115), Expect = 4e-06
 Identities = 34/132 (25%), Positives = 52/132 (39%), Gaps = 16/132 (12%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
               +  K+  K++K++  +++    +   + +E +NK+  KKKKKKKKKKKKK   +  
Sbjct: 16  QNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAY 75

Query: 66  Q---------SVKEIEESLKERKEKTLTEDSQCDTMIP------VFKKGVLYKGIYCPSL 110
                     S    +++   RK     E     T  P       FK G    G      
Sbjct: 76  DLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQTQPPTIPVSKQFKDGEYPVGEIQEYP 135

Query: 111 -TNSFREPSLEM 121
             NS R  S E 
Sbjct: 136 GENSSRTSSEEK 147



 Score = 35.8 bits (83), Expect = 0.029
 Identities = 15/57 (26%), Positives = 26/57 (45%)

Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            NE  E E +  +++  +K  KKK KK KK          + + E +E+  ++  K
Sbjct: 1  AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNK 57



 Score = 31.2 bits (71), Expect = 0.91
 Identities = 13/49 (26%), Positives = 25/49 (51%)

Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
           E    + K++K++ K+K  KKK KK ++ V + +  L E   +    +
Sbjct: 3  NENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAE 51


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 47.8 bits (114), Expect = 5e-06
 Identities = 33/107 (30%), Positives = 56/107 (52%), Gaps = 24/107 (22%)

Query: 20  KEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           KE EKE    E E+E++EK++ EE+   K+++  ++++K++KKKK KK +++  E E   
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88

Query: 76  KE-----RKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREP 117
           K      R  K +T++               Y   Y  SLTN + +P
Sbjct: 89  KTKPIWTRNPKDVTKEE--------------YAAFY-KSLTNDWEDP 120



 Score = 40.9 bits (96), Expect = 8e-04
 Identities = 23/79 (29%), Positives = 45/79 (56%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E++++ EK+ +EE+  + E+E  E+E+ EE+K K KK K+   + +   K K    ++ K
Sbjct: 40  EEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTKPIWTRNPK 99

Query: 70  EIEESLKERKEKTLTEDSQ 88
           ++ +       K+LT D +
Sbjct: 100 DVTKEEYAAFYKSLTNDWE 118


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 47.7 bits (114), Expect = 5e-06
 Identities = 26/53 (49%), Positives = 38/53 (71%), Gaps = 3/53 (5%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           K+ + KR  EE KE+  +  K+K   EE+K +K+KKKKK+KKK KK+KKK ++
Sbjct: 365 KEELNKR-IEEIKEKYPKPPKKKR--EEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 39.6 bits (93), Expect = 0.002
 Identities = 18/45 (40%), Positives = 31/45 (68%), Gaps = 5/45 (11%)

Query: 27  NEQEEKEKEDVEEE-----KNKKKKKKKKKKKKKKKKKKKKKEEQ 66
            E+  K  E+++E+     K K+++KK +K+KKKKK+KKK K+ +
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 33.0 bits (76), Expect = 0.23
 Identities = 16/50 (32%), Positives = 34/50 (68%), Gaps = 1/50 (2%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           E+  + +EE K +K  K  KKK+++KK +K+KK+++  K+ ++  K+ ++
Sbjct: 366 EELNKRIEEIK-EKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 32.6 bits (75), Expect = 0.32
 Identities = 16/53 (30%), Positives = 35/53 (66%), Gaps = 2/53 (3%)

Query: 31  EKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           ++ KE++ +  E+ K+K  K  KKK+++KK +K+K+++  K+  +  K++  K
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 45.6 bits (108), Expect = 7e-06
 Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 4/61 (6%)

Query: 11  KKKRIEKRNKEEEKEENEQE---EKEKEDVEEEKNKKKKKKK-KKKKKKKKKKKKKKEEQ 66
           KKK   K  K   ++   +E   + E     E+K+KKKK +  K++KKKKK+KKKKK+  
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169

Query: 67  S 67
           S
Sbjct: 170 S 170



 Score = 42.9 bits (101), Expect = 6e-05
 Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 3/56 (5%)

Query: 17  KRNKEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           K++K + K+   Q+   +E   D E  K  +KK KKKK +  K++KKKKKE++  K
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166



 Score = 30.6 bits (69), Expect = 0.74
 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 5/66 (7%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK-----KKKKKEEQSV 68
            I+   K+ + +  +   ++    E   + +  K  +KK KKKK      +KKKK+E+  
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164

Query: 69  KEIEES 74
           K+   S
Sbjct: 165 KKKRHS 170



 Score = 30.6 bits (69), Expect = 0.79
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 15/55 (27%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           E  K  EK++K+++                E +K++KKKKK+KKKKKK+   +  
Sbjct: 135 EGLKGHEKKHKKKKH---------------EDDKERKKKKKEKKKKKKRHSPEHP 174



 Score = 28.3 bits (63), Expect = 5.1
 Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 18/61 (29%)

Query: 39  EEKNKKKKKKKKKKKKK------------------KKKKKKKKEEQSVKEIEESLKERKE 80
           +   KK K K KK + +                  KK KKKK E+   ++ ++  K++K+
Sbjct: 107 QPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166

Query: 81  K 81
           K
Sbjct: 167 K 167


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 45.1 bits (107), Expect = 8e-06
 Identities = 21/47 (44%), Positives = 31/47 (65%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           ++EEE EE   +E E+ED  E K + K+KKK++  K K +K+K K E
Sbjct: 96  DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 44.4 bits (105), Expect = 2e-05
 Identities = 19/68 (27%), Positives = 35/68 (51%), Gaps = 1/68 (1%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           +I +       +E E+ E+E  D  E+++  + K + K+KKK++  K K E++  K   +
Sbjct: 86  KIFRELSSS-DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144

Query: 74  SLKERKEK 81
             K  K K
Sbjct: 145 KPKPSKPK 152



 Score = 38.6 bits (90), Expect = 0.001
 Identities = 16/57 (28%), Positives = 29/57 (50%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
           E    + E+E+ EEE   + +++   + K + K+KKK+E    K  +E  K   +K 
Sbjct: 90  ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146



 Score = 34.3 bits (79), Expect = 0.036
 Identities = 16/63 (25%), Positives = 33/63 (52%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            R+      E+++  E+   E E+E+  + + E ++ ++ +  K K +K+K K + KK K
Sbjct: 88  FRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147

Query: 61  KKK 63
             K
Sbjct: 148 PSK 150



 Score = 29.0 bits (65), Expect = 2.3
 Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
           E++   E + + +EK++ E  + + E  + +   KK K  K K
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 45.0 bits (107), Expect = 1e-05
 Identities = 23/54 (42%), Positives = 37/54 (68%), Gaps = 1/54 (1%)

Query: 11  KKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           K+K+ +K  KE +E++E  +++K +   + E+N KK+K  KK KKKKK KKK +
Sbjct: 151 KEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGR 204



 Score = 39.2 bits (92), Expect = 0.001
 Identities = 24/59 (40%), Positives = 41/59 (69%), Gaps = 7/59 (11%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK------KKKKKKKKKKKKKKKKE 64
           KK ++++ K+++K + E +E+ KE VE++K +++KK      K+K  KK KKKKK KK+
Sbjct: 145 KKALKRKEKQKKKSKKEWKER-KEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKK 202



 Score = 37.7 bits (88), Expect = 0.004
 Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 9/80 (11%)

Query: 10  EKKKRIEKR--NKEEEKEENEQEEKEKEDVEEEKNK-------KKKKKKKKKKKKKKKKK 60
            +KK++E+   +K  E EE E+  K     E  K K       K  K+K+K+KKK KK+ 
Sbjct: 103 ARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEW 162

Query: 61  KKKEEQSVKEIEESLKERKE 80
           K+++E+  K+  E  K+R+E
Sbjct: 163 KERKEKVEKKKAERQKKREE 182



 Score = 36.9 bits (86), Expect = 0.008
 Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 18/98 (18%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----------- 50
           ++RR     K ++ +KR + ++KE+ ++ E E+   EE K+KKK    +           
Sbjct: 10  QRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKV 69

Query: 51  -------KKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
                   KK  K KKKKKK++   K++ + L+ RK+K
Sbjct: 70  EFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKK 107



 Score = 35.7 bits (83), Expect = 0.019
 Identities = 13/45 (28%), Positives = 30/45 (66%)

Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
             + +++K++++K +KK+K+K+ KK+E + K   E +K  + K+
Sbjct: 6  ALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKS 50



 Score = 35.0 bits (81), Expect = 0.031
 Identities = 16/56 (28%), Positives = 33/56 (58%)

Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
            +E + E++ +K++++K +KK+K+K+ KKK++ Q  +  E   +E K K      
Sbjct: 2  SSREALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPI 57



 Score = 32.3 bits (74), Expect = 0.23
 Identities = 14/58 (24%), Positives = 29/58 (50%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
              E   +++    E++  +KK+K+K+ KKK+  +K + EE   +E +   K    +
Sbjct: 1  PSSREALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIE 58



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 15/57 (26%), Positives = 33/57 (57%)

Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
          +E+R ++ E+ +  +++K KE  ++E  +K + ++ K ++ K KKK    E +   I
Sbjct: 8  LEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNI 64



 Score = 28.4 bits (64), Expect = 4.1
 Identities = 10/58 (17%), Positives = 34/58 (58%)

Query: 45  KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLY 102
           +++++K++++K +KK+K+KE +  ++ ++S  E  +    +  +    I   +  +++
Sbjct: 9   EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVF 66



 Score = 28.0 bits (63), Expect = 5.9
 Identities = 11/55 (20%), Positives = 31/55 (56%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
          E+++R  ++ K  +K++ ++ +K+++  + E  + K ++ K KKK    +  +  
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 46.3 bits (111), Expect = 2e-05
 Identities = 19/92 (20%), Positives = 49/92 (53%), Gaps = 8/92 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------KKKKKKK 61
           E KK  E   KE   E  E+  K + + E+E  +++ + +K +K+         +K +  
Sbjct: 46  EAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELL 105

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
           +K E+ +++ E+ L++++++   ++ + + +I
Sbjct: 106 EKREEELEKKEKELEQKQQELEKKEEELEELI 137



 Score = 44.4 bits (106), Expect = 7e-05
 Identities = 19/76 (25%), Positives = 44/76 (57%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           + +   EK  +E   E  + E++  +  E    K +  +K++++ +KK+K+ ++++Q ++
Sbjct: 68  KLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELE 127

Query: 70  EIEESLKERKEKTLTE 85
           + EE L+E  E+ L E
Sbjct: 128 KKEEELEELIEEQLQE 143



 Score = 40.9 bits (97), Expect = 7e-04
 Identities = 18/62 (29%), Positives = 42/62 (67%), Gaps = 1/62 (1%)

Query: 13  KRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           +++EKR  + EE  + + E  EK + E EK +K+ ++K+++ +KK+++ ++  E+ ++E+
Sbjct: 85  QKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144

Query: 72  EE 73
           E 
Sbjct: 145 ER 146



 Score = 40.5 bits (96), Expect = 0.001
 Identities = 22/81 (27%), Positives = 44/81 (54%), Gaps = 4/81 (4%)

Query: 4   RRGGAGEKKKRIEKRNK---EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           R+  A  K K  E+  K   EE K+E E  +KE     +E+  K + + +K+ ++++ + 
Sbjct: 25  RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84

Query: 61  KKKEEQSVKEIEESLKERKEK 81
           +K E++ + + EE+L  + E 
Sbjct: 85  QKLEKR-LLQKEENLDRKLEL 104



 Score = 36.7 bits (86), Expect = 0.016
 Identities = 17/79 (21%), Positives = 49/79 (62%), Gaps = 5/79 (6%)

Query: 8   AGEKKKRIEKRNKEEEKE-ENEQEEKE---KEDVEEEKNKKKKKKKKKKKKKKKKKKK-K 62
             E ++  ++  +E +KE E  ++E     KE++ + +N+ +K+ ++++ + +K +K+  
Sbjct: 33  IKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLL 92

Query: 63  KEEQSVKEIEESLKERKEK 81
           ++E+++    E L++R+E+
Sbjct: 93  QKEENLDRKLELLEKREEE 111


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 43.0 bits (102), Expect = 2e-05
 Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 14/62 (22%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            +++R G   KK+RI  R + E  +E             EK K+ +K ++KK K+++K+K
Sbjct: 83  KKRKRPG---KKRRIALRLRRERTKE-----------RAEKEKRTRKNREKKFKRRQKEK 128

Query: 61  KK 62
           +K
Sbjct: 129 EK 130



 Score = 37.6 bits (88), Expect = 0.002
 Identities = 16/58 (27%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           K+ N + EKE    +E+K K+K+  KK++   + ++++ KE    +  + + K R++K
Sbjct: 65  KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKE--KRTRKNREKK 120


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 42.0 bits (99), Expect = 2e-05
 Identities = 22/48 (45%), Positives = 26/48 (54%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
          K KK   KKKKKKKKKK K K++     +E E+S  E   K   ED  
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDED 60



 Score = 35.5 bits (82), Expect = 0.005
 Identities = 18/62 (29%), Positives = 30/62 (48%)

Query: 43  KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLY 102
           K K KK   KKKKKKKKKK K ++ V   +E  ++   ++  ++ + D      ++    
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70

Query: 103 KG 104
             
Sbjct: 71  MN 72



 Score = 34.7 bits (80), Expect = 0.009
 Identities = 20/45 (44%), Positives = 25/45 (55%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
            K K K KK   KKKKKKKK++   KE   + KE +EK+  E  
Sbjct: 7  GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESD 51



 Score = 34.3 bits (79), Expect = 0.013
 Identities = 22/61 (36%), Positives = 32/61 (52%)

Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
          K K    + K KKKKKKKK K K++   +K++EE+S  E +    E  E     + + D 
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70

Query: 92 M 92
          M
Sbjct: 71 M 71


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 43.7 bits (104), Expect = 2e-05
 Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 3  KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           ++    + +  I   N EE K+  E  E    D +E K  +K +KK +K K K +KKK 
Sbjct: 32 AQKKKGKKGELEITDLN-EEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKA 90

Query: 63 KEEQS 67
          K+E+ 
Sbjct: 91 KKEEP 95



 Score = 37.9 bits (89), Expect = 0.002
 Identities = 17/48 (35%), Positives = 26/48 (54%)

Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
          NE+ +  KE +E     KK+ K  +K +KK +K K K E+   + EE 
Sbjct: 48 NEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95



 Score = 28.6 bits (65), Expect = 2.7
 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 16/50 (32%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          KK ++   K E+K E                K K K +KKK KK++ K +
Sbjct: 65 KKELKAWEKAEKKAE----------------KAKAKAEKKKAKKEEPKPR 98



 Score = 28.6 bits (65), Expect = 3.0
 Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 17/50 (34%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
          ++ K  EK  K+ EK                  K K +KKK KK++ K +
Sbjct: 66 KELKAWEKAEKKAEKA-----------------KAKAEKKKAKKEEPKPR 98


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 43.6 bits (103), Expect = 3e-05
 Identities = 24/64 (37%), Positives = 34/64 (53%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            E+ E EQ+E  KE  E+E  K KK+KK+KK+KK  +K  KK+     +       +K  
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 82  TLTE 85
           T  E
Sbjct: 152 TKKE 155



 Score = 40.9 bits (96), Expect = 2e-04
 Identities = 20/64 (31%), Positives = 35/64 (54%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            +K++ E   + EEKE  + ++++KE  E++  +K  KKK  K  K   KK  K+  + K
Sbjct: 95  SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKK 154

Query: 70  EIEE 73
           E  +
Sbjct: 155 EEGK 158



 Score = 40.1 bits (94), Expect = 4e-04
 Identities = 22/62 (35%), Positives = 37/62 (59%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           EE ++E ++  KE E+ E  K KK+KK+KK+KK  +K  KKK  + +    +++ K+   
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152

Query: 81  KT 82
           K 
Sbjct: 153 KK 154



 Score = 39.7 bits (93), Expect = 5e-04
 Identities = 21/62 (33%), Positives = 33/62 (53%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           KE+++   E EEKE    ++EK +KK+KK  +K  KKK  K  K        + + K+ +
Sbjct: 97  KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEE 156

Query: 80  EK 81
            K
Sbjct: 157 GK 158



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 18/64 (28%), Positives = 35/64 (54%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
            ++ ++E  +E E++E  K   E+++ K+KK  +K  KKK  K  K   +++ K+     
Sbjct: 95  SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKK 154

Query: 76  KERK 79
           +E K
Sbjct: 155 EEGK 158



 Score = 36.7 bits (85), Expect = 0.006
 Identities = 12/58 (20%), Positives = 30/58 (51%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
            E+ +++ +E   + ++K+  K KK+KK+KK+KK  + + + + +   +         
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149



 Score = 33.6 bits (77), Expect = 0.061
 Identities = 20/63 (31%), Positives = 34/63 (53%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
           E EK +K+  K+ ++K+  K KK+KKE++  K  E+  K++  KT    ++  T     K
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153

Query: 98  KGV 100
           K  
Sbjct: 154 KEE 156



 Score = 32.0 bits (73), Expect = 0.24
 Identities = 13/56 (23%), Positives = 35/56 (62%)

Query: 33  EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           E+ + E+++  K+ ++K+  K KK+KK+KK+++ + K  ++   +  + T  + ++
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATK 148


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 45.1 bits (107), Expect = 3e-05
 Identities = 20/72 (27%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K K+ +K N E +K+  ++++K+KE  E +   + K   K  KK KK KKK  + +  ++
Sbjct: 56  KDKKEDK-NNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114

Query: 71  IEESLKERKEKT 82
           ++ +  +  E  
Sbjct: 115 VDNAFNKIAELA 126



 Score = 42.0 bits (99), Expect = 3e-04
 Identities = 18/64 (28%), Positives = 36/64 (56%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           +K    +K++++++K++ E++E + E   +   K  KK KK KKK  K K  +  + +  
Sbjct: 61  DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFN 120

Query: 70  EIEE 73
           +I E
Sbjct: 121 KIAE 124



 Score = 42.0 bits (99), Expect = 3e-04
 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 2/64 (3%)

Query: 20 KEEEKEENEQEEKEKED--VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
          KE     +E+E K       +++K  K  + KKK +KKKKKKK+KKE +S  E +   K 
Sbjct: 36 KEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKT 95

Query: 78 RKEK 81
           K+ 
Sbjct: 96 PKKS 99



 Score = 40.9 bits (96), Expect = 8e-04
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 5   RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
                  K+ +   ++EE K      +K+K+   E+KN + KKK +KKKKKKK+KK+ K 
Sbjct: 29  SNVLVLSKEILSTFSEEENKVATTSTKKDKK---EDKNNESKKKSEKKKKKKKEKKEPKS 85

Query: 65  EQSVKEIEESLKERKEK 81
           E   K   ++ K+ K+ 
Sbjct: 86  EGETKLGFKTPKKSKKT 102



 Score = 40.5 bits (95), Expect = 9e-04
 Identities = 18/69 (26%), Positives = 28/69 (40%), Gaps = 1/69 (1%)

Query: 21 EEEKEENEQEEKEKED-VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
                     KE      EE+NK      KK KK+ K  + KK+ +  K+ ++  KE K
Sbjct: 25 SYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84

Query: 80 EKTLTEDSQ 88
           +  T+   
Sbjct: 85 SEGETKLGF 93



 Score = 40.5 bits (95), Expect = 0.001
 Identities = 20/75 (26%), Positives = 43/75 (57%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
               E+ NK       + ++++K +  ++K++KKKKKKK+KK+ K + + K   ++ K+ 
Sbjct: 40  STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99

Query: 72  EESLKERKEKTLTED 86
           +++ K+  +    ED
Sbjct: 100 KKTKKKPPKPKPNED 114


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 45.4 bits (108), Expect = 4e-05
 Identities = 20/73 (27%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            E  + I + + E  +   E E+ EKE V+E +  K++ ++ +K+ +  +  K+K E+ +
Sbjct: 203 EEVLREINEISSELPELREELEKLEKE-VKELEELKEEIEELEKELESLEGSKRKLEEKI 261

Query: 69  KEIEESLKERKEK 81
           +E+EE ++E K++
Sbjct: 262 RELEERIEELKKE 274



 Score = 44.3 bits (105), Expect = 9e-05
 Identities = 25/87 (28%), Positives = 47/87 (54%), Gaps = 8/87 (9%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK-------KKKKKKKK 54
              +    EK + +E+R +E +KE  E EEK KE ++E K K ++        ++   + 
Sbjct: 251 EGSKRKLEEKIRELEERIEELKKEIEELEEKVKE-LKELKEKAEEYIKLSEFYEEYLDEL 309

Query: 55  KKKKKKKKKEEQSVKEIEESLKERKEK 81
           ++ +K+  + E+ +  IEE +KE +EK
Sbjct: 310 REIEKRLSRLEEEINGIEERIKELEEK 336



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 26/72 (36%), Positives = 48/72 (66%), Gaps = 1/72 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+ +++EK  KE E+ + E EE EKE    E +K+K ++K ++ +++ ++ KK+ E+ ++
Sbjct: 221 EELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE-LE 279

Query: 70  EIEESLKERKEK 81
           E  + LKE KEK
Sbjct: 280 EKVKELKELKEK 291



 Score = 38.9 bits (91), Expect = 0.003
 Identities = 15/72 (20%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           +  + + +  K  E+   E EE EK+   EE +  +++  +  ++    + + ++ E+  
Sbjct: 630 KAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRR 689

Query: 69  KEIEESLKERKE 80
           +EI+++L++ KE
Sbjct: 690 EEIKKTLEKLKE 701



 Score = 38.1 bits (89), Expect = 0.007
 Identities = 24/68 (35%), Positives = 44/68 (64%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E +++++K N EE +++ E+ EK KE + + K + K  KK+ +K ++ KKK  + E+ + 
Sbjct: 507 ELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLD 566

Query: 70  EIEESLKE 77
           E+EE L E
Sbjct: 567 ELEEELAE 574



 Score = 37.7 bits (88), Expect = 0.008
 Identities = 19/87 (21%), Positives = 49/87 (56%), Gaps = 2/87 (2%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            +   G  E+ K +E++ +  E+ + + +E EK   E E+  +  ++ K KK++ ++ KK
Sbjct: 320 EEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKK 379

Query: 62  KKEEQSVKEIEESLK--ERKEKTLTED 86
           +    + +++E+ L+  E+ ++ + E+
Sbjct: 380 RLTGLTPEKLEKELEELEKAKEEIEEE 406



 Score = 37.0 bits (86), Expect = 0.016
 Identities = 19/73 (26%), Positives = 49/73 (67%), Gaps = 1/73 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQSV 68
           E +KR+E+  KE E+ E +  E+E E++ EE  +  ++    + + ++ +K+++E ++++
Sbjct: 637 ETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTL 696

Query: 69  KEIEESLKERKEK 81
           ++++E L+ER++ 
Sbjct: 697 EKLKEELEEREKA 709



 Score = 36.2 bits (84), Expect = 0.029
 Identities = 20/74 (27%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQSVK 69
           ++   E R  E+     E+E    E+  +E  +K+++ ++ KKK K+ +K+ +E E+  +
Sbjct: 303 EEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHE 362

Query: 70  EIEE--SLKERKEK 81
             EE  + KE  E+
Sbjct: 363 LYEEAKAKKEELER 376



 Score = 35.8 bits (83), Expect = 0.037
 Identities = 21/76 (27%), Positives = 44/76 (57%), Gaps = 4/76 (5%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEK--EDVEEEKNKKKKKKKKKKKKKKKKKKKKKE--E 65
           E+ + ++K+ KE EK   E EE+ +  E+ + +K + ++ KK+      +K +K+ E  E
Sbjct: 338 ERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELE 397

Query: 66  QSVKEIEESLKERKEK 81
           ++ +EIEE + +   +
Sbjct: 398 KAKEEIEEEISKITAR 413



 Score = 35.8 bits (83), Expect = 0.039
 Identities = 21/58 (36%), Positives = 35/58 (60%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           KKR+     E+ ++E E+ EK KE++EEE +K   +  + KK+ K+ KK  +E +  K
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAK 435



 Score = 35.0 bits (81), Expect = 0.064
 Identities = 21/91 (23%), Positives = 44/91 (48%), Gaps = 5/91 (5%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           + + +    E +K +E     + K E  ++ +E E+  EE  K+ ++ ++K K+ K+ K+
Sbjct: 233 LEELKEEIEELEKELESLEGSKRKLE--EKIRELEERIEELKKEIEELEEKVKELKELKE 290

Query: 61  KKKEEQSVKEIEESLKERK---EKTLTEDSQ 88
           K +E   + E  E   +     EK L+   +
Sbjct: 291 KAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321



 Score = 35.0 bits (81), Expect = 0.066
 Identities = 18/72 (25%), Positives = 40/72 (55%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            ++        EE  +E  + EK    +EEE N  +++ K+ ++K+++ ++ KK+ + ++
Sbjct: 292 AEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELE 351

Query: 70  EIEESLKERKEK 81
           +  E L+ER E 
Sbjct: 352 KRLEELEERHEL 363



 Score = 33.9 bits (78), Expect = 0.14
 Identities = 20/73 (27%), Positives = 43/73 (58%), Gaps = 4/73 (5%)

Query: 11  KKKRIEKRNKEEE----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           ++ R E      E    + E E+ EK +E++++   K K++ ++++K KK+ +K +K  +
Sbjct: 662 EELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALE 721

Query: 67  SVKEIEESLKERK 79
            V+E+ E +K+ K
Sbjct: 722 RVEELREKVKKYK 734



 Score = 33.5 bits (77), Expect = 0.17
 Identities = 17/64 (26%), Positives = 39/64 (60%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           ++  E E++E ++ E+E +   EE  + +K+ ++ +K+ ++ +KK  EE+  +  EE L+
Sbjct: 611 EKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLE 670

Query: 77  ERKE 80
             +E
Sbjct: 671 LSRE 674



 Score = 33.5 bits (77), Expect = 0.19
 Identities = 23/90 (25%), Positives = 47/90 (52%), Gaps = 5/90 (5%)

Query: 1   MRKRRGGAGEKKKRIEKRNKE--EEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKK 57
           +R+      ++ K +E+  +E  E ++E E  E  K  +EE+ +  +++ ++ KK+ ++ 
Sbjct: 219 LREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEEL 278

Query: 58  KKKKK--KEEQSVKEIEESLKERKEKTLTE 85
           ++K K  KE +   E    L E  E+ L E
Sbjct: 279 EEKVKELKELKEKAEEYIKLSEFYEEYLDE 308



 Score = 33.1 bits (76), Expect = 0.26
 Identities = 17/72 (23%), Positives = 40/72 (55%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+   + K  +E   E  E+ E+  +++E   N+  + K  +K+ ++++K+ KK E+ + 
Sbjct: 570 EELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELD 629

Query: 70  EIEESLKERKEK 81
           +  E L E +++
Sbjct: 630 KAFEELAETEKR 641



 Score = 33.1 bits (76), Expect = 0.28
 Identities = 21/77 (27%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           + +   G   + + +EKR +E +K     + E EE+EK   E EK +K  ++ ++ ++K 
Sbjct: 671 LSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKV 730

Query: 57  KKKKKKKEEQSVKEIEE 73
           KK K   +E+++ ++ E
Sbjct: 731 KKYKALLKERALSKVGE 747



 Score = 32.3 bits (74), Expect = 0.39
 Identities = 13/65 (20%), Positives = 35/65 (53%), Gaps = 1/65 (1%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
            +  +   E  E+  K  E++EE  K K+K+ ++  ++  +   +  +  + ++++E+ +
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230

Query: 76  KERKE 80
           KE +E
Sbjct: 231 KELEE 235



 Score = 32.3 bits (74), Expect = 0.41
 Identities = 14/74 (18%), Positives = 42/74 (56%), Gaps = 1/74 (1%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            GE  K I++R  E  ++  ++ E  +E ++E++ + ++  ++  +   +  + ++E + 
Sbjct: 167 LGEVIKEIKRR-IERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEK 225

Query: 68  VKEIEESLKERKEK 81
           +++  + L+E KE+
Sbjct: 226 LEKEVKELEELKEE 239



 Score = 31.2 bits (71), Expect = 0.89
 Identities = 22/76 (28%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV---KEIE 72
            K   EE   E ++ EKE +++EE++ K +K+ ++ +K  KK+ +  K ++     KE+E
Sbjct: 450 RKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELE 509

Query: 73  ESLKERKEKTLTEDSQ 88
           E LK+   + L + ++
Sbjct: 510 EKLKKYNLEELEKKAE 525



 Score = 31.2 bits (71), Expect = 1.0
 Identities = 21/65 (32%), Positives = 39/65 (60%), Gaps = 3/65 (4%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV---KEIEESLKE 77
           E  KE  E+   E + +E+E  + ++K++K +K+ ++ +K  K+E  +   KE+ E LKE
Sbjct: 448 EHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKE 507

Query: 78  RKEKT 82
            +EK 
Sbjct: 508 LEEKL 512



 Score = 30.8 bits (70), Expect = 1.3
 Identities = 18/70 (25%), Positives = 37/70 (52%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           +K K +EKR +E E+     EE + +  E E+ KK+      +K +K+ ++ +K ++ ++
Sbjct: 345 KKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIE 404

Query: 70  EIEESLKERK 79
           E    +  R 
Sbjct: 405 EEISKITARI 414



 Score = 30.8 bits (70), Expect = 1.5
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A  KK+ +E+  K       E+ EKE E++E+ K + +++  K   +  + KK+ KE + 
Sbjct: 367 AKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK- 425

Query: 68  VKEIEESLKERKEK 81
            K IEE LK+ K K
Sbjct: 426 -KAIEE-LKKAKGK 437



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 27/93 (29%), Positives = 47/93 (50%), Gaps = 13/93 (13%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEE--KEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
            K      EK++++ K  +E EK   ++ E  K KE  E+ K  ++K KK   ++ +KK 
Sbjct: 465 EKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKA 524

Query: 60  KK-----------KKEEQSVKEIEESLKERKEK 81
           ++           K E +S+K+  E L+E K+K
Sbjct: 525 EEYEKLKEKLIKLKGEIKSLKKELEKLEELKKK 557



 Score = 28.9 bits (65), Expect = 5.0
 Identities = 13/64 (20%), Positives = 38/64 (59%), Gaps = 2/64 (3%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE--QSVKEIEE 73
           E ++ ++E E+ E+ +K+  ++E++ ++ +++  +  K+ ++   +  EE  + +KE+E 
Sbjct: 540 EIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEP 599

Query: 74  SLKE 77
              E
Sbjct: 600 FYNE 603


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
          approximately 300 residues, found in plants and
          vertebrates. They contain a highly conserved DDRGK
          motif.
          Length = 189

 Score = 43.5 bits (103), Expect = 4e-05
 Identities = 24/78 (30%), Positives = 54/78 (69%), Gaps = 4/78 (5%)

Query: 7  GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK----KKK 62
          GA ++ K  EK+ + +++E  E+E +E++ +EE++  ++K++++ +++++KKK    +K+
Sbjct: 4  GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63

Query: 63 KEEQSVKEIEESLKERKE 80
          +EEQ+ KE EE  K +  
Sbjct: 64 REEQARKEQEEYEKLKSS 81



 Score = 41.6 bits (98), Expect = 2e-04
 Identities = 20/62 (32%), Positives = 45/62 (72%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
          K+  K E +Q  +++ + EEE+ +++KK ++K++ ++K++++ +EE+  K+ EE  KER+
Sbjct: 6  KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65

Query: 80 EK 81
          E+
Sbjct: 66 EQ 67



 Score = 39.3 bits (92), Expect = 0.001
 Identities = 23/77 (29%), Positives = 46/77 (59%), Gaps = 2/77 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+KK  EKR  E ++EE  +EE+EK+  EEE+ +++++ +K++++ +K K     E+   
Sbjct: 30  ERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEE-- 87

Query: 70  EIEESLKERKEKTLTED 86
             ++   + +   L ED
Sbjct: 88  GTDKLSADEESNELLED 104



 Score = 37.0 bits (86), Expect = 0.007
 Identities = 16/78 (20%), Positives = 51/78 (65%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          +++R  +  + EE+++ E++ + +   EEE  ++++KKK+++++K+++++ +KE++  ++
Sbjct: 18 RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEK 77

Query: 71 IEESLKERKEKTLTEDSQ 88
          ++ S    +E T    + 
Sbjct: 78 LKSSFVVEEEGTDKLSAD 95



 Score = 30.4 bits (69), Expect = 1.1
 Identities = 19/76 (25%), Positives = 42/76 (55%), Gaps = 2/76 (2%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           +++     + +R E+   EEE+E+ ++EE+ KE   EE+ +K++++ +K K     +++ 
Sbjct: 31  RKKLEEKREGERKEEEELEEEREKKKEEEERKE--REEQARKEQEEYEKLKSSFVVEEEG 88

Query: 63  KEEQSVKEIEESLKER 78
            ++ S  E    L E 
Sbjct: 89  TDKLSADEESNELLED 104


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 45.3 bits (107), Expect = 4e-05
 Identities = 27/72 (37%), Positives = 47/72 (65%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E   ++ K NKEEEKE+  QEE+ K   +EE+  K +  K +++K   ++K K+ E+ +K
Sbjct: 266 EILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELK 325

Query: 70  EIEESLKERKEK 81
           ++E+ LK+ KE+
Sbjct: 326 KLEKELKKEKEE 337



 Score = 43.8 bits (103), Expect = 1e-04
 Identities = 23/72 (31%), Positives = 44/72 (61%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E++K  + + +E +    E+EE + E ++ E+ K   ++K K+ +K+ KK +K+ ++  +
Sbjct: 277 EEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKE 336

Query: 70  EIEESLKERKEK 81
           EIEE  KE KE 
Sbjct: 337 EIEELEKELKEL 348



 Score = 43.0 bits (101), Expect = 2e-04
 Identities = 24/80 (30%), Positives = 50/80 (62%), Gaps = 1/80 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-KKKKKKKKKKKKKEEQSV 68
           EK+K++++   +   +E E+ + E   +E  K   ++K K+ +K+ KK +K+ KKE++ +
Sbjct: 279 EKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEI 338

Query: 69  KEIEESLKERKEKTLTEDSQ 88
           +E+E+ LKE + K   E+ +
Sbjct: 339 EELEKELKELEIKREAEEEE 358



 Score = 41.9 bits (98), Expect = 5e-04
 Identities = 23/92 (25%), Positives = 45/92 (48%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            ++     +++K+ E +  EE +E  E ++ +  + +EE  K+  K  K K + KK +  
Sbjct: 415 SEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDL 474

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
            KE + VK +E+       + L E SQ ++  
Sbjct: 475 LKETKLVKLLEQLELLLLRQKLEEASQKESKA 506



 Score = 41.9 bits (98), Expect = 5e-04
 Identities = 19/74 (25%), Positives = 42/74 (56%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
            E+    +  +EN++EEKEK+  EEE     K++++ K +  K +++K +++   +  E 
Sbjct: 263 KEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEK 322

Query: 75  LKERKEKTLTEDSQ 88
             ++ EK L ++ +
Sbjct: 323 ELKKLEKELKKEKE 336



 Score = 41.9 bits (98), Expect = 5e-04
 Identities = 20/71 (28%), Positives = 41/71 (57%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           ++++ IE   +E EKEE    +  KE+ EEEK KK ++++ K   K++++ K +  +  +
Sbjct: 249 DEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLER 308

Query: 70  EIEESLKERKE 80
              +  ++ KE
Sbjct: 309 RKVDDEEKLKE 319



 Score = 41.5 bits (97), Expect = 6e-04
 Identities = 21/76 (27%), Positives = 38/76 (50%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
             K   E+ +  +E   +EQEE E    E EK ++   +  K+ K+++K+KK +EE+   
Sbjct: 232 YLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKL 291

Query: 70  EIEESLKERKEKTLTE 85
             +E  + + E    E
Sbjct: 292 LAKEEEELKSELLKLE 307



 Score = 41.5 bits (97), Expect = 7e-04
 Identities = 17/83 (20%), Positives = 38/83 (45%), Gaps = 1/83 (1%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            G+ EK+K+ ++R K+  +E     E   +  E +  + K K++ KK  +  + K+K E 
Sbjct: 164 AGSREKRKK-KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLEL 222

Query: 66  QSVKEIEESLKERKEKTLTEDSQ 88
           +    +     +  E+ +    +
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQE 245



 Score = 41.1 bits (96), Expect = 0.001
 Identities = 22/81 (27%), Positives = 45/81 (55%), Gaps = 2/81 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           E+K R++K  +EEEK E   +EKE  + EE  EK K +++K++K K ++++ +  +EE  
Sbjct: 753 EEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELK 812

Query: 68  VKEIEESLKERKEKTLTEDSQ 88
            +      ++   +   +  +
Sbjct: 813 EEAELLEEEQLLIEQEEKIKE 833



 Score = 39.6 bits (92), Expect = 0.002
 Identities = 23/81 (28%), Positives = 48/81 (59%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E++++   + +EEE+E++E   KEKE  EEE+  +K K +++K++K K ++++      +
Sbjct: 751 EEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEE 810

Query: 70  EIEESLKERKEKTLTEDSQCD 90
             EE+    +E+ L E  +  
Sbjct: 811 LKEEAELLEEEQLLIEQEEKI 831



 Score = 39.6 bits (92), Expect = 0.003
 Identities = 19/77 (24%), Positives = 45/77 (58%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           K+  E +++  + E  + +++EK    E++ KK +K+ KK+K++ ++ +K+ +E  +K  
Sbjct: 294 KEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKRE 353

Query: 72  EESLKERKEKTLTEDSQ 88
            E  +E + + L E  +
Sbjct: 354 AEEEEEEQLEKLQEKLE 370



 Score = 39.6 bits (92), Expect = 0.003
 Identities = 22/83 (26%), Positives = 42/83 (50%), Gaps = 4/83 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK----KKKKKKKKKKKKKKKKEE 65
           EKK + E+     ++EE  + E  K +  +  +++K K    + KK +K+ KK+K++ EE
Sbjct: 281 EKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEE 340

Query: 66  QSVKEIEESLKERKEKTLTEDSQ 88
              +  E  +K   E+   E  +
Sbjct: 341 LEKELKELEIKREAEEEEEEQLE 363



 Score = 38.4 bits (89), Expect = 0.005
 Identities = 19/79 (24%), Positives = 41/79 (51%), Gaps = 2/79 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EK+   E+   E+ K E E+EEK K   EE +  +++ K++ +  ++++   ++EE+   
Sbjct: 774 EKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEK--I 831

Query: 70  EIEESLKERKEKTLTEDSQ 88
           + EE  +   E    +  +
Sbjct: 832 KEEELEELALELKEEQKLE 850



 Score = 38.0 bits (88), Expect = 0.007
 Identities = 21/73 (28%), Positives = 44/73 (60%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            E++ ++  + +EE K E  + E+ K D EE+  + +K+ KK +K+ KK+K++ +E +  
Sbjct: 285 QEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKE 344

Query: 69  KEIEESLKERKEK 81
            +  E  +E +E+
Sbjct: 345 LKELEIKREAEEE 357



 Score = 36.9 bits (85), Expect = 0.016
 Identities = 19/80 (23%), Positives = 45/80 (56%), Gaps = 2/80 (2%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK- 69
            ++R     ++ ++ E E ++ EKE  +E++  ++ +K+ K+ + K++ ++++EEQ  K 
Sbjct: 306 LERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKL 365

Query: 70  -EIEESLKERKEKTLTEDSQ 88
            E  E L+E        +S+
Sbjct: 366 QEKLEQLEEELLAKKKLESE 385



 Score = 35.3 bits (81), Expect = 0.048
 Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            E+ KR+    +E           E E+ EE  NK + KK++ +++KK+  ++  EE
Sbjct: 961  ERNKRLLLAKEELGNVNLMAIA-EFEEKEERYNKDELKKERLEEEKKELLREIIEE 1015



 Score = 35.3 bits (81), Expect = 0.049
 Identities = 19/77 (24%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           + +    +R K +++E+ ++ EKE +  + EK  KK+K++ ++ +K+ K+ + K E   +
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELK--KLEKELKKEKEEIEELEKELKELEIKREAEEE 357

Query: 70  EIEESLKERKEKTLTED 86
           E E+  K +++    E+
Sbjct: 358 EEEQLEKLQEKLEQLEE 374



 Score = 34.6 bits (79), Expect = 0.096
 Identities = 23/83 (27%), Positives = 43/83 (51%), Gaps = 7/83 (8%)

Query: 10  EKKKRIEKRNKEEEKEENE-------QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           EK+ + EK   EE ++E +        EE+E+E +E+ + K ++ +++   KKK + ++ 
Sbjct: 328 EKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERL 387

Query: 63  KEEQSVKEIEESLKERKEKTLTE 85
                +KE E  LK  +EK    
Sbjct: 388 SSAAKLKEEELELKNEEEKEAKL 410



 Score = 34.2 bits (78), Expect = 0.12
 Identities = 22/82 (26%), Positives = 40/82 (48%), Gaps = 1/82 (1%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK-KKKEEQ 66
             E K+   KR  EEE+EE  ++ +EK +  EE+   KKK + ++     K K ++ E +
Sbjct: 342 EKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELK 401

Query: 67  SVKEIEESLKERKEKTLTEDSQ 88
           + +E E  L     +   +  +
Sbjct: 402 NEEEKEAKLLLELSEQEEDLLK 423



 Score = 34.2 bits (78), Expect = 0.14
 Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 4/83 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKED----VEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           +     E++ KEEE EE   E KE++      EEE  + +++  K++  ++   K+++ E
Sbjct: 822 QLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELE 881

Query: 66  QSVKEIEESLKERKEKTLTEDSQ 88
           +   + E   KE KEK   ++ +
Sbjct: 882 EQKLKDELESKEEKEKEEKKELE 904



 Score = 33.8 bits (77), Expect = 0.14
 Identities = 15/76 (19%), Positives = 34/76 (44%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E +   E     EE +  E + KE+     E  + K+K + +++        K  E+ + 
Sbjct: 182 ETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERID 241

Query: 70  EIEESLKERKEKTLTE 85
            ++E L++ +E+  + 
Sbjct: 242 LLQELLRDEQEEIESS 257



 Score = 33.4 bits (76), Expect = 0.19
 Identities = 19/79 (24%), Positives = 43/79 (54%), Gaps = 6/79 (7%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EK+ +  ++  ++EKEE E+ EKE       K  + K++ +++++++ +K ++K EQ  +
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKEL------KELEIKREAEEEEEEQLEKLQEKLEQLEE 374

Query: 70  EIEESLKERKEKTLTEDSQ 88
           E+    K   E+  +    
Sbjct: 375 ELLAKKKLESERLSSAAKL 393



 Score = 33.4 bits (76), Expect = 0.23
 Identities = 20/89 (22%), Positives = 43/89 (48%), Gaps = 13/89 (14%)

Query: 13  KRIEKRNKEEEKEENE-QEEKEKEDVEEE------------KNKKKKKKKKKKKKKKKKK 59
           K  E + KE+ K+  E  + KEK ++EEE            +     ++  + ++++ + 
Sbjct: 197 KLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIES 256

Query: 60  KKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
            K++ E+  + + + LKE KE+   +  Q
Sbjct: 257 SKQELEKEEEILAQVLKENKEEEKEKKLQ 285



 Score = 31.5 bits (71), Expect = 0.78
 Identities = 11/64 (17%), Positives = 29/64 (45%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
                 +     EE++E   ++E +KE +EEEK +  ++  ++  ++ K+  +     +  
Sbjct: 974  GNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRG 1033

Query: 70   EIEE 73
              + 
Sbjct: 1034 LNKV 1037



 Score = 31.1 bits (70), Expect = 1.1
 Identities = 10/79 (12%), Positives = 36/79 (45%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E    +    +E + +E + +E+ K+ +E  + K+K + +++        K  +E   + 
Sbjct: 184 ENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLL 243

Query: 70  EIEESLKERKEKTLTEDSQ 88
           +     ++ + ++  ++ +
Sbjct: 244 QELLRDEQEEIESSKQELE 262


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 43.6 bits (103), Expect = 5e-05
 Identities = 12/59 (20%), Positives = 35/59 (59%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            +++    +  KEEEKE    E+KE +   E+++++ +++ +++ ++   + +K+ E+ 
Sbjct: 55  DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEK 113



 Score = 42.8 bits (101), Expect = 9e-05
 Identities = 20/80 (25%), Positives = 43/80 (53%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
                     +++E K+ ++QE  E E+V+EE+ +    + K+ K   +K+ ++ EE++ 
Sbjct: 37  PSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE 96

Query: 69  KEIEESLKERKEKTLTEDSQ 88
           +E EES  E +++T  +   
Sbjct: 97  EEDEESSDENEKETEEKTES 116


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 44.2 bits (105), Expect = 5e-05
 Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 10  EKKKRIEKRNKEEEKE-ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E K   +   + ++K+ E E++ +  E   E+K K K++ KKKK K+ K  K  K
Sbjct: 219 EFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALK 273



 Score = 41.9 bits (99), Expect = 3e-04
 Identities = 18/49 (36%), Positives = 24/49 (48%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            +KR  EE+ E  E   ++K   +EE  KKK K+ K  K  KK   K  
Sbjct: 232 KKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGM 280



 Score = 38.5 bits (90), Expect = 0.004
 Identities = 18/54 (33%), Positives = 35/54 (64%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            K   +  KE  +++ + +EDVE  +++ +KK+K K++ KKKK K+ K  +++K
Sbjct: 220 FKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALK 273



 Score = 35.8 bits (83), Expect = 0.025
 Identities = 19/59 (32%), Positives = 34/59 (57%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            K  +    K  +K   E ++K++E  E+ +  + + +KK+K K++ KKKK KE + VK
Sbjct: 212 LKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVK 270



 Score = 33.8 bits (78), Expect = 0.091
 Identities = 14/55 (25%), Positives = 29/55 (52%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
              +        +E ++ ++E +E  +  E + +KK+K K++ KKKK K+ K  +
Sbjct: 216 LIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVK 270



 Score = 28.8 bits (65), Expect = 4.0
 Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 6/54 (11%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
           R+        + R EK+ K +E      E K+K+  E +  K  KK   K  KK
Sbjct: 235 RETEEDVEAAESRAEKKRKSKE------EIKKKKPKESKGVKALKKVVAKGMKK 282


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 43.1 bits (102), Expect = 6e-05
 Identities = 21/53 (39%), Positives = 33/53 (62%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           K +R++K+  + ++ E  +++K+  + EEEK KKKKK  KKKK KK      K
Sbjct: 152 KVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204



 Score = 31.5 bits (72), Expect = 0.50
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           +K+  I++     +K+++   E+EK+  +++  KKKK KK      K 
Sbjct: 158 KKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           ++KE +  E E  + KKK    +++KKKKKKK  +++ +K
Sbjct: 157 KKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLK 196



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 28  EQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           E+ +K++ D++E      KKK    +++KKKKKKK  K+++  K    
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 43.6 bits (103), Expect = 1e-04
 Identities = 13/51 (25%), Positives = 35/51 (68%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           K++ +   E +  E  + + ++E +++E+N+KK+K++  + K+K++K ++K
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 42.9 bits (101), Expect = 2e-04
 Identities = 13/49 (26%), Positives = 33/49 (67%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           +   E K+E +  E+ +   +EEK K+++ +KK+K++  + K+K+++++
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431



 Score = 42.5 bits (100), Expect = 2e-04
 Identities = 15/51 (29%), Positives = 32/51 (62%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +  +  ++ +  EE E   +EEK K++  E+K K++  + K+K++K ++KK
Sbjct: 384 KLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 42.5 bits (100), Expect = 2e-04
 Identities = 15/52 (28%), Positives = 30/52 (57%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           KK    K   +  +E   + ++EK   EE + K+K++  + K+K++K ++KK
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 42.1 bits (99), Expect = 3e-04
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 16  EKRNKEEEKEENEQEEKEKED--VEEEKNKKKKKKKKKKKKKKKKKKKKK 63
               K+E     E E K KE+   +EE  KK+K++  + K+K++K ++KK
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 39.8 bits (93), Expect = 0.002
 Identities = 13/52 (25%), Positives = 31/52 (59%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           +K   ++      E+ E + +E++ +  E EK +K++  + K+K++K ++KK
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 37.5 bits (87), Expect = 0.008
 Identities = 16/62 (25%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQ-EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           E  K   K+ ++  K+  E  +E +  +  E K K++K K+++ +KK+K++  + +E+  
Sbjct: 369 ELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428

Query: 69  KE 70
           K+
Sbjct: 429 KD 430



 Score = 36.3 bits (84), Expect = 0.019
 Identities = 17/55 (30%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
            +K      E     E   K +E E+ +QEE EK+  +E+ ++ K+K++K ++KK
Sbjct: 382 TKKLGEVKDETDASEEAEAKAKE-EKLKQEENEKKQ-KEQADEDKEKRQKDERKK 434



 Score = 35.5 bits (82), Expect = 0.034
 Identities = 14/63 (22%), Positives = 33/63 (52%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
             + ++   E  +K ++  ++    K +    ++ + K K++K K+E++ K+ +E   E 
Sbjct: 364 GDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADED 423

Query: 79  KEK 81
           KEK
Sbjct: 424 KEK 426



 Score = 32.1 bits (73), Expect = 0.53
 Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 2/71 (2%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K     K N +   ++ ++  KE     ++  KK  + K +    ++ + K KEE+   +
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKL--K 408

Query: 71  IEESLKERKEK 81
            EE+ K++KE+
Sbjct: 409 QEENEKKQKEQ 419



 Score = 31.3 bits (71), Expect = 0.80
 Identities = 17/71 (23%), Positives = 34/71 (47%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            EE K   +    +++++ +E NKK +   KK  + K +    +E ++  + E+  +E  
Sbjct: 353 YEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEEN 412

Query: 80  EKTLTEDSQCD 90
           EK   E +  D
Sbjct: 413 EKKQKEQADED 423


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 42.2 bits (100), Expect = 1e-04
 Identities = 15/47 (31%), Positives = 30/47 (63%)

Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
          +KKK KK KK+K+K++K+ +K  +    E++++ +E K +    D +
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRE 59



 Score = 39.1 bits (92), Expect = 0.001
 Identities = 14/45 (31%), Positives = 30/45 (66%)

Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          V+++K KK KK+K+K++K+ +K     +++  +  EE+  E+ E+
Sbjct: 12 VDKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAER 56



 Score = 34.1 bits (79), Expect = 0.054
 Identities = 14/65 (21%), Positives = 34/65 (52%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
          KK+ +K  KE+ K+  +  +   +  +E K   ++ K +K ++ ++  ++++ E   K I
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKAI 73

Query: 72 EESLK 76
             +K
Sbjct: 74 VAQIK 78



 Score = 32.2 bits (74), Expect = 0.21
 Identities = 14/70 (20%), Positives = 41/70 (58%), Gaps = 2/70 (2%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          +K+KR  K+ K+  K  ++ +++ K+  EE K +K ++ ++  ++++ + ++K     +K
Sbjct: 21 KKEKR--KQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKAIVAQIK 78

Query: 70 EIEESLKERK 79
          ++ E  +  +
Sbjct: 79 QLIEQNRIDR 88


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 43.9 bits (104), Expect = 1e-04
 Identities = 18/59 (30%), Positives = 34/59 (57%)

Query: 12   KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            ++ +E++ + EEKE  +++  + +   +    +K K KKK+KKKKK    K ++ SV  
Sbjct: 1138 EEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVG 1196



 Score = 38.5 bits (90), Expect = 0.005
 Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 11/92 (11%)

Query: 10   EKKKRIEK-RNKEEE----------KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
            +K+K +EK +N   +          +E  E++E+ +E    ++ + K K K K  K +K 
Sbjct: 1113 KKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKP 1172

Query: 59   KKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
            K KKKE++  K   +  K+      ++    D
Sbjct: 1173 KLKKKEKKKKKSSADKSKKASVVGNSKRVDSD 1204



 Score = 36.2 bits (84), Expect = 0.028
 Identities = 21/82 (25%), Positives = 38/82 (46%), Gaps = 21/82 (25%)

Query: 10   EKKKRIEKRNKEEEKEENE---------------------QEEKEKEDVEEEKNKKKKKK 48
            EK +++    +++EKE  +                     +E++E E+ E  K ++ K K
Sbjct: 1102 EKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSK 1161

Query: 49   KKKKKKKKKKKKKKKEEQSVKE 70
             K K  K +K K KK+E+  K+
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKK 1183



 Score = 34.3 bits (79), Expect = 0.13
 Identities = 23/79 (29%), Positives = 37/79 (46%)

Query: 12   KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
             KR +  +    K  +  ++K K+ +E      KKKKK +KK  +KKK K + +Q+    
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQ 1348

Query: 72   EESLKERKEKTLTEDSQCD 90
               L  R  K  ++ S  D
Sbjct: 1349 SSRLLRRPRKKKSDSSSED 1367



 Score = 32.7 bits (75), Expect = 0.37
 Identities = 18/85 (21%), Positives = 30/85 (35%), Gaps = 1/85 (1%)

Query: 3    KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
             + G      KR+              + +     +     KKK KK+ +      KKKK
Sbjct: 1266 SKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKK 1325

Query: 63   KEEQSVKEIEESLKERKEKTLTEDS 87
            K E+     ++S K R ++     S
Sbjct: 1326 KSEKKTARKKKS-KTRVKQASASQS 1349



 Score = 30.0 bits (68), Expect = 2.4
 Identities = 19/89 (21%), Positives = 37/89 (41%), Gaps = 7/89 (7%)

Query: 9    GEKKKRIEKRNK---EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK----KKKKKK 61
            G K     K+      E      +++K+ E     K K K + K+    +     ++ +K
Sbjct: 1299 GSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRK 1358

Query: 62   KKEEQSVKEIEESLKERKEKTLTEDSQCD 90
            KK + S ++ ++S  +  E    ED + D
Sbjct: 1359 KKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387



 Score = 29.2 bits (66), Expect = 4.5
 Identities = 12/80 (15%), Positives = 32/80 (40%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
               KK+        ++K+++E++   K+  +    +    +  +  ++ +KKK     + 
Sbjct: 1308 KKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSED 1367

Query: 68   VKEIEESLKERKEKTLTEDS 87
              + E    E ++    ED 
Sbjct: 1368 DDDSEVDDSEDEDDEDDEDD 1387



 Score = 28.9 bits (65), Expect = 5.4
 Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 5/78 (6%)

Query: 13   KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK---KKKKKKKKKKKKKEEQSVK 69
                    +      ++ +K  E       KKKK +KK   KKK K + K+    + S  
Sbjct: 1293 DGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSS-- 1350

Query: 70   EIEESLKERKEKTLTEDS 87
             +    +++K  + +ED 
Sbjct: 1351 RLLRRPRKKKSDSSSEDD 1368


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 43.4 bits (103), Expect = 1e-04
 Identities = 15/68 (22%), Positives = 30/68 (44%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
                + E E+ E       E K  +KK++K +KK +K++ +K   ++  +   +  K  
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449

Query: 79  KEKTLTED 86
             +T   D
Sbjct: 450 DGETKKVD 457



 Score = 42.6 bits (101), Expect = 2e-04
 Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 3/50 (6%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
           E+KK  +K+ K E+K E E+ EK      ++K +   KK K    + KK 
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAA---AKKKAEAAAKKAKGPDGETKKV 456



 Score = 34.9 bits (81), Expect = 0.061
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 45  KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
             ++KK +KK++K +KK ++E++ K   +   E   K       
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451



 Score = 31.8 bits (73), Expect = 0.57
 Identities = 15/56 (26%), Positives = 30/56 (53%), Gaps = 5/56 (8%)

Query: 27  NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
           +++    + + EE +N      ++KK +KK++K +KK E+     EE+ K   +K 
Sbjct: 388 HDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK-----EEAEKAAAKKK 438



 Score = 29.5 bits (67), Expect = 3.0
 Identities = 10/59 (16%), Positives = 24/59 (40%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
               E+   E       + +K ++   + + K +K++ +K   KKK +   K+ +    
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451



 Score = 29.1 bits (66), Expect = 4.0
 Identities = 11/44 (25%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK 44
           +RK++  A +K ++ E   K   K++ E   K+ +  + E  K 
Sbjct: 414 LRKKQRKAEKKAEKEEAE-KAAAKKKAEAAAKKAKGPDGETKKV 456


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 42.7 bits (101), Expect = 1e-04
 Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 11/73 (15%)

Query: 27  NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE-----------ESL 75
            E+ E  K  +EEE+ KKKKKKKKKK KK        E      +            E+ 
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225

Query: 76  KERKEKTLTEDSQ 88
           K +K++++  D++
Sbjct: 226 KLKKKRSIAPDNE 238



 Score = 38.5 bits (90), Expect = 0.003
 Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 1/62 (1%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
           +EE E  +   E+ E  K KKKKKKKK KK        +    S    E S    +    
Sbjct: 166 EEEVELLKARLEE-ERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224

Query: 84  TE 85
            +
Sbjct: 225 KK 226



 Score = 30.0 bits (68), Expect = 1.7
 Identities = 14/60 (23%), Positives = 26/60 (43%), Gaps = 5/60 (8%)

Query: 34  KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
           +E+VE       K + ++++ KKKKKKKKK+ +       S +      +  +       
Sbjct: 166 EEEVEL-----LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQ 220



 Score = 28.5 bits (64), Expect = 4.7
 Identities = 14/66 (21%), Positives = 25/66 (37%), Gaps = 2/66 (3%)

Query: 1   MRKRRGGAGEKK--KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
           + + R    +KK  K+ +K N      E         ++     +  + KK KKK+    
Sbjct: 176 LEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAP 235

Query: 59  KKKKKE 64
             +K E
Sbjct: 236 DNEKSE 241


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
          family of proteins is found in bacteria and viruses.
          Proteins in this family are typically between 180 and
          214 amino acids in length.
          Length = 125

 Score = 40.7 bits (96), Expect = 1e-04
 Identities = 20/78 (25%), Positives = 40/78 (51%), Gaps = 1/78 (1%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E+K   +K   +   +E  + EK++E+ + E  K  K   ++K + + +K +K+ E+   
Sbjct: 4  EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63

Query: 70 EIEES-LKERKEKTLTED 86
          E+    LK   +K L+E 
Sbjct: 64 ELARRELKAEAKKMLSEK 81



 Score = 38.4 bits (90), Expect = 0.001
 Identities = 19/61 (31%), Positives = 38/61 (62%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
          E E+E+   +++  + + +EK K +KK+++KK + +K  K   EE++  E+E+  KE +E
Sbjct: 1  EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60

Query: 81 K 81
           
Sbjct: 61 L 61



 Score = 33.8 bits (78), Expect = 0.039
 Identities = 19/60 (31%), Positives = 34/60 (56%)

Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
          E E+E+   +   ++   K+K K +KK+++KK + +K  + S +E  E   E+ EK L E
Sbjct: 1  EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60



 Score = 33.0 bits (76), Expect = 0.068
 Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 5/70 (7%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK-----KKKKKEEQSVKEIEES 74
           EEEK   ++E  +    E+ K +KK+++KK + +K  K     K + + E+  KE+EE 
Sbjct: 2  PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEEL 61

Query: 75 LKERKEKTLT 84
            E   + L 
Sbjct: 62 EAELARRELK 71



 Score = 32.2 bits (74), Expect = 0.12
 Identities = 16/50 (32%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 10 EKKKRIEKRNKEEEKE-ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
           + +++ K + EE+ E E E+ EKE E++E E  +++ K + KK   +K 
Sbjct: 33 SEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLSEKG 82


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 43.3 bits (102), Expect = 1e-04
 Identities = 16/70 (22%), Positives = 39/70 (55%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           K+  E+R K+E+++  E ++K+  + E  K  +K++   +++KK+ ++  K+     K+ 
Sbjct: 75  KRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQA 134

Query: 72  EESLKERKEK 81
           EE+  +    
Sbjct: 135 EEAAAKAAAA 144



 Score = 41.3 bits (97), Expect = 5e-04
 Identities = 12/60 (20%), Positives = 25/60 (41%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           +K+ + E K++   E K+K   E +    K   + K   +K    K  E+ +  +    +
Sbjct: 200 KKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259



 Score = 40.9 bits (96), Expect = 8e-04
 Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 10/82 (12%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK---------- 59
           E+ K++EK     ++++ + EE  K+   ++K  ++   K     K K +          
Sbjct: 101 ERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAA 160

Query: 60  KKKKEEQSVKEIEESLKERKEK 81
           KK   E   K   E+ K+   +
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAE 182



 Score = 40.6 bits (95), Expect = 8e-04
 Identities = 19/76 (25%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 8   AGEKKKRIEKRNKEE--EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           A E++K+ E++  EE  +K+  EQE  ++ + E    +++KK+ ++  K+   K+K+ EE
Sbjct: 77  AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136

Query: 66  QSVKEIEESLKERKEK 81
            + K    +  + + +
Sbjct: 137 AAAKAAAAAKAKAEAE 152



 Score = 37.5 bits (87), Expect = 0.009
 Identities = 22/83 (26%), Positives = 40/83 (48%), Gaps = 4/83 (4%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----KKKKKKKK 58
           K R  A E+KK+ E+  K+   ++ + EE   +     K K + + K+     KK   + 
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEA 167

Query: 59  KKKKKEEQSVKEIEESLKERKEK 81
           KKK + E + K   E+ K+ + +
Sbjct: 168 KKKAEAEAAKKAAAEAKKKAEAE 190



 Score = 36.7 bits (85), Expect = 0.014
 Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 6/85 (7%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK------KKKKKKKKKKKK 56
            +R  A  KK   E + K E +   +   + K+  E E   K      KK + + KKK  
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAA 212

Query: 57  KKKKKKKEEQSVKEIEESLKERKEK 81
            + KKK   ++     ++  E K  
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAA 237



 Score = 36.7 bits (85), Expect = 0.016
 Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 6/83 (7%)

Query: 8   AGEKKKRIE----KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           A E KK+ E     +   E K++ E E K+K   E +K  K   + K    K   + K  
Sbjct: 180 AAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKK--KAAAEAKAAAAKAAAEAKAA 237

Query: 64  EEQSVKEIEESLKERKEKTLTED 86
            E++            +     D
Sbjct: 238 AEKAAAAKAAEKAAAAKAAAEVD 260



 Score = 35.6 bits (82), Expect = 0.036
 Identities = 15/72 (20%), Positives = 31/72 (43%), Gaps = 7/72 (9%)

Query: 16  EKRNKEEEK-----EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK--KKKEEQSV 68
           +K+  E+E+     +E    +++K+  EE   +   K+K+ ++   K     K K E   
Sbjct: 94  QKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEA 153

Query: 69  KEIEESLKERKE 80
           K    + K+   
Sbjct: 154 KRAAAAAKKAAA 165



 Score = 34.4 bits (79), Expect = 0.083
 Identities = 17/79 (21%), Positives = 29/79 (36%), Gaps = 10/79 (12%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKED------VEEEKNKKKKKKKKKKKKKKKK----K 59
           +KK+  E   +   K++  +E   K         E E  +     KK   + KKK     
Sbjct: 116 QKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175

Query: 60  KKKKEEQSVKEIEESLKER 78
            KK   ++ K+ E     +
Sbjct: 176 AKKAAAEAKKKAEAEAAAK 194



 Score = 33.6 bits (77), Expect = 0.14
 Identities = 15/76 (19%), Positives = 41/76 (53%), Gaps = 4/76 (5%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           KR E++ K++E ++  +E ++K+  E+E+    K+ +K++   +++KK+ +E      ++
Sbjct: 75  KRAEEQRKKKE-QQQAEELQQKQAAEQER---LKQLEKERLAAQEQKKQAEEAAKQAALK 130

Query: 73  ESLKERKEKTLTEDSQ 88
           +   E         ++
Sbjct: 131 QKQAEEAAAKAAAAAK 146



 Score = 33.2 bits (76), Expect = 0.22
 Identities = 19/76 (25%), Positives = 45/76 (59%), Gaps = 3/76 (3%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           G   E+  R +++ K  ++ E EQ +K+++   EE  +K+  ++++ K+ +K++   +E+
Sbjct: 58  GAVVEQYNRQQQQQKSAKRAE-EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ 116

Query: 66  QSVKEIEESLKERKEK 81
           +  K+ EE+ K+   K
Sbjct: 117 K--KQAEEAAKQAALK 130



 Score = 31.3 bits (71), Expect = 0.80
 Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 4/83 (4%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK--- 59
             +  A ++K+  E   K     + + E + K      K    + KKK + +  KK    
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182

Query: 60  -KKKKEEQSVKEIEESLKERKEK 81
            KKK E ++  +     K++ E 
Sbjct: 183 AKKKAEAEAAAKAAAEAKKKAEA 205



 Score = 30.9 bits (70), Expect = 0.99
 Identities = 17/85 (20%), Positives = 35/85 (41%), Gaps = 1/85 (1%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
             +   A + K   E +      ++   E K+K + E  K    + KKK + +   K   
Sbjct: 138 AAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAA 197

Query: 62  KKEEQSVKEIEESLK-ERKEKTLTE 85
           + ++++  E ++    E K+K   E
Sbjct: 198 EAKKKAEAEAKKKAAAEAKKKAAAE 222



 Score = 27.8 bits (62), Expect = 9.0
 Identities = 15/65 (23%), Positives = 38/65 (58%), Gaps = 13/65 (20%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKK------------KEEQSVKEIE-ESLKERKEKTL 83
           V E+ N++++++K  K+ ++++KKK+             E++ +K++E E L  +++K  
Sbjct: 60  VVEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119

Query: 84  TEDSQ 88
            E++ 
Sbjct: 120 AEEAA 124


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 43.1 bits (101), Expect = 2e-04
 Identities = 22/56 (39%), Positives = 34/56 (60%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           K    E ++  EK      E+K+KK KKK+KK+K+K++ K KKKE +  K +  +L
Sbjct: 177 KHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLAL 232



 Score = 40.0 bits (93), Expect = 0.002
 Identities = 19/53 (35%), Positives = 32/53 (60%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           +K  N +  K  E  +    +KK KK KKK+KK+K+K++ K+++   E  +SL
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSL 228



 Score = 39.3 bits (91), Expect = 0.002
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           +     + E +E +  E E+     +  KKK++K+K++KKKKKK    +   S    E+ 
Sbjct: 254 VSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313

Query: 75  LK 76
           ++
Sbjct: 314 VQ 315



 Score = 38.5 bits (89), Expect = 0.005
 Identities = 27/123 (21%), Positives = 45/123 (36%), Gaps = 12/123 (9%)

Query: 18  RNKEEEKEENEQEEKEKEDV----EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           RN E  K        EK DV    ++ K  KKK+KK+K+K++ K KKK+ E      +  
Sbjct: 179 RNAETSKS------PEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLAL 232

Query: 74  SLKERKEKTLTEDSQCDTMIPVFKKG--VLYKGIYCPSLTNSFREPSLEMSYQRYSHRQR 131
                   ++ E  +      V                   + + P  +   QR    ++
Sbjct: 233 DDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEK 292

Query: 132 QKS 134
           +K 
Sbjct: 293 KKK 295



 Score = 37.4 bits (86), Expect = 0.011
 Identities = 20/71 (28%), Positives = 34/71 (47%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
              +    E K+ E  E +K+ K KKKK++K+K++KKKKKK         +   ++  + 
Sbjct: 257 TAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQN 316

Query: 82  TLTEDSQCDTM 92
              E+     M
Sbjct: 317 GAVEEEPLPPM 327



 Score = 36.6 bits (84), Expect = 0.019
 Identities = 23/103 (22%), Positives = 46/103 (44%), Gaps = 34/103 (33%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEK----------------------------------E 35
           +K K+ EK+ KE+E+++++++E E                                    
Sbjct: 200 KKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAP 259

Query: 36  DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           D E ++ K  + ++ KK  K KKKK++KE++  K+ ++    R
Sbjct: 260 DSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 34.7 bits (79), Expect = 0.082
 Identities = 9/41 (21%), Positives = 28/41 (68%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           E+E + ++ +E++K +KKK++K+K+ +++      + ++ +
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDI 125



 Score = 33.9 bits (77), Expect = 0.15
 Identities = 10/41 (24%), Positives = 25/41 (60%)

Query: 35  EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E+    + + +K K++KKK++K+K+ +++      E +E +
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDI 125



 Score = 30.8 bits (69), Expect = 1.2
 Identities = 17/71 (23%), Positives = 33/71 (46%), Gaps = 4/71 (5%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           G          K+ E EE ++  K K+    +K +K+K++KKKKKK    +    +  + 
Sbjct: 256 GTAPDSEPDEPKDAEAEETKKSPKHKK----KKQRKEKEEKKKKKKHHHHRCHHSDGGAE 311

Query: 69  KEIEESLKERK 79
           + ++    E +
Sbjct: 312 QPVQNGAVEEE 322



 Score = 30.4 bits (68), Expect = 1.7
 Identities = 10/50 (20%), Positives = 29/50 (58%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
           K +++++ +++ +K K++KKK+++E+  +    SL    ++ +      D
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVD 132



 Score = 30.0 bits (67), Expect = 1.9
 Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 1/82 (1%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           G     +  E ++ E E+ +   + K+K+  +E++ KKKKKK    +        ++  Q
Sbjct: 256 GTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQ 315

Query: 67  SVKEIEESL-KERKEKTLTEDS 87
           +    EE L      K L E+S
Sbjct: 316 NGAVEEEPLPPMSSYKLLAENS 337



 Score = 29.6 bits (66), Expect = 2.7
 Identities = 9/27 (33%), Positives = 22/27 (81%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           V+ E+ ++ +++ +K K++KKK++K+K
Sbjct: 82  VKLEEERRHRQRLEKDKREKKKREKEK 108



 Score = 28.9 bits (64), Expect = 4.4
 Identities = 15/66 (22%), Positives = 33/66 (50%), Gaps = 4/66 (6%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           EEE+   ++ EK+K     EK K++K+K+ +++      +  ++    + ++   +E  E
Sbjct: 85  EEERRHRQRLEKDKR----EKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPE 140

Query: 81  KTLTED 86
             L  D
Sbjct: 141 NALPSD 146



 Score = 28.5 bits (63), Expect = 6.2
 Identities = 12/58 (20%), Positives = 31/58 (53%), Gaps = 4/58 (6%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----QSVKEIEESLKE 77
           K E E+  +++ + ++ + KK++K+K+ +++      +  E+    Q V  + E + E
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPE 140


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 41.2 bits (97), Expect = 2e-04
 Identities = 20/86 (23%), Positives = 44/86 (51%), Gaps = 6/86 (6%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK---KKKKEEQSV 68
           KK IE  N +  +EE E +++     E++K   +K K+K ++++ K +   +   E  +V
Sbjct: 33  KKDIEI-NTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTV 91

Query: 69  KEIEESL--KERKEKTLTEDSQCDTM 92
           K  ++ L   E ++ + + +S  +  
Sbjct: 92  KATKKQLFSSEYEQTSSSSESTSEEE 117



 Score = 37.4 bits (87), Expect = 0.003
 Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 6/78 (7%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ-----SVKEI 71
           K N+ E+K+     +  +E+ E +K     +++KK   +K K+K ++EE      S    
Sbjct: 27  KPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENAT 86

Query: 72  E-ESLKERKEKTLTEDSQ 88
           E  ++K  K++  + + +
Sbjct: 87  ENNTVKATKKQLFSSEYE 104



 Score = 34.7 bits (80), Expect = 0.030
 Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           K    E+ + E       EE +  K+    +++KK   +K K+K EQ  +E++  L   
Sbjct: 26 VKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQ--EELKNQLFSE 83

Query: 79 KEK 81
             
Sbjct: 84 NAT 86



 Score = 30.0 bits (68), Expect = 0.89
 Identities = 13/73 (17%), Positives = 28/73 (38%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
                +K+   ++ +E  ++E+ K  +  E   +    K  KK+    + ++    S   
Sbjct: 54  FTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQLFSSEYEQTSSSSEST 113

Query: 71  IEESLKERKEKTL 83
            EE  K+     L
Sbjct: 114 SEEETKKTSSILL 126


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 42.9 bits (101), Expect = 2e-04
 Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 5/93 (5%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEE----KNKKKKKKKKKKKKKKKKKKKK 62
               K+K+ +K   E   +E  +EE  K DV  E    +     + + + + K   ++  
Sbjct: 223 KKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSD 282

Query: 63  KEEQSVKEIEESLKERKEKTLTEDSQCDTMIPV 95
            EE++ ++ +E  ++R +K + ++ + + M  V
Sbjct: 283 SEEETEEKEKEK-RKRLKKMMEDEDEDEEMEIV 314



 Score = 40.2 bits (94), Expect = 0.001
 Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 7/85 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-------KKKKKKKKKKKK 62
           E K +  K+ ++  KE   ++ + K  V+    K+   KK        KKK K+KK+KK+
Sbjct: 175 EVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKE 234

Query: 63  KEEQSVKEIEESLKERKEKTLTEDS 87
             E +VKE  E    +++  L ++S
Sbjct: 235 ASESTVKEESEEESGKRDVILEDES 259



 Score = 37.1 bits (86), Expect = 0.013
 Identities = 21/73 (28%), Positives = 31/73 (42%), Gaps = 10/73 (13%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEE-----KEENEQEE--KEKEDVEEEKNKKKKKKKKKKKK 54
            +RRG     KK+  K   +EE     K+  E E   +++ +    K K K         
Sbjct: 349 GRRRGRRRVMKKKTFK---DEEGYLVTKKVYEWESFSEDEAEPPPTKPKPKVSTPAVPAA 405

Query: 55  KKKKKKKKKEEQS 67
            KK K  KK++QS
Sbjct: 406 AKKPKAPKKKKQS 418



 Score = 35.6 bits (82), Expect = 0.034
 Identities = 17/78 (21%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++   KR+   E E  E    ++++ E+E     ++   +++ ++K+K+K+K    +K
Sbjct: 243 ESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR---LK 299

Query: 70  EIEESLKERKEKTLTEDS 87
           ++ E   E +E  +  +S
Sbjct: 300 KMMEDEDEDEEMEIVPES 317



 Score = 34.4 bits (79), Expect = 0.084
 Identities = 19/61 (31%), Positives = 35/61 (57%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            E+E E+  +   E+ D EEE  +K+K+K+K+ KK  + + + +E + V E     +E +
Sbjct: 266 DEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESE 325

Query: 80  E 80
           E
Sbjct: 326 E 326



 Score = 32.5 bits (74), Expect = 0.33
 Identities = 12/46 (26%), Positives = 24/46 (52%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           EE +E       +KE+ +EE        +++ +++  KKK  K+E+
Sbjct: 322 EESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEE 367



 Score = 31.3 bits (71), Expect = 0.78
 Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 21/80 (26%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEK---------------------EKEDVEEEK 41
                  EK+K   KR K+  ++E+E EE                       K++ E+E+
Sbjct: 282 DSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEE 341

Query: 42  NKKKKKKKKKKKKKKKKKKK 61
                   +++ +++  KKK
Sbjct: 342 VTVSPDGGRRRGRRRVMKKK 361



 Score = 31.0 bits (70), Expect = 1.0
 Identities = 15/48 (31%), Positives = 25/48 (52%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
             KK K+KK+KK+  +   K++ EE+S K       E  E T  ++ +
Sbjct: 222 FKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDE 269


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 40.7 bits (96), Expect = 2e-04
 Identities = 19/87 (21%), Positives = 50/87 (57%), Gaps = 10/87 (11%)

Query: 9   GEKKKRIEKRNKEEEKE------ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            +++    KR ++ + E      + + E++E + + E  +++  +++KK++++K+K++ +
Sbjct: 32  DDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKEEIE 91

Query: 63  KEEQSVKEIEES---LKERKEKTLTED 86
           K  +   +I++    LK R   T+TED
Sbjct: 92  KYREENPKIQQQFADLK-RNLATVTED 117


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 41.7 bits (98), Expect = 2e-04
 Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 11/70 (15%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
            +  RIE+R++  EK++ E  EK +++++   E  N        +KK+K K + +K+ EQ
Sbjct: 120 RRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYN--------EKKEKTKAQNRKEAEQ 171

Query: 67  SVKEIEESLK 76
            + E ++   
Sbjct: 172 FLAERDDFSS 181



 Score = 31.6 bits (72), Expect = 0.52
 Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 14/77 (18%)

Query: 20  KEEEKEENE--QEEKEKEDVEEEKNKKKKKKKKKKKKKKKK------------KKKKKEE 65
              E EE E  ++ +E+ D+  E+  +  +KKK++  +K +            KK+K + 
Sbjct: 104 DRVEGEEPESIRKWRERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKA 163

Query: 66  QSVKEIEESLKERKEKT 82
           Q+ KE E+ L ER + +
Sbjct: 164 QNRKEAEQFLAERDDFS 180



 Score = 29.3 bits (66), Expect = 2.4
 Identities = 16/62 (25%), Positives = 37/62 (59%), Gaps = 4/62 (6%)

Query: 25  EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE---ESLKERKEK 81
           + +  E +E E + + + ++  + +++ +  +KKK++  E+ + KEI+   E+  E+KEK
Sbjct: 102 QADRVEGEEPESIRKWRERRDLRIEERDEASEKKKEELIEK-AQKEIDDFYENYNEKKEK 160

Query: 82  TL 83
           T 
Sbjct: 161 TK 162


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 42.4 bits (100), Expect = 3e-04
 Identities = 18/86 (20%), Positives = 35/86 (40%), Gaps = 1/86 (1%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R  R   G+  +   K+    +K    ++  +K + E +   + +KK K K  KK  +K 
Sbjct: 729 RAPRN-VGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKA 787

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDS 87
           KK     ++I  + K ++        
Sbjct: 788 KKPSAKTQKIAAATKAKRAAKKKVAE 813


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 42.2 bits (99), Expect = 3e-04
 Identities = 19/72 (26%), Positives = 46/72 (63%), Gaps = 1/72 (1%)

Query: 10  EKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           ++KK+ E+  +E + K+  EQE  ++ + E  K ++++K+ ++ +K+ + ++K++EEQ+ 
Sbjct: 80  QRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQAR 139

Query: 69  KEIEESLKERKE 80
           K   E  K+ + 
Sbjct: 140 KAAAEQKKKAEA 151



 Score = 40.3 bits (94), Expect = 0.001
 Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 4/85 (4%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK- 59
           + K R  A E++K+ E+  K+ + E+ +QEE+ ++   E+K K +  K K   +  K K 
Sbjct: 106 LEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKA 165

Query: 60  ---KKKKEEQSVKEIEESLKERKEK 81
               KKK E++ K  EE+  + +  
Sbjct: 166 AAEAKKKAEEAAKAAEEAKAKAEAA 190



 Score = 39.5 bits (92), Expect = 0.002
 Identities = 19/84 (22%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK---KKKKKKKK 57
           ++ ++    E+ K++EK   + ++++ + EE EK+   E+K ++++ +K   ++KKK + 
Sbjct: 92  LKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEA 151

Query: 58  KKKKKKEEQSVKEIEESLKERKEK 81
            K K   E +  +     K++ E+
Sbjct: 152 AKAKAAAEAAKLKAAAEAKKKAEE 175



 Score = 39.2 bits (91), Expect = 0.003
 Identities = 14/78 (17%), Positives = 33/78 (42%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            +     K K      K++ + E +   ++ +   E K K +KK +   ++K   +KKK 
Sbjct: 177 AKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKA 236

Query: 64  EEQSVKEIEESLKERKEK 81
             ++  +   +  +  E+
Sbjct: 237 AAKAKADKAAAAAKAAER 254



 Score = 38.0 bits (88), Expect = 0.007
 Identities = 19/80 (23%), Positives = 50/80 (62%), Gaps = 4/80 (5%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK----KKKKKKKKKKKKKKKKK 61
           G   ++  RI+ +    +K E ++++KE++  EE K K+    ++ K+ +K++ K ++++
Sbjct: 58  GAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQ 117

Query: 62  KKEEQSVKEIEESLKERKEK 81
           K+ E++ K+ +   K+++E+
Sbjct: 118 KQAEEAEKQAQLEQKQQEEQ 137



 Score = 35.3 bits (81), Expect = 0.039
 Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 1/62 (1%)

Query: 3   KRRGGAGEKKKRIEKRNK-EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           K +  A   KK+ E   K   EK + E E K K + + E   ++K   +KKK   K K  
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKAD 243

Query: 62  KK 63
           K 
Sbjct: 244 KA 245



 Score = 34.9 bits (80), Expect = 0.049
 Identities = 21/97 (21%), Positives = 39/97 (40%), Gaps = 14/97 (14%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK-------- 59
           A  ++K+ E++ ++   E+ ++ E  K     E  K K   + KKK ++  K        
Sbjct: 127 AQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAK 186

Query: 60  ------KKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
                 KKK E ++    E++  E + K   E     
Sbjct: 187 AEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEA 223



 Score = 34.2 bits (78), Expect = 0.090
 Identities = 12/74 (16%), Positives = 30/74 (40%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E     +K   E +    + + + +   + EK  +   ++K   +KKK   K K +++  
Sbjct: 188 EAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAA 247

Query: 70  EIEESLKERKEKTL 83
             + + ++     L
Sbjct: 248 AAKAAERKAAAAAL 261



 Score = 33.8 bits (77), Expect = 0.12
 Identities = 20/74 (27%), Positives = 29/74 (39%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           K +  A  KKK  E     EE +   +    K+  E E     +K K + + K K +KK 
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKA 221

Query: 63  KEEQSVKEIEESLK 76
           +     K   E  K
Sbjct: 222 EAAAEEKAAAEKKK 235



 Score = 33.0 bits (75), Expect = 0.26
 Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 4/79 (5%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            +   A E KK+ E+  K  E+ + + E    +   E + K   +K K + + K K +KK
Sbjct: 161 AKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKK 220

Query: 63  KEEQSVKEIEESLKERKEK 81
            E       EE     K+K
Sbjct: 221 AE----AAAEEKAAAEKKK 235



 Score = 31.5 bits (71), Expect = 0.63
 Identities = 15/54 (27%), Positives = 23/54 (42%), Gaps = 2/54 (3%)

Query: 8   AGEKKKRIEKRNKE--EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
           A  +K + E   K   E+K E   EEK   + ++   K K  K     K  ++K
Sbjct: 202 AAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERK 255


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 42.3 bits (99), Expect = 3e-04
 Identities = 19/95 (20%), Positives = 41/95 (43%), Gaps = 13/95 (13%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK---------- 52
            R      KK  +E     E  EE  +    ++   ++++KK  ++ ++K          
Sbjct: 74  TRSVKRATKKTVVEI---SEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEE 130

Query: 53  KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           +K +KK +K+++ + + E  E      E +  E+S
Sbjct: 131 EKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEES 165



 Score = 37.7 bits (87), Expect = 0.008
 Identities = 19/94 (20%), Positives = 46/94 (48%), Gaps = 14/94 (14%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKE------KEDVEEEKNKKKKKKKKKKK--------K 54
            +K  R  KR  ++   E  +  +E       ED   +K  KK  ++ ++K        +
Sbjct: 70  KKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVE 129

Query: 55  KKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           ++K +KK ++ + VK+++E ++++  ++   D +
Sbjct: 130 EEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVE 163



 Score = 36.9 bits (85), Expect = 0.016
 Identities = 23/80 (28%), Positives = 32/80 (40%), Gaps = 11/80 (13%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
                KK     R K               DVEEEK +KK +K++K KK  +  + +  E
Sbjct: 106 LDKESKKTPRRTRRK---------AAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156

Query: 66  QSVKEIEESLKERKEKTLTE 85
             V ++EES  E       E
Sbjct: 157 SEVSDVEES--EFVTSLENE 174



 Score = 33.9 bits (77), Expect = 0.13
 Identities = 22/81 (27%), Positives = 37/81 (45%), Gaps = 4/81 (4%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           + +R   +NK      + ++ +    V+EE ++     KKK  +  K+  KK     V E
Sbjct: 32  RGERWRLQNKFLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKT----VVE 87

Query: 71  IEESLKERKEKTLTEDSQCDT 91
           I E L+E  E  + ED+  D 
Sbjct: 88  ISEPLEEGSELVVNEDAALDK 108



 Score = 31.9 bits (72), Expect = 0.48
 Identities = 15/74 (20%), Positives = 30/74 (40%), Gaps = 8/74 (10%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           AG +KK       +EE +EN            +K   +  K+  KK   +  +  +E   
Sbjct: 46  AGSRKKIESALAVDEEPDEN--------GAVSKKKPTRSVKRATKKTVVEISEPLEEGSE 97

Query: 68  VKEIEESLKERKEK 81
           +   E++  +++ K
Sbjct: 98  LVVNEDAALDKESK 111


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 42.1 bits (100), Expect = 3e-04
 Identities = 21/79 (26%), Positives = 47/79 (59%), Gaps = 2/79 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++ +E++ +E E    E E K KE++EE+K K ++++ K  ++ +K+ ++  +E + K
Sbjct: 527 ELERELEQKAEEAEALLKEAE-KLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE-AKK 584

Query: 70  EIEESLKERKEKTLTEDSQ 88
           E +E +KE ++      + 
Sbjct: 585 EADEIIKELRQLQKGGYAS 603



 Score = 37.5 bits (88), Expect = 0.010
 Identities = 21/89 (23%), Positives = 43/89 (48%), Gaps = 27/89 (30%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEK---------------------------NKK 44
           +++ EK  +EE+K   E E++ ++ ++E K                             +
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEAR 613

Query: 45  KKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           K+  K  +KK+KKKKK+K++++ +K  +E
Sbjct: 614 KRLNKANEKKEKKKKKQKEKQEELKVGDE 642



 Score = 36.3 bits (85), Expect = 0.027
 Identities = 24/74 (32%), Positives = 48/74 (64%), Gaps = 3/74 (4%)

Query: 9   GEKKKRIEK--RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           GE K+++ +   + EE + E EQ+ +E E + +E  K K++ ++KK+K ++++ K  EE 
Sbjct: 512 GEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA 571

Query: 67  SVKEIEESLKERKE 80
             KE ++++KE K+
Sbjct: 572 E-KEAQQAIKEAKK 584



 Score = 35.2 bits (82), Expect = 0.060
 Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 14  RIEKRNKEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            I K  ++ +K      +  E  +  +  NK  +KK+KKKKK+K+K+++ K    VK
Sbjct: 588 EIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVK 644



 Score = 33.3 bits (77), Expect = 0.25
 Identities = 20/68 (29%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++ I++  KE ++   E  + +K      K  +  + +K+  K  +KK+KKK++Q  K
Sbjct: 574 EAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQ--K 631

Query: 70  EIEESLKE 77
           E +E LK 
Sbjct: 632 EKQEELKV 639



 Score = 32.9 bits (76), Expect = 0.29
 Identities = 23/97 (23%), Positives = 42/97 (43%), Gaps = 31/97 (31%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK-------------NKKKKK--------- 47
           E +KR+ K N+++EK++ +Q+EK++E    ++             +    K         
Sbjct: 611 EARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIM 670

Query: 48  ---------KKKKKKKKKKKKKKKKEEQSVKEIEESL 75
                    +K +K KKKKKKK K  +   + +   L
Sbjct: 671 KMKVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLEL 707



 Score = 32.1 bits (74), Expect = 0.46
 Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 9/70 (12%)

Query: 17  KRNKEEEKEENEQ-----EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           K+   E+KE+  +     EE E+E  ++ +  +   K+ +K K++ ++KK+K    ++E 
Sbjct: 508 KKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEK----LQEE 563

Query: 72  EESLKERKEK 81
           E+ L E  EK
Sbjct: 564 EDKLLEEAEK 573



 Score = 31.3 bits (72), Expect = 0.89
 Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 8/69 (11%)

Query: 22  EEKEENEQEEKEK-----EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           EE ++   E+KEK       +EE + + ++K ++ +   K+ +K K+E   ++E +E L+
Sbjct: 505 EEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEE---LEEKKEKLQ 561

Query: 77  ERKEKTLTE 85
           E ++K L E
Sbjct: 562 EEEDKLLEE 570


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 42.1 bits (100), Expect = 4e-04
 Identities = 13/71 (18%), Positives = 38/71 (53%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            +      + KK  +K+   + K++  + +  +     +  K +++++ +++++KKKKK+
Sbjct: 107 AEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQ 166

Query: 62  KKEEQSVKEIE 72
           K  E+  +E+ 
Sbjct: 167 KPTEKIPREVV 177



 Score = 41.0 bits (97), Expect = 9e-04
 Identities = 14/74 (18%), Positives = 39/74 (52%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
              R    +K  + +K  K ++K+   +  +  +  +  K +++++ +++++KKKKK+K 
Sbjct: 109 AAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKP 168

Query: 62  KKEEQSVKEIEESL 75
            ++      I E++
Sbjct: 169 TEKIPREVVIPETI 182



 Score = 37.9 bits (89), Expect = 0.007
 Identities = 13/72 (18%), Positives = 24/72 (33%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E +K   KR  EEE +   +E    E  EE K +       ++  +     +     +  
Sbjct: 42  EARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAED 101

Query: 70  EIEESLKERKEK 81
           E     +    +
Sbjct: 102 EAARPAEAAARR 113



 Score = 36.8 bits (86), Expect = 0.019
 Identities = 12/74 (16%), Positives = 28/74 (37%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            G + K      +EEE  + E + + +E+ + E  +    + +++ K +       EE +
Sbjct: 27  LGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAA 86

Query: 68  VKEIEESLKERKEK 81
                     R  +
Sbjct: 87  EAAAAAEAAARPAE 100



 Score = 36.8 bits (86), Expect = 0.019
 Identities = 16/77 (20%), Positives = 35/77 (45%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           +  +   G K K+ + + K     +  +  K +      + ++KKKKK+K  +K  ++  
Sbjct: 118 KAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVV 177

Query: 62  KKEEQSVKEIEESLKER 78
             E  +V E+ E +  +
Sbjct: 178 IPETITVAELAEKMAVK 194



 Score = 33.7 bits (78), Expect = 0.15
 Identities = 11/83 (13%), Positives = 44/83 (53%), Gaps = 2/83 (2%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
             R   A  ++ + +K  K+++  + ++++ +++       K+ K  K +++++ +++++
Sbjct: 103 AARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRK--AARGGKRGKGGKGRRRRRGRRRRR 160

Query: 62  KKEEQSVKEIEESLKERKEKTLT 84
           KK+++     +   +    +T+T
Sbjct: 161 KKKKKQKPTEKIPREVVIPETIT 183



 Score = 32.1 bits (74), Expect = 0.48
 Identities = 14/81 (17%), Positives = 33/81 (40%), Gaps = 9/81 (11%)

Query: 11 KKKRIEKRNKE---------EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           K R+ +  KE         E+ +E   E K      EE+  +K++ K++ +++ K + +
Sbjct: 2  SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAE 61

Query: 62 KKEEQSVKEIEESLKERKEKT 82
          +      +E  ++        
Sbjct: 62 EAAAAEAEEEAKAEAAAAAPA 82


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 41.9 bits (98), Expect = 4e-04
 Identities = 13/50 (26%), Positives = 25/50 (50%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
            +  G  +K+ +IE  + E E E    +++E++ ++       KKKK  K
Sbjct: 801 PKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850



 Score = 36.9 bits (85), Expect = 0.017
 Identities = 9/50 (18%), Positives = 19/50 (38%)

Query: 48  KKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
           K   + K+K K+K   +++   E      E +  T+ +  +     P   
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAIL 841



 Score = 35.3 bits (81), Expect = 0.054
 Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 7/61 (11%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-------KKKKKKKKKKK 63
           K     K+  +E+   +++ + E   VE E  +   K++++K          KKKK  K 
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851

Query: 64  E 64
           +
Sbjct: 852 D 852



 Score = 31.8 bits (72), Expect = 0.70
 Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 5/59 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           ++K + +  N +E K E+   E E      E+   K++++K          KKK+   V
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEG-----ERCTIKQREEKGIDAPAILNVKKKKPYKV 851



 Score = 31.5 bits (71), Expect = 0.93
 Identities = 18/88 (20%), Positives = 30/88 (34%), Gaps = 4/88 (4%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           +++  E     EE       +     +        + K+K K+K    K+ K E  SV  
Sbjct: 761 EEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSV-- 818

Query: 71  IEESLKERKEKTLTEDSQCDTMIPVFKK 98
             E+  ER      E+   D    +  K
Sbjct: 819 --ETEGERCTIKQREEKGIDAPAILNVK 844



 Score = 30.7 bits (69), Expect = 1.4
 Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 4/60 (6%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           +Q+ KEK   ++E     K +    + + ++   K+ E+   +    L  +K+K    D+
Sbjct: 798 KQKPKEKGPNDKEI----KIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVDN 853



 Score = 30.7 bits (69), Expect = 1.5
 Identities = 10/60 (16%), Positives = 27/60 (45%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
           K     ++K KE    +K  K +    + + ++   K+++E+        ++K++K   +
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851



 Score = 28.4 bits (63), Expect = 8.6
 Identities = 15/57 (26%), Positives = 19/57 (33%), Gaps = 1/57 (1%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
             R     K+K    +  + E    E E E+      EEK          KKKK  K
Sbjct: 794 AARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850


>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
           hydroxylase/glutamate synthase subunit beta;
           Provisional.
          Length = 752

 Score = 41.7 bits (98), Expect = 5e-04
 Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 1/71 (1%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE-SLKER 78
           K  E EE  + E+       E  K  K K++   ++    +   E ++    EE +L   
Sbjct: 261 KTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEEVNLGLT 320

Query: 79  KEKTLTEDSQC 89
           KE+ +TE  +C
Sbjct: 321 KEQAMTEAKRC 331


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 41.2 bits (97), Expect = 5e-04
 Identities = 19/72 (26%), Positives = 40/72 (55%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            + KR++K     E   + + + + +   + +    + K+KKK+KKK++KKKK+ E+  +
Sbjct: 291 YQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350

Query: 70  EIEESLKERKEK 81
            IE+   +  +K
Sbjct: 351 RIEKLEVQATDK 362



 Score = 39.3 bits (92), Expect = 0.002
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EK K ++ + K  +K     E       + +   ++  +K   + K+KKK+KKKEE+  K
Sbjct: 284 EKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343

Query: 70  EIEESLKERKEK 81
           +IE  L+ER EK
Sbjct: 344 QIER-LEERIEK 354



 Score = 34.6 bits (80), Expect = 0.069
 Identities = 19/58 (32%), Positives = 35/58 (60%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            K+K   K  ++ EK + E +EK+KE  +EEK KK+ ++ +++ +K + +   KEE  
Sbjct: 309 LKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366



 Score = 31.2 bits (71), Expect = 0.80
 Identities = 15/62 (24%), Positives = 36/62 (58%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
              KR  K   E + E+ + E KEK+  ++++ KKKK+ ++ +++ +K + +  +++  K
Sbjct: 307 SDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366

Query: 70  EI 71
            +
Sbjct: 367 TV 368



 Score = 28.1 bits (63), Expect = 8.5
 Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 3/69 (4%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL---KERKE 80
           K   +  EK +E ++  K + K+ KK     +     K+K +   +   E L    + K+
Sbjct: 273 KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKK 332

Query: 81  KTLTEDSQC 89
           K   ++ + 
Sbjct: 333 KEKKKEEKK 341


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 41.5 bits (98), Expect = 5e-04
 Identities = 19/77 (24%), Positives = 32/77 (41%), Gaps = 4/77 (5%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E KK+  ++  +     +   +   +  E +  KKK K   K    K   KKK +     
Sbjct: 43  ESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLK----D 98

Query: 70  EIEESLKERKEKTLTED 86
           E++ S K  K+  L +D
Sbjct: 99  ELDSSKKAEKKNALDKD 115



 Score = 38.4 bits (90), Expect = 0.004
 Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 10/89 (11%)

Query: 10  EKKKRIEKRNKEEEKEENEQEE--------KEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           E K+ +E + K  E+ +              +  + +  K K K   K    K   KKK 
Sbjct: 37  EIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKL 96

Query: 62  KKEEQSVK--EIEESLKERKEKTLTEDSQ 88
           K E  S K  E + +L +  +    +D  
Sbjct: 97  KDELDSSKKAEKKNALDKDDDLNYVKDID 125



 Score = 36.9 bits (86), Expect = 0.014
 Identities = 16/83 (19%), Positives = 30/83 (36%)

Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
             K E   E E ++K K+   + K+K    K++ K+  + KKK  ++   V      + 
Sbjct: 4  ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMV 63

Query: 77 ERKEKTLTEDSQCDTMIPVFKKG 99
          +  +     D          K  
Sbjct: 64 KDTDDATESDIPKKKTKTAAKAA 86



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 11/63 (17%), Positives = 27/63 (42%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
             ++  E++  +K+ +   +    K   KKK K +    KK +K+    K+ + +  +  
Sbjct: 65  DTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDI 124

Query: 80  EKT 82
           +  
Sbjct: 125 DVL 127



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 10/79 (12%), Positives = 25/79 (31%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
                   +  +  E     ++ +   +    +   KKK K +    KK +KK   ++  
Sbjct: 57  IFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116

Query: 68  VKEIEESLKERKEKTLTED 86
                + +    +    +D
Sbjct: 117 DLNYVKDIDVLNQADDDDD 135



 Score = 29.2 bits (66), Expect = 3.4
 Identities = 10/57 (17%), Positives = 22/57 (38%), Gaps = 1/57 (1%)

Query: 31  EKEKEDVEEEKNKKKKKK-KKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
             +    +    KK K +    KK +KK    K ++ +  +  + L +  +    +D
Sbjct: 82  AAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDD 138



 Score = 28.8 bits (65), Expect = 4.8
 Identities = 13/65 (20%), Positives = 24/65 (36%)

Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
              + + E   E+  ++K KK   K K K    K++ K+  E   K  E+  +     +
Sbjct: 1  MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLS 60

Query: 83 LTEDS 87
               
Sbjct: 61 GMVKD 65



 Score = 28.4 bits (64), Expect = 7.0
 Identities = 12/66 (18%), Positives = 33/66 (50%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            +++ ++++ +E + ED +++    + ++KK+ K+ +K            + E   + RK
Sbjct: 143 DDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARK 202

Query: 80  EKTLTE 85
           +  LT 
Sbjct: 203 DAKLTA 208


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 41.0 bits (97), Expect = 7e-04
 Identities = 22/72 (30%), Positives = 48/72 (66%), Gaps = 3/72 (4%)

Query: 12  KKRIEKRNKEEEKEEN---EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           KKRI+   KEEE+  +   E+E  +    EEE+ +K+K+++++ +   +++ +++E++  
Sbjct: 29  KKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQ 88

Query: 69  KEIEESLKERKE 80
           +E EE L+ER++
Sbjct: 89  EEYEERLQEREQ 100



 Score = 34.9 bits (81), Expect = 0.062
 Identities = 15/60 (25%), Positives = 41/60 (68%)

Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          EEK+  + EEKE+E   +E  ++++ K   +++++++K+K++  +    ++E ++ER+++
Sbjct: 27 EEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKR 86



 Score = 33.0 bits (76), Expect = 0.21
 Identities = 26/84 (30%), Positives = 52/84 (61%), Gaps = 6/84 (7%)

Query: 3   KRRGGAGEKKKRIEKR--NKEEEKEENEQEEKEK----EDVEEEKNKKKKKKKKKKKKKK 56
           +RR G    +++IE+R   ++EE EE  QE ++     E ++EE   + ++K++K+KK +
Sbjct: 68  ERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLR 127

Query: 57  KKKKKKKEEQSVKEIEESLKERKE 80
           ++  +  EE+  ++ EE  +ER+E
Sbjct: 128 EEIDEFNEERIERKEEEKEREREE 151



 Score = 32.6 bits (75), Expect = 0.33
 Identities = 20/78 (25%), Positives = 50/78 (64%), Gaps = 8/78 (10%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEK---EDVEEEKNKKKKKKKKKKKKKKKKKKK-----KK 63
            + IE+  +E+E E  E+ EK+K   E+++E   ++ ++K+++K+++++++ K     ++
Sbjct: 102 DEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQRE 161

Query: 64  EEQSVKEIEESLKERKEK 81
           + +  +E E   +ERKE+
Sbjct: 162 KAEREEEREAERRERKEE 179



 Score = 31.4 bits (72), Expect = 0.60
 Identities = 20/83 (24%), Positives = 50/83 (60%), Gaps = 12/83 (14%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK------KKKKKKKKKKKKKKKK 63
           E+++RI++   EEE+ +   EE+E+E   +E+ ++ +       ++++K+++++ +++ +
Sbjct: 38  EEERRIDEM-MEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQ 96

Query: 64  EEQSVKEI-----EESLKERKEK 81
           E + + EI     EE   E +EK
Sbjct: 97  EREQMDEIIERIQEEDEAEAQEK 119



 Score = 31.4 bits (72), Expect = 0.64
 Identities = 15/83 (18%), Positives = 41/83 (49%), Gaps = 13/83 (15%)

Query: 12  KKRIEKRNKEEEKEENE-----QEEKEKEDVEEEKNKKKKKKKKKK--------KKKKKK 58
           K+   +  + EE+ E E     Q E E+ + E  + ++ K+ + ++        K++++ 
Sbjct: 246 KEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRA 305

Query: 59  KKKKKEEQSVKEIEESLKERKEK 81
            ++++E +  + + E   ER+ +
Sbjct: 306 AEREEELEEGERLREEEAERQAR 328



 Score = 31.4 bits (72), Expect = 0.77
 Identities = 23/94 (24%), Positives = 54/94 (57%), Gaps = 14/94 (14%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEK---EDVEEEKNKKKKKKKKKKKKKKK 57
             +RR    EK++ + +   ++E+ E+E+EE ++   +  +EE  +K+++K+K++ +K++
Sbjct: 170 EAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRR 229

Query: 58  KKKKK-----------KEEQSVKEIEESLKERKE 80
           ++K++           KEE+  +E  E   ER+ 
Sbjct: 230 RQKQELQRAREEQIEEKEERLQEERAEEEAERER 263



 Score = 31.0 bits (71), Expect = 0.94
 Identities = 21/86 (24%), Positives = 51/86 (59%), Gaps = 14/86 (16%)

Query: 10  EKKKRIEKRNKE-----------EEKEENEQEEKEKEDVEEEKNKK--KKKKKKKKKKKK 56
           ++ +  EKR K+           EE+ E ++EEKE+E  EE K  +  ++K +++++++ 
Sbjct: 112 DEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREA 171

Query: 57  KKKKKKKE-EQSVKEIEESLKERKEK 81
           +++++K+E E+ V  +    +E +++
Sbjct: 172 ERRERKEEKEREVARLRAQQEEAEDE 197



 Score = 30.3 bits (69), Expect = 1.7
 Identities = 15/85 (17%), Positives = 48/85 (56%), Gaps = 14/85 (16%)

Query: 11  KKKRIEKRNKEEEKEE------------NEQEEKEKEDVEEEKNK--KKKKKKKKKKKKK 56
           +K++ E   +  +K+E             E+ ++E+ + E E+ +  +K+ + ++ +++ 
Sbjct: 219 QKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQEN 278

Query: 57  KKKKKKKEEQSVKEIEESLKERKEK 81
            +K++ K  +  +E+E+ ++E++E+
Sbjct: 279 AEKRRMKRLEHRRELEQQIEEKEER 303



 Score = 30.3 bits (69), Expect = 1.8
 Identities = 15/73 (20%), Positives = 44/73 (60%), Gaps = 3/73 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK---KKKEEQ 66
           E+ +R E+  + E +EE +  E ++E  E E+ ++ +++++K++K+++  +   +++E +
Sbjct: 136 ERIERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAE 195

Query: 67  SVKEIEESLKERK 79
             +E  + L+   
Sbjct: 196 DEREELDELRADL 208



 Score = 29.9 bits (68), Expect = 2.0
 Identities = 12/76 (15%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EK+   E+  +E  ++   +  + + ++E++  +K++++  +++++ ++ ++ +EE+   
Sbjct: 266 EKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEE--A 323

Query: 70  EIEESLKERKEKTLTE 85
           E +  ++E +++ L E
Sbjct: 324 ERQARIEEERQRLLKE 339



 Score = 29.5 bits (67), Expect = 2.9
 Identities = 18/96 (18%), Positives = 48/96 (50%), Gaps = 27/96 (28%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKED-----------VEEEKNKKKKKKKKKKK-------- 53
           + +E+   +   EE E+E K KE+           +EE + +++++ +++ +        
Sbjct: 45  EMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEI 104

Query: 54  --------KKKKKKKKKKEEQSVKEIEESLKERKEK 81
                   + + ++K++K+++  +EI+E  +ER E+
Sbjct: 105 IERIQEEDEAEAQEKREKQKKLREEIDEFNEERIER 140


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 41.2 bits (97), Expect = 7e-04
 Identities = 24/83 (28%), Positives = 43/83 (51%), Gaps = 9/83 (10%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK-------KKKKKKKKKKKKKKK 62
           E    IE+  +E E EE   EE+ +E    +KN  ++K       K+ + KK KK+ K +
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEE--PSKKNVGRRKFGPENGEKEAESKKLKKENKNE 446

Query: 63  KEEQSVKEIEESLKERKEKTLTE 85
            +E+   + EE L++ +E  + +
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEK 469



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 25/71 (35%), Positives = 38/71 (53%), Gaps = 2/71 (2%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           +R E R KEE   E E+  +E E  EE  E+  ++  KK   ++K   +  +KE +S K 
Sbjct: 380 QRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKL 439

Query: 71  IEESLKERKEK 81
            +E+  E KEK
Sbjct: 440 KKENKNEFKEK 450



 Score = 38.1 bits (89), Expect = 0.006
 Identities = 20/61 (32%), Positives = 37/61 (60%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
              + K+++K NK E KE+ E +E+E+ + EEE   +K   K  K+ +K +K++++EE  
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELD 491

Query: 68  V 68
            
Sbjct: 492 E 492



 Score = 37.7 bits (88), Expect = 0.008
 Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 1/74 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEK-EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           +  KR EK  KEEE+EE ++E    K      K+ KK+  KKK   K  K   K  + +V
Sbjct: 473 KLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAV 532

Query: 69  KEIEESLKERKEKT 82
           K  ++  KE+    
Sbjct: 533 KVKKKKKKEKSIDL 546



 Score = 37.0 bits (86), Expect = 0.015
 Identities = 22/85 (25%), Positives = 44/85 (51%), Gaps = 10/85 (11%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK----------KEE 65
           E+ ++EE +E +++    ++   E   K+ + KK KK+ K + K+KK          +EE
Sbjct: 405 EESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEE 464

Query: 66  QSVKEIEESLKERKEKTLTEDSQCD 90
             V+++   L +R EK   E+ + +
Sbjct: 465 AKVEKVANKLLKRSEKAQKEEEEEE 489



 Score = 36.6 bits (85), Expect = 0.019
 Identities = 17/75 (22%), Positives = 38/75 (50%), Gaps = 1/75 (1%)

Query: 17  KRNKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           +R +  +KEEN+ E +E + ++E E+   +++ ++  KK   ++K   E    +   + L
Sbjct: 380 QRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKL 439

Query: 76  KERKEKTLTEDSQCD 90
           K+  +    E  + D
Sbjct: 440 KKENKNEFKEKKESD 454



 Score = 36.6 bits (85), Expect = 0.020
 Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 3/81 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEE---KNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           E+ K  +  NK  ++ E  Q+E+E+E+++EE           K  KK+  KKK   K ++
Sbjct: 463 EEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDK 522

Query: 67  SVKEIEESLKERKEKTLTEDS 87
           +  +I ++  + K+K   E S
Sbjct: 523 AANKISKAAVKVKKKKKKEKS 543



 Score = 35.0 bits (81), Expect = 0.061
 Identities = 21/75 (28%), Positives = 42/75 (56%), Gaps = 2/75 (2%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            RR    E  ++  +  K +++ +NE +EK++ D EEE   +++ K +K   K  K+ +K
Sbjct: 421 GRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEK 480

Query: 63  KEEQSVKEIEESLKE 77
            +++  +E EE  +E
Sbjct: 481 AQKE--EEEEELDEE 493



 Score = 35.0 bits (81), Expect = 0.061
 Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 14/57 (24%)

Query: 21  EEEKEENEQEEKEKE--------------DVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E+EK+E  +EE  KE               +++ K K+K+K++   K +  KK+K+K
Sbjct: 596 EKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGVKKEKRK 652



 Score = 33.5 bits (77), Expect = 0.17
 Identities = 19/103 (18%), Positives = 43/103 (41%), Gaps = 19/103 (18%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK-------------- 55
           E K   +++ + +E+EE E EE+ K +    K  K+ +K +K++++              
Sbjct: 442 ENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTS 501

Query: 56  -----KKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
                 KK+  KK+  S  +   +   +    + +  + +  I
Sbjct: 502 SVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSI 544



 Score = 33.1 bits (76), Expect = 0.24
 Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 25/78 (32%)

Query: 23  EKEENEQE-EKEKEDVEEEKNKK----------------KKKKKKKKKKKKKKKKKKKEE 65
             ++   E EKEK++V EE++ K                 KK+KKK+K+K++   K +  
Sbjct: 587 AGDDVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGV 646

Query: 66  QSVKEIEESLKERKEKTL 83
           +  K        RK+K L
Sbjct: 647 KKEK--------RKDKKL 656



 Score = 33.1 bits (76), Expect = 0.25
 Identities = 16/94 (17%), Positives = 37/94 (39%), Gaps = 14/94 (14%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKED--------------VEEEKNKKKKKKKKKKK 53
                +R       E++ E+++ +KE ++               +EE+ K +K   K  K
Sbjct: 417 KKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLK 476

Query: 54  KKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           + +K +K+++EE+  +E              +  
Sbjct: 477 RSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQ 510



 Score = 32.7 bits (75), Expect = 0.30
 Identities = 16/60 (26%), Positives = 27/60 (45%), Gaps = 1/60 (1%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK-EEQSVK 69
               + K  K+++ ++    + +K   +  K   K KKKKKK+K          EE S+K
Sbjct: 499 TTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIK 558



 Score = 32.0 bits (73), Expect = 0.53
 Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 2/39 (5%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           EE K K+ KK K KK  +  KK+K KEE   KE EE +K
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKEEL--KEFEELVK 252



 Score = 32.0 bits (73), Expect = 0.57
 Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 7/81 (8%)

Query: 10  EKKKRIEKRNKE-----EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           EK K   K  +E      E    E E+ E+   EE    K K   K  K   K  + K +
Sbjct: 238 EKLKEELKEFEELVKADPEAALEELEKLERRRAEERMTLKHKNTSKWAKSMLKTGRAKYD 297

Query: 65  EQSVKEIEESLKERKEKTLTE 85
           E++ K +EE L  R+ + L  
Sbjct: 298 EEARKALEEQL--RQGEELRR 316



 Score = 31.6 bits (72), Expect = 0.73
 Identities = 18/86 (20%), Positives = 41/86 (47%), Gaps = 20/86 (23%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK------------ 59
           +++IE ++  EE E+ + + +E+++ ++E +   +     +KK  K K            
Sbjct: 315 RRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLL 374

Query: 60  --------KKKKEEQSVKEIEESLKE 77
                   + +K+E++  EIEE  +E
Sbjct: 375 SMKFMQRAEARKKEENDAEIEELRRE 400



 Score = 29.6 bits (67), Expect = 2.7
 Identities = 18/80 (22%), Positives = 40/80 (50%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             E  K+   R K   +   ++ E +K   E +   K+KK+  ++++ + +++ K E+ +
Sbjct: 412 NEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVA 471

Query: 68  VKEIEESLKERKEKTLTEDS 87
            K ++ S K +KE+   E  
Sbjct: 472 NKLLKRSEKAQKEEEEEELD 491


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 40.3 bits (95), Expect = 7e-04
 Identities = 14/64 (21%), Positives = 35/64 (54%), Gaps = 2/64 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E K +++ +  E++ ++ E   ++++     + +K K K  +KK+K + +K + +   V 
Sbjct: 105 EAKAKVQAQRAEQQAKKREAAGEKEK--APRRERKPKPKAPRKKRKPRAQKPEPQHTPVS 162

Query: 70  EIEE 73
           +I E
Sbjct: 163 DISE 166



 Score = 34.5 bits (80), Expect = 0.046
 Identities = 9/49 (18%), Positives = 27/49 (55%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           EE +   Q ++ ++  ++ +   +K+K  ++++K K K  +K+ +   +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152



 Score = 33.7 bits (78), Expect = 0.085
 Identities = 13/53 (24%), Positives = 29/53 (54%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           EE K + + +  E++  + E   +K+K  ++++K K K  +KK +   ++ E 
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156



 Score = 33.0 bits (76), Expect = 0.15
 Identities = 8/46 (17%), Positives = 24/46 (52%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
           E+  KK++   +K+K  ++++K K +   K+ +   ++ + +    
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161



 Score = 31.8 bits (73), Expect = 0.40
 Identities = 7/55 (12%), Positives = 26/55 (47%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
            +  E   +++    +K+K  ++++K K K  ++++  +  +   +      ++E
Sbjct: 112 AQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166



 Score = 27.6 bits (62), Expect = 8.0
 Identities = 14/64 (21%), Positives = 30/64 (46%), Gaps = 5/64 (7%)

Query: 11  KKKRIEKRNK--EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE---E 65
           + +R E++ K  E   E+ +   +E++   +   KK+K + +K + +        E    
Sbjct: 111 QAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVG 170

Query: 66  QSVK 69
           Q+VK
Sbjct: 171 QAVK 174



 Score = 27.6 bits (62), Expect = 9.2
 Identities = 13/64 (20%), Positives = 33/64 (51%), Gaps = 1/64 (1%)

Query: 26  ENEQEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
           E E  E  ++ +EE K K + ++ +++ KK++   +K+K  +  ++ +     +K K   
Sbjct: 92  EEEHVEHARKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRA 151

Query: 85  EDSQ 88
           +  +
Sbjct: 152 QKPE 155


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 41.0 bits (96), Expect = 8e-04
 Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 3/71 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            +KKR   +N+E  K ENE++ +E  ++E EK    +++  +     + K+ +++ +S  
Sbjct: 141 IEKKRENNKNEERLKFENEKKLEESLELEREK---FEEQLHEANLDLEFKENEEQRESKW 197

Query: 70  EIEESLKERKE 80
            I + LK R E
Sbjct: 198 AILKKLKRRAE 208



 Score = 37.1 bits (86), Expect = 0.012
 Identities = 20/76 (26%), Positives = 45/76 (59%), Gaps = 1/76 (1%)

Query: 11  KKKRIEKRNKEEEKE-ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           K + +E  N E++KE E  ++E ++   E +K  +   +  +KK++  K +++ + ++ K
Sbjct: 101 KLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEK 160

Query: 70  EIEESLKERKEKTLTE 85
           ++EESL+  +EK   +
Sbjct: 161 KLEESLELEREKFEEQ 176



 Score = 36.3 bits (84), Expect = 0.019
 Identities = 17/72 (23%), Positives = 37/72 (51%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E   + E    E  +  N +++KE E +E+E ++  K+ +K+ +   +  +KK+E    +
Sbjct: 92  ELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNE 151

Query: 70  EIEESLKERKEK 81
           E  +   E+K +
Sbjct: 152 ERLKFENEKKLE 163



 Score = 32.9 bits (75), Expect = 0.24
 Identities = 17/70 (24%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 10  EKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           +KK++    N++ ++ E   Q+E  K +  E  N +K K+ +  +K+  +  K+ ++Q  
Sbjct: 76  QKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQ 135

Query: 69  KEIEESLKER 78
              E   K+R
Sbjct: 136 NTAEIIEKKR 145



 Score = 32.5 bits (74), Expect = 0.39
 Identities = 14/79 (17%), Positives = 34/79 (43%), Gaps = 2/79 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           + +   +K   + E++   Q++++K    E+  + +   + +  K +  +    E+   K
Sbjct: 57  QYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKD--K 114

Query: 70  EIEESLKERKEKTLTEDSQ 88
           E+E   KE  E +     Q
Sbjct: 115 ELELLEKELDELSKELQKQ 133



 Score = 31.3 bits (71), Expect = 0.75
 Identities = 25/82 (30%), Positives = 42/82 (51%), Gaps = 8/82 (9%)

Query: 10  EKKKRIEKRNKEEEKEENEQE-------EKEKEDVEEE-KNKKKKKKKKKKKKKKKKKKK 61
           E + RI    +E EKE NE+        +KE   +EE+  N+KK++K    ++ K+ +  
Sbjct: 35  EDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELA 94

Query: 62  KKEEQSVKEIEESLKERKEKTL 83
            ++E +  E  E L   K+K L
Sbjct: 95  LQDEIAKLEALELLNLEKDKEL 116



 Score = 29.8 bits (67), Expect = 2.5
 Identities = 11/87 (12%), Positives = 41/87 (47%), Gaps = 8/87 (9%)

Query: 10  EKKKRIEKRNKEEEKEENEQEE---KEKEDVEEEKNKKKKKKKKKKKKKKK-----KKKK 61
           E  +  E+   ++++++N   E   + +  +++E  K +  +    +K K+     K+  
Sbjct: 65  ELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELD 124

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDSQ 88
           +  ++  K+++ + +  ++K     ++
Sbjct: 125 ELSKELQKQLQNTAEIIEKKRENNKNE 151


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 40.7 bits (96), Expect = 8e-04
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           +E E+E  E+ E+E ++ E EK   + +KK  +KK KK  KK K EE   +E+ E   E 
Sbjct: 64  EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123

Query: 79  KEK 81
            E 
Sbjct: 124 PEP 126



 Score = 38.0 bits (89), Expect = 0.005
 Identities = 30/92 (32%), Positives = 44/92 (47%), Gaps = 12/92 (13%)

Query: 6   GGAGEKK-----------KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
             +GE+K           + IE+  +EE +EE ++ E EKE  E EK K  +KK KK  K
Sbjct: 46  AESGERKSAVDKLAMKPLEEIEEELREEYEEELKEYEAEKEIWEAEK-KGLEKKAKKAIK 104

Query: 55  KKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
           K K ++   EE    E EE     + + +  D
Sbjct: 105 KGKDEEALAEELLELEAEEPEPPLRPRLIVND 136


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 40.9 bits (96), Expect = 9e-04
 Identities = 22/88 (25%), Positives = 37/88 (42%), Gaps = 16/88 (18%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
                E      E ++ E +++E  D EE+  KKK KKK+++K                 
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRK-------------PEST 197

Query: 71  IEESLKERKEKTLTEDSQCDTMIPVFKK 98
           +E++ +    K L  +   D   P+FKK
Sbjct: 198 LEKNFEALNVKKLDLEFAVD---PLFKK 222



 Score = 29.0 bits (65), Expect = 4.6
 Identities = 9/35 (25%), Positives = 14/35 (40%)

Query: 44  KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
                K   +K+KK+ K+  E      +EE   E 
Sbjct: 435 AAADTKSAAEKQKKRAKEPFEIDFGAPLEEIDFEV 469


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 40.8 bits (95), Expect = 0.001
 Identities = 17/89 (19%), Positives = 41/89 (46%), Gaps = 8/89 (8%)

Query: 8   AGEKKKRIEKRNKEEEKEENE---QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           A +++  + ++ +E +           KE + V E + ++ +K + + KK  ++  K K+
Sbjct: 243 ADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKD 302

Query: 65  ---EQSVKEIEESLKERKEKTLTEDSQCD 90
                  +E + S KE ++K    ++Q  
Sbjct: 303 HKAFDLKQESKASEKEAEDK--ELEAQKK 329



 Score = 40.0 bits (93), Expect = 0.002
 Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 12/99 (12%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK-KKKKKKKKKKKKKKEE 65
              E K+  E + +E EK    Q E +K D E  K K  K    K++ K  +K+ + KE 
Sbjct: 268 SPKEDKQVAENQKREIEKA---QIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324

Query: 66  QSVKEIEESLKE--------RKEKTLTEDSQCDTMIPVF 96
           ++ K+ E   ++          + T   +   D+  PV+
Sbjct: 325 EAQKKREPVAEDLQKTKPQVEAQPTSLNEDAIDSSNPVY 363



 Score = 39.2 bits (91), Expect = 0.002
 Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV- 68
            K        ++++  EN++ E EK  +E +KN ++  K K  K    K++ K  E+   
Sbjct: 261 PKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAE 320

Query: 69  KEIEESLKERKEKTLTEDSQ 88
            +  E+ K+R+   + ED Q
Sbjct: 321 DKELEAQKKREP--VAEDLQ 338



 Score = 37.3 bits (86), Expect = 0.010
 Identities = 18/82 (21%), Positives = 38/82 (46%), Gaps = 1/82 (1%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKK 60
           ++ +    E  K+    +K ++K +  Q+  +K+ D   +K ++ K   K       K+ 
Sbjct: 213 KRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKED 272

Query: 61  KKKEEQSVKEIEESLKERKEKT 82
           K+  E   +EIE++  E K+  
Sbjct: 273 KQVAENQKREIEKAQIEIKKND 294



 Score = 36.1 bits (83), Expect = 0.028
 Identities = 12/74 (16%), Positives = 37/74 (50%), Gaps = 1/74 (1%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE-EEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           A + +++ +      +K+ +E  +K++E     +       K+ K+  + +K++ +K + 
Sbjct: 229 ADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQI 288

Query: 67  SVKEIEESLKERKE 80
            +K+ +E   + K+
Sbjct: 289 EIKKNDEEALKAKD 302



 Score = 35.7 bits (82), Expect = 0.034
 Identities = 15/72 (20%), Positives = 42/72 (58%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           ++K    + N +++++E  Q+++E +++ +  +    K+ K+  + +K++ +K + +  K
Sbjct: 233 QQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKK 292

Query: 70  EIEESLKERKEK 81
             EE+LK +  K
Sbjct: 293 NDEEALKAKDHK 304



 Score = 33.4 bits (76), Expect = 0.17
 Identities = 13/70 (18%), Positives = 31/70 (44%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           + +K +  R    + +E E +E  K   + ++   KK+    K ++K    +   ++   
Sbjct: 189 DNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRD 248

Query: 70  EIEESLKERK 79
           E+ +  +E K
Sbjct: 249 EVRQKQQEAK 258



 Score = 31.9 bits (72), Expect = 0.58
 Identities = 18/74 (24%), Positives = 38/74 (51%), Gaps = 4/74 (5%)

Query: 12  KKRIEKRNKEEEKEENEQEE----KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           KK +E   ++ EK  N + +    KE+E  E+ K  ++ K++  KK+    K ++K + +
Sbjct: 180 KKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFA 239

Query: 68  VKEIEESLKERKEK 81
               ++   E ++K
Sbjct: 240 QDNADKQRDEVRQK 253


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 39.5 bits (92), Expect = 0.001
 Identities = 17/62 (27%), Positives = 40/62 (64%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            +E  EE + + KEK++  EE  K+ ++ + +K   +K  ++K++++ ++E++E L+E  
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200

Query: 80  EK 81
           E+
Sbjct: 201 EE 202



 Score = 36.0 bits (83), Expect = 0.015
 Identities = 18/61 (29%), Positives = 40/61 (65%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
             EE K + +++++  E++E+E  + + +K   +K  ++K+KKK+ E+  +E+EE L+E 
Sbjct: 144 RLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEEL 203

Query: 79  K 79
           +
Sbjct: 204 E 204


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 40.8 bits (96), Expect = 0.001
 Identities = 27/79 (34%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E + +IE+  KE +K E + E+ E E   E+K ++ +K K K   K  KK  KK + ++ 
Sbjct: 92  EIEAQIEELKKELKKLEEKIEQLEAE--IEKKEEELEKAKNKFLDKAWKKLAKKYDSNLS 149

Query: 70  EIEESLKER---KEKTLTE 85
           E  + L  +   KEK L E
Sbjct: 150 EALKGLNYKKNFKEKLLKE 168



 Score = 38.1 bits (89), Expect = 0.007
 Identities = 20/69 (28%), Positives = 37/69 (53%), Gaps = 5/69 (7%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           GE+   IE + +E +KE  + EEK      E+   + +KK+++ +K K K   K  ++  
Sbjct: 87  GEENIEIEAQIEELKKELKKLEEKI-----EQLEAEIEKKEEELEKAKNKFLDKAWKKLA 141

Query: 69  KEIEESLKE 77
           K+ + +L E
Sbjct: 142 KKYDSNLSE 150



 Score = 35.4 bits (82), Expect = 0.047
 Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 11/84 (13%)

Query: 9   GEKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
            + +++IE+   E EK  EE E+ + +  D   +K  KK      +  K    KK  +E+
Sbjct: 105 KKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEK 164

Query: 67  SVKEIE---------ESLKERKEK 81
            +KE++          SL+E K K
Sbjct: 165 LLKELKSVILNASSLLSLEELKAK 188



 Score = 35.0 bits (81), Expect = 0.071
 Identities = 16/76 (21%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            ++I+   KE+ K + +       +++E+    +K+KK  +K     +K+ K+ E  +K 
Sbjct: 381 NEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKA 440

Query: 71  IEESLKERKEKTLTED 86
           +E+ +KE +++    +
Sbjct: 441 LEKEIKELEKQLTNIE 456



 Score = 34.2 bits (79), Expect = 0.096
 Identities = 33/151 (21%), Positives = 70/151 (46%), Gaps = 8/151 (5%)

Query: 34  KEDVEEEKNKKKKKKKKKKKKKKKKKK----KKKEEQSVKEIEESLKERKEKTLTEDSQC 89
           +E++E E   ++ KK+ KK ++K ++     +KKEE+  K   + L +  +K   +    
Sbjct: 88  EENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKY--- 144

Query: 90  DTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQRQKSLIIVNLVDMVLKAVL 149
           D+ +    KG+ YK  +   L    +   L  S    S  + +  +  +   +    A+L
Sbjct: 145 DSNLSEALKGLNYKKNFKEKLLKELKSVILNASS-LLSLEELKAKIKTLFSSNKPELALL 203

Query: 150 TVAWISYNEVELSQIDSTKLTWTVYTMMSNL 180
           T++ I ++E+E ++I    +  +    +S L
Sbjct: 204 TLSVIDFDEIEQAEILEKSIIGSSDVPISEL 234


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 39.7 bits (93), Expect = 0.001
 Identities = 16/75 (21%), Positives = 44/75 (58%)

Query: 9  GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
             ++   K N+ +E+++ ++E+ E E++E+E+  +  ++  + K ++ K +  K ++  
Sbjct: 11 ENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEEN 70

Query: 69 KEIEESLKERKEKTL 83
          K++E  L+  K++ L
Sbjct: 71 KKLENELEALKDRLL 85



 Score = 37.0 bits (86), Expect = 0.007
 Identities = 21/85 (24%), Positives = 44/85 (51%), Gaps = 9/85 (10%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKK--------KKKKKK 60
          +  K         ++ EN++E+K K ED+E E+ +K++  +  ++         K +  K
Sbjct: 6  KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNK 65

Query: 61 KKKEEQSVKEIEESLKERKEKTLTE 85
           K+E + ++   E+LK+R  +T+ E
Sbjct: 66 LKEENKKLENELEALKDRLLRTVAE 90



 Score = 35.9 bits (83), Expect = 0.016
 Identities = 16/65 (24%), Positives = 38/65 (58%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           ++  ++ K EN +E+  KE+  +E++K K++  + ++ +K++  +  EE +  +IEE  
Sbjct: 1  MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60

Query: 76 KERKE 80
           E  +
Sbjct: 61 DENNK 65



 Score = 34.7 bits (80), Expect = 0.038
 Identities = 18/71 (25%), Positives = 39/71 (54%), Gaps = 5/71 (7%)

Query: 16 EKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKK----KKKKKKKKKKKKKKKEEQSVKE 70
          EK  K+ + E  E++  KE E+ EE+K K++  +    +K++  +  ++  + + + +K+
Sbjct: 2  EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61

Query: 71 IEESLKERKEK 81
              LKE  +K
Sbjct: 62 ENNKLKEENKK 72


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
          Provisional.
          Length = 191

 Score = 39.4 bits (92), Expect = 0.001
 Identities = 22/58 (37%), Positives = 32/58 (55%), Gaps = 4/58 (6%)

Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK---KKKKKKKEEQSVKEIE 72
          ++  E  ++   ++   EDVE    K  KKKK   +KKK   K KKKKKE++ V E+ 
Sbjct: 25 KSTTESVDKVN-DDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELA 81



 Score = 33.6 bits (77), Expect = 0.098
 Identities = 18/67 (26%), Positives = 29/67 (43%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
          K   E  +K  +D+  E  +   KK  KKKK   +KKK  ++   K+ E+          
Sbjct: 25 KSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEE 84

Query: 84 TEDSQCD 90
            DS+ +
Sbjct: 85 LSDSEEN 91



 Score = 29.8 bits (67), Expect = 1.4
 Identities = 9/57 (15%), Positives = 26/57 (45%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             ++ +   K ++K++ ++E  E    E   +++ ++  KK   +  K +    E +
Sbjct: 57  TPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVN 113



 Score = 28.2 bits (63), Expect = 4.6
 Identities = 15/64 (23%), Positives = 29/64 (45%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           +K+     +K  +  K+ K+E++E  E   +E  D EE +   KK   +  K +    + 
Sbjct: 53  KKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEV 112

Query: 62  KKEE 65
             E+
Sbjct: 113 NHED 116


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 38.2 bits (89), Expect = 0.001
 Identities = 17/74 (22%), Positives = 43/74 (58%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            R     +++  IE++ +E+EK+E E+  KEK ++ EE+ +K+ + +K ++K + +K ++
Sbjct: 28  SRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDEKLQE 87

Query: 63  KEEQSVKEIEESLK 76
              +    +   ++
Sbjct: 88  TWHEHNLALANFIR 101



 Score = 34.8 bits (80), Expect = 0.022
 Identities = 17/74 (22%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 3  KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
          K          + ++R + E+K E EQE++E+E++ +EK +  +++++K+ + +K ++K 
Sbjct: 22 KFSQEESRLTSKEKRRAEIEQKLE-EQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80

Query: 63 KEEQSVKEIEESLK 76
          ++E+  +   E   
Sbjct: 81 EDEKLQETWHEHNL 94



 Score = 33.2 bits (76), Expect = 0.063
 Identities = 17/74 (22%), Positives = 42/74 (56%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          +K  + E R   +EK   E E+K +E  ++E+ + +K+K++  +++++K+ + ++ +   
Sbjct: 21 QKFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80

Query: 70 EIEESLKERKEKTL 83
          E E+  +   E  L
Sbjct: 81 EDEKLQETWHEHNL 94


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 38.4 bits (90), Expect = 0.001
 Identities = 22/79 (27%), Positives = 48/79 (60%), Gaps = 2/79 (2%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK--KKKKKKKKKKKKKKKKKKKEEQS 67
          +K  RI    ++ E++E E+EEKE+++ +E++  K   +K++++K+ ++ +K K K    
Sbjct: 6  DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65

Query: 68 VKEIEESLKERKEKTLTED 86
            + E+  +E KE+   +D
Sbjct: 66 YADDEDYDEELKEQERWDD 84


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 38.5 bits (90), Expect = 0.001
 Identities = 21/35 (60%), Positives = 27/35 (77%)

Query: 29  QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           +E+K  + V +EK K+KK+KKKKKKKKKKK  KK 
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 37.7 bits (88), Expect = 0.002
 Identities = 26/52 (50%), Positives = 34/52 (65%), Gaps = 2/52 (3%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           + RI +    EE++  +Q  KEK   +++K KKKKKKKKKKK  KK  KKKK
Sbjct: 86  RDRIVRAFLVEEQKIVKQVLKEKA--KQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 37.0 bits (86), Expect = 0.003
 Identities = 20/37 (54%), Positives = 25/37 (67%), Gaps = 4/37 (10%)

Query: 37  VEEEK----NKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           VEE+K      K+K K+KK+KKKKKKKKKKK  +   
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAA 131



 Score = 36.6 bits (85), Expect = 0.005
 Identities = 18/29 (62%), Positives = 21/29 (72%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           K K K+KK+KKKKKKKKKKK  K+    K
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 35.0 bits (81), Expect = 0.015
 Identities = 18/37 (48%), Positives = 28/37 (75%)

Query: 33  EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E++ V++   +K K+KK+KKKKKKKKKKK  ++ + K
Sbjct: 97  EQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 35.0 bits (81), Expect = 0.019
 Identities = 15/38 (39%), Positives = 25/38 (65%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           EE+   K+  K+K K+KK+KKKKKK+++     + + K
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 31.6 bits (72), Expect = 0.25
 Identities = 16/40 (40%), Positives = 29/40 (72%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           E+ K  K+  K+K K+KK+KKKKK+++  K  +++ K++K
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 28.1 bits (63), Expect = 4.2
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 49  KKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
           K+  K+K K+KK+KK+++  K+ + S K  K+K 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 27.3 bits (61), Expect = 7.1
 Identities = 13/41 (31%), Positives = 25/41 (60%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
             ++K  K+  K+K K+KK+KKK+++  K+       +K+K
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 39.8 bits (94), Expect = 0.001
 Identities = 15/44 (34%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 27 NEQEEKEKEDVEEEK-NKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          NEQ ++ KE+++    +KK+ K   K +KKK+K++ K  +   K
Sbjct: 46 NEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 39.0 bits (92), Expect = 0.002
 Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 3/62 (4%)

Query: 2  RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          ++++   GE    +   N E+ KE  E+ +    D +E K   K +KKK+K++ K  K K
Sbjct: 31 QRKKSQKGEL--EVTNLN-EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAK 87

Query: 62 KK 63
           K
Sbjct: 88 SK 89



 Score = 34.4 bits (80), Expect = 0.076
 Identities = 13/54 (24%), Positives = 25/54 (46%), Gaps = 8/54 (14%)

Query: 9  GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           E+ K +++  K    ++ E +   K         +KKK+K++ K  K K K +
Sbjct: 46 NEQYKEMKEELKAALLDKKELKAWHKA--------QKKKEKQEAKAAKAKSKPR 91


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 39.4 bits (92), Expect = 0.001
 Identities = 15/60 (25%), Positives = 32/60 (53%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          + K   +K N E   + N Q++  ++     K +  ++ KK   ++KK++KKK +++ V 
Sbjct: 20 KYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVP 79



 Score = 38.6 bits (90), Expect = 0.002
 Identities = 17/50 (34%), Positives = 30/50 (60%)

Query: 7  GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
          G  E   +I  ++K  EK +   + +  E V++  +++KK++KKK KKKK
Sbjct: 28 GNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77



 Score = 37.9 bits (88), Expect = 0.004
 Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 1/58 (1%)

Query: 5  RGGAGEKKKRIEKRNKEEEKEE-NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          +  A + K   E  NK   +++  E+ +   +    E+ KK   ++KK++KKK KKKK
Sbjct: 20 KYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77



 Score = 33.2 bits (76), Expect = 0.13
 Identities = 11/57 (19%), Positives = 20/57 (35%)

Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           E E  +  K+K   +   K   + K  +K K     +  E   K   ++   E  +
Sbjct: 16 LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKK 72



 Score = 31.7 bits (72), Expect = 0.44
 Identities = 16/77 (20%), Positives = 33/77 (42%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
          K+ +     E+ K   ++   E  +    ++K  +K K   K +  ++ KK   Q  KE 
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69

Query: 72 EESLKERKEKTLTEDSQ 88
          ++  K++K       +Q
Sbjct: 70 KKKPKKKKVPLQVNPAQ 86



 Score = 29.0 bits (65), Expect = 3.2
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKK 62
             K  K   KK KK KKKKKKK+ 
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217



 Score = 28.2 bits (63), Expect = 5.0
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 42  NKKKKKKKKKKKKKKKKKKKKKEE 65
           + K  K   KK KK KKKKKKK  
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217



 Score = 27.5 bits (61), Expect = 9.9
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKK 61
             + +K   KK KK KKKKKKK+ 
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 37.4 bits (87), Expect = 0.001
 Identities = 22/72 (30%), Positives = 38/72 (52%), Gaps = 13/72 (18%)

Query: 22 EEKEENEQEEKEKEDV-------EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
          EEK+  + ++++ +D+       +++K +  K  + K  KK +KK KKK+E    E  E 
Sbjct: 2  EEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEF--PEFPEE 59

Query: 75 LKERKEKTLTED 86
           K R+    TED
Sbjct: 60 SKRRR----TED 67



 Score = 32.4 bits (74), Expect = 0.080
 Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 10/70 (14%)

Query: 17 KRNKEEEKEENE----------QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
          K  K+ EK +            + +K+K+   +    K  KK +KK KKK +  +  EE 
Sbjct: 1  KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60

Query: 67 SVKEIEESLK 76
            +  E+ L 
Sbjct: 61 KRRRTEDGLP 70


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
          region.  This highly conserved region is found towards
          the C-terminus of the transmembrane domain. The
          function is unclear.
          Length = 151

 Score = 38.4 bits (89), Expect = 0.001
 Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--KKKKKKKKKEEQSVKEIEES 74
          KE + +E   K+  E  K + +K  KKKK+K  +KK K KKKEE+   +I E 
Sbjct: 2  KETSYEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGKIPEH 54



 Score = 35.0 bits (80), Expect = 0.021
 Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 20 KEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
          KE   EE   ++ KE    + +K+ KKKK+K  +KK K KKK++K    + E  E  +E 
Sbjct: 2  KETSYEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGKIPE-HEPNQEV 60

Query: 79 KEKTLTEDSQCDTMIPV 95
           E  +  + +    + V
Sbjct: 61 TEVEVIIEKEPVPAVAV 77



 Score = 30.7 bits (69), Expect = 0.60
 Identities = 14/39 (35%), Positives = 25/39 (64%)

Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
          K+ KE  K ++++ +K+K++   EK  K KKK++K   K
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 39.3 bits (92), Expect = 0.001
 Identities = 16/86 (18%), Positives = 40/86 (46%), Gaps = 9/86 (10%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK---------KKKKKKKKKKKKKKKKK 62
           K+   ++  ++  EE +++E+EKE  E E  ++          ++K ++  K++  K KK
Sbjct: 96  KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKK 155

Query: 63  KEEQSVKEIEESLKERKEKTLTEDSQ 88
           +  +S         + +     E+++
Sbjct: 156 EAAESASSSLSGSAKPERNVSQEEAK 181



 Score = 37.0 bits (86), Expect = 0.011
 Identities = 21/77 (27%), Positives = 44/77 (57%), Gaps = 13/77 (16%)

Query: 10  EKKKRIEKRNKEEEK------------EENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKK 56
             K+R  K  KE  +            E N  +E+ K+ ++E E  K K++++K++++++
Sbjct: 145 AAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERR 204

Query: 57  KKKKKKKEEQSVKEIEE 73
           K++KK++EE+  K+  E
Sbjct: 205 KQRKKQQEEEERKQKAE 221



 Score = 36.2 bits (84), Expect = 0.017
 Identities = 14/54 (25%), Positives = 38/54 (70%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            K     ++EE K+  ++ E +K   +++K +++++K++KK+++++++K+K EE
Sbjct: 169 AKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222



 Score = 32.0 bits (73), Expect = 0.40
 Identities = 13/50 (26%), Positives = 38/50 (76%), Gaps = 3/50 (6%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
            +KR +E E ++ +Q+++++   EEE+ K++KK+++++++K+K ++  ++
Sbjct: 180 AKKRLQEWELKKLKQQQQKR---EEERRKQRKKQQEEEERKQKAEEAWQK 226



 Score = 31.6 bits (72), Expect = 0.55
 Identities = 18/106 (16%), Positives = 54/106 (50%), Gaps = 26/106 (24%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKED-----------------------VE 38
            K R  A  +++  +++ +E  +++ +Q  K++                         V 
Sbjct: 117 EKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVS 176

Query: 39  EEKNKKKKKK---KKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           +E+ KK+ ++   KK K++++K+++++++++  ++ EE  K++ E+
Sbjct: 177 QEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 14/49 (28%), Positives = 31/49 (63%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            ++K  ++++K E E+ ++ K+  EEE+ K+K ++  +K  K  K++ K
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQKWMKNVKQRPK 236



 Score = 29.3 bits (66), Expect = 2.8
 Identities = 16/68 (23%), Positives = 35/68 (51%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
             +        +E + E + +E  E   + K+ +++KK +K  ++K+K++ E+  +E E 
Sbjct: 66  LSDSSTYSRSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAEL 125

Query: 74  SLKERKEK 81
             +  KEK
Sbjct: 126 RQRLAKEK 133



 Score = 28.5 bits (64), Expect = 4.7
 Identities = 16/58 (27%), Positives = 37/58 (63%), Gaps = 4/58 (6%)

Query: 10  EKKKRIE----KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E KKR++    K+ K+++++  E+  K+++  +EE+ +K+K ++  +K  K  K++ K
Sbjct: 179 EAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQKWMKNVKQRPK 236


>gnl|CDD|220839 pfam10661, EssA, WXG100 protein secretion system (Wss), protein
          EssA.  The WXG100 protein secretion system (Wss) is
          responsible for the secretion of WXG100 proteins
          (pfam06013) such as ESAT-6 and CFP-10 in Mycobacterium
          tuberculosis or EsxA and EsxB in Staphylococcus aureus.
          In S. aureus, the Wss seems to be encoded by a locus of
          eight CDS, called ess (eSAT-6 secretion system). This
          locus encodes, amongst several other proteins, EssA, a
          protein predicted to possess one transmembrane domain.
          Due to its predicted membrane location and its absolute
          requirement for WXG100 protein secretion, it has been
          speculated that EssA could form a secretion apparatus
          in conjunction with the polytopic membrane protein
          EsaA, YukC (pfam10140) and YukAB, which is a
          membrane-bound ATPase containing Ftsk/SpoIIIE domains
          (pfam01580) called EssC in S. aureus and Snm1/Snm2 in
          Mycobacterium tuberculosis. Proteins homologous to
          EssA, YukC, EsaA and YukD seem absent from
          mycobacteria.
          Length = 145

 Score = 38.3 bits (89), Expect = 0.002
 Identities = 20/74 (27%), Positives = 41/74 (55%), Gaps = 9/74 (12%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEK-------EDVEEEKNKKKKKKKKKKKKKKKK--KKK 61
          K  R++K ++E+ +++  + E +K        + EEE NKKK  ++K+ +  +     + 
Sbjct: 18 KVDRLQKTDQEKNEKKLRETELDKLGIELFTTETEEEINKKKNAEQKEMEDIENSLFSED 77

Query: 62 KKEEQSVKEIEESL 75
          K+   +VKE ++SL
Sbjct: 78 KEGNVAVKETKDSL 91



 Score = 31.4 bits (71), Expect = 0.30
 Identities = 18/86 (20%), Positives = 42/86 (48%), Gaps = 9/86 (10%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN-----KKKKKKKKKKKKKKKKKKKKKE 64
          E   +++ +    +K + E+ EK+  + E +K        + +++  KKK  ++K+ +  
Sbjct: 10 EDDGKMQFKVDRLQKTDQEKNEKKLRETELDKLGIELFTTETEEEINKKKNAEQKEMEDI 69

Query: 65 EQSV----KEIEESLKERKEKTLTED 86
          E S+    KE   ++KE K+   + +
Sbjct: 70 ENSLFSEDKEGNVAVKETKDSLFSSE 95


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 36.9 bits (86), Expect = 0.002
 Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK--KKKKKEEQ 66
              EE EE++++    +     +  ++KKKKKKK KKK K    +KKE+ 
Sbjct: 14 NGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQM 64


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 40.0 bits (94), Expect = 0.002
 Identities = 19/43 (44%), Positives = 35/43 (81%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           K ++KEE ++E++EKE ++E+K  +K KK+++KKKK+ +K +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 39.2 bits (92), Expect = 0.003
 Identities = 24/83 (28%), Positives = 44/83 (53%), Gaps = 12/83 (14%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           +KEE + E E++E  K        ++K+ +K KK+++KKKK+ +K E++     E  K +
Sbjct: 549 DKEELQREKEEKEALK--------EQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQ 600

Query: 79  KEKTLTEDSQCDTMIPV-FKKGV 100
           ++K    D   +T +P     G 
Sbjct: 601 EDKYSAFD---ETGLPTHDADGE 620



 Score = 36.2 bits (84), Expect = 0.027
 Identities = 16/40 (40%), Positives = 30/40 (75%)

Query: 34  KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           K D +EE  ++K++K+  K++K+ +K KK+EE+  KE+E+
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 35.4 bits (82), Expect = 0.051
 Identities = 13/43 (30%), Positives = 31/43 (72%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            ++E+ +++K++K+  K++K+ +K KK+E+  K+  E L++ K
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 35.0 bits (81), Expect = 0.065
 Identities = 16/49 (32%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 22  EEKEENE----QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           E+K +       ++KE+   E+E+ +  K++K+ +K KK+++KKKKE +
Sbjct: 536 EDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELE 584



 Score = 34.2 bits (79), Expect = 0.11
 Identities = 12/48 (25%), Positives = 30/48 (62%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
           ++K + +++K++K+  K++K+ +K ++Q  K+ +E  K  K K    +
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595



 Score = 33.8 bits (78), Expect = 0.14
 Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 14/79 (17%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK-------KKKKKKKKKKEEQSV- 68
           +R KEE++   EQ+   K   +EEK KK+ +K +K K       K+++ K    +E  + 
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLP 613

Query: 69  ------KEIEESLKERKEK 81
                 +EI +  +++  K
Sbjct: 614 THDADGEEISKKERKKLSK 632



 Score = 32.3 bits (74), Expect = 0.38
 Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 23/93 (24%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-------KKKKKKKKK------------ 52
           +K   K  K   K + ++E+K+KE  + EK K       K+++ K               
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDAD 618

Query: 53  ----KKKKKKKKKKKEEQSVKEIEESLKERKEK 81
                KK++KK  K+ ++  K  EE L +  + 
Sbjct: 619 GEEISKKERKKLSKEYDKQAKLHEEYLAKGGKS 651



 Score = 29.6 bits (67), Expect = 2.9
 Identities = 10/40 (25%), Positives = 26/40 (65%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           K   K++ +++K++K+  K++K+  +  K+ E+  KE ++
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 37.6 bits (87), Expect = 0.002
 Identities = 19/48 (39%), Positives = 25/48 (52%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           +N E     +  E    +D   E  +K KKKKKK KK KK KK  K++
Sbjct: 73  QNGECHLTTSSGEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 31.4 bits (71), Expect = 0.25
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 47  KKKKKKKKKKKKKKKKKEEQ 66
           K KKKKKK KK KK KK  +
Sbjct: 99  KPKKKKKKSKKTKKPKKSSK 118



 Score = 31.0 bits (70), Expect = 0.30
 Identities = 13/21 (61%), Positives = 15/21 (71%)

Query: 46  KKKKKKKKKKKKKKKKKKEEQ 66
           K KKKKKK KK KK KK  ++
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 29.5 bits (66), Expect = 1.2
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 49  KKKKKKKKKKKKKKKEEQSVKE 70
           K KKKKKK KK KK ++ S K+
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 28.3 bits (63), Expect = 2.9
 Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 48  KKKKKKKKKKKKKKKKEEQS 67
           K KKKKKK KK KK K+   
Sbjct: 99  KPKKKKKKSKKTKKPKKSSK 118


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 38.3 bits (89), Expect = 0.002
 Identities = 19/82 (23%), Positives = 47/82 (57%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E  K  ++ + EEE+EE+E+E +E ED+E+E+   + ++++++ ++     K  E++++ 
Sbjct: 41  EDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIN 100

Query: 70  EIEESLKERKEKTLTEDSQCDT 91
           +I  S ++   + L   +    
Sbjct: 101 DIFNSTQDDNAQNLISKNYKKN 122



 Score = 36.8 bits (85), Expect = 0.007
 Identities = 17/74 (22%), Positives = 33/74 (44%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+ +   +  ++ E EE   E++E+E+ +EE N   K  +KK         +    Q++ 
Sbjct: 56  EEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLI 115

Query: 70  EIEESLKERKEKTL 83
                  E+ +KT 
Sbjct: 116 SKNYKKNEKSKKTA 129



 Score = 33.7 bits (77), Expect = 0.059
 Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 6/88 (6%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEE------KNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+   EEE EE E  E E+E VE+E      +      K  +KK         +++ +  
Sbjct: 54  EEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQN 113

Query: 70  EIEESLKERKEKTLTEDSQCDTMIPVFK 97
            I ++ K+ ++   T +    T+  +  
Sbjct: 114 LISKNYKKNEKSKKTAEDIVKTLFGLLN 141



 Score = 31.8 bits (72), Expect = 0.33
 Identities = 15/65 (23%), Positives = 27/65 (41%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E+ ++E+  +  + E+K   D+         +    K  KK +K KK  E  VK +   L
Sbjct: 81  EEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKTAEDIVKTLFGLL 140

Query: 76  KERKE 80
               +
Sbjct: 141 NGNNQ 145



 Score = 27.1 bits (60), Expect = 9.4
 Identities = 21/85 (24%), Positives = 47/85 (55%), Gaps = 12/85 (14%)

Query: 16 EKRNKEEEKEENEQEE------------KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
          EK+ +EE+K+EN  E             KE EDV++EK +  ++++++ +++ ++ +  +
Sbjct: 10 EKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIE 69

Query: 64 EEQSVKEIEESLKERKEKTLTEDSQ 88
          +E+ + E EE  +E +E  +     
Sbjct: 70 DEEEIVEDEEEEEEDEEDNVDLKDI 94


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 16/52 (30%), Positives = 26/52 (50%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           E + E   E  ++  V   K + K K K K K K  KK +++ ++ VK +E 
Sbjct: 75  EPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126



 Score = 35.8 bits (83), Expect = 0.021
 Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 1/58 (1%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           E  KE      K  E   + K K K K  KK +++ K++ K  E +     E +   R
Sbjct: 83  EPPKEAPVVIPKP-EPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPAR 139



 Score = 30.8 bits (70), Expect = 0.83
 Identities = 18/53 (33%), Positives = 22/53 (41%), Gaps = 2/53 (3%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
            E E E     E  KE        + K K K K K K    KK EEQ  +E++
Sbjct: 72  VEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKP--VKKVEEQPKREVK 122


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 15/72 (20%), Positives = 38/72 (52%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+ + +E   +E E+   E +EK +   EE + ++   ++ ++   + ++ K++ E+ + 
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368

Query: 70  EIEESLKERKEK 81
            + E L+E  E 
Sbjct: 369 ALLEELEELFEA 380



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 18/72 (25%), Positives = 38/72 (52%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E K++IE   +E E+ E   EE E+   E E+ K++ ++K     ++ ++  +   + + 
Sbjct: 327 ELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELA 386

Query: 70  EIEESLKERKEK 81
           E+E  L E + +
Sbjct: 387 ELEAELAEIRNE 398



 Score = 38.5 bits (90), Expect = 0.005
 Identities = 22/85 (25%), Positives = 50/85 (58%), Gaps = 1/85 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++ +E+  KE E+ + E EE E E  E E   K+ +++K++ +++ ++ + +  +  +
Sbjct: 846 ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKE 905

Query: 70  EIEESLKERKEKTLTEDSQCDTMIP 94
           EIE+ L+ER E+   +  + +  +P
Sbjct: 906 EIEK-LRERLEELEAKLERLEVELP 929



 Score = 38.5 bits (90), Expect = 0.006
 Identities = 29/151 (19%), Positives = 54/151 (35%), Gaps = 33/151 (21%)

Query: 7   GAGEKKKRIEK-RNKEEEKEENEQE--------EKEKEDVEEEKNKKKKKKKKKKKKKK- 56
           G  + K+R E+   K E  EEN +         EK+ E +E +  K ++ ++ K + ++ 
Sbjct: 166 GVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELREL 225

Query: 57  -------KKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPS 109
                  K K+ +KE + ++E    L+E  E+   E  + +  I   K            
Sbjct: 226 ELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKS----------- 274

Query: 110 LTNSFREPSLEMSYQRYSHRQRQKSLIIVNL 140
                    L    +       +    I  L
Sbjct: 275 -----ELEELREELEELQEELLELKEEIEEL 300



 Score = 37.4 bits (87), Expect = 0.013
 Identities = 15/75 (20%), Positives = 41/75 (54%), Gaps = 3/75 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           E ++ +E+  +E  + + E EE E E     E  +  + + ++ +++ ++ K+K +  ++
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKE 337

Query: 67  SVKEIEESLKERKEK 81
            ++E E  L+E ++ 
Sbjct: 338 ELEERETLLEELEQL 352



 Score = 35.1 bits (81), Expect = 0.061
 Identities = 20/90 (22%), Positives = 49/90 (54%), Gaps = 4/90 (4%)

Query: 1   MRKRRGGAGEKKKRIEKR--NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
           +R+R      + + +E+R    +E+ E  ++E +E+E + EE  +   + ++ K++ ++K
Sbjct: 307 LRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEK 366

Query: 59  KKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
                EE  ++E+ E+L+E   +   E ++
Sbjct: 367 LSALLEE--LEELFEALREELAELEAELAE 394



 Score = 34.7 bits (80), Expect = 0.074
 Identities = 22/83 (26%), Positives = 48/83 (57%), Gaps = 2/83 (2%)

Query: 10  EKKKRIEKRNKEEEKEENE--QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           E K+ +E+   E+E+ E+E  + E+EKE++EEE  + + +  + K++ +K +++ +E ++
Sbjct: 860 ELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEA 919

Query: 68  VKEIEESLKERKEKTLTEDSQCD 90
             E  E      E+ L E+ +  
Sbjct: 920 KLERLEVELPELEEELEEEYEDT 942



 Score = 34.3 bits (79), Expect = 0.10
 Identities = 21/80 (26%), Positives = 54/80 (67%), Gaps = 1/80 (1%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-KKKKKKKKKKK 61
           +RR  A E++    ++ +E  ++E E+ E+E E++EE+ ++ +++ ++ +K+ ++ K++ 
Sbjct: 806 ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL 865

Query: 62  KKEEQSVKEIEESLKERKEK 81
           ++ E   +E+E+ LKE +E+
Sbjct: 866 EELEAEKEELEDELKELEEE 885



 Score = 33.5 bits (77), Expect = 0.20
 Identities = 16/85 (18%), Positives = 40/85 (47%), Gaps = 6/85 (7%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKE--DVEEEKNKKKKKKKKKKKKKK----KKKKKKK 63
             ++ + +   E  +  NE EE ++E   +EE   +  ++ +  K++ K    + ++ + 
Sbjct: 380 ALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQT 439

Query: 64  EEQSVKEIEESLKERKEKTLTEDSQ 88
           E + + E  E L+E+ E+      +
Sbjct: 440 ELEELNEELEELEEQLEELRDRLKE 464



 Score = 33.5 bits (77), Expect = 0.20
 Identities = 18/79 (22%), Positives = 43/79 (54%), Gaps = 7/79 (8%)

Query: 10  EKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNK---KKKKKKKKKKKKKK--KKKKK 62
           E K  +E+  +E E+  EE  + ++E E++E E +    + ++ + + ++ ++  ++ K+
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKE 330

Query: 63  KEEQSVKEIEESLKERKEK 81
           K E   +E+EE     +E 
Sbjct: 331 KIEALKEELEERETLLEEL 349



 Score = 33.1 bits (76), Expect = 0.22
 Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 1/86 (1%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK-KKKKKKKKK 60
            K      E      ++   E +E  E+ E++   + EE  +  +  +++  + + +  +
Sbjct: 335 LKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAE 394

Query: 61  KKKEEQSVKEIEESLKERKEKTLTED 86
            + E + +K   ESL+ER E+     
Sbjct: 395 IRNELEELKREIESLEERLERLSERL 420



 Score = 33.1 bits (76), Expect = 0.25
 Identities = 17/87 (19%), Positives = 44/87 (50%), Gaps = 11/87 (12%)

Query: 10  EKKKRIEKRNKEEEKEENEQE---------EKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           E ++++E+  +E    E E E         E+E E++EEE  + +++ ++ +++ +  ++
Sbjct: 713 ELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772

Query: 61  KKKEEQSVKEIEESLKERKEKTLTEDS 87
              + +   EIEE  ++R+      + 
Sbjct: 773 ALAKLKE--EIEELEEKRQALQEELEE 797



 Score = 33.1 bits (76), Expect = 0.28
 Identities = 21/79 (26%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E +  +++  +E+E+ E E  E E E  E     K++ +K +++ ++ + K ++ E  + 
Sbjct: 874 ELEDELKELEEEKEELEEELRELESELAE----LKEEIEKLRERLEELEAKLERLEVELP 929

Query: 70  EIEESLKERKEKTLTEDSQ 88
           E+EE L+E  E TL  + +
Sbjct: 930 ELEEELEEEYEDTLETELE 948



 Score = 32.0 bits (73), Expect = 0.57
 Identities = 17/73 (23%), Positives = 48/73 (65%), Gaps = 2/73 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQSV 68
           E   ++++   EE +E+ +  ++E E++EEE  + +++    +++ +  ++++++ EQ +
Sbjct: 772 EALAKLKEEI-EELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830

Query: 69  KEIEESLKERKEK 81
           +E+EE ++E +EK
Sbjct: 831 EELEEEIEELEEK 843



 Score = 32.0 bits (73), Expect = 0.66
 Identities = 17/84 (20%), Positives = 41/84 (48%), Gaps = 2/84 (2%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEE--KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
             R         ++++  KE E  +EE  + E+E E+++EE  + +K+ ++ K + ++ +
Sbjct: 221 ELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELR 280

Query: 60  KKKKEEQSVKEIEESLKERKEKTL 83
           ++ +E Q      +   E  E  +
Sbjct: 281 EELEELQEELLELKEEIEELEGEI 304



 Score = 31.6 bits (72), Expect = 0.77
 Identities = 24/107 (22%), Positives = 50/107 (46%), Gaps = 17/107 (15%)

Query: 2   RKRRGGAGEKKKRIEKRNKE-----EEKEEN-EQEEKEKEDVEEE-----------KNKK 44
           RK      E+  R+E+  +E     EE E+  E+ + E E++ EE           K + 
Sbjct: 238 RKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEI 297

Query: 45  KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
           ++ + +    +++ ++ + E + ++E  E LKE+ E    E  + +T
Sbjct: 298 EELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERET 344



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 16/68 (23%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE--QSVKEIEE 73
           E R + EE E   +E K +    EE+ ++ + + ++ +++ ++ +++ EE  + ++E+EE
Sbjct: 706 ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765

Query: 74  SLKERKEK 81
            L+  +E 
Sbjct: 766 ELESLEEA 773



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 15/88 (17%), Positives = 46/88 (52%), Gaps = 16/88 (18%)

Query: 10   EKKKRIE-KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-------------KKKKK 55
             K +R+E +  + EE+ E E E+  + ++E E  + +++ +              + +++
Sbjct: 919  AKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEER 978

Query: 56   KKKKKKKKE--EQSVKEIEESLKERKEK 81
             ++ K ++E  E++ +++ E ++E  ++
Sbjct: 979  YEELKSQREDLEEAKEKLLEVIEELDKE 1006



 Score = 30.5 bits (69), Expect = 1.8
 Identities = 18/71 (25%), Positives = 43/71 (60%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++R++   +E E  E  +E  E+E  E E+  ++ ++K  + +++ ++ +K+ E+  +
Sbjct: 804 EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKE 863

Query: 70  EIEESLKERKE 80
           E+EE   E++E
Sbjct: 864 ELEELEAEKEE 874



 Score = 30.1 bits (68), Expect = 2.0
 Identities = 20/97 (20%), Positives = 54/97 (55%), Gaps = 7/97 (7%)

Query: 2   RKRRGGAGEKKKRIEKR--NKEEEKEE-NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK- 57
           ++      E+ ++++ R    EEE EE  E+ E+ +E +EE + + +  ++   K K++ 
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEI 781

Query: 58  ---KKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
              ++K++  ++ ++E+EE L+E + +    + + ++
Sbjct: 782 EELEEKRQALQEELEELEEELEEAERRLDALERELES 818



 Score = 29.7 bits (67), Expect = 2.9
 Identities = 17/80 (21%), Positives = 41/80 (51%), Gaps = 4/80 (5%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK----KKKKKKKKKKKKKKKKEE 65
           E+ +++    +E ++E  E+     E++EE     +++    + +  + + + ++ K+E 
Sbjct: 347 EELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREI 406

Query: 66  QSVKEIEESLKERKEKTLTE 85
           +S++E  E L ER E    E
Sbjct: 407 ESLEERLERLSERLEDLKEE 426



 Score = 29.3 bits (66), Expect = 4.2
 Identities = 13/82 (15%), Positives = 42/82 (51%), Gaps = 6/82 (7%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKE------KEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E ++ +E+  +E E+ +   EE E      +E + + K + ++ ++K++  +++ ++ ++
Sbjct: 741 ELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEE 800

Query: 64  EEQSVKEIEESLKERKEKTLTE 85
           E +  +   ++L+   E     
Sbjct: 801 ELEEAERRLDALERELESLEQR 822



 Score = 28.5 bits (64), Expect = 7.0
 Identities = 11/71 (15%), Positives = 36/71 (50%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++ + +   + EK E E +  + E    E   ++ +++ ++ +++ ++ K++     +
Sbjct: 671 ELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEE 730

Query: 70  EIEESLKERKE 80
           E+E+     +E
Sbjct: 731 ELEQLQSRLEE 741



 Score = 28.1 bits (63), Expect = 8.3
 Identities = 13/82 (15%), Positives = 43/82 (52%), Gaps = 1/82 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            ++K      + EE  E  +EE  + + E  + + + ++ K++ +  +++ ++  E+ ++
Sbjct: 363 LEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSER-LE 421

Query: 70  EIEESLKERKEKTLTEDSQCDT 91
           +++E LKE + +     ++ + 
Sbjct: 422 DLKEELKELEAELEELQTELEE 443


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 39.7 bits (92), Expect = 0.002
 Identities = 22/82 (26%), Positives = 35/82 (42%), Gaps = 3/82 (3%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKK 59
              R   G+KK R E     E+  E  +      D +    K  KKK  ++ ++ K+ KK
Sbjct: 605 SMDRILHGQKK-RAEGAVVFEKPLEATENFNPWLDRKMRRIKRIKKKAYRRIRRDKRLKK 663

Query: 60  KKKKEEQSVKEIEESLKERKEK 81
           K  +EE + +    S K+R   
Sbjct: 664 KMPEEENTQENHLGSEKKRHGG 685



 Score = 29.3 bits (65), Expect = 3.8
 Identities = 14/86 (16%), Positives = 40/86 (46%), Gaps = 2/86 (2%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           KK+  +R + +++   +++  E+E+ +E     +KK+         K+ + ++++    +
Sbjct: 648 KKKAYRRIRRDKR--LKKKMPEEENTQENHLGSEKKRHGGVPDILLKEIEVEDDEKTPIL 705

Query: 72  EESLKERKEKTLTEDSQCDTMIPVFK 97
                E  E+ L+  +Q + +   F 
Sbjct: 706 SPGGDEEVEEGLSIKTQEELVALAFA 731


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 38.2 bits (88), Expect = 0.002
 Identities = 20/72 (27%), Positives = 45/72 (62%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E K+R  +     E+E   QEE+   + EE+  +K ++++K++++++++ +K+KEE   +
Sbjct: 59  ELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEAR 118

Query: 70  EIEESLKERKEK 81
             EE+ + R E+
Sbjct: 119 AREEAERMRLER 130



 Score = 36.2 bits (83), Expect = 0.009
 Identities = 16/65 (24%), Positives = 44/65 (67%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
          ++R   E++E+ EQE +E+E+ +  + ++ K++  +++ +++++ +++EE+  +E EE  
Sbjct: 31 KRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKA 90

Query: 76 KERKE 80
          K + E
Sbjct: 91 KRKAE 95



 Score = 36.2 bits (83), Expect = 0.010
 Identities = 20/80 (25%), Positives = 50/80 (62%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R RR     +++    R KEE+ +   +EE+++E  E+E+ +K+K++ + + +++ ++ +
Sbjct: 68  RLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMR 127

Query: 62  KKEEQSVKEIEESLKERKEK 81
            + E+  ++IE+   ERK++
Sbjct: 128 LEREKHFQQIEQERLERKKR 147



 Score = 36.2 bits (83), Expect = 0.010
 Identities = 19/75 (25%), Positives = 55/75 (73%), Gaps = 3/75 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK---KKKKKKKKKKKKKKKKKKKKEEQ 66
           EK+++  ++ ++EE+E  EQEE+++ + EE K +   ++ +++++ +++++++ ++KEE+
Sbjct: 30  EKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEK 89

Query: 67  SVKEIEESLKERKEK 81
           + ++ EE  K+ +E+
Sbjct: 90  AKRKAEEEEKQEQEE 104



 Score = 35.5 bits (81), Expect = 0.021
 Identities = 22/87 (25%), Positives = 56/87 (64%), Gaps = 4/87 (4%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK----KKKKKKKKKKKKKK 58
           ++R  A E++++ E+  +E+E+++  + E+ K    EE+ ++    +++++++ ++K++K
Sbjct: 30  EKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEK 89

Query: 59  KKKKKEEQSVKEIEESLKERKEKTLTE 85
            K+K EE+  +E EE  + +K+K   E
Sbjct: 90  AKRKAEEEEKQEQEEQERIQKQKEEAE 116



 Score = 33.5 bits (76), Expect = 0.071
 Identities = 15/73 (20%), Positives = 45/73 (61%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E+  +EEE    E+E   +++ + ++  ++++K++++++++ +K+K++ E   +E  E +
Sbjct: 67  ERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERM 126

Query: 76  KERKEKTLTEDSQ 88
           +  +EK   +  Q
Sbjct: 127 RLEREKHFQQIEQ 139



 Score = 32.0 bits (72), Expect = 0.25
 Identities = 19/71 (26%), Positives = 44/71 (61%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+ +   +  K    EE  + E+E    EEE+ ++K++K K+K ++++K++++++E+  K
Sbjct: 51  EQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQK 110

Query: 70  EIEESLKERKE 80
           + EE+    +E
Sbjct: 111 QKEEAEARARE 121



 Score = 31.2 bits (70), Expect = 0.47
 Identities = 20/80 (25%), Positives = 50/80 (62%), Gaps = 1/80 (1%)

Query: 4   RRGGAGEKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           RR      +++ EK  ++ EE+E+ EQEE+E+   ++E+ + + +++ ++ + +++K  +
Sbjct: 76  RRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLEREKHFQ 135

Query: 63  KEEQSVKEIEESLKERKEKT 82
           + EQ   E ++ L+E  ++T
Sbjct: 136 QIEQERLERKKRLEEIMKRT 155


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 39.7 bits (92), Expect = 0.002
 Identities = 25/73 (34%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 15  IEKRNKEEEKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           +EK  ++E  +E+E++ K+  ED EE K ++ K+  + KKKK KK K+  +E  V+   +
Sbjct: 207 VEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHK 266

Query: 74  SLKERKEKTLTED 86
            L  R  K +T++
Sbjct: 267 PLWTRDPKDVTKE 279



 Score = 31.6 bits (71), Expect = 0.82
 Identities = 18/55 (32%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 12  KKRIEKRNKEEEKEENEQ-EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           +K +   ++E+ K+ +E  EE + E+V+E    KKKK KK K+  K+ + + K +
Sbjct: 212 EKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHK 266


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 39.6 bits (93), Expect = 0.002
 Identities = 20/80 (25%), Positives = 44/80 (55%), Gaps = 7/80 (8%)

Query: 13  KRIEKRNKEEEKE-EN---EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           K++ +   EEE+E E    E+ E+    +EEE   + + K+   +K+ + + ++++E+  
Sbjct: 177 KKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELR 236

Query: 69  KEIEESLK---ERKEKTLTE 85
           K+ EE L+   ER+ +   +
Sbjct: 237 KKYEEKLRQELERQAEAHEQ 256



 Score = 39.2 bits (92), Expect = 0.003
 Identities = 12/77 (15%), Positives = 40/77 (51%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           ++ ++++ +E   +  E+     E  E    K+ + + +++K++ +KK ++K  Q ++  
Sbjct: 191 ERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQ 250

Query: 72  EESLKERKEKTLTEDSQ 88
            E+ +++ +  L   + 
Sbjct: 251 AEAHEQKLKNELALQAI 267



 Score = 33.1 bits (76), Expect = 0.24
 Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E++      +KE   E+  + E E+E  E  K  ++K +++ +++ +  ++K K E +++
Sbjct: 206 EEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQ 265

Query: 70  EIE---ESLKERKEKTLTE 85
            IE   E  KE KEK   E
Sbjct: 266 AIELQREFNKEIKEKVEEE 284



 Score = 30.8 bits (70), Expect = 1.3
 Identities = 14/83 (16%), Positives = 36/83 (43%), Gaps = 4/83 (4%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
            + K +K  + E  + +   +  + EE  + + K      +    K     ++ V+ +EE
Sbjct: 67  SVTKLSKGPKDEPKQVKIPRQSGLSEEVAEAEAKATSVAAEATTPKS---IQELVEALEE 123

Query: 74  SLKERKEKTLTEDSQCDTMIPVF 96
            L+E  ++T ++      ++ +F
Sbjct: 124 LLEELLKETASDPVV-QELVSIF 145



 Score = 28.1 bits (63), Expect = 7.6
 Identities = 14/85 (16%), Positives = 38/85 (44%), Gaps = 2/85 (2%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           K      + +  I    +E ++   +  E + E+ EE +   K+K+++   K +++   +
Sbjct: 153 KEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLAR 212

Query: 63  KEEQSVKEIEESLKE--RKEKTLTE 85
            E +     ++   E  R+++ L +
Sbjct: 213 LESKEAALEKQLRLEFEREKEELRK 237


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 39.4 bits (92), Expect = 0.002
 Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 1/81 (1%)

Query: 6   GGAGEK-KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           G   E+    IEK +  E++ E + E  EK   E+EK KK+ +++ ++ K++++ KK + 
Sbjct: 507 GEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLEL 566

Query: 65  EQSVKEIEESLKERKEKTLTE 85
           E+  +E  ++LK+  E  + E
Sbjct: 567 EKEAQEALKALKKEVESIIRE 587



 Score = 39.4 bits (92), Expect = 0.003
 Identities = 19/68 (27%), Positives = 43/68 (63%), Gaps = 3/68 (4%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE---QSVKEIEE 73
           ++  E++ E  E+  KE+E +++E  ++ ++ K++++ KK + +K+ +E      KE+E 
Sbjct: 524 EKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVES 583

Query: 74  SLKERKEK 81
            ++E KEK
Sbjct: 584 IIRELKEK 591



 Score = 37.1 bits (86), Expect = 0.014
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 13/83 (15%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK------------KKKKKKKKKK 57
           +K + +EK  KE+EK + E E +E E+++E +  KK +            KK+ +   ++
Sbjct: 529 QKNEHLEKLLKEQEKLKKELE-QEMEELKERERNKKLELEKEAQEALKALKKEVESIIRE 587

Query: 58  KKKKKKEEQSVKEIEESLKERKE 80
            K+KK  +    +  E L + KE
Sbjct: 588 LKEKKIHKAKEIKSIEDLVKLKE 610



 Score = 36.0 bits (83), Expect = 0.030
 Identities = 23/73 (31%), Positives = 46/73 (63%), Gaps = 3/73 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             +K +E++N+  EK   EQE K K+++E+  E+ K++++ KK + +K+ ++  K  ++ 
Sbjct: 522 ALEKELEQKNEHLEKLLKEQE-KLKKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580

Query: 68  VKEIEESLKERKE 80
           V+ I   LKE+K 
Sbjct: 581 VESIIRELKEKKI 593



 Score = 33.3 bits (76), Expect = 0.21
 Identities = 26/74 (35%), Positives = 42/74 (56%), Gaps = 1/74 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            KK+  ++  + +E+E N++ E EKE  E  K  KK+ +   ++ K+KK  K KE +S+ 
Sbjct: 544 LKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSI- 602

Query: 70  EIEESLKERKEKTL 83
           E    LKE K+K  
Sbjct: 603 EDLVKLKETKQKIP 616



 Score = 28.6 bits (64), Expect = 5.5
 Identities = 17/74 (22%), Positives = 34/74 (45%), Gaps = 1/74 (1%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           I ++ K    E  E+     E +   + K+ ++K +  +K  K+++K K+E   +  E  
Sbjct: 498 IIEQAKTFYGEFKEEINVLIEKLSALE-KELEQKNEHLEKLLKEQEKLKKELEQEMEELK 556

Query: 75  LKERKEKTLTEDSQ 88
            +ER +K   E   
Sbjct: 557 ERERNKKLELEKEA 570


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 38.0 bits (89), Expect = 0.002
 Identities = 19/78 (24%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK----KKKKKKKKKKKKKKKEEQSV 68
           + ++ R K  + ++ E+E  ++   +E++  K+ KK    KK + KK   KKK  E +  
Sbjct: 89  QSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKS 148

Query: 69  KEIEESLKERKEKTLTED 86
           K+++++L+++++K   ++
Sbjct: 149 KQLDKALEKKRKKNAGKE 166



 Score = 33.0 bits (76), Expect = 0.13
 Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 10/88 (11%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEE---------NEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
           R     E +K ++K    EEKEE         +  +  + +D E E  K+ KK++K+  K
Sbjct: 61  REKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIK 120

Query: 55  KKKKKK-KKKEEQSVKEIEESLKERKEK 81
           + KK    KK E     +++   E K+ 
Sbjct: 121 EGKKPYYLKKSEIKKLVLKKKFDELKKS 148



 Score = 30.7 bits (70), Expect = 0.60
 Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 16  EKRNKEEEKEENE-QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           + R KE E+ E   ++ K+ E+ EE K   +  K + K  K K ++++  ++  K+ +E 
Sbjct: 59  DYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKEL 118

Query: 75  LKERKEK 81
           +KE K+ 
Sbjct: 119 IKEGKKP 125


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 39.7 bits (93), Expect = 0.002
 Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 3/75 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ--- 66
             +K IE  N ++E+ E E EE E    + E      KK++ + + + ++ ++K E+   
Sbjct: 851 SIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEA 910

Query: 67  SVKEIEESLKERKEK 81
            +++  + L E K K
Sbjct: 911 QIEKKRKRLSELKAK 925



 Score = 34.3 bits (79), Expect = 0.12
 Identities = 17/91 (18%), Positives = 47/91 (51%), Gaps = 5/91 (5%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----KKKKKKK 57
            +      E ++++ +   E+E  E E +E + E   + K + K  +K+      KK++ 
Sbjct: 808 SRIEARLREIEQKLNRLTLEKEYLEKEIQELQ-EQRIDLKEQIKSIEKEIENLNGKKEEL 866

Query: 58  KKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           +++ ++ E +++++E  L + K++    ++Q
Sbjct: 867 EEELEELEAALRDLESRLGDLKKERDELEAQ 897



 Score = 33.9 bits (78), Expect = 0.16
 Identities = 21/83 (25%), Positives = 45/83 (54%), Gaps = 3/83 (3%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKE--DVEEEK-NKKKKKKKKKKKKKKKKKKKKKEE 65
           GE +K IE+  +EEEK +   EE E++   +E+E  N K + K+ + + ++ ++   K E
Sbjct: 719 GEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLE 778

Query: 66  QSVKEIEESLKERKEKTLTEDSQ 88
           +++ ++E  L   +   +  +  
Sbjct: 779 EALNDLEARLSHSRIPEIQAELS 801



 Score = 32.7 bits (75), Expect = 0.35
 Identities = 12/72 (16%), Positives = 37/72 (51%), Gaps = 3/72 (4%)

Query: 12  KKRIEKRNKEEEKEENE--QEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
             + E+  +E E+ E      E    D+++E    + + ++ ++K ++ + + +K+ + +
Sbjct: 860 NGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRL 919

Query: 69  KEIEESLKERKE 80
            E++  L+  +E
Sbjct: 920 SELKAKLEALEE 931



 Score = 32.0 bits (73), Expect = 0.53
 Identities = 18/88 (20%), Positives = 45/88 (51%), Gaps = 7/88 (7%)

Query: 1   MRKRRGGAGEKKKRIEKRNKE------EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
           +++       + +RIE R  E      +   +  + EKE E +E+E+ K K++ ++ ++ 
Sbjct: 686 LKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEED 745

Query: 55  -KKKKKKKKKEEQSVKEIEESLKERKEK 81
               +++ +  +  +KE+E  ++E +E 
Sbjct: 746 LSSLEQEIENVKSELKELEARIEELEED 773



 Score = 32.0 bits (73), Expect = 0.62
 Identities = 20/87 (22%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 2   RKRRGGAGEKKKRIEK-RNKEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKK-----KKK 54
            K      E ++ IE+     +EK +  E+  +E+E  E  +   K+K++ +     K+K
Sbjct: 173 EKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEK 232

Query: 55  KKKKKKKKKEEQSVKEIEESLKERKEK 81
           +  +++K+  E+ +  +EE L++  E+
Sbjct: 233 EALERQKEAIERQLASLEEELEKLTEE 259



 Score = 31.6 bits (72), Expect = 0.84
 Identities = 23/82 (28%), Positives = 44/82 (53%), Gaps = 6/82 (7%)

Query: 10  EKKKRIEKRNKEEEKEENEQE-EKEKEDVEEE---KNKKKKKKKKKKKKKKKKKKKKKEE 65
           EK++++E+  +E EK E  Q   KEK + E     K K+  +++K+  +++    +++ E
Sbjct: 195 EKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELE 254

Query: 66  QSVKEIEESLKE--RKEKTLTE 85
           +  +EI E  K     E+ L E
Sbjct: 255 KLTEEISELEKRLEEIEQLLEE 276



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 18/80 (22%), Positives = 44/80 (55%), Gaps = 8/80 (10%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK----KKKKK 62
           G  E  ++ EK    EE EE E+  +  + + +EK ++ ++ +++++K ++     K+K+
Sbjct: 164 GVAEFDRKKEK--ALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKR 221

Query: 63  KEEQS--VKEIEESLKERKE 80
           + E    +KE E   ++++ 
Sbjct: 222 EYEGYELLKEKEALERQKEA 241



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 19/76 (25%), Positives = 40/76 (52%), Gaps = 7/76 (9%)

Query: 16  EKRNKEEEKEENEQEEKEK-EDVEEEKNKK-----KKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+    E++ EN   +KE+ E+  EE          +    KK++ + + + ++ E+ ++
Sbjct: 847 EQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIE 906

Query: 70  EIEESLKERKEKTLTE 85
           E+E  + E+K K L+E
Sbjct: 907 ELEAQI-EKKRKRLSE 921



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 18/95 (18%), Positives = 40/95 (42%), Gaps = 18/95 (18%)

Query: 11  KKKRIEKRNKEEEKEENEQE----EKEKEDVEEEKNKKKKK-------------KKKKKK 53
           ++   +  + E+E E  + E    E   E++EE+ +K ++               + + +
Sbjct: 740 EELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAE 799

Query: 54  KKKKKKKKKKEEQSVKEIEESLKERK-EKTLTEDS 87
             K +++  + E  ++EIE+ L     EK   E  
Sbjct: 800 LSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKE 834



 Score = 29.3 bits (66), Expect = 4.4
 Identities = 15/85 (17%), Positives = 40/85 (47%), Gaps = 9/85 (10%)

Query: 10  EKKKRIEKRNKEEEKEENE----QEEKEK--EDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           + ++++EK  +E  + + E    QEE ++  E++ +        + K  + +++K+ K  
Sbjct: 389 DYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKAL 448

Query: 64  EEQSVKEIEESLKERKEKTLTEDSQ 88
           E   +K+ E  L++        + +
Sbjct: 449 E---IKKQEWKLEQLAADLSKYEQE 470



 Score = 28.9 bits (65), Expect = 4.9
 Identities = 17/83 (20%), Positives = 31/83 (37%), Gaps = 7/83 (8%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK-------KKKKKKKK 56
            R     ++   E  +        E +  E E+ +E+K  + KK+            K +
Sbjct: 409 DRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468

Query: 57  KKKKKKKEEQSVKEIEESLKERK 79
           ++    KEE    E E S  +R+
Sbjct: 469 QELYDLKEEYDRVEKELSKLQRE 491



 Score = 28.9 bits (65), Expect = 5.2
 Identities = 21/76 (27%), Positives = 40/76 (52%), Gaps = 11/76 (14%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+  RIE R +E E++ N     EKE +E+E  + ++++   K++ K          S++
Sbjct: 805 EEVSRIEARLREIEQKLNR-LTLEKEYLEKEIQELQEQRIDLKEQIK----------SIE 853

Query: 70  EIEESLKERKEKTLTE 85
           +  E+L  +KE+   E
Sbjct: 854 KEIENLNGKKEELEEE 869



 Score = 28.9 bits (65), Expect = 5.6
 Identities = 15/70 (21%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           +R  +    EE  E ++E ++   ++EE   +  + + + K  ++K +K K+E   +K  
Sbjct: 348 ERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRE 407

Query: 72  EESLKERKEK 81
            + L+E  ++
Sbjct: 408 LDRLQEELQR 417


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 38.5 bits (90), Expect = 0.002
 Identities = 15/88 (17%), Positives = 40/88 (45%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
            A + ++ +   N E +  +      E E  E  +  +K +K+ +  K++ ++ +K   +
Sbjct: 83  SAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAE 142

Query: 67  SVKEIEESLKERKEKTLTEDSQCDTMIP 94
           +   +EE + E +E+     S+ + +  
Sbjct: 143 AEARLEEEVAEIREEGQELSSKREELKE 170



 Score = 32.7 bits (75), Expect = 0.19
 Identities = 16/82 (19%), Positives = 37/82 (45%), Gaps = 15/82 (18%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK---------------KKKKKKKKKKKKK 60
           E+  + EEK    ++E+E   +  E    K++               +K +K+ +  K++
Sbjct: 73  ERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKER 132

Query: 61  KKKEEQSVKEIEESLKERKEKT 82
            ++ E+++ E E  L+E   + 
Sbjct: 133 LERLEKNLAEAEARLEEEVAEI 154



 Score = 31.2 bits (71), Expect = 0.73
 Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 14/91 (15%)

Query: 10  EKKKRIEKRNKEEEKEENEQE---------EKEKEDVEEEKNK-----KKKKKKKKKKKK 55
             + RI++  K  +K + E E         E E ED+E + ++     ++ +++ K+ ++
Sbjct: 21  RLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEE 80

Query: 56  KKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
           K    K + E     IE  + + +  +L ++
Sbjct: 81  KLSAVKDERELRALNIEIQIAKERINSLEDE 111



 Score = 29.6 bits (67), Expect = 1.9
 Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 9/82 (10%)

Query: 10  EKKKRIEKRNKEEEKEENE--------QEEKEKEDVEEEKNKKKKKKKKKKK-KKKKKKK 60
              K +E    E E  EN+        QE +E+    EEK    K +++ +    + +  
Sbjct: 42  ALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIA 101

Query: 61  KKKEEQSVKEIEESLKERKEKT 82
           K++      E+ E ++E ++  
Sbjct: 102 KERINSLEDELAELMEEIEKLE 123


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 39.4 bits (92), Expect = 0.002
 Identities = 18/53 (33%), Positives = 33/53 (62%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
            N QE   +E  +    ++K+KKK++ K K++K++  + E+  KE+ E LKE+
Sbjct: 337 PNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEK 389


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 39.3 bits (92), Expect = 0.003
 Identities = 17/57 (29%), Positives = 28/57 (49%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
             E K+ I      +   E  + +K  E++ EE  ++   K KKKK+KKK+  +K  
Sbjct: 386 LSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFR 442



 Score = 38.9 bits (91), Expect = 0.004
 Identities = 14/50 (28%), Positives = 27/50 (54%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           E     E  +   ++ + K+ +EE + +  ++   K KKKK+KKK+  E+
Sbjct: 391 EAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440



 Score = 31.6 bits (72), Expect = 0.72
 Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 3/78 (3%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
             +K K   +  +   K+  + +  +      + E K      +  K   +K + KK  E
Sbjct: 354 ELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIE 413

Query: 65  EQSVKEIEESLKERKEKT 82
           E   + IEE L + K+K 
Sbjct: 414 EIREELIEEGLLKSKKKK 431



 Score = 30.4 bits (69), Expect = 1.7
 Identities = 13/64 (20%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE--IEESLKERKEK 81
           K      +++  +++E     +  K   +K + KK  ++  E+ ++E  ++   K+RK+K
Sbjct: 376 KGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKK 435

Query: 82  TLTE 85
              E
Sbjct: 436 EWFE 439



 Score = 29.7 bits (67), Expect = 2.8
 Identities = 17/76 (22%), Positives = 29/76 (38%), Gaps = 6/76 (7%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK--KKKKKKKKKKKKKKKKKKEE-- 65
           E  K +   +    +E   + +K K   E  +   K  KK K  K    ++  + KE   
Sbjct: 335 EGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIA 394

Query: 66  --QSVKEIEESLKERK 79
             +S K   E  + +K
Sbjct: 395 YYESAKTALEKAEGKK 410



 Score = 29.3 bits (66), Expect = 3.3
 Identities = 18/100 (18%), Positives = 38/100 (38%), Gaps = 28/100 (28%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKED----------------------VEEEKNKKKKK 47
           ++ K++E + +++E E  E E+  +E                       + +    ++ K
Sbjct: 293 KELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGNEEIK 352

Query: 48  KKKKKKKK------KKKKKKKKEEQSVKEIEESLKERKEK 81
            +  K K       +  KK KK + +   ++  L E KE 
Sbjct: 353 IELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEA 392



 Score = 28.5 bits (64), Expect = 6.6
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           E +K K+   + +KK +K+ KK + K E+   E+EE  K 
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKA 315



 Score = 28.1 bits (63), Expect = 9.2
 Identities = 12/45 (26%), Positives = 28/45 (62%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           ++E+  + K K+   + +KK +K+ KK E  +++ E+ L+E ++ 
Sbjct: 271 LDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKA 315



 Score = 28.1 bits (63), Expect = 9.2
 Identities = 12/53 (22%), Positives = 33/53 (62%), Gaps = 1/53 (1%)

Query: 33  EKEDVEEEKNKKKKK-KKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
           E++ +++  ++ +KK +K+ KK + K +K++ E + +++  E L+++ E    
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYA 328


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 37.8 bits (88), Expect = 0.003
 Identities = 17/69 (24%), Positives = 34/69 (49%), Gaps = 3/69 (4%)

Query: 16  EKRNKEEEKEENEQE---EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           E+  KEE + E E E     +   V  E +  +K+ +++   +K+K  + KE +S K+  
Sbjct: 68  ERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDPR 127

Query: 73  ESLKERKEK 81
              ++  +K
Sbjct: 128 GGTQDVVDK 136



 Score = 34.7 bits (80), Expect = 0.028
 Identities = 13/60 (21%), Positives = 21/60 (35%), Gaps = 1/60 (1%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           AG          KE E++   Q+ K   D  E K+ K  +   +    K +      E+ 
Sbjct: 89  AGAVAGESSADRKEAEQQGAAQKRKSCRDK-ERKSAKDPRGGTQDVVDKSQASLDYGEEE 147



 Score = 32.0 bits (73), Expect = 0.24
 Identities = 15/92 (16%), Positives = 40/92 (43%), Gaps = 7/92 (7%)

Query: 1   MRKRR----GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           +R+R       A E++  ++              ++++ + +    K+K  + K++K  K
Sbjct: 66  LRERELKEEAEA-EEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAK 124

Query: 57  KKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
             +     +  V + + SL   +E+T  +++Q
Sbjct: 125 DPRGGT--QDVVDKSQASLDYGEEETQQQEAQ 154


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 37.4 bits (87), Expect = 0.003
 Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 3/68 (4%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK---KKKKKEEQSVKEIE 72
                 EE E+ + E   +    E + K K+K ++ K K KK+   K+K+KEE+ +K + 
Sbjct: 40  PGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALA 99

Query: 73  ESLKERKE 80
           E+ KER E
Sbjct: 100 EAEKERAE 107



 Score = 36.6 bits (85), Expect = 0.005
 Identities = 18/64 (28%), Positives = 39/64 (60%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            + KR E R K +EK   ++ + +KE+ E+EK K+++  K   + +K++ + +K++   K
Sbjct: 57  AESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAK 116

Query: 70  EIEE 73
            ++E
Sbjct: 117 LMKE 120



 Score = 34.7 bits (80), Expect = 0.025
 Identities = 17/65 (26%), Positives = 40/65 (61%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            KR  G  + K+++ +   + +KEE E+E++++E   +   + +K++ + +KKK + K  
Sbjct: 59  SKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLM 118

Query: 62  KKEEQ 66
           K+E++
Sbjct: 119 KEEKK 123



 Score = 30.8 bits (70), Expect = 0.58
 Identities = 15/51 (29%), Positives = 29/51 (56%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            KEK   ++ K KK++ +K+K+K+++  K   + E+   E+E+   E K  
Sbjct: 68  AKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLM 118


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 37.0 bits (86), Expect = 0.003
 Identities = 14/57 (24%), Positives = 27/57 (47%), Gaps = 9/57 (15%)

Query: 19  NKEEEKEENEQE---------EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           + +  K  N +          E+EK +  +E  K  ++++ +K +K+ KK KK E  
Sbjct: 71  DIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPV 127



 Score = 34.7 bits (80), Expect = 0.019
 Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 3/58 (5%)

Query: 10  EKKKRIEKR---NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           +  K I  R   N   E  E E+ E  KE ++  + ++ +K +K+ KK KK +  + +
Sbjct: 73  KVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQGQ 130


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 38.9 bits (91), Expect = 0.004
 Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 17/93 (18%)

Query: 10  EKKKRIEKRNKEEEKEENEQE------EKEKEDVEEEKNKKK-----------KKKKKKK 52
           E +K +E+  +E EK  +E E         K  + E K +             ++ K + 
Sbjct: 515 EVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEA 574

Query: 53  KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
             + K  KKKK        E +LK++K K L E
Sbjct: 575 VDEVKFLKKKKGILDAGAFESTLKDKKNKVLPE 607



 Score = 32.3 bits (74), Expect = 0.50
 Identities = 12/65 (18%), Positives = 23/65 (35%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           G K   +E + +             +E   E  ++ K  KKKK        +   +++  
Sbjct: 543 GGKTSLLELKAQIVVLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKN 602

Query: 69  KEIEE 73
           K + E
Sbjct: 603 KVLPE 607


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 38.8 bits (91), Expect = 0.004
 Identities = 19/76 (25%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEK--EKEDVEEEKNKKKKKKKKK--KKKKKKKKKKKKEE 65
           + +  +E+  KE   E    E +  ++E+  + K +   KK++   KK+K+   K+K  +
Sbjct: 62  KLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLD 121

Query: 66  QSVKEIEESLKERKEK 81
           +  +E+EE + E++E+
Sbjct: 122 EKEEELEELIAEQREE 137



 Score = 38.0 bits (89), Expect = 0.007
 Identities = 26/97 (26%), Positives = 51/97 (52%), Gaps = 10/97 (10%)

Query: 1   MRKRRGGAGEKKKRIE---KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
             K+ G A E  KRI    K+  E  K+E   E KE+      + +++ K+++ + ++ +
Sbjct: 23  AEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLE 82

Query: 58  KKKKKKEEQ------SVKEIEESLKERKEKTLTEDSQ 88
           ++  ++EE       S+ + EE+L E+KEK L+   +
Sbjct: 83  RRLLQREETLDRKMESLDKKEENL-EKKEKELSNKEK 118



 Score = 37.2 bits (87), Expect = 0.010
 Identities = 21/82 (25%), Positives = 42/82 (51%), Gaps = 8/82 (9%)

Query: 8   AGEKKKRIEKRNKEE---EKEENEQEEKEKED---VEEEKNKKKKKKKKKKKKKKKKKKK 61
           A   KK      KEE    + E E+E KE+ +     E +  ++++   +K +   KK++
Sbjct: 45  AETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEE 104

Query: 62  K--KEEQSVKEIEESLKERKEK 81
              K+E+ +   E++L E++E+
Sbjct: 105 NLEKKEKELSNKEKNLDEKEEE 126



 Score = 34.1 bits (79), Expect = 0.12
 Identities = 23/83 (27%), Positives = 43/83 (51%), Gaps = 7/83 (8%)

Query: 10  EKKKRIEKRNKEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKK---KKKKKKK 62
           E K+ + K   E E+E     NE +  E+  ++ E+   +K +   KK++   KK+K+  
Sbjct: 55  EAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELS 114

Query: 63  KEEQSVKEIEESLKERKEKTLTE 85
            +E+++ E EE L+E   +   E
Sbjct: 115 NKEKNLDEKEEELEELIAEQREE 137



 Score = 31.4 bits (72), Expect = 0.85
 Identities = 22/85 (25%), Positives = 45/85 (52%), Gaps = 3/85 (3%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKK 59
           + K+     +K+K +  + K  +++E E EE   E  EE E+     +++ K+   ++ +
Sbjct: 99  LDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVE 158

Query: 60  KKKKEEQS--VKEIEESLKERKEKT 82
           ++ + E +  +KEIEE  KE  +K 
Sbjct: 159 EEARHEAAKLIKEIEEEAKEEADKK 183



 Score = 31.0 bits (71), Expect = 1.0
 Identities = 15/62 (24%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 13  KRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           +R+E+R  + EE  + + E  +K++   EK +K+   K+K   +K+++ ++   +  +E+
Sbjct: 79  QRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREEL 138

Query: 72  EE 73
           E 
Sbjct: 139 ER 140



 Score = 29.5 bits (67), Expect = 2.7
 Identities = 19/108 (17%), Positives = 43/108 (39%), Gaps = 19/108 (17%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKK---KKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
              E+  + + EE  K+ +  KK+     K++  K + + E+ +KE    L+  + + L 
Sbjct: 28  GSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQ 87

Query: 85  EDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQRQ 132
            +   D  +    K                +E +LE   +  S++++ 
Sbjct: 88  REETLDRKMESLDK----------------KEENLEKKEKELSNKEKN 119


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 38.1 bits (89), Expect = 0.005
 Identities = 13/54 (24%), Positives = 32/54 (59%), Gaps = 1/54 (1%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           ++ +EKR  E+ KE+  +   + +   ++K +  + ++KKK+KK K +++  + 
Sbjct: 343 REELEKR-IEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 32.4 bits (74), Expect = 0.32
 Identities = 12/52 (23%), Positives = 27/52 (51%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           E+ EK  E ++E+  K   K K ++ KK++  + +++++  K   E    + 
Sbjct: 344 EELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 32.4 bits (74), Expect = 0.41
 Identities = 12/57 (21%), Positives = 31/57 (54%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
            + +    +EE E+  ++ ++   +   K K ++ KK++  + ++KKKE+++  E  
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERR 391



 Score = 31.6 bits (72), Expect = 0.56
 Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKK-KKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
            EE E+  ++ KEK      K K  + KK++  + ++KKK+KK + +      
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 29.3 bits (66), Expect = 3.6
 Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 9/53 (16%)

Query: 34  KEDVEE------EKNKKKKKKKKKKKKKKK---KKKKKKEEQSVKEIEESLKE 77
           +E++E+      EK  K   K K ++ KK+   + ++KK+E+  K     L+ 
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 28.5 bits (64), Expect = 6.7
 Identities = 13/60 (21%), Positives = 28/60 (46%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            E +     E+ ++ +E+ K K  K   K K ++ KK++  +  +  KE +   + R  +
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394



 Score = 28.1 bits (63), Expect = 8.6
 Identities = 6/48 (12%), Positives = 26/48 (54%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           ++ ++++++  +   +   + +K++      + KK+KK K +++  + 
Sbjct: 348 KRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 38.5 bits (90), Expect = 0.005
 Identities = 18/77 (23%), Positives = 30/77 (38%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E K  + +  + EE+ +N  E K +    +   KK  K    K    K      E +  +
Sbjct: 849 EIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDE 908

Query: 70  EIEESLKERKEKTLTED 86
           +  +  K+  EKT  E 
Sbjct: 909 KTTKKKKKDLEKTDEEA 925



 Score = 32.3 bits (74), Expect = 0.48
 Identities = 16/72 (22%), Positives = 34/72 (47%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E   +I +  +E  K    Q E+     EE +   +++K  K++ K +  + ++ E+ ++
Sbjct: 806 ELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQ 865

Query: 70  EIEESLKERKEK 81
            I E+  E  E 
Sbjct: 866 NINENKNEFVEF 877


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 35.4 bits (82), Expect = 0.005
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          E +  KKKKKKKKKK +      K +++  E
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKAKKATAE 49



 Score = 34.6 bits (80), Expect = 0.008
 Identities = 13/35 (37%), Positives = 17/35 (48%)

Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
            KKKKKKKKKK +      ++ K   E + E   
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56



 Score = 34.3 bits (79), Expect = 0.011
 Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 6/51 (11%)

Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE----IEESLKER 78
          K + D+   K KKKKKKKK +      K KK   + V E    +E  L E+
Sbjct: 17 KTEADLV--KKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFLGEK 65



 Score = 28.5 bits (64), Expect = 1.4
 Identities = 13/43 (30%), Positives = 22/43 (51%)

Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
          K +    KKKKKKKKKK +    + K  + + ++  E  +  +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFME 59


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
          family.  This model family represents the major capsid
          protein component of the heads (capsids) of
          bacteriophage HK97, phi-105, P27, and related phage.
          This model represents one of several analogous families
          lacking detectable sequence similarity. The gene
          encoding this component is typically located in an
          operon encoding the small and large terminase subunits,
          the portal protein and the prohead or maturation
          protease [Mobile and extrachromosomal element
          functions, Prophage functions].
          Length = 384

 Score = 38.1 bits (89), Expect = 0.006
 Identities = 17/82 (20%), Positives = 38/82 (46%)

Query: 1  MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
          +++ R    E  K + K  ++E+  E E+E+ E + ++EE +K   +  + ++   + + 
Sbjct: 1  LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60

Query: 61 KKKEEQSVKEIEESLKERKEKT 82
          K          EE  +E K + 
Sbjct: 61 KPAASGEGGGGEEEEEEAKAEA 82



 Score = 31.9 bits (73), Expect = 0.52
 Identities = 11/73 (15%), Positives = 31/73 (42%), Gaps = 3/73 (4%)

Query: 1  MRKRRGGAGEKKKRIEKRNKE---EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
            + +      K+ I+K + E    E+  +E E K     E    ++++++ K +  + +
Sbjct: 27 AEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAEAAEFR 86

Query: 58 KKKKKKEEQSVKE 70
             +  ++   +E
Sbjct: 87 AYLRGGDDALAEE 99



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 10/66 (15%), Positives = 30/66 (45%), Gaps = 6/66 (9%)

Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
           KE  E   +  +++ +    +K  + +++K +    K++     + +++  + +R E+ 
Sbjct: 1  LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEE-----IDKLDAEI-DRLEEL 54

Query: 83 LTEDSQ 88
          L E   
Sbjct: 55 LDELEA 60


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 36.4 bits (85), Expect = 0.006
 Identities = 19/77 (24%), Positives = 44/77 (57%), Gaps = 8/77 (10%)

Query: 12  KKRIEKRNKE----EEKEENEQEEKEKEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           K RI K + E     E     +EE++ + V   E E+  ++++ +KK+ K++ +++K   
Sbjct: 45  KARIAK-DPEFQYLNEDIAWLKEERDDKTVSLNEAERKAEREEDEKKRLKRENERRKALG 103

Query: 65  EQSVKEIEESLKERKEK 81
           E+ +K +++  K+++E 
Sbjct: 104 EEPLKSLDDLPKDKEEP 120



 Score = 30.6 bits (70), Expect = 0.50
 Identities = 13/46 (28%), Positives = 24/46 (52%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           N+ E K E E++EK++   E E+ K   ++  K      K K++ +
Sbjct: 76  NEAERKAEREEDEKKRLKRENERRKALGEEPLKSLDDLPKDKEEPD 121


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 38.2 bits (89), Expect = 0.006
 Identities = 18/71 (25%), Positives = 42/71 (59%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            ++   E R  EEE  E  + E+  ++  EEK +K +   ++ ++ K+K + ++ ++ ++
Sbjct: 503 IEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELR 562

Query: 70  EIEESLKERKE 80
           ++E+ L+E KE
Sbjct: 563 QLEDRLQELKE 573



 Score = 37.8 bits (88), Expect = 0.010
 Identities = 16/72 (22%), Positives = 42/72 (58%), Gaps = 2/72 (2%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            + ++   +  +E ++ ENE++ +EK   EE +  +++ ++ +++ ++  KK  + EQ +
Sbjct: 661 EKVEELEAEIRRELQRIENEEQLEEKL--EELEQLEEELEQLREELEELLKKLGEIEQLI 718

Query: 69  KEIEESLKERKE 80
           +E+E    E +E
Sbjct: 719 EELESRKAELEE 730



 Score = 37.0 bits (86), Expect = 0.017
 Identities = 17/87 (19%), Positives = 44/87 (50%), Gaps = 2/87 (2%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R+R     +K K +E+R  + E+     E  E E   E +  +++ + + +K   + + +
Sbjct: 592 RERLKELKKKLKELEERLSQLEELLQSLELSEAE--NELEEAEEELESELEKLNLQAELE 649

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDSQ 88
           +  + +++E+EE ++E + +   E  +
Sbjct: 650 ELLQAALEELEEKVEELEAEIRRELQR 676



 Score = 36.7 bits (85), Expect = 0.020
 Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 2/74 (2%)

Query: 10  EKKKRIE--KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           EK +R+E  +R  EE +EE E      E++EE   K K  +++ +K ++K +K + E + 
Sbjct: 288 EKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEE 347

Query: 68  VKEIEESLKERKEK 81
           + E +  L +  E+
Sbjct: 348 LAEEKNELAKLLEE 361



 Score = 36.3 bits (84), Expect = 0.026
 Identities = 16/72 (22%), Positives = 43/72 (59%), Gaps = 1/72 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+ + +E+  +E E++    EE E+E +EE + + +  +   ++ ++  +K K  E+ ++
Sbjct: 274 EELRELERLLEELEEKIERLEELERE-IEELEEELEGLRALLEELEELLEKLKSLEERLE 332

Query: 70  EIEESLKERKEK 81
           ++EE L++ + +
Sbjct: 333 KLEEKLEKLESE 344



 Score = 35.5 bits (82), Expect = 0.043
 Identities = 21/73 (28%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E  K +E+R KE E+     EE EKE  +  +  K+ ++  ++ K++  +     E+  +
Sbjct: 354 ELAKLLEERLKELEERL---EELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQE 410

Query: 70  EIEESLKERKEKT 82
           E+EE  KE +E  
Sbjct: 411 ELEELEKELEELE 423



 Score = 35.5 bits (82), Expect = 0.043
 Identities = 19/73 (26%), Positives = 39/73 (53%), Gaps = 2/73 (2%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             E  K  +   K EE E    E  E  +   E  +++ K+ KK ++ ++++++++ EQ 
Sbjct: 176 LKEVIKEAKA--KIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQE 233

Query: 68  VKEIEESLKERKE 80
           ++ +EE L E +E
Sbjct: 234 IEALEERLAELEE 246



 Score = 35.1 bits (81), Expect = 0.057
 Identities = 16/72 (22%), Positives = 43/72 (59%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++++EK     E+ E  +E+ + + ++EE  + + + ++ K+  ++ +  +  ++ ++
Sbjct: 530 ELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELE 589

Query: 70  EIEESLKERKEK 81
           E+ E LKE K+K
Sbjct: 590 ELRERLKELKKK 601



 Score = 35.1 bits (81), Expect = 0.059
 Identities = 18/75 (24%), Positives = 36/75 (48%), Gaps = 1/75 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS-V 68
            +  R      EE  E+ +  E+  E +EE+  K + + ++  ++K +  K  +E    +
Sbjct: 307 LEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKEL 366

Query: 69  KEIEESLKERKEKTL 83
           +E  E L++  EK L
Sbjct: 367 EERLEELEKELEKAL 381



 Score = 35.1 bits (81), Expect = 0.063
 Identities = 20/77 (25%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 8   AGEKKKRIEKRNKEE--EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
             +K + I++  +EE  E+E    EE+  E  EE++  ++ K +  + +  + +  K  E
Sbjct: 214 ELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIRE 273

Query: 66  QSVKEIEESLKERKEKT 82
           + ++E+E  L+E +EK 
Sbjct: 274 EELRELERLLEELEEKI 290



 Score = 34.4 bits (79), Expect = 0.10
 Identities = 15/77 (19%), Positives = 43/77 (55%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E K++++ +  +EE  + E   +E +++ EE    + +K++ ++ +++ K+ KK+ + ++
Sbjct: 547 ELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELE 606

Query: 70  EIEESLKERKEKTLTED 86
           E    L+E  +     +
Sbjct: 607 ERLSQLEELLQSLELSE 623



 Score = 34.4 bits (79), Expect = 0.11
 Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 11/83 (13%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE---- 65
           E+ +++E R +E ++   E         E E+ +++ K+ KKK K+ +++  + EE    
Sbjct: 559 EELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQS 618

Query: 66  -------QSVKEIEESLKERKEK 81
                    ++E EE L+   EK
Sbjct: 619 LELSEAENELEEAEEELESELEK 641



 Score = 34.0 bits (78), Expect = 0.15
 Identities = 22/76 (28%), Positives = 43/76 (56%), Gaps = 1/76 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++ +  R KEE +   E EE EKE  E E+   +  + ++  K++ ++K +K E  ++
Sbjct: 485 ELEEEL-SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLE 543

Query: 70  EIEESLKERKEKTLTE 85
           E+EE  ++ + + L E
Sbjct: 544 ELEELKEKLQLQQLKE 559



 Score = 33.6 bits (77), Expect = 0.18
 Identities = 21/72 (29%), Positives = 48/72 (66%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
             ++R+EK  ++ EK E+E EE  +E  E  K  +++ K+ +++ ++ +K+ +K  + +K
Sbjct: 326 SLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLK 385

Query: 70  EIEESLKERKEK 81
           ++EE+++E KE+
Sbjct: 386 QLEEAIQELKEE 397



 Score = 33.6 bits (77), Expect = 0.20
 Identities = 17/89 (19%), Positives = 49/89 (55%), Gaps = 6/89 (6%)

Query: 10  EKKKRIEKRNK---EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK---KKKKKKKKKKK 63
           E+K+R+E+      E E  E E  +  +E++ E +   ++ ++K +   + +++ ++ ++
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305

Query: 64  EEQSVKEIEESLKERKEKTLTEDSQCDTM 92
           E + ++ + E L+E  EK  + + + + +
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKL 334



 Score = 33.2 bits (76), Expect = 0.24
 Identities = 20/81 (24%), Positives = 45/81 (55%), Gaps = 2/81 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E + R E    E+E  E E+E  E  ++EE   ++ ++K +K +   ++ ++ KE+  ++
Sbjct: 496 EAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQ 555

Query: 70  EIEESLK--ERKEKTLTEDSQ 88
           +++E L+  E + + L E  +
Sbjct: 556 QLKEELRQLEDRLQELKELLE 576



 Score = 31.3 bits (71), Expect = 1.0
 Identities = 20/73 (27%), Positives = 44/73 (60%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             E ++ +EK    EE+ E  +E+ EK + E E+  ++K +  K  +++ K+ +++ E+ 
Sbjct: 314 LEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEEL 373

Query: 68  VKEIEESLKERKE 80
            KE+E++L+  K+
Sbjct: 374 EKELEKALERLKQ 386



 Score = 31.3 bits (71), Expect = 1.0
 Identities = 15/87 (17%), Positives = 43/87 (49%), Gaps = 8/87 (9%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEK--------EDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           +K+  +   + ++ EE  QE KE+        E+++EE  + +K+ ++ +++ ++ +++ 
Sbjct: 374 EKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEI 433

Query: 63  KEEQSVKEIEESLKERKEKTLTEDSQC 89
           K+ +      ES +    +      +C
Sbjct: 434 KKLEEQINQLESKELMIAELAGAGEKC 460



 Score = 31.3 bits (71), Expect = 1.1
 Identities = 15/76 (19%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 9   GEKKKRIEKRNKEEE---KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            E +K + +  + E    +EE  +E++E E  EE +  +K+ ++ +++  +  + ++  +
Sbjct: 469 EEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528

Query: 66  QSVKEIEESLKERKEK 81
           + ++E  E L+   E+
Sbjct: 529 EELEEKLEKLENLLEE 544



 Score = 30.5 bits (69), Expect = 1.5
 Identities = 15/72 (20%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 13  KRIEKRNKE--EEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           + +E+R  E  EEKE  E+ +    E    E    K ++++ ++ ++  ++ +++ + ++
Sbjct: 235 EALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLE 294

Query: 70  EIEESLKERKEK 81
           E+E  ++E +E+
Sbjct: 295 ELEREIEELEEE 306



 Score = 30.1 bits (68), Expect = 2.3
 Identities = 18/65 (27%), Positives = 42/65 (64%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           +  +E EKE  E  E E E++EEE +++K++ + +++ ++ +K+ ++ E+ + E+ E  +
Sbjct: 466 ELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEE 525

Query: 77  ERKEK 81
             KE+
Sbjct: 526 ALKEE 530



 Score = 29.7 bits (67), Expect = 3.0
 Identities = 11/66 (16%), Positives = 35/66 (53%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
             + + EEK E  +   E+ +  +EK + ++ K++ ++ + + ++ K+  + ++ +    
Sbjct: 526 ALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRK 585

Query: 76  KERKEK 81
           +E +E 
Sbjct: 586 EELEEL 591



 Score = 29.7 bits (67), Expect = 3.0
 Identities = 17/73 (23%), Positives = 35/73 (47%), Gaps = 3/73 (4%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           ++   E     EE EE  ++  E E + EE   +K + ++ KK+ +K +K  +    ++E
Sbjct: 691 EQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALEL---LEE 747

Query: 71  IEESLKERKEKTL 83
           + E L +   +  
Sbjct: 748 LREKLGKAGLRAD 760



 Score = 29.0 bits (65), Expect = 5.2
 Identities = 15/73 (20%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            + +++  + + EE  +   EE E++ VEE + + +++ ++ + +++ ++K ++ EQ  +
Sbjct: 637 SELEKLNLQAELEELLQAALEELEEK-VEELEAEIRRELQRIENEEQLEEKLEELEQLEE 695

Query: 70  EIEESLKERKEKT 82
           E+E+  +E +E  
Sbjct: 696 ELEQLREELEELL 708


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 34.9 bits (81), Expect = 0.006
 Identities = 17/41 (41%), Positives = 22/41 (53%), Gaps = 2/41 (4%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           K  KK+  KK  +KKK+K   K+ E+ VKE      ER E
Sbjct: 14 AKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRA--AERDE 52



 Score = 29.5 bits (67), Expect = 0.40
 Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE--RKEKTLTE 85
          ++N+K + K+ KK+  KK  +KKK +   K  E+ +KE    E+    
Sbjct: 7  KRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIR 54



 Score = 26.8 bits (60), Expect = 4.4
 Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          KK+K+ +K + K+ KK+  KK  +  K     + +R EK
Sbjct: 4  KKRKRNEKLRAKRAKKRAAKKAARKKKRKL--IFKRAEK 40


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 37.4 bits (86), Expect = 0.006
 Identities = 19/81 (23%), Positives = 50/81 (61%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           +KKK +  R ++ EK   E+EE E+    E + +K  +++ +++++++++++  EEQS  
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDD 175

Query: 70  EIEESLKERKEKTLTEDSQCD 90
              E +++ + +T ++D + +
Sbjct: 176 SEHEIIEQDESETESDDDKTE 196



 Score = 29.3 bits (65), Expect = 2.3
 Identities = 16/77 (20%), Positives = 49/77 (63%), Gaps = 8/77 (10%)

Query: 14  RIEKRNKEEEKEENEQEEK---EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           RI  + +  +K++ ++++     +E +E+ + ++++ +++ + +++++K  ++E+Q    
Sbjct: 102 RIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQ---- 157

Query: 71  IEESLKERKEKTLTEDS 87
            EE  +ER+E+T+ E S
Sbjct: 158 -EEQEREREEQTIEEQS 173



 Score = 28.1 bits (62), Expect = 5.1
 Identities = 12/60 (20%), Positives = 36/60 (60%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A ++++ +E+R + E +EE   E +++E+ E E+ ++  +++    + +  ++ + E +S
Sbjct: 131 ARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETES 190



 Score = 28.1 bits (62), Expect = 5.9
 Identities = 11/53 (20%), Positives = 31/53 (58%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           +R E++  E E +E ++ E+E++ +EE+ +  + +  ++ + + +    K E+
Sbjct: 145 ERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 35.3 bits (82), Expect = 0.006
 Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 3/53 (5%)

Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE-IEESLKERK 79
          EQEEKE +   +E N   KKKKK+  K KKK+ +     +  E +++ L+++K
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKR--KAKKKRDEAGPASTAAEAVKKMLQKKK 81



 Score = 34.2 bits (79), Expect = 0.015
 Identities = 20/72 (27%), Positives = 33/72 (45%), Gaps = 11/72 (15%)

Query: 27 NEQEEKEKEDV-----------EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
          +E+E K KE +           +EEK  K+K  +     KKKKK+K K+++       + 
Sbjct: 10 SEEEVKLKERIWDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTA 69

Query: 76 KERKEKTLTEDS 87
           E  +K L +  
Sbjct: 70 AEAVKKMLQKKK 81


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 38.2 bits (89), Expect = 0.006
 Identities = 16/85 (18%), Positives = 42/85 (49%), Gaps = 8/85 (9%)

Query: 5   RGGAGEKKKRIEKRNKE------EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK--K 56
           +  A EK +      KE       ++ +  + E+E+   EEE+  +K+++   + +K   
Sbjct: 43  QKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDN 102

Query: 57  KKKKKKKEEQSVKEIEESLKERKEK 81
            + + ++ E+++   E  L+E +++
Sbjct: 103 LENQLEEREKALSARELELEELEKQ 127


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 193

 Score = 36.9 bits (86), Expect = 0.007
 Identities = 22/91 (24%), Positives = 47/91 (51%), Gaps = 11/91 (12%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE----- 64
             +  E+  K EE+E  E+E +E+ ++EEE+ +  + + + ++ K K  + + E     
Sbjct: 11 PDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQAEFENLR 70

Query: 65 EQSVKEIEESLKERKEKTLTEDSQCDTMIPV 95
          +++ +E EE+ K   EK   +      ++PV
Sbjct: 71 KRTEREREEAKKYAIEKFAKD------LLPV 95



 Score = 35.0 bits (81), Expect = 0.028
 Identities = 17/72 (23%), Positives = 43/72 (59%), Gaps = 2/72 (2%)

Query: 12 KKRIEKRNKEEEKEENEQEEKE--KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          K++  +    EE EE E+ E+E  +E+  EE+N+ ++++++  + + + ++ K +    +
Sbjct: 4  KEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQ 63

Query: 70 EIEESLKERKEK 81
             E+L++R E+
Sbjct: 64 AEFENLRKRTER 75



 Score = 31.5 bits (72), Expect = 0.41
 Identities = 15/58 (25%), Positives = 40/58 (68%)

Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
          + EQ+ +E +  E E+ +K ++++ ++++ +++ + ++E+Q + E+E  L+E K+K L
Sbjct: 3  DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYL 60


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 35.7 bits (83), Expect = 0.007
 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 23/74 (31%)

Query: 16  EKRNKEEEKEENEQEEKEKED--------------VEEEKNKKKKKKKKKKKKKKKKKK- 60
           +KR +EE++E  E   +  ED                EE  + K K+K+KKKKKKK+ + 
Sbjct: 31  DKREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90

Query: 61  --------KKKEEQ 66
                   KKKEE 
Sbjct: 91  FYRFQIREKKKEEL 104



 Score = 27.2 bits (61), Expect = 5.7
 Identities = 16/70 (22%), Positives = 33/70 (47%), Gaps = 19/70 (27%)

Query: 21 EEEKEENEQEEKEKEDVEEE--------KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
          ++ +EE ++E KE  +  +E          +K K  +++  + K K+K+KK+        
Sbjct: 31 DKREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKK-------- 82

Query: 73 ESLKERKEKT 82
             K++KE  
Sbjct: 83 ---KKKKELE 89


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 37.1 bits (86), Expect = 0.007
 Identities = 28/102 (27%), Positives = 45/102 (44%), Gaps = 15/102 (14%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES-----LKERKEKTLTE 85
           +  K++ E+  +  ++ K+KKKKK KKKK  K    +   +EES     + E+K +TL+ 
Sbjct: 73  QAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVHEQKSETLSL 132

Query: 86  DSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYS 127
             + +             G Y P  T     P L   Y+  S
Sbjct: 133 PIELEEQSM---------GRYLP-TTFKPGSPDLARHYKSAS 164


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 38.3 bits (89), Expect = 0.007
 Identities = 18/60 (30%), Positives = 29/60 (48%)

Query: 13   KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
            K ++K +KE+ K E  +E+ ++     E    KK  ++  KK   KK  KK  +S    E
Sbjct: 1163 KELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222



 Score = 36.0 bits (83), Expect = 0.035
 Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 12/97 (12%)

Query: 3    KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK------------KKKKKKK 50
              RG +G  KK   +  K+   ++  ++  E E  EE                K K +  
Sbjct: 1186 AARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAG 1245

Query: 51   KKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
             KKK     K+K+EE  + ++++ L      +    S
Sbjct: 1246 AKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQS 1282



 Score = 34.5 bits (79), Expect = 0.099
 Identities = 17/89 (19%), Positives = 36/89 (40%)

Query: 15   IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
            +EK   + +KE+ + EE  ++        +    KK  ++  KK   KK  +   E E +
Sbjct: 1161 LEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETT 1220

Query: 75   LKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
             +      +  ++  + + P  + G   K
Sbjct: 1221 EETYGSSAMETENVAEVVKPKGRAGAKKK 1249



 Score = 33.7 bits (77), Expect = 0.16
 Identities = 14/62 (22%), Positives = 32/62 (51%), Gaps = 7/62 (11%)

Query: 20   KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-------KKKKKKKKKKKEEQSVKEIE 72
            K+ +  E E ++ +KED + E+ ++K ++   +       K  ++  KK   +++ K+  
Sbjct: 1156 KDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKAS 1215

Query: 73   ES 74
            ES
Sbjct: 1216 ES 1217


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 37.7 bits (87), Expect = 0.008
 Identities = 18/64 (28%), Positives = 36/64 (56%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E  + +EE +  ++ E E + ++  +  +  K   K K KK+K  +++EE+ +K I  S 
Sbjct: 492 EDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSN 551

Query: 76  KERK 79
           K++K
Sbjct: 552 KQKK 555



 Score = 37.0 bits (85), Expect = 0.013
 Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 8/78 (10%)

Query: 14  RIEKRNKE----EEKEENEQEEKEKEDVEEE----KNKKKKKKKKKKKKKKKKKKKKKEE 65
           R E    E    EE +++++E + ++++E E    K  +  +  K   K K KK+K  EE
Sbjct: 479 RFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEE 538

Query: 66  QSVKEIEESLKERKEKTL 83
           +  K+++  +   K+K L
Sbjct: 539 EEEKKLKMIMMSNKQKKL 556



 Score = 36.6 bits (84), Expect = 0.021
 Identities = 21/61 (34%), Positives = 36/61 (59%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           E +K+ N+ + K+++  EEE+ KK K      K+KK  KK K      +E  E+LK++K+
Sbjct: 520 EADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKK 579

Query: 81  K 81
           +
Sbjct: 580 Q 580



 Score = 32.7 bits (74), Expect = 0.33
 Identities = 19/72 (26%), Positives = 37/72 (51%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E  K + K   ++ K + E+EEK+ + +     +KK  KK K    KK+++ +  ++  K
Sbjct: 520 EADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKK 579

Query: 70  EIEESLKERKEK 81
           +I +  K   +K
Sbjct: 580 QIAKQKKLDSKK 591


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 37.8 bits (87), Expect = 0.008
 Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 6/77 (7%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK------KKKKKKKKKKKEEQSVKEIEES 74
            ++K E   E+ +++  E ++N+K+++ KK K       K KK+   KK ++    IE +
Sbjct: 426 SDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERT 485

Query: 75  LKERKEKTLTEDSQCDT 91
              ++E  L +    D 
Sbjct: 486 AASKEELELIKADDEDD 502



 Score = 34.0 bits (77), Expect = 0.14
 Identities = 19/68 (27%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK--KKKKKKKEE 65
             EK  R  +  +E  KE   ++ K+  DV+ +  K+   KK KK K    +    K+E 
Sbjct: 433 TIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEEL 492

Query: 66  QSVKEIEE 73
           + +K  +E
Sbjct: 493 ELIKADDE 500



 Score = 32.8 bits (74), Expect = 0.32
 Identities = 13/62 (20%), Positives = 32/62 (51%), Gaps = 2/62 (3%)

Query: 27  NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
           NE E  +K +   EK  +K +++++ +K+++ KK K  + S  ++++  +   +K     
Sbjct: 422 NESESDDKLETTIEKLDRKLRERQENRKERQLKKTK--DDSDVDLKDKKESINKKNKKGK 479

Query: 87  SQ 88
             
Sbjct: 480 HA 481



 Score = 31.7 bits (71), Expect = 0.67
 Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 11/100 (11%)

Query: 10  EKKKRIEKRNKEEE---------KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +KK+ I K+NK+ +         KEE E  + + ED E+  +   K   K +K KK +K 
Sbjct: 466 DKKESINKKNKKGKHAIERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKL 525

Query: 61  KKKEEQSVKEIEESLKERKEKTLTEDSQ--CDTMIPVFKK 98
           KKK     +      K+ +   + ED     D   P FKK
Sbjct: 526 KKKASNLEEGFVFDPKDPRFVAIFEDHNFAIDPTHPEFKK 565



 Score = 28.2 bits (62), Expect = 8.6
 Identities = 18/90 (20%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           K      E+++  ++R  ++ K++++ + K+K++   +KNKK K   ++    K++ +  
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELI 495

Query: 63  KEEQ---------SVKEIEESLKERKEKTL 83
           K +           +K I ++ K +K + L
Sbjct: 496 KADDEDDEQLDHFDMKSILKAEKFKKNRKL 525


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.7 bits (87), Expect = 0.008
 Identities = 15/48 (31%), Positives = 33/48 (68%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           K  +KR +  EK + E E+K +E+ E EK K+K+++++++++ ++  K
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623



 Score = 35.8 bits (82), Expect = 0.041
 Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
           K+R  A EK KR  ++   EE+E  +++EKE+E  E E+  ++  K
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERER-EREREAERAAK 623



 Score = 33.9 bits (77), Expect = 0.14
 Identities = 17/65 (26%), Positives = 36/65 (55%), Gaps = 2/65 (3%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
             +   KKR E    E+ K E EQ+ +E+ + E+EK K+++++++++ ++  K      E
Sbjct: 573 ASSKLAKKREEA--VEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHE 630

Query: 66  QSVKE 70
             + E
Sbjct: 631 SRMSE 635


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 37.9 bits (89), Expect = 0.009
 Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKN---KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           +  +E+ E  + +KE+ED +  K      KK + +KKK+ K    K+   Q  K+I++ 
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKE 224



 Score = 36.4 bits (85), Expect = 0.026
 Identities = 16/51 (31%), Positives = 25/51 (49%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
             E  +  E+ EK +   EEE  K  K+  +  KK + +KKK+ +    KE
Sbjct: 162 DFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKE 212



 Score = 36.0 bits (84), Expect = 0.039
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 7/71 (9%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK-------KK 62
           E     E+  K E ++E E E+  KE +E  K  + +KKK+ K    K+          K
Sbjct: 164 EIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDK 223

Query: 63  KEEQSVKEIEE 73
           +E   +KEI E
Sbjct: 224 EEITPMKEINE 234



 Score = 33.3 bits (77), Expect = 0.21
 Identities = 14/63 (22%), Positives = 32/63 (50%), Gaps = 2/63 (3%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK--KKKKKKKKKKKKEEQSVKEIEESLKE 77
             +   E +  ++E E  E +K ++ +K  K+  +  KK + +KKK+ ++    E  ++ 
Sbjct: 158 ILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQI 217

Query: 78  RKE 80
            K+
Sbjct: 218 GKK 220


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 37.7 bits (87), Expect = 0.009
 Identities = 18/39 (46%), Positives = 28/39 (71%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK 44
           GG G      E+  +EEE+EE E+EE+E+E+ EEE+N++
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 33.8 bits (77), Expect = 0.14
 Identities = 13/31 (41%), Positives = 24/31 (77%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
           +   +EEE+EE E+EE+E+E+ EEE+ + ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 33.4 bits (76), Expect = 0.19
 Identities = 13/30 (43%), Positives = 26/30 (86%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
           ++EEE+EE E+EE+E+E+ EEE+ +++ ++
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 32.7 bits (74), Expect = 0.36
 Identities = 13/31 (41%), Positives = 26/31 (83%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKK 49
           + EEE+EE E+EE+E+E+ EEE+ ++++ ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 31.5 bits (71), Expect = 0.83
 Identities = 25/97 (25%), Positives = 46/97 (47%), Gaps = 1/97 (1%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           +R G  GE+    E  N EE   E EQE  E E   E +++ +   ++K +++ + + + 
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEG-ETETKGENESEGEIPAERKGEQEGEGEIEA 701

Query: 63  KEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKG 99
           KE     E E    E + +T  E ++ +  I   ++G
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEG 738



 Score = 30.3 bits (68), Expect = 1.8
 Identities = 11/31 (35%), Positives = 26/31 (83%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
           + E+EE E+EE+E+E+ EEE+ ++++++ ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 28.8 bits (64), Expect = 5.9
 Identities = 13/39 (33%), Positives = 27/39 (69%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
           E+++  E+  +EEE+EE E+EE+ +E +  E  + ++K+
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 28.8 bits (64), Expect = 6.0
 Identities = 10/31 (32%), Positives = 26/31 (83%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
           + EE E+EE+E+E+ EEE+ ++++++++ ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 28.4 bits (63), Expect = 8.4
 Identities = 17/73 (23%), Positives = 37/73 (50%), Gaps = 2/73 (2%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS--VKEIEESLKER 78
           E +++E+E E +  ED E + ++  + K + + + +  +K + E QS    +  E   E 
Sbjct: 773 EGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDET 832

Query: 79  KEKTLTEDSQCDT 91
            E+ L  ++Q + 
Sbjct: 833 GEQELNAENQGEA 845


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
          N-terminal.  This is the very acidic N-terminal region
          of the early transcription elongation factor Spt5. The
          Spt5-Spt4 complex regulates early transcription
          elongation by RNA polymerase II and has an imputed role
          in pre-mRNA processing via its physical association
          with mRNA capping enzymes. The actual function of this
          N-terminal domain is not known although it is
          dispensable for binding to Spt4.
          Length = 92

 Score = 34.7 bits (80), Expect = 0.009
 Identities = 17/68 (25%), Positives = 41/68 (60%), Gaps = 2/68 (2%)

Query: 22 EEKEENEQEEKEKEDVEEEKN--KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
          +E+EE E+EE + ED+ +E     + + +  ++ ++  ++++K+EE+  +E+ E L++R 
Sbjct: 10 DEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAEYLRKRY 69

Query: 80 EKTLTEDS 87
                D+
Sbjct: 70 GDEADADA 77


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 37.6 bits (87), Expect = 0.009
 Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          E+KK+  K  K +EKE  + +  +KE   + + ++        KK +KK +K+  E
Sbjct: 19 ERKKK--KEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVE 72



 Score = 37.2 bits (86), Expect = 0.013
 Identities = 18/42 (42%), Positives = 27/42 (64%)

Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E E ++K   EEE  +KKKK++K K+K+ KK K  ++E   K
Sbjct: 5  ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46



 Score = 37.2 bits (86), Expect = 0.014
 Identities = 20/62 (32%), Positives = 34/62 (54%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           E EK+   +EE E++  +EEK K+K+ KK K  +K+ K K + ++ S         E+K
Sbjct: 6  SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65

Query: 80 EK 81
           +
Sbjct: 66 SR 67



 Score = 36.0 bits (83), Expect = 0.035
 Identities = 20/82 (24%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           R E   +++   E E E K+K++ E+ K K+ KK K  +K+ K K + ++        +
Sbjct: 2  SRTESEAEKKILTEEELERKKKKE-EKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60

Query: 73 ESLKERKEKTLTEDSQCDTMIP 94
          +S K+ +++ + +++  D + P
Sbjct: 61 KSEKKSRKRDVEDENPEDFIDP 82



 Score = 32.9 bits (75), Expect = 0.29
 Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 9/90 (10%)

Query: 5  RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK----- 59
               E +K+I    + E K++ E++ KEK   E +K K  +K+ K K + ++       
Sbjct: 2  SRTESEAEKKILTEEELERKKKKEEKAKEK---ELKKLKAAQKEAKAKLQAQQASDGTNV 58

Query: 60 -KKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           KK +++   +++E+   E      T   Q
Sbjct: 59 PKKSEKKSRKRDVEDENPEDFIDPDTPFGQ 88


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 37.0 bits (86), Expect = 0.010
 Identities = 18/42 (42%), Positives = 24/42 (57%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            K KKKKKKKKK   KK K KK   +Q    ++    +R+E+
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294



 Score = 33.9 bits (78), Expect = 0.10
 Identities = 15/48 (31%), Positives = 31/48 (64%)

Query: 25  EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
            + ++++K+K+ V  +K K KK   K++K + K KK ++ E+  K+I+
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300



 Score = 33.1 bits (76), Expect = 0.17
 Identities = 24/74 (32%), Positives = 31/74 (41%), Gaps = 6/74 (8%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           AG     +E   K+          K K      K KKKKKK   KK K KK   K+ +  
Sbjct: 230 AGVGPDSLEDMYKKAHAAIRANPSKVK------KKKKKKKKVVHKKYKTKKLTGKQRKAR 283

Query: 68  VKEIEESLKERKEK 81
           VK  +   +ER +K
Sbjct: 284 VKAKKAQRRERLQK 297



 Score = 29.7 bits (67), Expect = 2.2
 Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 8/79 (10%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKK------KKKKKKKKKKKKKKKE 64
           K++++ + ++ K++  +  K     +  E+  KK          K KKKKKKKKK   K+
Sbjct: 210 KQLKEEDPDKYKKQFSKYIKAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKK 269

Query: 65  EQSVKEIEESLKERKEKTL 83
            ++ K   +  K R +   
Sbjct: 270 YKTKKLTGKQRKARVKAKK 288



 Score = 28.1 bits (63), Expect = 6.7
 Identities = 13/59 (22%), Positives = 32/59 (54%), Gaps = 3/59 (5%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK---KKKKKKKKKKKKKKKKKKKEEQS 67
           KK          K + ++++K+K   ++ K KK   K++K + K KK +++++ +++  
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 37.3 bits (86), Expect = 0.010
 Identities = 21/68 (30%), Positives = 44/68 (64%), Gaps = 1/68 (1%)

Query: 10  EKKKRIEKRNK-EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            KKKR E+R   EEE++  +QEE +++  EEE+ ++ K++ ++++ +  +K++K  E  +
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGL 278

Query: 69  KEIEESLK 76
            E ++  K
Sbjct: 279 SEDKKPFK 286



 Score = 36.9 bits (85), Expect = 0.015
 Identities = 18/69 (26%), Positives = 41/69 (59%), Gaps = 2/69 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQ--EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           E++++ E R + EE EE E   + ++K D  + +  +K++K+ + ++++K K+   EE +
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENN 171

Query: 68  VKEIEESLK 76
            + +   LK
Sbjct: 172 GEFMTHKLK 180



 Score = 36.2 bits (83), Expect = 0.024
 Identities = 32/126 (25%), Positives = 67/126 (53%), Gaps = 9/126 (7%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           K++ ++   E E+ + ++EE+ K  +EEE+ ++K+++  +K +++++K++ KEE   +  
Sbjct: 206 KQKQQEAALELEELKKKREERRKV-LEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRA 264

Query: 72  EESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQR 131
           E + K +K   + ED   +   P   K    KG    SL  + R   L  S Q+ S ++ 
Sbjct: 265 EAAEKRQK---VPEDGLSEDKKPF--KCFTPKG---SSLKITERAEFLNRSIQKSSVKKS 316

Query: 132 QKSLII 137
              L++
Sbjct: 317 HPPLVV 322



 Score = 34.2 bits (78), Expect = 0.090
 Identities = 20/68 (29%), Positives = 32/68 (47%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E   + E++N   + EE ++EEKE E  EEEK K+   ++   +    K K  +   S  
Sbjct: 130 EGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRG 189

Query: 70  EIEESLKE 77
             E +  E
Sbjct: 190 GAEGAQVE 197



 Score = 31.9 bits (72), Expect = 0.58
 Identities = 21/75 (28%), Positives = 43/75 (57%), Gaps = 1/75 (1%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E    E  KE  + ++K++E   E +  KKK+++++K  +++++++K+EE   K  EE  
Sbjct: 192 EGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEE 251

Query: 76  KER-KEKTLTEDSQC 89
           K R KE+     ++ 
Sbjct: 252 KRRLKEEIERRRAEA 266



 Score = 30.4 bits (68), Expect = 1.5
 Identities = 20/90 (22%), Positives = 40/90 (44%), Gaps = 18/90 (20%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------------- 54
           +K    +    ++E++E E EE+EK      +    +    K K                
Sbjct: 137 QKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQV 196

Query: 55  ---KKKKKKKKKEEQSVKEIEESLKERKEK 81
              K+ +K K+K++++  E+EE  K+R+E+
Sbjct: 197 EAGKEFEKLKQKQQEAALELEELKKKREER 226



 Score = 29.2 bits (65), Expect = 3.8
 Identities = 19/79 (24%), Positives = 43/79 (54%), Gaps = 4/79 (5%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE---- 65
           E    +E+  K+ E+     EE+E+   +EE ++K +++++K++ K++ ++++ E     
Sbjct: 211 EAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKR 270

Query: 66  QSVKEIEESLKERKEKTLT 84
           Q V E   S  ++  K  T
Sbjct: 271 QKVPEDGLSEDKKPFKCFT 289



 Score = 28.1 bits (62), Expect = 8.7
 Identities = 17/66 (25%), Positives = 33/66 (50%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E    E E  E E++E+ +E+ EE +  +   K ++K   +  ++ +KEE+  +  EE  
Sbjct: 102 EDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEK 161

Query: 76  KERKEK 81
            +R   
Sbjct: 162 PKRGSL 167


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 37.5 bits (87), Expect = 0.010
 Identities = 19/46 (41%), Positives = 30/46 (65%)

Query: 43  KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           KKKKKKK K+KKKKK ++  K+  +++ IEE ++    +  +E  Q
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQ 730



 Score = 35.2 bits (81), Expect = 0.059
 Identities = 22/53 (41%), Positives = 33/53 (62%), Gaps = 2/53 (3%)

Query: 48  KKKKKKKKKKKKKKKKEEQSVKEIE--ESLKERKEKTLTEDSQCDTMIPVFKK 98
           +KKKKKKK K+KKKKK  ++ K+I   E+++E  E +  E S     + + KK
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736



 Score = 30.6 bits (69), Expect = 1.5
 Identities = 18/44 (40%), Positives = 29/44 (65%), Gaps = 1/44 (2%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE-IEESLKERKEKT 82
           +K KKKK K+KKKKK ++  K+    ++++E IE S  E+  +T
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSET 728


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 35.6 bits (82), Expect = 0.011
 Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 7/87 (8%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK-------KKKKKKK 58
           G  G K+   ++ + EEE EE ++  +E E+  + K++K +++++        K   KKK
Sbjct: 18  GMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDKYGIKKK 77

Query: 59  KKKKKEEQSVKEIEESLKERKEKTLTE 85
           ++ ++E Q+  E EE    RK+KT  E
Sbjct: 78  EEDEEEPQAAAEEEEGRLGRKKKTPEE 104


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 37.8 bits (88), Expect = 0.011
 Identities = 24/79 (30%), Positives = 36/79 (45%), Gaps = 15/79 (18%)

Query: 9    GEKKKRIEKRNKEEEKEEN-EQEEKEKEDVEEEKNKKKKK--------KKKKK------K 53
             +KK  I    K+  + EN  QEEKE     E ++ K+KK         ++K       +
Sbjct: 1504 QKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAE 1563

Query: 54   KKKKKKKKKKEEQSVKEIE 72
               KK+K KK+ +S  E E
Sbjct: 1564 SDIKKRKNKKQYKSNTEAE 1582



 Score = 35.8 bits (83), Expect = 0.043
 Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 9/75 (12%)

Query: 11   KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK--------KKKKKKKKK 62
            +KK IE  N+ +E++E   + + + D E++ N +     ++K           KK+K KK
Sbjct: 1514 EKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKK 1573

Query: 63   KEEQSV-KEIEESLK 76
            + + +   E++  LK
Sbjct: 1574 QYKSNTEAELDFFLK 1588



 Score = 34.7 bits (80), Expect = 0.092
 Identities = 17/36 (47%), Positives = 26/36 (72%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            E +  D  EEK KKKKKK+KKK+++ K+++K + E
Sbjct: 733 AEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 33.2 bits (76), Expect = 0.28
 Identities = 15/47 (31%), Positives = 30/47 (63%)

Query: 29  QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
            ++ E +  +  + K KKKKKK+KKK+++ K+++K    + E  +S+
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEAWDSI 776



 Score = 33.2 bits (76), Expect = 0.28
 Identities = 15/54 (27%), Positives = 26/54 (48%)

Query: 3    KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
            K   G GE +   EK+   E    N+++  E++  E +  K+K KK+ K   + 
Sbjct: 1528 KEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581



 Score = 30.5 bits (69), Expect = 1.7
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 4/54 (7%)

Query: 34  KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           +    ++   K     ++K KKKKKK+KKKEE+   + EE  K R E     DS
Sbjct: 726 RNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEY--KREE--KARIEIAEAWDS 775


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 37.3 bits (87), Expect = 0.011
 Identities = 13/75 (17%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQ 66
              +  R+E++ K+  +E     E++ E++E +  + + K  +  ++  + ++K +E ++
Sbjct: 293 LANEISRLEQQ-KQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351

Query: 67  SVKEIEESLKERKEK 81
            ++ +E  L+E + +
Sbjct: 352 ELESLEAELEELEAE 366



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 12/69 (17%), Positives = 33/69 (47%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           + K  E   +  E EE  +E KE+ +  E + ++ + + ++ + + ++ +++ E    K 
Sbjct: 329 ESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388

Query: 71  IEESLKERK 79
            +  L+   
Sbjct: 389 AQLELQIAS 397



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 9/83 (10%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ---- 66
           K++R E   K E   EN       ED+  E  ++ K  +++ +K ++ K+ K E +    
Sbjct: 171 KERRKETERKLERTRENLDR---LEDILNELERQLKSLERQAEKAERYKELKAELRELEL 227

Query: 67  --SVKEIEESLKERKEKTLTEDS 87
              V  +EE  +E +E       
Sbjct: 228 ALLVLRLEELREELEELQEELKE 250



 Score = 28.5 bits (64), Expect = 7.0
 Identities = 15/81 (18%), Positives = 34/81 (41%), Gaps = 7/81 (8%)

Query: 8   AGEKKKRIE--KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK-----KKK 60
               ++R+E  +   EE  E+ E    E E++EE   + + + +    ++          
Sbjct: 833 IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALL 892

Query: 61  KKKEEQSVKEIEESLKERKEK 81
           + + E+  +E+ E   +R E 
Sbjct: 893 RSELEELSEELRELESKRSEL 913



 Score = 28.5 bits (64), Expect = 7.8
 Identities = 24/93 (25%), Positives = 41/93 (44%), Gaps = 9/93 (9%)

Query: 2   RKRRGGAGEKKKRIEK-RNKEEEKEENEQE--------EKEKEDVEEEKNKKKKKKKKKK 52
            K      E+++ IE+   K EE EE   E         KE E++EEE  + +K+ ++  
Sbjct: 666 AKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELS 725

Query: 53  KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
           ++    +K     ++  E  E    +  K LTE
Sbjct: 726 RQISALRKDLARLEAEVEQLEERIAQLSKELTE 758



 Score = 28.1 bits (63), Expect = 9.8
 Identities = 16/85 (18%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK----KKKKKKEE 65
           E+ +   K  +EE +E   + ++ +E +EE + +  + +++ ++ +K+      +  + E
Sbjct: 242 EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301

Query: 66  QSVKEIEESLK--ERKEKTLTEDSQ 88
           Q  + + E L   ER+ + L    +
Sbjct: 302 QQKQILRERLANLERQLEELEAQLE 326


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 36.6 bits (85), Expect = 0.011
 Identities = 15/66 (22%), Positives = 42/66 (63%), Gaps = 5/66 (7%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK-----E 80
            ++Q+ ++++++     + ++ KKK++ + KKKK+ ++ +   K+I++ +KE++     +
Sbjct: 69  LSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQ 128

Query: 81  KTLTED 86
             LTE+
Sbjct: 129 TKLTEE 134



 Score = 34.3 bits (79), Expect = 0.066
 Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK--KEEQSVKEIEESLKE 77
           E+EKE +    + ++  ++E+ + KKKK+ ++ K  +KK  +  KE+Q    ++  L E
Sbjct: 75  EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLTE 133



 Score = 30.1 bits (68), Expect = 1.6
 Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK--KKKKKKKKKKKKKKEEQSVKEIEE 73
                E+ K++ E E K+K+++EE K  +KK  +  K+K+     + K  EE  +  I +
Sbjct: 83  SSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLTEEGLLITILD 142

Query: 74  SL 75
           ++
Sbjct: 143 NI 144



 Score = 28.9 bits (65), Expect = 3.8
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
           KE      E E+ +K++  E K KK+ ++ K  +KK
Sbjct: 78  KELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 36.9 bits (86), Expect = 0.012
 Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 3/67 (4%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          K+K  +   K EEK E E+EE+  E  EEE+ + +++KK+K     K K  + +E   K+
Sbjct: 6  KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKE---KD 62

Query: 71 IEESLKE 77
          IE+ L+E
Sbjct: 63 IEDLLEE 69



 Score = 36.5 bits (85), Expect = 0.015
 Identities = 26/67 (38%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 12 KKRIEK-RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K +EK   K EE+EE E  E E+E+ EE++ +KK+K     K K  + K+K  E  ++E
Sbjct: 10 SKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEE 69

Query: 71 IEESLKE 77
          +E  L E
Sbjct: 70 LELELLE 76


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 36.3 bits (84), Expect = 0.014
 Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 7/76 (9%)

Query: 18 RNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
          R+  E  ++   ++   ED+E    ++ KKK+    +K    KK KKK +E   K  EE 
Sbjct: 25 RSTTESVDKVR-DDIFPEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKE---KLTEEE 80

Query: 75 LKERKEKTLTEDSQCD 90
           K   +   TE+++ D
Sbjct: 81 KKPESDDDKTEENEND 96



 Score = 30.1 bits (68), Expect = 1.1
 Identities = 11/73 (15%), Positives = 32/73 (43%), Gaps = 4/73 (5%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            ++     + KK+ +++  EEEK+    ++K     EE +N     ++    ++      
Sbjct: 59  PRKPATTKKSKKKDKEKLTEEEKKPESDDDKT----EENENDPDNNEESGDSQESASANS 114

Query: 62  KKEEQSVKEIEES 74
             +  +  ++++S
Sbjct: 115 LSDIDNEDDMDDS 127



 Score = 27.4 bits (61), Expect = 8.4
 Identities = 19/71 (26%), Positives = 30/71 (42%), Gaps = 8/71 (11%)

Query: 29  QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK------KKEEQS--VKEIEESLKERKE 80
           Q +K++     +    KK KKK K+K  +++KK      K EE        EES   ++ 
Sbjct: 50  QPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQES 109

Query: 81  KTLTEDSQCDT 91
            +    S  D 
Sbjct: 110 ASANSLSDIDN 120


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 36.7 bits (85), Expect = 0.014
 Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 10  EKKKRIEKRNKE---EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
           E  K  EK   E        +E +E E ++  +E+ +KK+K KKK +K+K  
Sbjct: 319 EIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370



 Score = 33.6 bits (77), Expect = 0.13
 Identities = 18/51 (35%), Positives = 31/51 (60%), Gaps = 3/51 (5%)

Query: 16  EKRNKEEEKEENE---QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E+  K +EK  +E         ED E+E +++ K++ +KK+K KKK +K+K
Sbjct: 318 EEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRK 368



 Score = 33.6 bits (77), Expect = 0.16
 Identities = 18/77 (23%), Positives = 33/77 (42%), Gaps = 11/77 (14%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK-----------KKKKKKEEQSV 68
           +E E EE      E+E V+ +    + ++  KKK +K+            K   +     
Sbjct: 254 REIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPE 313

Query: 69  KEIEESLKERKEKTLTE 85
            +I E + +  EK+L+E
Sbjct: 314 SDIHEEIPKLDEKSLSE 330



 Score = 32.5 bits (74), Expect = 0.32
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
               + +      +E +E+E +E+ KE+VE+++  KKK +K+K  
Sbjct: 326 KSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
          production and conversion].
          Length = 108

 Score = 34.7 bits (80), Expect = 0.015
 Identities = 17/77 (22%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE---QSV 68
           + IE+  +E E+   E  E+ +E +EE + + +K  ++  ++ +++ +++ EE   ++ 
Sbjct: 20 DEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEEILAEAE 79

Query: 69 KEIEESLKERKEKTLTE 85
          KE    L +  E  + E
Sbjct: 80 KEASAILSKAAEGKVVE 96



 Score = 29.7 bits (67), Expect = 0.76
 Identities = 11/76 (14%), Positives = 34/76 (44%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++ I++  +E  +   E EE+ ++  +E   + +++ +++ ++   + +K+      K
Sbjct: 29  EAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEEILAEAEKEASAILSK 88

Query: 70  EIEESLKERKEKTLTE 85
             E  + E        
Sbjct: 89  AAEGKVVEAALSEFLA 104



 Score = 28.9 bits (65), Expect = 1.3
 Identities = 15/65 (23%), Positives = 36/65 (55%), Gaps = 4/65 (6%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           E K+     ++E E+ +EE     ++  K+ +++ ++  ++ EE++ K  +E L+E +E
Sbjct: 10 REIKKAEISADEEIEEAKEE----AEQIIKEAREEAREIIEEAEEEAEKLAQEILEEARE 65

Query: 81 KTLTE 85
          +   E
Sbjct: 66 EAEEE 70


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 37.0 bits (85), Expect = 0.015
 Identities = 16/49 (32%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 17   KRNKEEEKEENEQEEKEKE--DVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            K  +E+E+ E+ Q  KE+E    E+E+ ++ +K      K+  KK+ KK
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKK 1075



 Score = 34.7 bits (79), Expect = 0.093
 Identities = 16/51 (31%), Positives = 26/51 (50%)

Query: 12   KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            K + EK   E  +   E+E  +KE   E++ +K      K+  KK+ KKK+
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077



 Score = 33.6 bits (76), Expect = 0.20
 Identities = 17/68 (25%), Positives = 41/68 (60%), Gaps = 4/68 (5%)

Query: 16   EKRNKEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
            E R K E K+    E  ++++EKE +E  +  K+++  KK+K+++++ +K   +   +  
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMA 1069

Query: 72   EESLKERK 79
            ++ LK+++
Sbjct: 1070 KKRLKKKR 1077



 Score = 33.6 bits (76), Expect = 0.20
 Identities = 14/52 (26%), Positives = 31/52 (59%)

Query: 30   EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            E +EK ++++   K++ K +++K++ +  ++ K+EE   KE E   + RK  
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTI 1061



 Score = 32.4 bits (73), Expect = 0.49
 Identities = 15/55 (27%), Positives = 33/55 (60%)

Query: 3    KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
            K R    E+K+R+E   + +E+E  ++E++ ++ + +  +   K+  KK+ KKK+
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077



 Score = 28.6 bits (63), Expect = 6.9
 Identities = 20/97 (20%), Positives = 41/97 (42%), Gaps = 8/97 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-------KKKKKKKKKK 62
                +E+        ENE E  ++ D E E+ +  +KK++ +       +   +KK   
Sbjct: 613 SSDNEMEESRGSSVTAENE-ESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVD 671

Query: 63  KEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKG 99
              +  ++IEE LK  + +  T   +   +I  ++ G
Sbjct: 672 WYTEEKRKIEEQLKINRSEFETMVPESRVVIEGYRAG 708



 Score = 28.2 bits (62), Expect = 8.9
 Identities = 19/75 (25%), Positives = 45/75 (60%), Gaps = 6/75 (8%)

Query: 13   KRIEKRNKEEEKEENE-QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV--- 68
            +RIE     E +E++E ++   KE +++++  +K++ +  ++ K+++  KK++E+     
Sbjct: 1001 RRIELPVPPECREKHEIKDRIVKERIKDQE--EKERMESLQRAKEEEIGKKEKEREQRIR 1058

Query: 69   KEIEESLKERKEKTL 83
            K I ++ KE  +K L
Sbjct: 1059 KTIHDNYKEMAKKRL 1073


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 35.9 bits (83), Expect = 0.017
 Identities = 14/78 (17%), Positives = 30/78 (38%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             E K    K  KE +    + +  + +   ++   + +KK  + + +   +KK  E  +
Sbjct: 98  QAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAA 157

Query: 68  VKEIEESLKERKEKTLTE 85
           V   E +  E +E     
Sbjct: 158 VAAEEAAAAEEEEAEEAP 175



 Score = 32.8 bits (75), Expect = 0.16
 Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQSVKE-IEESLKERKE 80
           E + E K +   EE++ K   KK K  K KK  KK   E E+ V E   E++ E+K 
Sbjct: 96  EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKA 152



 Score = 31.7 bits (72), Expect = 0.39
 Identities = 13/77 (16%), Positives = 33/77 (42%), Gaps = 1/77 (1%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            EK+ ++    K++  +  +  +K   + E++ N+ + +   +KK  +      +E  + 
Sbjct: 108 EEKEGKVNA-KKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAA 166

Query: 69  KEIEESLKERKEKTLTE 85
           +E E      +E    E
Sbjct: 167 EEEEAEEAPAEEAPAEE 183



 Score = 28.2 bits (63), Expect = 5.9
 Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 3/54 (5%)

Query: 35  EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK---EIEESLKERKEKTLTE 85
           E   E K +  K++K+ K   KK K  K ++ + K   E E+ + E + + + E
Sbjct: 96  EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAE 149


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 35.8 bits (83), Expect = 0.017
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           E K+  ++EE   E+ +E      KK    +K + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 32.0 bits (73), Expect = 0.28
 Identities = 11/39 (28%), Positives = 21/39 (53%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           E+++ +Q+E+   +  +E      KK    +K + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 30.4 bits (69), Expect = 0.99
 Identities = 10/39 (25%), Positives = 21/39 (53%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
            +++ K++ E   +E+++      KK    +K + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 28.9 bits (65), Expect = 3.6
 Identities = 11/34 (32%), Positives = 17/34 (50%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
           K+ E+   EE KE      K+    ++ + KKKK
Sbjct: 163 KQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 28.5 bits (64), Expect = 4.1
 Identities = 10/40 (25%), Positives = 17/40 (42%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
            E+K   +K     E+ +       K+    +K + KKKK
Sbjct: 157 AERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 28.1 bits (63), Expect = 5.5
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 2/39 (5%)

Query: 27  NEQEEKEKEDV--EEEKNKKKKKKKKKKKKKKKKKKKKK 63
             ++ K+KE+   EE K       KK    +K + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 28.1 bits (63), Expect = 6.4
 Identities = 9/37 (24%), Positives = 19/37 (51%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
           K  K++E+   E+ ++      ++    +K + KKKK
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 36.4 bits (84), Expect = 0.018
 Identities = 19/45 (42%), Positives = 24/45 (53%)

Query: 42  NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
            + K KK K KKK KK KKK+ +E S K+ EE  +   E    E 
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEES 186



 Score = 35.6 bits (82), Expect = 0.039
 Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 6/89 (6%)

Query: 11  KKKRIEKRNKEE--EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           K  +    +  E  +K    + + +K   +++  K KKK+ K+   K  +++ + E++S 
Sbjct: 123 KPVKNVLESIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESK 182

Query: 69  KEIEE---SLKERKEKTLTED-SQCDTMI 93
            E      S  E +E + +ED SQ D M+
Sbjct: 183 SEESAEDDSDDEEEEDSDSEDYSQYDGML 211



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 19/93 (20%), Positives = 41/93 (44%), Gaps = 10/93 (10%)

Query: 2   RKRRGGAGEKKKRIEKR--------NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
           RK R G   ++   EK+         KE EKE+ E+E ++ E   E +  K++    K  
Sbjct: 306 RKNRRGQRARQAIWEKKYGSGAKHVKKEREKEQKEREGRQSE--WEARQAKREGGDAKAG 363

Query: 54  KKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
           +  +    + +++  +      K+ K+ ++ + 
Sbjct: 364 RAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKP 396



 Score = 30.2 bits (68), Expect = 1.9
 Identities = 12/56 (21%), Positives = 20/56 (35%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           GE+   I       E E +E + +  E      +++     KK K+KK        
Sbjct: 220 GEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPS 275



 Score = 29.0 bits (65), Expect = 3.8
 Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 9/71 (12%)

Query: 2   RKRRGGAGEKK-KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK------ 54
             R+     ++ KR     K     E      +++    ++ +KKK KK    K      
Sbjct: 342 EGRQSEWEARQAKREGGDAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSW 401

Query: 55  --KKKKKKKKK 63
             KKK K+KK 
Sbjct: 402 EAKKKAKEKKA 412



 Score = 29.0 bits (65), Expect = 4.4
 Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 7/63 (11%)

Query: 4   RRGGAGEKKKRIEKRN--KEEEKEENEQEEKEKEDVEEEKN-----KKKKKKKKKKKKKK 56
               AG   +    R   K +  +  E+++ +K  V++  +     KKK K+KK   K +
Sbjct: 358 GDAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAKFQ 417

Query: 57  KKK 59
            KK
Sbjct: 418 GKK 420



 Score = 28.3 bits (63), Expect = 6.6
 Identities = 16/68 (23%), Positives = 24/68 (35%), Gaps = 16/68 (23%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKK----------------KKKKKKKKKEEQSVKEIEE 73
            E E +D E+    +  KK  K+K                    K  KK+ E+  KE E 
Sbjct: 284 SEDEDDDDEDIDPDQVVKKPVKRKNRRGQRARQAIWEKKYGSGAKHVKKEREKEQKEREG 343

Query: 74  SLKERKEK 81
              E + +
Sbjct: 344 RQSEWEAR 351


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 35.8 bits (83), Expect = 0.019
 Identities = 11/39 (28%), Positives = 21/39 (53%)

Query: 25  EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E  + +  E+   EE +     K+++  + +KK+ KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 35.0 bits (81), Expect = 0.031
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           E K+    EE   E+ E     K+++  + +KK+ KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 31.5 bits (72), Expect = 0.39
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
           K  K  E+   E+ E      EEE  + +KK+ KKKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 31.1 bits (71), Expect = 0.64
 Identities = 14/40 (35%), Positives = 17/40 (42%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
            E K          E+ E     KE+E  E EK + KKKK
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.8 bits (70), Expect = 0.73
 Identities = 11/39 (28%), Positives = 19/39 (48%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
             ++ K   E   +E E     K ++  + +KK+ KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 12/38 (31%), Positives = 18/38 (47%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
               +  EE   EE E     +E+   + +KK+ KKKK
Sbjct: 159 WKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK 50
            K  +E   EE E     KE+   E  KK+ KKKK
Sbjct: 162 AKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           E +  +  +E   EE       K+++  + +KK+ KKK+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 36.5 bits (85), Expect = 0.019
 Identities = 19/88 (21%), Positives = 40/88 (45%), Gaps = 19/88 (21%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK------------ 57
           E ++ + +   E+E  E EQ+EKEK+ ++E    K+ +  + +K+ +             
Sbjct: 405 ELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDEDDIAEVVA 464

Query: 58  -------KKKKKKEEQSVKEIEESLKER 78
                   K  + E++ +  +E  LK+R
Sbjct: 465 RWTGIPVAKLLEDEKEKLLNLERRLKKR 492



 Score = 34.2 bits (79), Expect = 0.12
 Identities = 16/64 (25%), Positives = 34/64 (53%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           +K + +++  +E  + E E+E  E+E  E+EK    +  K K+ +  + +K+ + E    
Sbjct: 398 DKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDED 457

Query: 70  EIEE 73
           +I E
Sbjct: 458 DIAE 461


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 36.1 bits (83), Expect = 0.019
 Identities = 22/97 (22%), Positives = 41/97 (42%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EKK   +K    +E  + ++  K       +K  K KK K  K   KK   K ++ +SVK
Sbjct: 15  EKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVK 74

Query: 70  EIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIY 106
           +   + K  K++ ++ +    +         L K ++
Sbjct: 75  KESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLF 111



 Score = 31.1 bits (70), Expect = 0.88
 Identities = 16/54 (29%), Positives = 25/54 (46%)

Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
            +K+  EE+K   KK    K+  K KK  K    ++ K+  +  K +  KT T
Sbjct: 7  TTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60



 Score = 28.8 bits (64), Expect = 3.9
 Identities = 11/68 (16%), Positives = 31/68 (45%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A +K  +++K    +   +    + EK +  ++++  KK  KK+    +  +   K  ++
Sbjct: 43  AAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKN 102

Query: 68  VKEIEESL 75
             ++ + L
Sbjct: 103 TSKLPKKL 110



 Score = 28.8 bits (64), Expect = 4.7
 Identities = 24/95 (25%), Positives = 34/95 (35%), Gaps = 18/95 (18%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK----KKKKKKKKKKKKKKEE 65
          E KK  +K+  EE+K        +K    +E  K KK  K    K  KK  K KK K  +
Sbjct: 3  ETKKTTKKKTTEEKKPA-----AKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVK 57

Query: 66 QSVK---------EIEESLKERKEKTLTEDSQCDT 91
           + K         E  +     K+    E    + 
Sbjct: 58 TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEV 92



 Score = 28.4 bits (63), Expect = 5.6
 Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 6/83 (7%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK------KKKKKEEQSVKEIEESLKE 77
             E ++  K+K   E++   KK    K+  K KK       K  KK  +  K        
Sbjct: 1   MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60

Query: 78  RKEKTLTEDSQCDTMIPVFKKGV 100
           +K     E ++      V KK V
Sbjct: 61  KKVTVKFEKTESVKKESVAKKTV 83


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 36.1 bits (84), Expect = 0.020
 Identities = 18/76 (23%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 8   AGEKKKRIEKRNKEE---EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           A EK    E+   E    E+E   +++KE+E + E + +  ++  K+  +K + +++K  
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260

Query: 65  EQSVKEIEESLKERKE 80
            +  + +E  L+E++E
Sbjct: 261 AEQERMLEHKLQEQEE 276



 Score = 31.5 bits (72), Expect = 0.65
 Identities = 15/64 (23%), Positives = 38/64 (59%), Gaps = 1/64 (1%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
             KE+  + E+ K +  E + +  ++K+K++++  + +++  +E  VK++ E ++  +EK
Sbjct: 200 TAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEH-VKQLIEKMEAEREK 258

Query: 82  TLTE 85
            L E
Sbjct: 259 LLAE 262



 Score = 29.9 bits (68), Expect = 2.0
 Identities = 15/58 (25%), Positives = 33/58 (56%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           E+  +E  K+  E+ E E+E +  E+ +  + K +++++  K+  K + E   KEI++
Sbjct: 238 ERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQKEIQD 295


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 36.6 bits (83), Expect = 0.022
 Identities = 24/87 (27%), Positives = 50/87 (57%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           K+  E+E    E E K++   EEE  + K ++++ +K+++KK+   K+ Q+ K+ E+++K
Sbjct: 95  KKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVK 154

Query: 77  ERKEKTLTEDSQCDTMIPVFKKGVLYK 103
             K +   +  +  T  PV ++ V+ K
Sbjct: 155 AEKAEKAEKTKKASTPAPVEEEIVVKK 181



 Score = 34.7 bits (78), Expect = 0.084
 Identities = 21/76 (27%), Positives = 43/76 (56%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            +K    E R  E E ++   +E+E +  + E+ + +K+++KK+   KK + +KK+E++V
Sbjct: 94  AKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAV 153

Query: 69  KEIEESLKERKEKTLT 84
           K  +    E+ +K  T
Sbjct: 154 KAEKAEKAEKTKKAST 169



 Score = 34.3 bits (77), Expect = 0.093
 Identities = 24/79 (30%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK---KKKKKKK 60
           RR  A  KK+  ++   ++ K E E+ +KE+E  E +  K + +KKK+K    +K +K +
Sbjct: 103 RRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAE 162

Query: 61  KKKEEQSVKEIEESLKERK 79
           K K+  +   +EE +  +K
Sbjct: 163 KTKKASTPAPVEEEIVVKK 181



 Score = 33.1 bits (74), Expect = 0.24
 Identities = 20/76 (26%), Positives = 43/76 (56%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+ +++EK  K E+K+  +Q  K+ E  ++   KK  +K+ ++ + + KK+  +EE+  +
Sbjct: 62  EQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQ 121

Query: 70  EIEESLKERKEKTLTE 85
              E  + +KE+   E
Sbjct: 122 WKAEQERIQKEQEKKE 137



 Score = 33.1 bits (74), Expect = 0.25
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 7/75 (9%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A EK+ R   R + E K+   QEE+ K+   E++  +K+++KK+   KK + +KKKE   
Sbjct: 97  AAEKEAR---RAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKE--- 150

Query: 68  VKEIEESLKERKEKT 82
            K ++    E+ EKT
Sbjct: 151 -KAVKAEKAEKAEKT 164



 Score = 30.0 bits (66), Expect = 2.0
 Identities = 23/67 (34%), Positives = 30/67 (44%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EKK +  KR  E E      E   +  VE      + +KK KK KKK + +K  EE    
Sbjct: 256 EKKNKKNKRKSESENVPAASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQA 315

Query: 70  EIEESLK 76
           E  +S K
Sbjct: 316 EAPKSKK 322



 Score = 29.6 bits (65), Expect = 2.8
 Identities = 19/60 (31%), Positives = 34/60 (56%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           K   E+  KE+EK+E + ++ + E  +E+  K +K +K +K KK       +EE  VK++
Sbjct: 123 KAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEEIVVKKV 182



 Score = 28.5 bits (62), Expect = 7.1
 Identities = 20/75 (26%), Positives = 32/75 (42%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           EK+NK+ +++   +      +   E   +      + +KK KK KKK E + V E     
Sbjct: 256 EKKNKKNKRKSESENVPAASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQA 315

Query: 76  KERKEKTLTEDSQCD 90
           +  K K  T D   D
Sbjct: 316 EAPKSKKPTADDNMD 330


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 34.9 bits (80), Expect = 0.022
 Identities = 18/51 (35%), Positives = 35/51 (68%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           EKK+  ++R +E+ ++E  + +KE E +E  K K+K++++++KK  KK K 
Sbjct: 76  EKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126



 Score = 34.5 bits (79), Expect = 0.031
 Identities = 22/74 (29%), Positives = 44/74 (59%), Gaps = 3/74 (4%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKK---KKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           E     E++   +Q +  KED + EK KK   KK+  K++K+++++K+  K ++ +++IE
Sbjct: 45  EGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIE 104

Query: 73  ESLKERKEKTLTED 86
            S K++KE+     
Sbjct: 105 LSKKKQKERERRRK 118



 Score = 34.1 bits (78), Expect = 0.046
 Identities = 22/75 (29%), Positives = 43/75 (57%), Gaps = 5/75 (6%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKED--VEEEKNKKKKKK---KKKKKKKKKKKKKKKEEQ 66
           KK   K  ++E     E+E  EK+    +E++  +KKKK   KK+  K++K+++++KE  
Sbjct: 35  KKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELA 94

Query: 67  SVKEIEESLKERKEK 81
             ++  E ++  K+K
Sbjct: 95  KRQKELEKIELSKKK 109



 Score = 32.9 bits (75), Expect = 0.096
 Identities = 19/60 (31%), Positives = 40/60 (66%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             EKKK+++++ +  ++ + EQ EKE    ++E  K +  KKK+K++++++KK  K+ +S
Sbjct: 67  KFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126



 Score = 31.4 bits (71), Expect = 0.37
 Identities = 18/59 (30%), Positives = 37/59 (62%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K   E R  E++K+ +E++E  K+   E++ K+  K++K+ +K +  KKK+KE +  ++
Sbjct: 60  KSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRK 118



 Score = 30.2 bits (68), Expect = 0.95
 Identities = 18/64 (28%), Positives = 38/64 (59%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           K  EKE     EKE  + + + +K+ +K +KKKK  +KK+  K+ ++  +E E + ++++
Sbjct: 40  KLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKE 99

Query: 80  EKTL 83
            + +
Sbjct: 100 LEKI 103



 Score = 28.7 bits (64), Expect = 2.8
 Identities = 17/60 (28%), Positives = 41/60 (68%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E +K  +K+  +E+KE  +Q ++E+ + E  K +K+ +K +  KKK+K++++++++ + K
Sbjct: 64  EDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKK 123


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 36.4 bits (84), Expect = 0.022
 Identities = 16/88 (18%), Positives = 32/88 (36%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           +K   K  K+E  ++    E+E      ++      + + ++   K KKK   + S  E 
Sbjct: 339 EKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNED 398

Query: 72  EESLKERKEKTLTEDSQCDTMIPVFKKG 99
             S  E  + +  +     T  P  +  
Sbjct: 399 TPSENEESKGSPPQVEATTTAEPNREPS 426



 Score = 33.7 bits (77), Expect = 0.15
 Identities = 18/90 (20%), Positives = 35/90 (38%), Gaps = 7/90 (7%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK-----KKKKKKKKKKKKKKK 61
            A  KK R  +  K E K   ++  +++   E E N++ +        + + ++   K K
Sbjct: 329 NAEIKKTRTAE--KNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAK 386

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
           KK        E++  E +E   +      T
Sbjct: 387 KKGLIDASPNEDTPSENEESKGSPPQVEAT 416



 Score = 33.3 bits (76), Expect = 0.22
 Identities = 9/79 (11%), Positives = 32/79 (40%), Gaps = 2/79 (2%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE--QSVKEI 71
            +   N E +K    ++ + K   +E   K++  +++  ++ ++++             +
Sbjct: 324 ELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAV 383

Query: 72  EESLKERKEKTLTEDSQCD 90
           +   K   + +  ED+  +
Sbjct: 384 KAKKKGLIDASPNEDTPSE 402



 Score = 32.9 bits (75), Expect = 0.23
 Identities = 13/69 (18%), Positives = 26/69 (37%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
              +  A  +KK I ++ +  E+E N +  +E+         ++   K KKK        
Sbjct: 337 TAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPN 396

Query: 62  KKEEQSVKE 70
           +      +E
Sbjct: 397 EDTPSENEE 405



 Score = 32.9 bits (75), Expect = 0.26
 Identities = 14/75 (18%), Positives = 28/75 (37%), Gaps = 1/75 (1%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           ++K R     + K  +K   ++ +    +  +E          + + ++   K KKK   
Sbjct: 332 IKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLI 391

Query: 61  KKKE-EQSVKEIEES 74
                E +  E EES
Sbjct: 392 DASPNEDTPSENEES 406


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 35.0 bits (81), Expect = 0.023
 Identities = 17/74 (22%), Positives = 39/74 (52%), Gaps = 6/74 (8%)

Query: 10  EKKKRIE------KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E++ +I       +R KEE +    + E+E E+  E+ ++  ++ KK+ ++  ++ K + 
Sbjct: 37  ERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEA 96

Query: 64  EEQSVKEIEESLKE 77
           EE+  +  E +  E
Sbjct: 97  EEELERIKEAAEAE 110



 Score = 29.2 bits (66), Expect = 1.8
 Identities = 12/70 (17%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 15  IEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           +++R  +   +  E E  ++E      + +++ ++ +++  +  ++ KK+ EQ  +EI+ 
Sbjct: 35  LDERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKA 94

Query: 74  SLKERKEKTL 83
             +E  E+  
Sbjct: 95  EAEEELERIK 104



 Score = 28.4 bits (64), Expect = 3.7
 Identities = 12/77 (15%), Positives = 35/77 (45%), Gaps = 1/77 (1%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            + +  A       E+  +E  ++ +E  E+ K++ E+   + K + +++ ++ K+  + 
Sbjct: 50  ERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEA 109

Query: 62  KKEEQSVKEIEESLKER 78
           +  E   +   E L+  
Sbjct: 110 EI-EAEKERALEELRAE 125


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 34.5 bits (80), Expect = 0.023
 Identities = 24/83 (28%), Positives = 49/83 (59%), Gaps = 7/83 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVE-------EEKNKKKKKKKKKKKKKKKKKKKK 62
           E+ +R E+  K+ E+E  ++EE+ +E +        E + K+++ +KK +++KK +K+K+
Sbjct: 21  EEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKE 80

Query: 63  KEEQSVKEIEESLKERKEKTLTE 85
           +E + +K   E LK   EK   +
Sbjct: 81  EEIKELKAELEELKAEIEKLEEK 103



 Score = 28.0 bits (63), Expect = 4.0
 Identities = 19/74 (25%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK---KKKKKKKKKKKKEEQSVKEIE 72
           KR + E +EE  ++ +E+ + +EE+ ++   K  K   + + K+++ +KK E+  K  +
Sbjct: 18 AKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRK 77

Query: 73 ESLKERKEKTLTED 86
          E  +E KE     +
Sbjct: 78 EKEEEIKELKAELE 91



 Score = 26.8 bits (60), Expect = 9.2
 Identities = 16/77 (20%), Positives = 47/77 (61%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +++R     +K++ +++   + +K   E E K +   ++ + +KK +K+K+++ K+ K +
Sbjct: 30  LKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAE 89

Query: 61  KKKEEQSVKEIEESLKE 77
            ++ +  ++++EE L+E
Sbjct: 90  LEELKAEIEKLEEKLEE 106


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 35.8 bits (83), Expect = 0.023
 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 3/66 (4%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +++R   A EKK  +EK  K ++K +   +E +  D   +   K   KKK +K    + K
Sbjct: 155 LQER---AKEKKDMLEKIKKLKKKRKGGGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGK 211

Query: 61  KKKEEQ 66
              + +
Sbjct: 212 PNAKRK 217



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 20/88 (22%), Positives = 39/88 (44%), Gaps = 8/88 (9%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKE---DVEEEKNKKKKK---KKKKKKK 54
           M+K R     +    +    EE +++ E ++  K+   +  +E+ K+KK    K KK KK
Sbjct: 117 MQKVRKKLLAEAAAKKAS--EEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKK 174

Query: 55  KKKKKKKKKEEQSVKEIEESLKERKEKT 82
           K+K    + +       + +    K+K 
Sbjct: 175 KRKGGGDEGDLFDFLVDDAAKGGSKKKG 202



 Score = 28.5 bits (64), Expect = 5.4
 Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 14/85 (16%)

Query: 11  KKKR-IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-------------KKKKKK 56
           +K+R ++K  K+ + E+ ++  KEK+D+ E+  K KKK+K                 K  
Sbjct: 138 RKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGDLFDFLVDDAAKGG 197

Query: 57  KKKKKKKEEQSVKEIEESLKERKEK 81
            KKK +K   +  +     K +  K
Sbjct: 198 SKKKGRKGGAARGKPNAKRKAKDAK 222


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 36.0 bits (83), Expect = 0.024
 Identities = 15/46 (32%), Positives = 20/46 (43%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           E    E         E ++ KK KK +  K  K  K KK K++ SV
Sbjct: 128 EMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSV 173



 Score = 34.1 bits (78), Expect = 0.097
 Identities = 15/50 (30%), Positives = 19/50 (38%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           E    E         + +E K  KK +  K  K  K KK KKK   S + 
Sbjct: 128 EMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177



 Score = 32.2 bits (73), Expect = 0.40
 Identities = 14/66 (21%), Positives = 23/66 (34%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           R     +         E KE +  ++ ++ K  K  K KK KKK     +  +       
Sbjct: 127 REMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186

Query: 74  SLKERK 79
           S  + K
Sbjct: 187 SKPDWK 192



 Score = 29.9 bits (67), Expect = 2.0
 Identities = 13/73 (17%), Positives = 26/73 (35%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
                E +E +K    +     K  K KK KKK     +  +   +    +   + ++  
Sbjct: 138 STAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQDLV 197

Query: 83  LTEDSQCDTMIPV 95
           +   S   + +PV
Sbjct: 198 INGVSMDISGMPV 210


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 35.8 bits (83), Expect = 0.027
 Identities = 13/71 (18%), Positives = 34/71 (47%), Gaps = 4/71 (5%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +++       +++R  K+   E+  E E+  +E     E+   +++K+   K ++ +KK 
Sbjct: 303 LKEIHEARLNEEERELKKKFTEKIREKEKRLEEL----EQNLIEERKELNSKLEEIQKKL 358

Query: 61  KKKEEQSVKEI 71
           +  E++  K  
Sbjct: 359 EDLEKRLEKLK 369



 Score = 32.3 bits (74), Expect = 0.41
 Identities = 16/79 (20%), Positives = 39/79 (49%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            K  G    K+    + N+EE + + +  EK +E  +  +  ++   +++K+   K ++ 
Sbjct: 295 LKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEI 354

Query: 62  KKEEQSVKEIEESLKERKE 80
           +K+ + +++  E LK  K 
Sbjct: 355 QKKLEDLEKRLEKLKSNKS 373



 Score = 31.9 bits (73), Expect = 0.49
 Identities = 17/75 (22%), Positives = 38/75 (50%), Gaps = 9/75 (12%)

Query: 11  KKKRIEKR-NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           + +++    N  E   +   E +  E+  E K     KK  +K ++K+K+ ++ E+  ++
Sbjct: 288 RTEKLSGLKNSGEPSLKEIHEARLNEEERELK-----KKFTEKIREKEKRLEELEQNLIE 342

Query: 70  EIEE---SLKERKEK 81
           E +E    L+E ++K
Sbjct: 343 ERKELNSKLEEIQKK 357


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
          consist of three concentric proteinaceous capsid
          layers. The innermost capsid (core) is made of VP2. The
          genomic RNA and the two minor proteins VP1 and VP3 are
          encapsidated within this layer. The N-terminus of
          rotavirus VP2 is necessary for the encapsidation of VP1
          and VP3.
          Length = 887

 Score = 36.1 bits (83), Expect = 0.028
 Identities = 19/85 (22%), Positives = 45/85 (52%), Gaps = 8/85 (9%)

Query: 2  RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKKK 60
          R RR      +  I   ++ +EK++ +Q++K + +++E+  +KK++          K++ 
Sbjct: 4  RNRR------EANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQS 57

Query: 61 KKKEEQSVKEIEESLKERKEKTLTE 85
           ++  +   E+++S KE + K L E
Sbjct: 58 SQENLKIADEVKKSTKE-ESKQLLE 81



 Score = 34.9 bits (80), Expect = 0.072
 Identities = 7/48 (14%), Positives = 28/48 (58%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
           +++       + ++K  +K+++++  E++E + ++KE+ +T++    
Sbjct: 5  NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSP 52



 Score = 32.2 bits (73), Expect = 0.51
 Identities = 24/105 (22%), Positives = 48/105 (45%), Gaps = 12/105 (11%)

Query: 2   RKRRGGAGEKKKRIEKRN-KEEEKEENEQEEK---EKEDVEEEKNKKKKKKKK-----KK 52
            +R        +  EK + K+++K   E +EK   +KE+V  +      K++      K 
Sbjct: 5   NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64

Query: 53  KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
             +  KK  K+E + + E+ ++ +E +++   E  Q    IP F+
Sbjct: 65  ADEV-KKSTKEESKQLLEVLKTKEEHQKEIQYEILQ--KTIPSFE 106


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 35.8 bits (83), Expect = 0.029
 Identities = 15/62 (24%), Positives = 33/62 (53%), Gaps = 1/62 (1%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           +E +  + + +KE E  + E ++++K +  KK K+   ++  +E++     EE  +  KE
Sbjct: 114 DEGEWIDVESDKEIESSDSE-DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172

Query: 81  KT 82
           K 
Sbjct: 173 KA 174



 Score = 31.9 bits (73), Expect = 0.51
 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 19/90 (21%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-------------------KKKKKK 49
            E  K+ ++ + EE  EE+E+E  E+E+ E EK K                   + + +K
Sbjct: 139 DEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPADFAKIQELRLEK 198

Query: 50  KKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
              K    K K++ ++   +  +E +    
Sbjct: 199 GVDKALGGKLKRRDKDAPERHSDELVDADD 228



 Score = 31.5 bits (72), Expect = 0.60
 Identities = 16/45 (35%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 20  KEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           K++ KEE     ++ +ED E+  ++K KK K+ K    K+K +KK
Sbjct: 236 KKQTKEERIATAKEGREDREKFGSRKGKKDKEGKSTTNKEKARKK 280



 Score = 30.8 bits (70), Expect = 1.0
 Identities = 13/83 (15%), Positives = 39/83 (46%), Gaps = 10/83 (12%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK----------KKKKEE 65
           E+   E  K+  E  ++E  + +EE+  ++++ + +K+K  +             K +E 
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPADFAKIQEL 194

Query: 66  QSVKEIEESLKERKEKTLTEDSQ 88
           +  K ++++L  + ++   +  +
Sbjct: 195 RLEKGVDKALGGKLKRRDKDAPE 217



 Score = 30.4 bits (69), Expect = 1.4
 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 7/75 (9%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKK-------KKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           E E++E+  +E E  DVE +K  +       ++K +  KK K+   ++  EE   +  EE
Sbjct: 105 EVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEE 164

Query: 74  SLKERKEKTLTEDSQ 88
              E +++  +E + 
Sbjct: 165 EEAEAEKEKASELAT 179



 Score = 29.2 bits (66), Expect = 2.9
 Identities = 15/66 (22%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           + E +KE    +  E E+ ++E  KK K+   ++  ++ +++  +EE++  E E++ +  
Sbjct: 120 DVESDKEIESSDS-EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELA 178

Query: 79  KEKTLT 84
             + LT
Sbjct: 179 TTRILT 184


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 36.2 bits (83), Expect = 0.029
 Identities = 17/45 (37%), Positives = 29/45 (64%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           E+ N +   E +E++E  +E+ EEE+NK+   K+ KKK++K   K
Sbjct: 760 EQDNSDSFAESSEEDESSEEEKEEEENKEVSAKRAKKKQRKNMLK 804


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
          family includes the vacuolar ATP synthase E subunit, as
          well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 35.0 bits (81), Expect = 0.029
 Identities = 17/65 (26%), Positives = 35/65 (53%)

Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           +E E++  E   + +E+ E EK +  ++ +KK ++  +KK+K+ E +    I  +  E 
Sbjct: 3  RQEAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNEA 62

Query: 79 KEKTL 83
          + K L
Sbjct: 63 RLKVL 67



 Score = 32.3 bits (74), Expect = 0.23
 Identities = 25/106 (23%), Positives = 50/106 (47%), Gaps = 4/106 (3%)

Query: 5   RGGAGEKKKRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKK---KKKKKKKKKKKKK 60
           R  A EK + I    +EE E E+ E  E+ ++ +EE   KK+K+   +K+      K + 
Sbjct: 3   RQEAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNEA 62

Query: 61  KKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIY 106
           + K   + +E+ +S+ E  ++ L   S+         K ++ + + 
Sbjct: 63  RLKVLNAREELLDSVFEEAKERLANLSEDKDEYKDLLKDLIKQALV 108


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
          family is found from fungi to humans, but its exact
          function is not known.
          Length = 88

 Score = 33.4 bits (77), Expect = 0.030
 Identities = 16/48 (33%), Positives = 37/48 (77%)

Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          +++EE KE+ E+ EK +E+ E E+ +++++K+K+K+  ++++KK +E 
Sbjct: 40 KDEEERKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEER 87



 Score = 32.2 bits (74), Expect = 0.077
 Identities = 16/47 (34%), Positives = 35/47 (74%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
          K+EE+ + + EE EK   E E+ +K+++++K+K+K+  ++++KK E+
Sbjct: 40 KDEEERKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEE 86



 Score = 30.7 bits (70), Expect = 0.22
 Identities = 12/47 (25%), Positives = 34/47 (72%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
          E++++ +    E+ +EE E+E KE+E+ +E++ +  ++++KK ++++
Sbjct: 42 EEERKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEERR 88



 Score = 29.2 bits (66), Expect = 0.77
 Identities = 15/46 (32%), Positives = 33/46 (71%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          E+R ++ E+ E  +EE E+E  E E+ K+K+K+  ++++KK ++++
Sbjct: 43 EERKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEERR 88



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 12/44 (27%), Positives = 32/44 (72%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
          ++E+ EE E+  +E E   +E+ ++K+K+K+  ++++KK ++++
Sbjct: 45 RKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEERR 88


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 35.9 bits (82), Expect = 0.032
 Identities = 18/46 (39%), Positives = 20/46 (43%), Gaps = 2/46 (4%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKK--KKKKKEEQSVKEIEE 73
           EE  K    +         KKKKKKKKK     K K+E    E EE
Sbjct: 89  EEINKSGEFDNITDDDTPSKKKKKKKKKGWFWAKSKQESKTIETEE 134



 Score = 28.2 bits (62), Expect = 7.7
 Identities = 19/66 (28%), Positives = 34/66 (51%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           EE  ++ + +   +D    K KKKKKKK     K K++ K  E + + ++  S+ +  E 
Sbjct: 89  EEINKSGEFDNITDDDTPSKKKKKKKKKGWFWAKSKQESKTIETEEIIDVGASVNQANET 148

Query: 82  TLTEDS 87
            L +D+
Sbjct: 149 RLFDDT 154


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 35.0 bits (81), Expect = 0.034
 Identities = 12/45 (26%), Positives = 29/45 (64%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           E+ E  E+E++++E+ E      K + ++ + K+K K+ +K+E++
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDE 164



 Score = 33.1 bits (76), Expect = 0.14
 Identities = 13/49 (26%), Positives = 29/49 (59%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           EQ E+E+E+  +E+ +++  +  K + ++ + K+K +E   +E EE   
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRH 168



 Score = 31.5 bits (72), Expect = 0.41
 Identities = 12/51 (23%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           E+ +R E+  ++EE+ E       K   E+ + K+K K+ +K++ ++ + +
Sbjct: 120 EQLEREEEEKRDEEERERL-LRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 13/54 (24%), Positives = 28/54 (51%), Gaps = 9/54 (16%)

Query: 30  EEKEKED----VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           E+ E+E+     EEE+ +  +  K + ++ + K+K K+ ++     EE  + R 
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK-----EEDEEMRH 168


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 35.9 bits (83), Expect = 0.036
 Identities = 15/47 (31%), Positives = 20/47 (42%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
            EE  EE + +   K       N  KK    KKK K+ K    K+E+
Sbjct: 847 VEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE 893



 Score = 32.0 bits (73), Expect = 0.56
 Identities = 19/57 (33%), Positives = 24/57 (42%), Gaps = 3/57 (5%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           +K +E   K EE  E   E  E+ED +              KK    KKK KE +SV
Sbjct: 834 QKLMENAGKMEEYVE---ENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSV 887



 Score = 28.2 bits (63), Expect = 9.7
 Identities = 11/46 (23%), Positives = 24/46 (52%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
           AG+ ++ +E+  +EE+ + + +          +K+   KKK K+ K
Sbjct: 840 AGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGK 885


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 35.8 bits (83), Expect = 0.038
 Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
              + G     K+ IE+   +E  E+  Q++K  E + E  N+ + KK    +K ++   
Sbjct: 825 FNDKNGAYESLKELIERI-IKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYI 883

Query: 61  KKKEEQSVKEIEESLKERKEKTLTED 86
             K+E+  K+  E     +E+    D
Sbjct: 884 LAKKEEEFKQFAEEEGLNEEELAFYD 909



 Score = 32.0 bits (73), Expect = 0.56
 Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 2/52 (3%)

Query: 14  RIEKRNKEEEKEENEQE--EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           R+E    EE  +E  +E  E+ +E  E+   K KKK K  +       +  +
Sbjct: 452 RVELELIEESIKEEAEELDERIEEITEDILEKIKKKTKNLEFLAMLAVRLIR 503



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 15/80 (18%), Positives = 31/80 (38%), Gaps = 9/80 (11%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK---------KKKKKKKKKKKKKKKKEEQ 66
              +K    E  ++  +       E  ++KKK          + + KK    +K ++   
Sbjct: 824 TFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYI 883

Query: 67  SVKEIEESLKERKEKTLTED 86
             K+ EE  +  +E+ L E+
Sbjct: 884 LAKKEEEFKQFAEEEGLNEE 903



 Score = 30.1 bits (68), Expect = 2.4
 Identities = 11/51 (21%), Positives = 23/51 (45%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            +  + + + +EE EE ++  +E  +   EK KKK K  +       +  +
Sbjct: 453 VELELIEESIKEEAEELDERIEEITEDILEKIKKKTKNLEFLAMLAVRLIR 503



 Score = 29.7 bits (67), Expect = 3.0
 Identities = 13/71 (18%), Positives = 28/71 (39%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           + K+++   K EE     ++E+E +   EE+   +++            K  +    + E
Sbjct: 868 RAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVE 927

Query: 71  IEESLKERKEK 81
                K  +EK
Sbjct: 928 KLAKEKSLREK 938



 Score = 29.3 bits (66), Expect = 4.4
 Identities = 11/54 (20%), Positives = 23/54 (42%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           G  E++            +  E   +  E + +EK+ ++K K   K K++ + K
Sbjct: 899 GLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDDWKAKEEVEAK 952



 Score = 28.9 bits (65), Expect = 5.8
 Identities = 18/70 (25%), Positives = 29/70 (41%), Gaps = 11/70 (15%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK-----------KKKKKKKKKKKKKKK 60
           KK  E +   EE+  NE+E    +D+     K             K+K  ++K K   K 
Sbjct: 886 KKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDDWKA 945

Query: 61  KKKEEQSVKE 70
           K++ E  +K 
Sbjct: 946 KEEVEAKLKA 955



 Score = 28.1 bits (63), Expect = 8.4
 Identities = 11/66 (16%), Positives = 24/66 (36%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           + R K+ +  E  +E       EEE  +  +++   +++             + E    L
Sbjct: 866 QYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTEL 925

Query: 76  KERKEK 81
            E+  K
Sbjct: 926 VEKLAK 931


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 34.4 bits (79), Expect = 0.042
 Identities = 13/70 (18%), Positives = 39/70 (55%)

Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
          ++E   +++  +E+  +E  KK+ K+  K++ ++K+   +  E+ + +++   K+ ++K 
Sbjct: 3  KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDKY 62

Query: 83 LTEDSQCDTM 92
          L   ++   M
Sbjct: 63 LRSQAEIQNM 72



 Score = 27.5 bits (61), Expect = 8.9
 Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 5/53 (9%)

Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK-----KKKKKKKEEQ 66
           K+  +EE   E  +KE  E  K + ++K+   +  +K     K K K  E++
Sbjct: 9  EKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDK 61


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 35.4 bits (83), Expect = 0.045
 Identities = 17/81 (20%), Positives = 39/81 (48%), Gaps = 11/81 (13%)

Query: 12 KKRIEKRNKEEEKEE-NEQEEKEKE---DVEEEKNKKKKKKKK--KKKKKKKKKKKKKEE 65
          K+ + KR    + +E  E +E+ +E   ++EE + ++    K+  + K+K +  +    E
Sbjct: 15 KEALAKRGFPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAE 74

Query: 66 -----QSVKEIEESLKERKEK 81
               + +K +E  L E + +
Sbjct: 75 VKELKEEIKALEAELDELEAE 95


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 35.1 bits (80), Expect = 0.045
 Identities = 15/38 (39%), Positives = 23/38 (60%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK 41
           ++ G   KK   E  + E+E+EE E+EE+E E  E E+
Sbjct: 120 KKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQ 157



 Score = 32.4 bits (73), Expect = 0.31
 Identities = 16/41 (39%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEE-NEQEEKEKEDVEEEKN 42
           K+ G AG+K  + ++ +++EE+EE  E+EE E E+ E+  N
Sbjct: 120 KKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTN 160


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 35.5 bits (81), Expect = 0.046
 Identities = 12/76 (15%), Positives = 37/76 (48%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E+  ++E K++ + E+ ++ + E    +K  +++K      +++   +EEQ     ++ L
Sbjct: 382 EEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLASPEEEALGEEEQKAPPKKKQL 441

Query: 76  KERKEKTLTEDSQCDT 91
            ++ ++     +  D 
Sbjct: 442 NQKNKQQTGSGTNSDE 457



 Score = 29.3 bits (65), Expect = 3.7
 Identities = 16/73 (21%), Positives = 33/73 (45%), Gaps = 2/73 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
             KKR +K   ++   E   E+  +ED EEE+     + K ++ +K+    + +    + 
Sbjct: 225 GGKKRRKKLKPKQPNGEESGEDDFQEDPEEEEQ--LPESKPEETEKRVSAFQVEGSTLIS 282

Query: 70  EIEESLKERKEKT 82
                 + ++EKT
Sbjct: 283 AENLRQQLKQEKT 295



 Score = 28.5 bits (63), Expect = 6.1
 Identities = 18/96 (18%), Positives = 43/96 (44%), Gaps = 10/96 (10%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK------ 55
           R+    +GE+++  ++   + + E+ ++ E E   + +   ++K      +++       
Sbjct: 373 RRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLASPEEEALGEEEQ 432

Query: 56  ----KKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
               KKK+  +K +Q       S +ER + +L ED 
Sbjct: 433 KAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDEDR 468


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 35.4 bits (82), Expect = 0.046
 Identities = 10/72 (13%), Positives = 24/72 (33%), Gaps = 4/72 (5%)

Query: 3   KRRGGAGEKKK----RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
           + +GG G++            ++++KE+  + E E       K  +K K    +      
Sbjct: 745 RVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLT 804

Query: 59  KKKKKEEQSVKE 70
                   +  +
Sbjct: 805 AADPDAVAAKVD 816



 Score = 32.7 bits (75), Expect = 0.34
 Identities = 12/78 (15%), Positives = 27/78 (34%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R + G   E       +    EK++ E+E K + +  +    K  +K K    +      
Sbjct: 745 RVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLT 804

Query: 62  KKEEQSVKEIEESLKERK 79
             +  +V    + +   +
Sbjct: 805 AADPDAVAAKVDGVSADR 822



 Score = 30.0 bits (68), Expect = 2.1
 Identities = 8/56 (14%), Positives = 22/56 (39%), Gaps = 1/56 (1%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE-ESLKERKEKT 82
              EK+K++ E +   +  K    K  +K K    +    +   + +++  + +  
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGV 818


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
          protein 2.  THO and TREX form a eukaryotic complex
          which functions in messenger ribonucleoprotein
          metabolism and plays a role in preventing the
          transcription-associated genetic instability. Tho2,
          along with four other subunits forms THO.
          Length = 296

 Score = 35.0 bits (81), Expect = 0.050
 Identities = 17/57 (29%), Positives = 28/57 (49%)

Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
           E  +KE E +E++  +         KKKK+KK+ K   + ++E  +   E  EKT 
Sbjct: 26 KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTK 82



 Score = 31.5 bits (72), Expect = 0.60
 Identities = 19/63 (30%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
          +KE E+ E + +E +      +K KKK+KK+ K   KK +++ KK  +  ++ ++ L E 
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDK-KKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEE 88

Query: 79 KEK 81
          K  
Sbjct: 89 KSS 91


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
          secreted from the skin of amphibians, including the
          opiate-like dermorphins and deltorphins, and the
          antimicrobial dermoseptins and temporins. The alignment
          for this family includes the signal peptide.
          Length = 46

 Score = 31.5 bits (72), Expect = 0.050
 Identities = 11/25 (44%), Positives = 19/25 (76%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKK 45
          EEEK E+E+E +++E+ EE+   K+
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46



 Score = 26.9 bits (60), Expect = 2.2
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 19 NKEEEKEENEQEEKEKEDVEEEK 41
           K E++EENE EE+ +E  E ++
Sbjct: 24 EKREDEEENEDEEEGEEQSEVKR 46



 Score = 25.8 bits (57), Expect = 5.5
 Identities = 8/23 (34%), Positives = 19/23 (82%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVE 38
          E++ ++EE+ E+E+E +E+ +V+
Sbjct: 23 EEKREDEEENEDEEEGEEQSEVK 45


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
          7 homolog A (Rrp7A) and similar proteins.  The family
          corresponds to the RRP7 domain of Rrp7A, also termed
          gastric cancer antigen Zg14, and similar proteins which
          are yeast ribosomal RNA-processing protein 7 (Rrp7p)
          homologs mainly found in Metazoans. The cellular
          function of Rrp7A remains unclear currently. Rrp7A
          harbors an N-terminal RNA recognition motif (RRM), also
          termed RBD (RNA binding domain) or RNP
          (ribonucleoprotein domain), and a C-terminal RRP7
          domain.
          Length = 129

 Score = 33.4 bits (77), Expect = 0.052
 Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 13/56 (23%)

Query: 22 EEKEENEQEEKEKEDVEEE-------------KNKKKKKKKKKKKKKKKKKKKKKE 64
          +++EE E+EEKEKE   +E             K  +K+    K  +K+KKKKKKKE
Sbjct: 31 DKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKE 86



 Score = 30.3 bits (69), Expect = 0.56
 Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 6/58 (10%)

Query: 10 EKKKRIEKRNKEEEKEENE------QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          +K++  EK  KE+E E +E       ++  +     +++   K  +K+KKKKKKK+ K
Sbjct: 31 DKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELK 88


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 35.0 bits (80), Expect = 0.053
 Identities = 23/77 (29%), Positives = 32/77 (41%), Gaps = 1/77 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDV-EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           E K+R     K +     +Q+ +  E+V EEEK K K K     K K     K+K E + 
Sbjct: 58  EAKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTE 117

Query: 69  KEIEESLKERKEKTLTE 85
           +  EE     K K    
Sbjct: 118 EVTEEEKAAAKAKAAAA 134



 Score = 33.8 bits (77), Expect = 0.12
 Identities = 24/79 (30%), Positives = 32/79 (40%), Gaps = 4/79 (5%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           K +  A  K K      ++ E  E   EE+E+ D  +EK K K     K K     K+K 
Sbjct: 128 KAKAAAAAKAKAAALAKQKREGTEEVTEEEEETD--KEKAKAKAAAAAKAKAAALAKQKA 185

Query: 63  KEEQSVKEIEESLKERKEK 81
            E       EE  +E K K
Sbjct: 186 AEAGE--GTEEVTEEEKAK 202



 Score = 32.6 bits (74), Expect = 0.32
 Identities = 31/105 (29%), Positives = 37/105 (35%), Gaps = 26/105 (24%)

Query: 3   KRRGGAGEKKKRIE-KRNKEEEKEENEQEEKEK-------------------------ED 36
           KRR  A  K K     + K E  EE  +EEK K                         E 
Sbjct: 60  KRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEV 119

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            EEEK   K K     K K     K+K E + +  EE  +  KEK
Sbjct: 120 TEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEK 164



 Score = 30.7 bits (69), Expect = 1.3
 Identities = 19/74 (25%), Positives = 28/74 (37%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A  K K       +      ++ E  +E  EEE+   K+K K K     K K     +Q 
Sbjct: 125 AAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQK 184

Query: 68  VKEIEESLKERKEK 81
             E  E  +E  E+
Sbjct: 185 AAEAGEGTEEVTEE 198



 Score = 30.3 bits (68), Expect = 1.6
 Identities = 15/56 (26%), Positives = 33/56 (58%)

Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
           EK+ ED+++E  ++ K++ +K+   K   +  K E+  +E E++L +  + T+ E
Sbjct: 3  PEKDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEE 58



 Score = 30.3 bits (68), Expect = 1.6
 Identities = 18/68 (26%), Positives = 32/68 (47%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
          K   E    EEE  E E+   + +D+  E+ K++     K K     K+K++  + V E 
Sbjct: 29 KHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEVTEE 88

Query: 72 EESLKERK 79
          E++  + K
Sbjct: 89 EKAKAKAK 96



 Score = 30.3 bits (68), Expect = 1.8
 Identities = 25/103 (24%), Positives = 40/103 (38%), Gaps = 21/103 (20%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEK-------EDVEEEKNKKKKKKKKKKKKKK 56
           RR     +K+ + K   E  K E E  EKEK         +EE K +     K K     
Sbjct: 16  RRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALA 75

Query: 57  KKKKKKKEEQSVKEIEE--------------SLKERKEKTLTE 85
           K+K++  EE + +E  +              +L ++K +   E
Sbjct: 76  KQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEE 118


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 34.7 bits (80), Expect = 0.055
 Identities = 19/86 (22%), Positives = 45/86 (52%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
            + +E   +  + ++    ++ +KKK  KKK+K  K+ K++K+ E+ +K++E+ L+ ++E
Sbjct: 150 LDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRE 209

Query: 81  KTLTEDSQCDTMIPVFKKGVLYKGIY 106
                  +   ++      V+YK   
Sbjct: 210 LMKKGKGKKKKIVKDKDGKVVYKWKK 235



 Score = 33.9 bits (78), Expect = 0.093
 Identities = 19/62 (30%), Positives = 33/62 (53%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           +K+R    E K+R E+  K ++ E+  + ++E     + K KK  K K  K   K KK++
Sbjct: 178 KKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKER 237

Query: 62  KK 63
           K+
Sbjct: 238 KR 239



 Score = 33.1 bits (76), Expect = 0.16
 Identities = 19/54 (35%), Positives = 29/54 (53%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
             KKKR   +  +E KE  ++ +K ++ +E ++   KK K KKKK  K K  K 
Sbjct: 176 SAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKV 229



 Score = 32.7 bits (75), Expect = 0.19
 Identities = 19/53 (35%), Positives = 30/53 (56%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           KK   +KR   +E +E ++ EK+ + VE+    +++  KK K KKKK  K K 
Sbjct: 174 KKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKD 226



 Score = 32.0 bits (73), Expect = 0.37
 Identities = 15/63 (23%), Positives = 32/63 (50%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            +  +      K+  E++ +E++ ++ EQ  + + ++ ++   KKKK  K K  K   K 
Sbjct: 174 KKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKW 233

Query: 61  KKK 63
           KK+
Sbjct: 234 KKE 236



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 17/73 (23%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 19  NKEEEKEEN--EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE-SL 75
           ++EE+K  +  E  +   E ++  +N+ +  + +K     +K+KKK  ++  K  +E   
Sbjct: 130 DEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKE 189

Query: 76  KERKEKTLTEDSQ 88
           ++ +EK L +  Q
Sbjct: 190 RKEREKKLKKVEQ 202


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
          Provisional.
          Length = 389

 Score = 34.8 bits (81), Expect = 0.056
 Identities = 15/50 (30%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
          E+ + +E+RN  E +E+  Q E +  D+E E  K +++ ++ K + +K K
Sbjct: 1  ERLEALEERN-SELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLK 49


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 33.8 bits (78), Expect = 0.057
 Identities = 23/77 (29%), Positives = 34/77 (44%), Gaps = 26/77 (33%)

Query: 11  KKKRIEKRNKE-----EEKEE----------NEQEEKE-----------KEDVEEEKNKK 44
           KKK I+KR K      E   E          N+  EK+           KED   +K ++
Sbjct: 88  KKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKKRRE 147

Query: 45  KKKKKKKKKKKKKKKKK 61
           KK++  K +K++ K KK
Sbjct: 148 KKERVAKNEKRELKNKK 164


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 34.9 bits (79), Expect = 0.057
 Identities = 21/73 (28%), Positives = 49/73 (67%), Gaps = 1/73 (1%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKK-KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           +EE+K E E+++ EKE +E   ++ K +++K+K +++K+K +++K++ S    + +++  
Sbjct: 153 QEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELE 212

Query: 79  KEKTLTEDSQCDT 91
           +EK  TE+ + D 
Sbjct: 213 QEKQKTENEKQDL 225



 Score = 33.7 bits (76), Expect = 0.13
 Identities = 17/70 (24%), Positives = 43/70 (61%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A  + K  +++ K E++++  ++EK+K      KN  + +++K+K + +K+   K+++  
Sbjct: 173 ANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDF 232

Query: 68  VKEIEESLKE 77
           +KE E++ +E
Sbjct: 233 IKEAEQNCQE 242



 Score = 32.9 bits (74), Expect = 0.25
 Identities = 20/70 (28%), Positives = 44/70 (62%), Gaps = 2/70 (2%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E++  E+EK++ E+E  E  + + +  ++K+K +++K+K +++K+K     +   IE  L
Sbjct: 154 EEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIE--L 211

Query: 76  KERKEKTLTE 85
           ++ K+KT  E
Sbjct: 212 EQEKQKTENE 221



 Score = 31.0 bits (69), Expect = 1.1
 Identities = 20/77 (25%), Positives = 41/77 (53%), Gaps = 6/77 (7%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNK------KKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           +++K E  Q +KE E+  +  NK      ++++K +++K+K +K+  +     +K  +E 
Sbjct: 124 QDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEK 183

Query: 75  LKERKEKTLTEDSQCDT 91
            K  +EK  TE  +  T
Sbjct: 184 QKTEQEKQKTEQEKQKT 200



 Score = 30.6 bits (68), Expect = 1.3
 Identities = 17/79 (21%), Positives = 42/79 (53%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E++K+  ++   E      + E+EK+  E+EK K +++K+K      K   + ++E+   
Sbjct: 159 EQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKT 218

Query: 70  EIEESLKERKEKTLTEDSQ 88
           E E+    +++K   ++++
Sbjct: 219 ENEKQDLIKEQKDFIKEAE 237


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 34.8 bits (80), Expect = 0.059
 Identities = 26/113 (23%), Positives = 54/113 (47%), Gaps = 15/113 (13%)

Query: 10  EKKKRI-EKRNKE-----EEKEENEQEE-KEKEDVEEEKNKKKKK--KKKKKKKKKKKKK 60
           +KK    +K NK+     +EK   EQEE +E  + E+E+ ++++   +K++++++  K+K
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178

Query: 61  KKKE-----EQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCP 108
            K+      E S     E + + K+ ++  + Q +      K      GI   
Sbjct: 179 NKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKP-KPEKPNTFSTGIKMG 230


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 34.7 bits (80), Expect = 0.060
 Identities = 20/85 (23%), Positives = 41/85 (48%), Gaps = 5/85 (5%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +RK      E+     + +K+ ++   E    + E    E  KK K  + K+K  K+++K
Sbjct: 205 LRKEADELHEE---FVELSKKIDELHEEFRNLQNE--LRELEKKIKALRAKEKAAKRREK 259

Query: 61  KKKEEQSVKEIEESLKERKEKTLTE 85
           +++ ++  +EI E  K  ++ T  E
Sbjct: 260 REELKERAEEIYEKFKRGEKLTTEE 284



 Score = 33.5 bits (77), Expect = 0.13
 Identities = 20/89 (22%), Positives = 44/89 (49%), Gaps = 3/89 (3%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           +   G    K    +  + E+K++      E+E    +K K+ +K+ +  KK  ++ +K 
Sbjct: 101 EFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKL 160

Query: 63  KEEQS-VKEIEESLKERKEK--TLTEDSQ 88
           KE ++ + E+++  +E  EK   L  ++Q
Sbjct: 161 KELKAEIDELKKKAREIHEKIQELANEAQ 189



 Score = 33.1 bits (76), Expect = 0.19
 Identities = 24/106 (22%), Positives = 54/106 (50%), Gaps = 9/106 (8%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQE------EKEKEDVEEEKNKKKKKKKKKKKKK 55
           +++R       + I+   +E E+ E +Q+      E+E+E V++ K  +K+ +  KK  +
Sbjct: 96  KEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALE 155

Query: 56  KKKKKK--KKEEQSVKEIEESLKERKEKTLTEDSQC-DTMIPVFKK 98
           + +K K  K E   +K+    + E+ ++   E  +  + MI +F++
Sbjct: 156 ENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEE 201



 Score = 30.8 bits (70), Expect = 0.90
 Identities = 17/71 (23%), Positives = 43/71 (60%), Gaps = 2/71 (2%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E K++  K   EE KE+ ++  KE  ++ E++++   K ++ ++K ++ ++++ E    +
Sbjct: 12 ELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDE--INE 69

Query: 70 EIEESLKERKE 80
          E++E  ++R E
Sbjct: 70 EVQELKEKRDE 80



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 17/73 (23%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 10  EKKKRIEKRNK-EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           E+++ ++K  +  +E E+ ++  +E E ++E K +  + KKK ++  +K ++   E Q  
Sbjct: 132 EERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQELANEAQEY 191

Query: 69  KE-IEESLKERKE 80
            E + +  +E  E
Sbjct: 192 HEEMIKLFEEADE 204



 Score = 29.3 bits (66), Expect = 2.8
 Identities = 17/77 (22%), Positives = 41/77 (53%), Gaps = 6/77 (7%)

Query: 10 EKKKRIEKR----NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           + KR + +      +E+++E  +E  E  +  +E N K ++ ++K ++ ++++ +  EE
Sbjct: 11 LELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEE 70

Query: 66 QSVKEIEESLKERKEKT 82
            V+E++E   E   K 
Sbjct: 71 --VQELKEKRDEINAKL 85



 Score = 28.9 bits (65), Expect = 4.3
 Identities = 21/86 (24%), Positives = 45/86 (52%), Gaps = 2/86 (2%)

Query: 3   KRRGGAGEKKKRIEKRN--KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           KR     + ++  EK    +EE  E NE+ ++ KE  +E   K ++ +K+ ++ K+K+ +
Sbjct: 42  KRDELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNE 101

Query: 61  KKKEEQSVKEIEESLKERKEKTLTED 86
                +S+K +E  ++  ++K  T  
Sbjct: 102 FNLGGRSIKSLEREIERLEKKQQTSV 127



 Score = 28.5 bits (64), Expect = 5.6
 Identities = 15/72 (20%), Positives = 40/72 (55%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          EK+  + K   E  ++ +E   K +E  E+ +  ++++ +  ++ ++ K+K+ +    ++
Sbjct: 27 EKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQ 86

Query: 70 EIEESLKERKEK 81
          E+ +  +E KEK
Sbjct: 87 ELRKEYRELKEK 98



 Score = 28.5 bits (64), Expect = 5.9
 Identities = 13/68 (19%), Positives = 38/68 (55%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
          ++  E   + ++ KE++EE K K+ + +K+  +  +K+ +   + + ++E  + L+E ++
Sbjct: 6  DKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERD 65

Query: 81 KTLTEDSQ 88
          +   E  +
Sbjct: 66 EINEEVQE 73



 Score = 28.1 bits (63), Expect = 6.3
 Identities = 17/79 (21%), Positives = 37/79 (46%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            R+     K+KR E        +  E+E +  E  ++      +++++  +K K+ +K+ 
Sbjct: 88  LRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKEL 147

Query: 63  KEEQSVKEIEESLKERKEK 81
           ++ +   E  E LKE K +
Sbjct: 148 EDAKKALEENEKLKELKAE 166



 Score = 27.7 bits (62), Expect = 8.5
 Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE--QSVKEIEE 73
          E   K ++ +E  +E KEK D   ++  +  +K+ +   K ++ ++K +E  +   EI E
Sbjct: 10 ELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINE 69

Query: 74 SLKERKEK 81
           ++E KEK
Sbjct: 70 EVQELKEK 77


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 34.5 bits (79), Expect = 0.061
 Identities = 23/91 (25%), Positives = 49/91 (53%), Gaps = 21/91 (23%)

Query: 16  EKRNKEEEKEENEQEE-------------------KEKEDVEEEKNKKKKKKKKKKKKKK 56
           E  + EEE+EE+E+EE                   K++  ++E + +  K+K  +++ K+
Sbjct: 10  EDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKR 69

Query: 57  KKKKKKKEEQSV--KEIEESLKERKEKTLTE 85
           K +++K+E   +  +E+++ L+ +K  TL E
Sbjct: 70  KAEERKRETLKIVEEEVKKELELKKRNTLLE 100



 Score = 31.8 bits (72), Expect = 0.50
 Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 4/72 (5%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK----KKKKKKKKEEQSVKEI 71
          E+ ++  E+EE E EE+E+ D E++   + K    +KK +    +++++  KE+   +E 
Sbjct: 8  EEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEA 67

Query: 72 EESLKERKEKTL 83
          +   +ERK +TL
Sbjct: 68 KRKAEERKRETL 79



 Score = 28.0 bits (62), Expect = 7.7
 Identities = 17/54 (31%), Positives = 28/54 (51%), Gaps = 7/54 (12%)

Query: 16  EKRNKEEEKEENEQEEKEKEDV-------EEEKNKKKKKKKKKKKKKKKKKKKK 62
           E +  + ++EE E+ E+EK ++       EEE+  + +K  K    K KK K K
Sbjct: 124 ELKRIKRDREEREEMEREKAEIEKMRNMTEEERRAELRKNPKVSTNKAKKGKYK 177


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 34.1 bits (79), Expect = 0.062
 Identities = 18/85 (21%), Positives = 44/85 (51%), Gaps = 6/85 (7%)

Query: 2   RKRRGGAGEKKKRIEK-RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            K +    + KK +E+ + +  E +   ++ K+  +  EE+ +  ++ K+ +K+ KK K 
Sbjct: 65  NKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKA 124

Query: 61  K-----KKEEQSVKEIEESLKERKE 80
           +     K + + +++++E  K  KE
Sbjct: 125 ELEKYEKNDPERIEKLKEETKVAKE 149



 Score = 29.1 bits (66), Expect = 2.7
 Identities = 21/64 (32%), Positives = 33/64 (51%), Gaps = 8/64 (12%)

Query: 10  EKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK-------KKKKKKKK 61
           E + +IEK  K  EE EE  +  +E + +E+E  K K + +K +K       K K++ K 
Sbjct: 87  ELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKV 146

Query: 62  KKEE 65
            KE 
Sbjct: 147 AKEA 150



 Score = 28.7 bits (65), Expect = 2.9
 Identities = 11/65 (16%), Positives = 39/65 (60%), Gaps = 1/65 (1%)

Query: 12  KKRIEKRNKEEE-KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           +   + + + E+ K+E E+ ++   +++ +  K KK +++ +++ +  ++ K+ E+ +K+
Sbjct: 62  QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKK 121

Query: 71  IEESL 75
           ++  L
Sbjct: 122 LKAEL 126



 Score = 28.0 bits (63), Expect = 5.6
 Identities = 15/72 (20%), Positives = 40/72 (55%), Gaps = 2/72 (2%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           +   +   + EK K+++EE K +  + + + +K KK +++ ++  + ++E+++  KE K+
Sbjct: 62  QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKK 121

Query: 81  --KTLTEDSQCD 90
               L +  + D
Sbjct: 122 LKAELEKYEKND 133


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 34.9 bits (81), Expect = 0.065
 Identities = 17/66 (25%), Positives = 25/66 (37%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A E K R E R    E+E+  +E + K+  E    K K        + K KK    +   
Sbjct: 448 AEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIV 507

Query: 68  VKEIEE 73
           +K    
Sbjct: 508 IKAGAR 513



 Score = 34.2 bits (79), Expect = 0.10
 Identities = 16/69 (23%), Positives = 36/69 (52%), Gaps = 4/69 (5%)

Query: 14  RIEK---RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           R  K   R  E+EK++ E E K + +  + + +++K  ++ + KK  + +  K++ +V  
Sbjct: 432 RQAKAEIRAIEQEKKKAE-EAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAA 490

Query: 71  IEESLKERK 79
               +K +K
Sbjct: 491 ALARVKAKK 499



 Score = 29.1 bits (66), Expect = 3.7
 Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 12/79 (15%)

Query: 10  EKKKRIEKRNKEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKKKKKK-------KKKKKK 61
           EKKK  E + + E ++   E+E+  +    E ++KK  + +  K K        + K KK
Sbjct: 444 EKKKAEEAKARFEARQARLEREKAAR----EARHKKAAEARAAKDKDAVAAALARVKAKK 499

Query: 62  KKEEQSVKEIEESLKERKE 80
               Q +     +  +   
Sbjct: 500 AAATQPIVIKAGARPDNSA 518


>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
           [Transcription].
          Length = 182

 Score = 34.1 bits (78), Expect = 0.066
 Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 4/81 (4%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK----KKKKKKK 59
           + GG    +  IEK  KE  K     E  +K   E     K+  ++K K    K +  ++
Sbjct: 100 KEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVIKGEPLEE 159

Query: 60  KKKKEEQSVKEIEESLKERKE 80
              K E  V+   + L E K 
Sbjct: 160 VIDKLEYPVESPADLLHEIKI 180



 Score = 31.0 bits (70), Expect = 0.68
 Identities = 14/79 (17%), Positives = 36/79 (45%), Gaps = 5/79 (6%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDV-----EEEKNKKKKKKKKKKKKKKKKKKKK 62
             E  +++++   +  + E E+ EKE   +       +K K +     K+  ++K K + 
Sbjct: 92  VRETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEV 151

Query: 63  KEEQSVKEIEESLKERKEK 81
            + + ++E+ + L+   E 
Sbjct: 152 IKGEPLEEVIDKLEYPVES 170


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 34.8 bits (81), Expect = 0.066
 Identities = 8/42 (19%), Positives = 15/42 (35%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            EK E  +   +    +    K   K  +K K++    +K  
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDG 856



 Score = 33.2 bits (77), Expect = 0.20
 Identities = 10/41 (24%), Positives = 16/41 (39%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            EK ++         K    KK   K  +K K++   S K+
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 32.8 bits (76), Expect = 0.31
 Identities = 12/42 (28%), Positives = 17/42 (40%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           EE  E   E+ EK     +   K    KK   K  +K K++ 
Sbjct: 808 EEALELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQY 849



 Score = 32.5 bits (75), Expect = 0.37
 Identities = 9/40 (22%), Positives = 14/40 (35%)

Query: 33  EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           EK +          K    KK   K  +K K++    E +
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 29.4 bits (67), Expect = 3.4
 Identities = 10/42 (23%), Positives = 17/42 (40%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+ EK +   +        KK   K  +K K++    E+  K
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 34.1 bits (78), Expect = 0.068
 Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 10/85 (11%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK---------- 56
            AG+  +  E     EE    + ++  KE   E++N+  K+  +K ++ K          
Sbjct: 151 EAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEED 210

Query: 57  KKKKKKKEEQSVKEIEESLKERKEK 81
             ++    E S    EE  KE  E+
Sbjct: 211 YDEEDNPVEDSKAIKEELAKEPVEE 235



 Score = 34.1 bits (78), Expect = 0.080
 Identities = 18/77 (23%), Positives = 34/77 (44%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           G K+K   +     E+ E   EE+ +  V + K K +++ K++ +   +K    K+E+  
Sbjct: 66  GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDG 125

Query: 69  KEIEESLKERKEKTLTE 85
              E  L E K     +
Sbjct: 126 PGKEPQLDEDKFLLAED 142



 Score = 33.8 bits (77), Expect = 0.090
 Identities = 17/82 (20%), Positives = 34/82 (41%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
             G+   R  K+  EEE +E  Q   EK  V +++     K+ +  + K    +   + Q
Sbjct: 88  EEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQ 147

Query: 67  SVKEIEESLKERKEKTLTEDSQ 88
              E  +  +E ++    E++ 
Sbjct: 148 ETLEAGKVHEETEDSYHVEETA 169



 Score = 29.5 bits (66), Expect = 2.0
 Identities = 16/82 (19%), Positives = 35/82 (42%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           G  EK         EE +   E+E +      ++K +++ K++ +   +K    K++E+ 
Sbjct: 66  GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDG 125

Query: 67  SVKEIEESLKERKEKTLTEDSQ 88
             KE +    +      ++D Q
Sbjct: 126 PGKEPQLDEDKFLLAEDSDDRQ 147



 Score = 29.1 bits (65), Expect = 3.4
 Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 6/71 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEE------EKNKKKKKKKKKKKKKKKKKKKKK 63
           E+    + +   +EK   ++ E  KE VE+      E +   ++   ++    +  K  K
Sbjct: 166 EETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIK 225

Query: 64  EEQSVKEIEES 74
           EE + + +EE 
Sbjct: 226 EELAKEPVEEQ 236



 Score = 28.7 bits (64), Expect = 3.7
 Identities = 13/73 (17%), Positives = 33/73 (45%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             +K    E  +  +E  E  +  +E ED    +    ++ K+  K+K  +++ +  ++ 
Sbjct: 133 DEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEP 192

Query: 68  VKEIEESLKERKE 80
           V++ E +  E  +
Sbjct: 193 VEKAERTKAETDD 205



 Score = 28.7 bits (64), Expect = 4.0
 Identities = 13/78 (16%), Positives = 40/78 (51%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
             +K +  + +E+   +  Q +++K  + E+ + +++  +  K  ++ +     EE + +
Sbjct: 112 LLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASE 171

Query: 70  EIEESLKERKEKTLTEDS 87
           + ++ +KE+  +   EDS
Sbjct: 172 QYKQDMKEKASEQENEDS 189



 Score = 28.7 bits (64), Expect = 4.6
 Identities = 12/76 (15%), Positives = 35/76 (46%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E K++++   ++    + E++   KE   +E      +    +++  +  K  +E +   
Sbjct: 104 EVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSY 163

Query: 70  EIEESLKERKEKTLTE 85
            +EE+  E+ ++ + E
Sbjct: 164 HVEETASEQYKQDMKE 179



 Score = 28.4 bits (63), Expect = 6.0
 Identities = 20/85 (23%), Positives = 37/85 (43%), Gaps = 1/85 (1%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            G G++ +  E +    E  ++ QE  E   V EE       ++   ++ K+  K+K  E
Sbjct: 124 DGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASE 183

Query: 66  QSVKEIEESLKERKEKTLTEDSQCD 90
           Q  ++ +E   E+ E+T  E     
Sbjct: 184 QENEDSKEP-VEKAERTKAETDDVT 207



 Score = 28.0 bits (62), Expect = 6.6
 Identities = 14/67 (20%), Positives = 31/67 (46%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           +  + K   +     EE E    EE +   +K K+K +++ K++ +   E+  V + EE 
Sbjct: 65  LGLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEED 124

Query: 75  LKERKEK 81
              ++ +
Sbjct: 125 GPGKEPQ 131


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 34.8 bits (80), Expect = 0.068
 Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 4/55 (7%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
           E+K K KKK    K K  K  KKK +     + +++ K+R     T       MI
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSK----AKSKKTTKKRAGPLYTPSPALAAMI 798



 Score = 31.7 bits (72), Expect = 0.71
 Identities = 15/36 (41%), Positives = 19/36 (52%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E++ K K+     K K  K  KKK K K KK  KK+
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783



 Score = 29.8 bits (67), Expect = 3.0
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           E + + K+K    + K  K  KKK K K KK  KK+ 
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 28.2 bits (63), Expect = 7.7
 Identities = 14/37 (37%), Positives = 17/37 (45%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           E+K K   +    K K  K  KKK K K KK  K+  
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 28.2 bits (63), Expect = 7.8
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EK+ +  ++  + K K  K  KKK K K KK  ++++  
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGP 786


>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin). 
          Length = 585

 Score = 35.0 bits (80), Expect = 0.068
 Identities = 14/66 (21%), Positives = 33/66 (50%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
             RR     ++K  ++ N+  E   N Q +   + V EE  K   + K+K+++  +++K 
Sbjct: 141 FPRRYEDNSEEKHSKRTNEIVEVFYNPQSQATLKSVFEEVGKLDGQSKQKRERSDEREKS 200

Query: 62  KKEEQS 67
            ++++ 
Sbjct: 201 YQDDED 206


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 32.9 bits (75), Expect = 0.071
 Identities = 20/74 (27%), Positives = 38/74 (51%), Gaps = 11/74 (14%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK-----------KKKKKKKKK 58
             ++  EK+ KE  + E + E+ E ++    +NKK  + +           + KK KKK+
Sbjct: 51  PAEEVKEKKKKEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKR 110

Query: 59  KKKKKEEQSVKEIE 72
           +KKKK+++  K + 
Sbjct: 111 EKKKKKKKGAKNLA 124


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
          This domain is the C-terminal end of ribosomal L18/L5
          proteins.
          Length = 93

 Score = 32.1 bits (74), Expect = 0.071
 Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 10/39 (25%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          +KKK+KK+ K + K+   KK           +L++RK +
Sbjct: 61 RKKKEKKEVKAESKRYNAKK----------LTLEQRKAR 89


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 34.6 bits (79), Expect = 0.072
 Identities = 16/72 (22%), Positives = 30/72 (41%), Gaps = 5/72 (6%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK-----KKKKKKKKEEQSVKEI 71
           K  KE+EKE  +  + E+E     K     K K + K K     +        ++  KE+
Sbjct: 233 KSFKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEV 292

Query: 72  EESLKERKEKTL 83
            +   + ++K +
Sbjct: 293 MKKALKMEKKAI 304



 Score = 33.5 bits (76), Expect = 0.17
 Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 2/61 (3%)

Query: 1   MRKRRGGAGEKKKRIEK--RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
           M+K R    E   R++     K + + +N+ E + +        KKK K+  KK  K +K
Sbjct: 242 MKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEK 301

Query: 59  K 59
           K
Sbjct: 302 K 302



 Score = 33.5 bits (76), Expect = 0.18
 Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 6/74 (8%)

Query: 10  EKKKRIEKRNKEE------EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E++K ++K  K E       K     + K +   + E   +        KKK K+  KK 
Sbjct: 237 EQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKA 296

Query: 64  EEQSVKEIEESLKE 77
            +   K I+ + K+
Sbjct: 297 LKMEKKAIKNAAKD 310



 Score = 30.0 bits (67), Expect = 2.1
 Identities = 17/62 (27%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK---KKKKEEQSVKEIEESLKER 78
           +E   ++ EE++++   E KN++K+ K K +   + K+     KK +  +K  +E  KE 
Sbjct: 183 DEDYPDDMEERDRKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEM 242

Query: 79  KE 80
           K+
Sbjct: 243 KK 244



 Score = 29.2 bits (65), Expect = 3.4
 Identities = 16/66 (24%), Positives = 30/66 (45%), Gaps = 5/66 (7%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ-----SVKEIEESLKERKEKT 82
           EQE++ K+  + E+    + K     K K + K K E +     S   +++  KE  +K 
Sbjct: 237 EQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKA 296

Query: 83  LTEDSQ 88
           L  + +
Sbjct: 297 LKMEKK 302



 Score = 29.2 bits (65), Expect = 4.0
 Identities = 15/50 (30%), Positives = 18/50 (36%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           A  K K   K   E E E        K+  +E   K  K +KK  K   K
Sbjct: 260 AALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKKAIKNAAK 309


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 33.0 bits (76), Expect = 0.076
 Identities = 17/52 (32%), Positives = 36/52 (69%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           KK  I  + ++  K E+E+ ++EK+   +EK +++K++K++ K++KKK K +
Sbjct: 79  KKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 29.9 bits (68), Expect = 0.96
 Identities = 13/29 (44%), Positives = 26/29 (89%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           E E+NK++KKK+ K+KK+++K++K++ +Q
Sbjct: 94  EHERNKQEKKKRSKEKKEEEKERKRQLKQ 122



 Score = 29.9 bits (68), Expect = 1.0
 Identities = 12/32 (37%), Positives = 25/32 (78%)

Query: 34  KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           K + E  K +KKK+ K+KK+++K++K++ K++
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQ 123



 Score = 28.0 bits (63), Expect = 3.8
 Identities = 11/32 (34%), Positives = 24/32 (75%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           E   ++KKK+ K+KK+++K++K++ ++Q  K 
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKA 127



 Score = 28.0 bits (63), Expect = 4.2
 Identities = 11/39 (28%), Positives = 29/39 (74%)

Query: 43  KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           K + ++ K++KKK+ K+KK++E++  +++++  K+ K +
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 26.8 bits (60), Expect = 9.7
 Identities = 14/47 (29%), Positives = 27/47 (57%), Gaps = 12/47 (25%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           E+ K+ +K+  +E+KE            EE++ K++ K++KKK K +
Sbjct: 96  ERNKQEKKKRSKEKKE------------EEKERKRQLKQQKKKAKHR 130


>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
           chromatin remodeling [Chromatin structure and dynamics].
          Length = 237

 Score = 34.0 bits (78), Expect = 0.077
 Identities = 14/68 (20%), Positives = 24/68 (35%), Gaps = 4/68 (5%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R  R       ++   +N   ++E++          E E    + K+  KKKK     K 
Sbjct: 63  RVLRKEKYNITRKTTGKNDLPKEEDS----SLPSSKETENGDTEGKETDKKKKSSTISKN 118

Query: 62  KKEEQSVK 69
               + VK
Sbjct: 119 SPSGEKVK 126


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 32.2 bits (74), Expect = 0.077
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          E++E+EK   EEE  + K  K+++ ++K +K KK
Sbjct: 2  ERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 31.1 bits (71), Expect = 0.21
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
          +E KE+E  + E+  K+ K  K+++ ++K +K KK 
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREEIEEKLEKIKKV 36



 Score = 28.0 bits (63), Expect = 2.5
 Identities = 7/35 (20%), Positives = 27/35 (77%)

Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
          +E+ +++K +++++ K+ K  K+++ E+ +++I++
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 26.8 bits (60), Expect = 6.6
 Identities = 13/52 (25%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
          +E++E E+ ++E+E ++  KN K+++ ++K +K KK    +  + S +++ +
Sbjct: 1  KERKEEEKAQREEE-LKRLKNLKREEIEEKLEKIKKVAGLRGADLSEEDLAD 51


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 34.4 bits (79), Expect = 0.079
 Identities = 15/72 (20%), Positives = 41/72 (56%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            +  ++    E +++E ++++++KE  + +K KK+   K K++ +     +K +E+  ++
Sbjct: 271 WESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQ 330

Query: 71  IEESLKERKEKT 82
           + +  KE KE+ 
Sbjct: 331 LAKLRKEEKERE 342



 Score = 31.7 bits (72), Expect = 0.59
 Identities = 21/87 (24%), Positives = 51/87 (58%), Gaps = 10/87 (11%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK---- 57
           R++   + +K + + K  + E KE   +E KE+++ +E + +KKKKK+   K K++    
Sbjct: 261 RRKLPPSTKKWESLVKFLRTERKE---KEAKEQQEKKELEQRKKKKKEMAPKVKQRFEAN 317

Query: 58  ---KKKKKKEEQSVKEIEESLKERKEK 81
              +K +++ ++ + ++ +  KER+++
Sbjct: 318 DPAQKLQEERKEQLAKLRKEEKEREKE 344



 Score = 30.1 bits (68), Expect = 1.5
 Identities = 14/38 (36%), Positives = 25/38 (65%)

Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
          EK++ E++   +  +EEE NK +KK+++ + KKK K  
Sbjct: 15 EKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKAN 52



 Score = 29.4 bits (66), Expect = 3.1
 Identities = 20/88 (22%), Positives = 33/88 (37%), Gaps = 2/88 (2%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +RR    EK K I   +++     E  EQ++            +++ KK K K   K   
Sbjct: 72  ERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKSIY 131

Query: 61  KKKEEQSVKEIEESLKERKEKTLTEDSQ 88
               +  ++E E   K R +    E  Q
Sbjct: 132 IPLLKDKMQEEELKRKIRVQMRAQELLQ 159



 Score = 29.0 bits (65), Expect = 4.2
 Identities = 14/113 (12%), Positives = 38/113 (33%), Gaps = 13/113 (11%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
              ++      E+      +      KK +   K  + ++K++E   ++ ++ L++RK+K
Sbjct: 243 TYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKK 302

Query: 82  TLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRY-SHRQRQK 133
                              + +           +E   E   +     ++R+K
Sbjct: 303 ------------KKEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEEKEREK 343



 Score = 28.2 bits (63), Expect = 7.5
 Identities = 15/71 (21%), Positives = 42/71 (59%), Gaps = 10/71 (14%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN----------KKKKKKKKKKKKKKKKKK 60
           + +R EK  KE+++++  ++ K+K+     K           +K ++++K++  K +K++
Sbjct: 279 RTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEE 338

Query: 61  KKKEEQSVKEI 71
           K++E++  +E+
Sbjct: 339 KEREKEYEQEL 349



 Score = 27.8 bits (62), Expect = 9.0
 Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK-- 69
           K    K     +K E+  +    E  E+E  ++++KK+ +++KKKKK+   K +Q  +  
Sbjct: 258 KPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEAN 317

Query: 70  -EIEESLKERKEK 81
              ++  +ERKE+
Sbjct: 318 DPAQKLQEERKEQ 330


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
           heteromeric structure-specific endonuclease found in
           fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
           substrates, particularly simple-Y, 5'-flap, or
           replication fork structures by cleaving the strand
           bearing the 5' non-homologous arm at the branch junction
           and thus generating ligatable nicked products from
           5'-flap or replication fork substrates.
          Length = 627

 Score = 34.6 bits (79), Expect = 0.080
 Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 8/82 (9%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNK--------KKKKKKKKKKKKKKKKKKKKE 64
            R++   +EE +    ++   +    E  ++        K+  KKKK KK K +K+ K++
Sbjct: 30  GRLDDFEEEENQRLQLRQVISRFTSFESDDQANSGNVSGKRVPKKKKIKKPKLRKRTKRK 89

Query: 65  EQSVKEIEESLKERKEKTLTED 86
            + +K +    +E  E      
Sbjct: 90  NKKIKSLTAFNEENFETDRAPS 111


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 33.7 bits (77), Expect = 0.080
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           E+K K K KK K K KK + KKK +E 
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 33.3 bits (76), Expect = 0.13
 Identities = 15/29 (51%), Positives = 16/29 (55%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
              EK KK K KK K K KK + KKK  E
Sbjct: 87  TAVEKKKKGKSKKNKLKGKKDEDKKKARE 115



 Score = 32.9 bits (75), Expect = 0.15
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           VE++K  K KK K K KK + KKK ++ 
Sbjct: 89  VEKKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 43  KKKKKKKKKKKKKKKKKKKKKEEQS 67
           KKKK K KK K K KK + KK+ + 
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKARE 115



 Score = 27.9 bits (62), Expect = 7.6
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 46  KKKKKKKKKKKKKKKKKKEEQSVKE 70
           KKKK K KK K K KK ++++  +E
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKARE 115


>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
           coiled-coil domain [Function unknown].
          Length = 231

 Score = 34.1 bits (78), Expect = 0.081
 Identities = 21/100 (21%), Positives = 46/100 (46%), Gaps = 16/100 (16%)

Query: 10  EKKKRIEKRNKEE----------------EKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
           E+ KR+   N E                  K    Q+   K+DVEE + + ++++K+   
Sbjct: 5   EEIKRVLLENPEILVDVLTQRPEILYEVLAKLTPWQQLATKQDVEELRKETEQRQKELAD 64

Query: 54  KKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
           +K + +K+K  ++ +K ++   +E    T  +  + +T+I
Sbjct: 65  EKLEVRKQKATKEDLKLLQRFQEEEFRATKEDIKRLETII 104


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 34.5 bits (80), Expect = 0.081
 Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
            +  EQ E+E + ++ +       ++K+KKK+KKK KKKK +    +I + LKE   + L
Sbjct: 375 LQRVEQLEQELKQLKAQP-VGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDL 433

Query: 84  TE 85
             
Sbjct: 434 EL 435



 Score = 33.7 bits (78), Expect = 0.14
 Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 2/67 (2%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
            +R+E+   E+E ++ + +       ++EK K+KKK KKKK K  + K  K  +++ ++ 
Sbjct: 375 LQRVEQL--EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQD 432

Query: 72  EESLKER 78
            E LK  
Sbjct: 433 LELLKNV 439


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
          outer membrane].
          Length = 170

 Score = 33.5 bits (77), Expect = 0.082
 Identities = 15/57 (26%), Positives = 33/57 (57%)

Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          +E+ Q +K   D+E E  K++K+ +K +K+ K K+ K +++  ++ + +  K   E 
Sbjct: 37 QESPQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEI 93



 Score = 31.2 bits (71), Expect = 0.52
 Identities = 17/83 (20%), Positives = 40/83 (48%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           + +  + + +   +KR KE +K + E + KE +  ++ K +    + K + + KK+K   
Sbjct: 41  QAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVN 100

Query: 63  KEEQSVKEIEESLKERKEKTLTE 85
              +  +E E+ L  R+ +   +
Sbjct: 101 AFNKKQQEYEKDLNRREAEEEQK 123



 Score = 27.4 bits (61), Expect = 8.3
 Identities = 15/76 (19%), Positives = 37/76 (48%), Gaps = 8/76 (10%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEE--------EKNKKKKKKKKKKKKKKKKKKK 61
           E KKR ++  K +++ + ++ + + +   E        E   KK+K      KK+++ +K
Sbjct: 52  EFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAFNKKQQEYEK 111

Query: 62  KKEEQSVKEIEESLKE 77
               +  +E ++ L++
Sbjct: 112 DLNRREAEEEQKLLEK 127


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
          describes the eukaryotic 60S (cytosolic) ribosomal
          protein L7 and paralogs that may or may not also be L7.
          Human, Drosophila, and Arabidopsis all have both a
          typical L7 and an L7-related paralog. This family is
          designated subfamily rather than equivalog to reflect
          these uncharacterized paralogs. Members of this family
          average ~ 250 residues in length, somewhat longer than
          the archaeal L30P/L7E homolog (~ 155 residues) and much
          longer than the related bacterial/organellar form (~ 60
          residues).
          Length = 235

 Score = 33.9 bits (78), Expect = 0.082
 Identities = 16/55 (29%), Positives = 27/55 (49%)

Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
          E   ++++       +  K+ K KKK  KKK+K   K+ E  VKE  ++ +E   
Sbjct: 1  ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIR 55


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 33.9 bits (78), Expect = 0.085
 Identities = 25/79 (31%), Positives = 43/79 (54%), Gaps = 8/79 (10%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKK------KKKKKK 61
           EK    EK  +EE+K+  E E+KE+E  ++  +  K   +  KK   +      K++K  
Sbjct: 26  EKAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLL 85

Query: 62  KKEEQSVKEIEESLKERKE 80
           KKE++S+++ EE  K  K+
Sbjct: 86  KKEKESIEKSEEEFKSAKK 104



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 17/74 (22%), Positives = 38/74 (51%), Gaps = 3/74 (4%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           G  + +   +  ++     +  ++  +KE    EK++++ K  KK  +K + KK KK+  
Sbjct: 60  GKDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKK-- 117

Query: 67  SVKEIEESLKERKE 80
             K++ + +KER +
Sbjct: 118 -AKQLVKVMKERYK 130



 Score = 28.9 bits (65), Expect = 3.4
 Identities = 22/88 (25%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 6  GGAGEKKKRIEKRNKEEEK----EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           G    KK  E   ++ EK    E+  +EE++K +  E+K ++   +  +  K   +  K
Sbjct: 9  TGCTTGKKPEESIYEQLEKAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIK 68

Query: 62 KKEEQSVKEIEESLKE-RKEKTLTEDSQ 88
          K  +Q++  +++  K  +KEK   E S+
Sbjct: 69 KLSDQALANVDKREKLLKKEKESIEKSE 96



 Score = 27.3 bits (61), Expect = 9.9
 Identities = 15/77 (19%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQ-EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           +K++ +  +  +  K++NE  ++   + +     ++K  KK+K+  +K +++ K  ++ +
Sbjct: 47  KKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEEEFKSAKKYI 106

Query: 69  KEIEESLKERKEKTLTE 85
           ++IE+   ++K K L +
Sbjct: 107 EKIEDKKLKKKAKQLVK 123


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 33.1 bits (76), Expect = 0.087
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKK 50
           +EE EE E +E++ ++ E E+ +   KK K
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 31.5 bits (72), Expect = 0.29
 Identities = 10/32 (31%), Positives = 23/32 (71%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
           ++++E+ E+E+ E++D E+E  +++   KK K
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 30.0 bits (68), Expect = 0.88
 Identities = 9/32 (28%), Positives = 22/32 (68%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
            ++E++E E++++E +D +E + ++   KK K
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 29.6 bits (67), Expect = 1.3
 Identities = 10/37 (27%), Positives = 27/37 (72%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           +E+E +++E++E+E++D E++   + ++++   KK K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 28.1 bits (63), Expect = 4.2
 Identities = 11/37 (29%), Positives = 28/37 (75%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           EE++ ++++E++E+ED EE+ ++ + ++++   KK K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 33.7 bits (77), Expect = 0.087
 Identities = 17/81 (20%), Positives = 42/81 (51%), Gaps = 1/81 (1%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
            +  +E++EE E  ++  +  EEE+  + ++ +K+K+  + +++   E+Q  +EI +  +
Sbjct: 33  IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQE-E 91

Query: 77  ERKEKTLTEDSQCDTMIPVFK 97
           E++E+      Q        K
Sbjct: 92  EKEEEAEDVKQQEVFSFKRKK 112



 Score = 29.8 bits (67), Expect = 1.7
 Identities = 16/56 (28%), Positives = 32/56 (57%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           +K      EK+K   +  +E+  E+ E+EE  +E+ +EE+ +  K+++    K+KK
Sbjct: 57  QKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKK 112



 Score = 28.3 bits (63), Expect = 4.8
 Identities = 14/77 (18%), Positives = 43/77 (55%), Gaps = 1/77 (1%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK-KKKKKKKKKKKK 60
            + +  + E  K  E+  K E +E  +++E  + + E+   +++K++  ++++K+++ + 
Sbjct: 40  EEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAED 99

Query: 61  KKKEEQSVKEIEESLKE 77
            K++E    + ++  KE
Sbjct: 100 VKQQEVFSFKRKKPFKE 116



 Score = 27.9 bits (62), Expect = 6.0
 Identities = 18/68 (26%), Positives = 41/68 (60%), Gaps = 2/68 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+K   E+  KE+E  E E+E+  ++  +EE  ++++K+++ +  K+++    K ++  K
Sbjct: 56  EQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKPFK 115

Query: 70  E--IEESL 75
           E  +EE +
Sbjct: 116 EMNLEEKI 123


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 34.0 bits (78), Expect = 0.091
 Identities = 12/31 (38%), Positives = 24/31 (77%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEE 40
           +K K +E  + +EE E++E+EE+E+E+ +E+
Sbjct: 158 KKLKELEAEDVDEEDEKDEEEEEEEEEEDED 188



 Score = 30.1 bits (68), Expect = 1.2
 Identities = 13/32 (40%), Positives = 23/32 (71%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK 41
           EK   +EK+ KE E E+ ++E+++ E+ EEE+
Sbjct: 151 EKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 10/40 (25%), Positives = 24/40 (60%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN 42
           K +    E     +++++EEE+EE E++E   +D +++ +
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198



 Score = 28.2 bits (63), Expect = 5.6
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
            NKK  KK    K K+K     +EE+ + E    L+++ ++   ED
Sbjct: 122 INKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAED 167


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 34.6 bits (80), Expect = 0.094
 Identities = 17/67 (25%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 16  EKRNKEEE-KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           E R + EE + E E  E+E E+VEE   + +   + + + ++ +++++  E+ + E  E+
Sbjct: 472 EDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRET 531

Query: 75  LKERKEK 81
           ++E++E+
Sbjct: 532 IEEKRER 538



 Score = 32.3 bits (74), Expect = 0.48
 Identities = 19/67 (28%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           EE++E  E+ E E ED+EEE  + +++ ++ +   + + + ++ E+  +++EE + ER+E
Sbjct: 471 EEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRE 530

Query: 81  KTLTEDS 87
            T+ E  
Sbjct: 531 -TIEEKR 536



 Score = 31.5 bits (72), Expect = 0.76
 Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 1/77 (1%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           RR    EK++R E+  +   + E E EEK  E   E + + ++ +++  +   K  + K+
Sbjct: 528 RRETIEEKRERAEELRERAAELEAEAEEKR-EAAAEAEEEAEEAREEVAELNSKLAELKE 586

Query: 64  EEQSVKEIEESLKERKE 80
             +S++ I   L    +
Sbjct: 587 RIESLERIRTLLAAIAD 603


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 33.8 bits (78), Expect = 0.098
 Identities = 13/76 (17%), Positives = 38/76 (50%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           +K + +++      +  +E    +  ++E+ + + +    K  +K+K+ ++ ++E Q + 
Sbjct: 182 KKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELT 241

Query: 70  EIEESLKERKEKTLTE 85
              E+L  +K + L E
Sbjct: 242 IAIEALTNKKSELLEE 257


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 33.4 bits (77), Expect = 0.098
 Identities = 15/48 (31%), Positives = 29/48 (60%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
              +++    EE+ E  +E+KE+E+ +  +   +  KKK+ +KKKK+K
Sbjct: 132 DVEVDEDESSEEEIEEFEEQKEEEEAKLAEEALEALKKKEAEKKKKEK 179



 Score = 31.9 bits (73), Expect = 0.27
 Identities = 19/45 (42%), Positives = 28/45 (62%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
            E+E  E E EE E++  EEE    ++  +  KKK+ +KKKK+KE
Sbjct: 136 DEDESSEEEIEEFEEQKEEEEAKLAEEALEALKKKEAEKKKKEKE 180


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 34.3 bits (79), Expect = 0.098
 Identities = 13/63 (20%), Positives = 27/63 (42%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EK       N   +  + EQ+  +  +  E+ +    +  K+K +K K  K+K ++  + 
Sbjct: 222 EKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGIS 281

Query: 70  EIE 72
            I 
Sbjct: 282 IIL 284



 Score = 31.6 bits (72), Expect = 0.68
 Identities = 18/83 (21%), Positives = 30/83 (36%), Gaps = 5/83 (6%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K K I    +E+    N      K + EE+K      + +       +  K+K E+    
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKI--- 268

Query: 71  IEESLKERKEKTLTEDSQCDTMI 93
             + LKE+ +K        D M 
Sbjct: 269 --KDLKEKAKKLGISIILFDDMT 289



 Score = 28.1 bits (63), Expect = 9.5
 Identities = 16/75 (21%), Positives = 25/75 (33%), Gaps = 12/75 (16%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK--------- 53
                 G K  + E++  +   E  +      E  +E+  K K  K+K KK         
Sbjct: 227 SNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGISIILFD 286

Query: 54  ---KKKKKKKKKKEE 65
              K K    K + E
Sbjct: 287 DMTKNKTTNYKIQNE 301


>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
           This family consists of the N-terminal region of the
           prokaryotic fibronectin-binding protein. Fibronectin
           binding is considered to be an important virulence
           factor in streptococcal infections. Fibronectin is a
           dimeric glycoprotein that is present in a soluble form
           in plasma and extracellular fluids; it is also present
           in a fibrillar form on cell surfaces. Both the soluble
           and cellular forms of fibronectin may be incorporated
           into the extracellular tissue matrix. While fibronectin
           has critical roles in eukaryotic cellular processes,
           such as adhesion, migration and differentiation, it is
           also a substrate for the attachment of bacteria. The
           binding of pathogenic Streptococcus pyogenes and
           Staphylococcus aureus to epithelial cells via
           fibronectin facilitates their internalisation and
           systemic spread within the host.
          Length = 447

 Score = 34.1 bits (79), Expect = 0.100
 Identities = 20/49 (40%), Positives = 31/49 (63%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           K+ IE     E + EN +  ++ E++ EE  ++   K+KK+KKKKKKKK
Sbjct: 399 KEEIEYLESVEAQLENAESLEDLEEIREELIEQGYLKEKKRKKKKKKKK 447



 Score = 31.8 bits (73), Expect = 0.52
 Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 4/66 (6%)

Query: 2   RKRRGGAGEKKKRIEKRNKE----EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           +K +      +++IE+  +E    E  E   +  +  ED+EE + +  ++   K+KK+KK
Sbjct: 382 KKLKRAVEAVEEQIEETKEEIEYLESVEAQLENAESLEDLEEIREELIEQGYLKEKKRKK 441

Query: 58  KKKKKK 63
           KKKKKK
Sbjct: 442 KKKKKK 447



 Score = 28.0 bits (63), Expect = 9.1
 Identities = 11/55 (20%), Positives = 32/55 (58%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
            +K E ++ ++++ D+E+    + +K +KK +K +K+ ++ +  ++ +   E L 
Sbjct: 282 SKKAERDRVKQKRSDLEKRVENELEKLEKKLEKLEKELEEAENAENYRLYGELLT 336


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 32.2 bits (74), Expect = 0.10
 Identities = 7/21 (33%), Positives = 15/21 (71%)

Query: 20 KEEEKEENEQEEKEKEDVEEE 40
             E+++ E+EE+E+++  EE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEE 95



 Score = 32.2 bits (74), Expect = 0.11
 Identities = 13/21 (61%), Positives = 17/21 (80%)

Query: 20 KEEEKEENEQEEKEKEDVEEE 40
            EEK+E E+EE+EKE+ EEE
Sbjct: 76 AAEEKKEEEEEEEEKEESEEE 96



 Score = 28.0 bits (63), Expect = 3.0
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 20 KEEEKEENEQEEKEKEDVE 38
          KEEE+EE E+EE E+E   
Sbjct: 81 KEEEEEEEEKEESEEEAAA 99



 Score = 27.2 bits (61), Expect = 4.9
 Identities = 8/20 (40%), Positives = 15/20 (75%)

Query: 20 KEEEKEENEQEEKEKEDVEE 39
          K+EE+EE E++E+ +E+   
Sbjct: 80 KKEEEEEEEEKEESEEEAAA 99


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 34.2 bits (79), Expect = 0.11
 Identities = 13/39 (33%), Positives = 28/39 (71%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
           +EE++E +++  +++D EEE+ +K+KKKKK  +  + + 
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 32.3 bits (74), Expect = 0.45
 Identities = 12/43 (27%), Positives = 30/43 (69%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           ++  ++EE++E+ + ++++ E+ EEE+ +KKKKK  +  + + 
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 31.5 bits (72), Expect = 0.83
 Identities = 15/40 (37%), Positives = 27/40 (67%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            +E+E+ E E+ + ED EEE+ ++K+KKKKK  +  + + 
Sbjct: 352 SDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 31.1 bits (71), Expect = 1.2
 Identities = 14/48 (29%), Positives = 29/48 (60%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E+    +  +E E EE E  D E+++ +++++K+KKKKK  +  + + 
Sbjct: 344 EEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 28.8 bits (65), Expect = 5.0
 Identities = 14/44 (31%), Positives = 32/44 (72%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           E+    +E+E++E ED ++E ++++++++K+KKKKK  +  + E
Sbjct: 347 EDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 28.8 bits (65), Expect = 5.2
 Identities = 13/50 (26%), Positives = 35/50 (70%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            EEE+E+ +  ++E+++ +E+ + +  +++++++K+KKKKK  +  +S  
Sbjct: 342 LEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 33.9 bits (77), Expect = 0.11
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 1/71 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS-V 68
           E+   +E R +  EK+  +    E +    E  K   K K+ K+K+K   K+  E QS  
Sbjct: 142 EQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNS 201

Query: 69  KEIEESLKERK 79
           K IE + K  K
Sbjct: 202 KPIETAPKADK 212



 Score = 28.9 bits (64), Expect = 4.4
 Identities = 13/74 (17%), Positives = 30/74 (40%), Gaps = 6/74 (8%)

Query: 16  EKRNKEEEKEENEQE------EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+R   E+ + + +           E     + +K+  +KK +K +  + +K   E    
Sbjct: 117 EQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKI 176

Query: 70  EIEESLKERKEKTL 83
             +    ++K+K L
Sbjct: 177 ASKVKEAKQKQKAL 190


>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
          Length = 530

 Score = 34.3 bits (79), Expect = 0.11
 Identities = 14/61 (22%), Positives = 26/61 (42%), Gaps = 2/61 (3%)

Query: 30 EEKEKEDVEEEKNKKKKKKK--KKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
            +E  +   EK  KK  KK   K +K +++   K    S+++ ++ L         E+ 
Sbjct: 1  SSQEAVEESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDEDDPLASNYGDVPLEEL 60

Query: 88 Q 88
          Q
Sbjct: 61 Q 61


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 33.9 bits (77), Expect = 0.11
 Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 4/62 (6%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK----KKEEQSVKEI 71
           ++  +  E EE E  E E ++ E+E NK K +K+K    KK +K       ++E+  ++I
Sbjct: 238 DQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKI 297

Query: 72  EE 73
             
Sbjct: 298 PA 299



 Score = 30.0 bits (67), Expect = 1.7
 Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 5/81 (6%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A E++KR +     E++   E+EE+   ++E   +  +K+K KKK  +K     +  E S
Sbjct: 190 ATEREKRQD-----EKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETS 244

Query: 68  VKEIEESLKERKEKTLTEDSQ 88
             E EES +   ++   ED++
Sbjct: 245 ESEEEESSESESDEDEDEDNK 265


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 34.4 bits (78), Expect = 0.12
 Identities = 21/70 (30%), Positives = 44/70 (62%), Gaps = 1/70 (1%)

Query: 17   KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
            + N EE  EEN +E  E E+VEE   +  ++  ++  ++ ++  ++ +E++V+EIEE+ +
Sbjct: 992  EENVEENVEENIEENVE-ENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAE 1050

Query: 77   ERKEKTLTED 86
            E  E+ + E+
Sbjct: 1051 ENVEENIEEN 1060



 Score = 31.3 bits (70), Expect = 0.94
 Identities = 19/68 (27%), Positives = 43/68 (63%), Gaps = 1/68 (1%)

Query: 21   EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
            EE  EEN +E  E E++EE   +  ++ ++  ++  ++  ++  E++V+EIEE+++E  E
Sbjct: 1046 EENAEENVEENIE-ENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVE 1104

Query: 81   KTLTEDSQ 88
            +   E+++
Sbjct: 1105 ENAEENAE 1112



 Score = 30.9 bits (69), Expect = 1.2
 Identities = 24/93 (25%), Positives = 52/93 (55%), Gaps = 5/93 (5%)

Query: 1    MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEK---EDVEE--EKNKKKKKKKKKKKKK 55
            ++ +     E  K   + N EE+ EEN +E+ E+   E+VEE  E+N ++  ++  ++  
Sbjct: 928  LKNQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENV 987

Query: 56   KKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
            ++  ++  EE   + IEE+++E  E+ + E+ +
Sbjct: 988  EENVEENVEENVEENIEENVEENVEENIEENVE 1020



 Score = 29.4 bits (65), Expect = 4.1
 Identities = 19/72 (26%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 17   KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
            + N EE  EEN +E  E E+VEE   +  ++ ++  ++  ++  ++ EE + + +EE+++
Sbjct: 1000 EENIEENVEENVEENIE-ENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIE 1058

Query: 77   ERKEKTLTEDSQ 88
            E  E+   E+ +
Sbjct: 1059 ENIEEYDEENVE 1070


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
          synthetase class II domain (pfam00587) and represents
          the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 31.8 bits (73), Expect = 0.12
 Identities = 21/79 (26%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 12 KKRIEKRNKEEE-----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
          K+++ KR  +        E +E+  + + ++EE + ++ +  K+  K KKKK+  +    
Sbjct: 15 KEKLRKRGVDVLDVDELLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIA 74

Query: 67 SVKEIEESLKERKEKTLTE 85
           VKE+++ LK   E  L E
Sbjct: 75 EVKELKDELKA-LEAELRE 92


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 31.8 bits (73), Expect = 0.12
 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 12/66 (18%)

Query: 36 DVEEEKNKKKKKKKKKKKKKKKKK---KKKKEEQSVKEIEESLKERKEKTLTEDSQCDTM 92
           V E+ N +  K++KKKKKKK +     +KKE++ V E      +RK      D+    +
Sbjct: 27 KVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSE------QRKADQKAVDAA---L 77

Query: 93 IPVFKK 98
          +   KK
Sbjct: 78 LAAIKK 83



 Score = 28.4 bits (64), Expect = 2.4
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 9/50 (18%)

Query: 33 EKEDVEEEKNKKKKKKKKK-------KKKKKKKKKKKKEEQSVKEIEESL 75
          EK + E  K +KKKKKKK        KK+KK+  +++K +Q  K ++ +L
Sbjct: 30 EKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQ--KAVDAAL 77


>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
          Length = 815

 Score = 34.0 bits (78), Expect = 0.13
 Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 1/77 (1%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E + +      EE K    +++ E ++ E E++ KK KK+ KK+        K  E +VK
Sbjct: 4  ETRSQAHAPAAEEGKGGTRKQKAENKEHEGEQSPKKAKKE-KKQDDSGNGNGKSAEDAVK 62

Query: 70 EIEESLKERKEKTLTED 86
          E EE  K  +E    E 
Sbjct: 63 EFEEFCKAIEEHLSIEQ 79


>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2).  Surfeit locus
           protein 2 is part of a group of at least six sequence
           unrelated genes (Surf-1 to Surf-6). The six Surfeit
           genes have been classified as housekeeping genes, being
           expressed in all tissue types tested and not containing
           a TATA box in their promoter region. The exact function
           of SURF2 is unknown.
          Length = 244

 Score = 33.4 bits (76), Expect = 0.13
 Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 3/63 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK---KKKKKKKKKKKKKKEEQ 66
            K    E+   E++  E + E++ + +  E + K+ KK+     KK KK  KKK      
Sbjct: 182 LKNPGKEQNGDEDDDFETDDEDEMEVESPELQQKRSKKQSGSLTKKFKKNHKKKGPFRFS 241

Query: 67  SVK 69
           SVK
Sbjct: 242 SVK 244


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
          Prothymosin alpha and parathymosin are two ubiquitous
          small acidic nuclear proteins that are thought to be
          involved in cell cycle progression, proliferation, and
          cell differentiation.
          Length = 106

 Score = 31.9 bits (72), Expect = 0.13
 Identities = 15/72 (20%), Positives = 38/72 (52%), Gaps = 1/72 (1%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
          EK+          E EEN  +E + E + EEE ++  ++++ + ++++ +++++ E  + 
Sbjct: 25 EKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATG 84

Query: 69 KEIEESLKERKE 80
          K   E  ++  E
Sbjct: 85 KRAAEDEEDDAE 96



 Score = 31.1 bits (70), Expect = 0.24
 Identities = 16/52 (30%), Positives = 32/52 (61%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           E+ ++++E+EE E EE+E E+ EE +    K+  + ++   + KK+K +E  
Sbjct: 55  EEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTDEDD 106



 Score = 28.4 bits (63), Expect = 2.1
 Identities = 14/50 (28%), Positives = 32/50 (64%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
           E+++ +++ ++EEE E  E+E +E+E+ E    K+  + ++   + KK+K
Sbjct: 52  EEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQK 101



 Score = 28.0 bits (62), Expect = 3.5
 Identities = 13/50 (26%), Positives = 28/50 (56%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           E+   E+++EE  + E+E+ + EEE      K+  + ++   + KK+K +
Sbjct: 54  EEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTD 103


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
           factor dissociates the posttermination complex, composed
           of the ribosome, deacylated tRNA, and mRNA, after
           termination of translation.  Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.  RRF is believed to bind the ribosome at the
           A-site in a manner that mimics tRNA, but the specific
           mechanisms remain unclear.  RRF is essential for
           bacterial growth.  It is not necessary for cell growth
           in archaea or eukaryotes, but is found in mitochondria
           or chloroplasts of some eukaryotic species.
          Length = 179

 Score = 33.0 bits (76), Expect = 0.14
 Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 7/66 (10%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE------QSVKEIEESLKE 77
           K+  E+ +    ++  + N K KK +K+K+  + + KK +E+      + +K+I+E LK 
Sbjct: 114 KKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDEYIKKIDELLKS 173

Query: 78  RKEKTL 83
            KEK L
Sbjct: 174 -KEKEL 178


>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
          trafficking and secretion].
          Length = 340

 Score = 33.8 bits (78), Expect = 0.14
 Identities = 21/71 (29%), Positives = 35/71 (49%), Gaps = 15/71 (21%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK---------------KKKKKKKKKKKE 64
          K EE+ E  + EKEK + E+E+ +K+   ++ K                   KK K+K +
Sbjct: 6  KLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKGIKGLFLKKIKEKLD 65

Query: 65 EQSVKEIEESL 75
          E  ++E+EE L
Sbjct: 66 EDLLEELEELL 76


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 33.8 bits (78), Expect = 0.14
 Identities = 14/55 (25%), Positives = 24/55 (43%), Gaps = 2/55 (3%)

Query: 10  EKKKRIEKRN--KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           E    IE  +  K  ++E+    E++     EEK    K +   K +  K+KK +
Sbjct: 238 ETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292



 Score = 33.4 bits (77), Expect = 0.21
 Identities = 13/71 (18%), Positives = 29/71 (40%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           KK +     +E E+E + + E   E    ++ ++   ++      ++K+   K E   K 
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283

Query: 71  IEESLKERKEK 81
                K+ +E 
Sbjct: 284 EILKEKKDEEL 294



 Score = 33.1 bits (76), Expect = 0.25
 Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 3/69 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK---KKKKKKKKKKKKKKKKKEEQ 66
           ++    E+R +E + E     E +    E+E + ++      ++K+   K +   K E  
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286

Query: 67  SVKEIEESL 75
             K+ EE  
Sbjct: 287 KEKKDEELF 295



 Score = 31.5 bits (72), Expect = 0.83
 Identities = 13/41 (31%), Positives = 26/41 (63%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
           K K+I +   E++KE    +  E++  +EEK +K K+++K+
Sbjct: 638 KIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 28.0 bits (63), Expect = 9.0
 Identities = 13/46 (28%), Positives = 29/46 (63%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           I +  K ++   N  E+K++  + +   +K KK++KK+K K+++K+
Sbjct: 633 ISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 32.8 bits (75), Expect = 0.14
 Identities = 17/63 (26%), Positives = 25/63 (39%), Gaps = 4/63 (6%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----QSVKEIEESLK 76
           E++K E +  E+E+E +   K  +  K  K   K K        E    +    I   LK
Sbjct: 97  EKKKAELKDLEEEREGILRSKQAEDGKLVKIYSKMKPDAAAAILENLPDEEAAAILMKLK 156

Query: 77  ERK 79
            RK
Sbjct: 157 PRK 159



 Score = 27.4 bits (61), Expect = 8.3
 Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 6/59 (10%)

Query: 24  KEENEQEEKEKE--DVEEEKNKKKKKKKKKKKKKKKKKK----KKKEEQSVKEIEESLK 76
             + E  EK K+  +V E     +KKK + K  +++++     K+ E+  + +I   +K
Sbjct: 74  ALQKELLEKLKDLAEVNERLKALEKKKAELKDLEEEREGILRSKQAEDGKLVKIYSKMK 132


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
          glycoprotein C-terminal domain.  The trypanosome
          parasite expresses these proteins to evade the immune
          response.
          Length = 98

 Score = 31.6 bits (72), Expect = 0.15
 Identities = 13/65 (20%), Positives = 27/65 (41%)

Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
           K+   + E +++  + +K +   K K K+ K KK      +    +   E  K +K+K 
Sbjct: 2  NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 83 LTEDS 87
            +  
Sbjct: 62 DCKKG 66



 Score = 29.7 bits (67), Expect = 0.75
 Identities = 13/54 (24%), Positives = 21/54 (38%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
          E   + + +  + +K      +    +   EK K KK KK  KK  K +    K
Sbjct: 22 EDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNTCK 75



 Score = 28.9 bits (65), Expect = 1.1
 Identities = 12/47 (25%), Positives = 21/47 (44%), Gaps = 3/47 (6%)

Query: 19 NKEEEKEE---NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
          +K+E+  +    E + K+      +    +   +K K KK KK  KK
Sbjct: 19 DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKK 65



 Score = 28.9 bits (65), Expect = 1.3
 Identities = 10/54 (18%), Positives = 18/54 (33%)

Query: 9  GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           ++     K  + + K+      +        +  K KK KK  KK  K +   
Sbjct: 20 KKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73



 Score = 28.6 bits (64), Expect = 1.6
 Identities = 12/43 (27%), Positives = 16/43 (37%)

Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
          E K +       +    E   +K K KK KK  KK  K +   
Sbjct: 31 EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73



 Score = 28.6 bits (64), Expect = 1.7
 Identities = 13/51 (25%), Positives = 22/51 (43%), Gaps = 4/51 (7%)

Query: 19 NKEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           KE++ +    E + ++      +    +   +K K KK KK  KK  K E
Sbjct: 20 KKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWE 70



 Score = 28.6 bits (64), Expect = 1.7
 Identities = 10/57 (17%), Positives = 22/57 (38%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           ++E +   + ++ K         +    +   +K K KK KK  ++  K    + K
Sbjct: 19 DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNTCK 75



 Score = 28.6 bits (64), Expect = 1.8
 Identities = 14/65 (21%), Positives = 27/65 (41%)

Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           +K + + E KEK    ++K    K K K+ K KK      +   +    E+   ++ +K
Sbjct: 2  NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 82 TLTED 86
             + 
Sbjct: 62 DCKKG 66



 Score = 28.2 bits (63), Expect = 2.0
 Identities = 10/58 (17%), Positives = 27/58 (46%)

Query: 8  AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            + K   +++  + +K+E++ + K KE   ++      +    +   +K K KK ++
Sbjct: 4  KNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61


>gnl|CDD|219911 pfam08572, PRP3, pre-mRNA processing factor 3 (PRP3).  Pre-mRNA
           processing factor 3 (PRP3) is a U4/U6-associated
           splicing factor. The human PRP3 has been implicated in
           autosomal retinitis pigmentosa.
          Length = 222

 Score = 33.1 bits (76), Expect = 0.16
 Identities = 24/100 (24%), Positives = 41/100 (41%), Gaps = 9/100 (9%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R+RRG    +K +  ++  EE + E + EE+E   + EE      K      +K  KK+ 
Sbjct: 17  RRRRGLQFIEKGKFSQQ-AEELRREAKLEEEEAR-ISEEAQNAGLKSATDLAEKIPKKEL 74

Query: 62  KKEEQSVKEIE----ESLKERKEKTLTEDSQCDTMIPVFK 97
             +   V +IE      L +     L ++S  D    +  
Sbjct: 75  PDD---VPDIEWWDMPILDDPDYNDLEDESDEDIDEEMSS 111


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation
          of 60S ribosomal subunits.
          Length = 132

 Score = 31.9 bits (73), Expect = 0.16
 Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS-VKEIEESLKERKEKTL 83
           + K+ ++ +K  K +K +KK K++ + +K++  E+  VK +     E KE+ L
Sbjct: 25 SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLE-KERRL 77



 Score = 26.9 bits (60), Expect = 8.3
 Identities = 16/43 (37%), Positives = 23/43 (53%)

Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
          KK  K KKK K +K +KK K +   ++ +   K R +  L ED
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPED 70


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 32.9 bits (75), Expect = 0.17
 Identities = 17/79 (21%), Positives = 35/79 (44%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            ++R G  ++K+R+E   K +E    +QEEK + D ++           +     K +  
Sbjct: 159 HRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPP 218

Query: 62  KKEEQSVKEIEESLKERKE 80
           K++ Q   + E  L+ +  
Sbjct: 219 KEKRQKHHDPERRLEPQSH 237



 Score = 32.1 bits (73), Expect = 0.29
 Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 10/86 (11%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK-KKK 60
           RK+      K+       K++EK   E  +K KED    + KK+++K++  + K+     
Sbjct: 152 RKKSDDEHRKRS-----GKQKEKRRVEDSQKHKED----RRKKQEEKRRNDEDKRPGGGG 202

Query: 61  KKKEEQSVKEIEESLKERKEKTLTED 86
                QS    ++   + K +   + 
Sbjct: 203 GSSGGQSGLSTKDEPPKEKRQKHHDP 228


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 33.8 bits (77), Expect = 0.17
 Identities = 17/64 (26%), Positives = 34/64 (53%), Gaps = 1/64 (1%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           KE   ++NE++ K +  ++ +  +++K++K       +   KK EE+  KE  E+ K +K
Sbjct: 124 KESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEERF-KEYLENFKRKK 182

Query: 80  EKTL 83
            K  
Sbjct: 183 FKRK 186


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 31.9 bits (71), Expect = 0.17
 Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           K  +I   N++E K  +E  +KE ED +  E+   + +KK +K + + +KK +K E +  
Sbjct: 40  KYDKIFISNEDEMKTTHEHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDE 99

Query: 69  KEIEESLKERKEK 81
           KE E+S    K+K
Sbjct: 100 KEFEDSDASDKKK 112



 Score = 26.9 bits (58), Expect = 8.4
 Identities = 19/55 (34%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEK-EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E  K+  +  K+ EK ENE E+K EK + E+EK  +K + + +K+ +      KK
Sbjct: 57  EHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDKK 111


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 33.5 bits (77), Expect = 0.18
 Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           EK   K  +K K ++ +KK KK +  +  K + +S+K
Sbjct: 344 EKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIK 380


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 33.6 bits (77), Expect = 0.18
 Identities = 11/62 (17%), Positives = 24/62 (38%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +R        +   ++  +K E   +N +  +EK   E        ++  K+ K+ + K 
Sbjct: 881 IRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKH 940

Query: 61  KK 62
           K 
Sbjct: 941 KG 942



 Score = 29.3 bits (66), Expect = 3.5
 Identities = 7/44 (15%), Positives = 16/44 (36%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
              + E   Q  K  E+   ++        ++  K+ K+ + K 
Sbjct: 897 NHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKH 940


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
          is an essential protein in yeast that is involved in
          pre-rRNA processing and ribosome assembly. It is
          speculated to be required for correct assembly of rpS27
          into the pre-ribosomal particle.
          Length = 131

 Score = 31.8 bits (73), Expect = 0.19
 Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 9/63 (14%)

Query: 10 EKKKRIEKRNKEEEKEENEQEE---------KEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
          EK  + E+  KEE K  +E +E           K      +NK  ++++K K+KKKKKKK
Sbjct: 28 EKYDKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKK 87

Query: 61 KKK 63
          + +
Sbjct: 88 ELE 90


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 32.2 bits (74), Expect = 0.19
 Identities = 17/69 (24%), Positives = 40/69 (57%), Gaps = 5/69 (7%)

Query: 10 EKKKRIEKRNKEEE-KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
          EK++  E  ++EE  +EE E+ E+E E++++    +  +   + +  +K+ ++++EE   
Sbjct: 2  EKEEEEELEDEEEALEEELEELEEEIEELKD----RLLRLLAEFENYRKRTEREREEAKK 57

Query: 69 KEIEESLKE 77
            IE+  K+
Sbjct: 58 YAIEKFAKD 66



 Score = 32.2 bits (74), Expect = 0.22
 Identities = 19/79 (24%), Positives = 42/79 (53%), Gaps = 10/79 (12%)

Query: 20 KEEEKEE--NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
          +E+E+EE   ++EE  +E++EE + + ++ K +  +   + +  +K  +  +E EE+ K 
Sbjct: 1  EEKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTE--REREEAKKY 58

Query: 78 RKEKTLTEDSQCDTMIPVF 96
            EK   +      ++PV 
Sbjct: 59 AIEKFAKD------LLPVL 71


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 33.1 bits (76), Expect = 0.19
 Identities = 20/67 (29%), Positives = 41/67 (61%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           IEK   + E+E+ E EE+ +  +EE K +  ++  +K +++  ++ ++  EQ  K IE+ 
Sbjct: 94  IEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEKGREEGLEEVRELIEQLNKIIEKL 153

Query: 75  LKERKEK 81
           +K+R+E 
Sbjct: 154 IKKREEI 160



 Score = 28.4 bits (64), Expect = 5.2
 Identities = 15/72 (20%), Positives = 37/72 (51%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+     K   EE +E+  +  +  E+  +E+  KK  +     +K + + ++++E+  +
Sbjct: 51  EEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEE 110

Query: 70  EIEESLKERKEK 81
           E+E  ++E K +
Sbjct: 111 ELERLIEEAKAE 122



 Score = 28.4 bits (64), Expect = 5.2
 Identities = 16/78 (20%), Positives = 41/78 (52%), Gaps = 4/78 (5%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQ----EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           A E  +  E+  KEE  ++ ++     EK +  +E E+ + +++ ++  ++ K +  ++ 
Sbjct: 68  AFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEG 127

Query: 64  EEQSVKEIEESLKERKEK 81
            E+  +E  E ++E  E+
Sbjct: 128 YEKGREEGLEEVRELIEQ 145


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is
           a family of proteins conserved in yeasts. The function
           is not known. The Schizosaccharomyces pombe member is
           SPBC18E5.07 and the Saccharomyces cerevisiae member is
           AIM21.
          Length = 671

 Score = 33.4 bits (76), Expect = 0.19
 Identities = 15/62 (24%), Positives = 29/62 (46%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           KK     +   +EK+E +++ KEK  + + +  + +   ++K       +K  E  SV  
Sbjct: 602 KKVLESPKEPSKEKKEEDEDTKEKAPLSDARKGRARGPARRKPATVAATEKLPEIPSVSI 661

Query: 71  IE 72
           IE
Sbjct: 662 IE 663



 Score = 30.7 bits (69), Expect = 1.3
 Identities = 12/58 (20%), Positives = 28/58 (48%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           K+  E  KE ++++++E ED +E+      +K + +   ++K       + + EI   
Sbjct: 602 KKVLESPKEPSKEKKEEDEDTKEKAPLSDARKGRARGPARRKPATVAATEKLPEIPSV 659


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 33.1 bits (76), Expect = 0.19
 Identities = 15/51 (29%), Positives = 33/51 (64%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +++++ E+R   E  E    EE E+   EE++ +KK+ K++K+++K+  +K
Sbjct: 159 QQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
          protein sorting and transport from the endosome to the
          vacuole/lysosome in eukaryotic cells.
          Vacuoles/lysosomes play an important role in the
          degradation of both lipids and cellular proteins. In
          order to perform this degradative function,
          vacuoles/lysosomes contain numerous hydrolases which
          have been transported in the form of inactive
          precursors via the biosynthetic pathway and are
          proteolytically activated upon delivery to the
          vacuole/lysosome. The delivery of transmembrane
          proteins, such as activated cell surface receptors to
          the lumen of the vacuole/lysosome, either for
          degradation/downregulation, or in the case of
          hydrolases, for proper localisation, requires the
          formation of multivesicular bodies (MVBs). These late
          endosomal structures are formed by invaginating and
          budding of the limiting membrane into the lumen of the
          compartment. During this process, a subset of the
          endosomal membrane proteins is sorted into the forming
          vesicles. Mature MVBs fuse with the vacuole/lysosome,
          thereby releasing cargo containing vesicles into its
          hydrolytic lumen for degradation. Endosomal proteins
          that are not sorted into the intralumenal MVB vesicles
          are either recycled back to the plasma membrane or
          Golgi complex, or remain in the limiting membrane of
          the MVB and are thereby transported to the limiting
          membrane of the vacuole/lysosome as a consequence of
          fusion. Therefore, the MVB sorting pathway plays a
          critical role in the decision between recycling and
          degradation of membrane proteins. A few archaeal
          sequences are also present within this family.
          Length = 169

 Score = 32.2 bits (74), Expect = 0.19
 Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 1/58 (1%)

Query: 19 NKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           K   + + +QE  E K    E + KK  KK  K       K+KK+ E+ + +++  L
Sbjct: 7  RKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQLDQLDGQL 64


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 33.3 bits (76), Expect = 0.20
 Identities = 17/74 (22%), Positives = 32/74 (43%), Gaps = 3/74 (4%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            ++KK+I+K+   ++  +N  + K            +  +K K K +K KKK    K   
Sbjct: 77  LKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTN 136

Query: 66  QSVKEIEESLKERK 79
           +  K++  S  E  
Sbjct: 137 KKKKKVLSSKDELI 150



 Score = 32.9 bits (75), Expect = 0.33
 Identities = 18/73 (24%), Positives = 32/73 (43%), Gaps = 6/73 (8%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK------KKKKKKEEQSV 68
           I K+  E+  ++N++  K  +  + +K K K+KKK KKK             K   +Q  
Sbjct: 48  INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFA 107

Query: 69  KEIEESLKERKEK 81
             +  SL  + + 
Sbjct: 108 GPLAISLMRKPKP 120



 Score = 32.1 bits (73), Expect = 0.46
 Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
            K K   E+ K K    K   KKKKK    K +  +      +S+
Sbjct: 116 RKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSI 160



 Score = 30.6 bits (69), Expect = 1.4
 Identities = 20/96 (20%), Positives = 34/96 (35%), Gaps = 9/96 (9%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK-------KKKKKKKKKKKKE 64
            K    + + E   ++     +      EK  KK KK  K       KK K K+KKK K+
Sbjct: 26  LKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKK 85

Query: 65  EQSVKEIEESLKERKEKT--LTEDSQCDTMIPVFKK 98
           +  + +  ++  + K  +           M     K
Sbjct: 86  KLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPK 121



 Score = 30.6 bits (69), Expect = 1.7
 Identities = 17/94 (18%), Positives = 38/94 (40%), Gaps = 3/94 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           + K   ++           + + + E ++++    K   KKKKK    K +  K + +  
Sbjct: 98  DSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKP 157

Query: 70  E---IEESLKERKEKTLTEDSQCDTMIPVFKKGV 100
           +   I   L  ++  TL    + + +  +F KG+
Sbjct: 158 KSISIHSPLTIQELSTLLCIPETEIIKSLFLKGI 191


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 33.2 bits (76), Expect = 0.20
 Identities = 13/52 (25%), Positives = 23/52 (44%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           E+R+ E E+EE E  E+E     E+ + +         + K  K+   +  S
Sbjct: 379 EQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDS 430



 Score = 32.4 bits (74), Expect = 0.36
 Identities = 11/62 (17%), Positives = 29/62 (46%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
            EE +E+E EE+E+   E E+ + +  +++  + ++    +   +       ++ KE   
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426

Query: 81  KT 82
            +
Sbjct: 427 DS 428



 Score = 30.5 bits (69), Expect = 1.7
 Identities = 12/64 (18%), Positives = 27/64 (42%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+    E   +E+  +E+E+EE E  + E  ++++    +         + K  +E +  
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASD 427

Query: 70  EIEE 73
              E
Sbjct: 428 SDSE 431



 Score = 29.7 bits (67), Expect = 2.7
 Identities = 9/60 (15%), Positives = 30/60 (50%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           I+    +E+++E E++  ++ + EE ++ +++  + ++    +       +   K  +ES
Sbjct: 365 IDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKES 424


>gnl|CDD|227468 COG5139, COG5139, Uncharacterized conserved protein [Function
           unknown].
          Length = 397

 Score = 33.1 bits (75), Expect = 0.20
 Identities = 13/69 (18%), Positives = 32/69 (46%), Gaps = 3/69 (4%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           + K+ +  E+ E+  ++E  N +  ++K+K             +   ++ ++SL+  +E 
Sbjct: 53  DTKDPDNGEKNEEAAIDENSNVEAAERKRKHISTDFSDMSLLRK---RKNDQSLQPTREP 109

Query: 82  TLTEDSQCD 90
             + DS  D
Sbjct: 110 MDSRDSGQD 118



 Score = 32.4 bits (73), Expect = 0.37
 Identities = 15/88 (17%), Positives = 36/88 (40%), Gaps = 3/88 (3%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK---KKKKKKKKKKKKKKK 58
                     +KR   ++ +  +E  +  +  ++  E +  +      ++ K     + +
Sbjct: 85  STDFSDMSLLRKRKNDQSLQPTREPMDSRDSGQDFTEAQSGELGDTGDRQLKAPAASRAR 144

Query: 59  KKKKKEEQSVKEIEESLKERKEKTLTED 86
           +K+   EQ+V EI   LK+R +    +D
Sbjct: 145 RKEDLLEQTVDEISLRLKKRMQDAAKKD 172


>gnl|CDD|219748 pfam08214, KAT11, Histone acetylation protein.  Histone acetylation
           is required in many cellular processes including
           transcription, DNA repair, and chromatin assembly. This
           family contains the fungal KAT11 protein (previously
           known as RTT109) which is required for H3K56
           acetylation. Loss of KAT11 results in the loss of H3K56
           acetylation, both on bulk histone and on chromatin.
           KAT11 and H3K56 acetylation appear to correlate with
           actively transcribed genes and associate with the
           elongating form of Pol II in yeast. This family also
           incorporates the p300/CBP histone acetyltransferase
           domain which has different catalytic properties and
           cofactor regulation to KAT11.
          Length = 341

 Score = 33.2 bits (76), Expect = 0.21
 Identities = 15/68 (22%), Positives = 22/68 (32%), Gaps = 2/68 (2%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK--NKKKKKKKKKKKKKKKKKKKKKEEQS 67
           E + +    ++  E+    QE      V          K    KK+ K+ KK    EE  
Sbjct: 238 EGRWKSVSLDQFWEELGFRQEFSLGRLVGFIGLVINPPKLVVSKKQYKRAKKAITGEEYD 297

Query: 68  VKEIEESL 75
                 SL
Sbjct: 298 TLSGAASL 305


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
          represents the L12P protein of the large (50S) subunit
          of the archaeal ribosome.
          Length = 105

 Score = 31.2 bits (71), Expect = 0.22
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 20 KEEEKEENEQEEKEKEDVEEE 40
            EE+EE E+EE+E+E+  EE
Sbjct: 74 AAEEEEEEEEEEEEEEEESEE 94



 Score = 31.2 bits (71), Expect = 0.23
 Identities = 13/20 (65%), Positives = 18/20 (90%)

Query: 21 EEEKEENEQEEKEKEDVEEE 40
          EEE+EE E+EE+E+E+ EEE
Sbjct: 76 EEEEEEEEEEEEEEEESEEE 95



 Score = 30.0 bits (68), Expect = 0.61
 Identities = 12/20 (60%), Positives = 17/20 (85%)

Query: 20 KEEEKEENEQEEKEKEDVEE 39
          +EEE+EE E+EE+E+E  EE
Sbjct: 76 EEEEEEEEEEEEEEEESEEE 95



 Score = 29.6 bits (67), Expect = 0.72
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 6  GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVE 38
            A       E+  +EEE+EE E+EE E+E + 
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 28.9 bits (65), Expect = 1.7
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 20 KEEEKEENEQEEKEKEDVEEE 40
             E+EE E+EE+E+E+ E E
Sbjct: 73 AAAEEEEEEEEEEEEEEEESE 93


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 31.9 bits (73), Expect = 0.22
 Identities = 24/83 (28%), Positives = 48/83 (57%), Gaps = 6/83 (7%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           KR ++R +++ +EE +++E+E E +EE K  ++++K++ +K+ K++K+  K      E E
Sbjct: 29  KRKQQR-RKKAQEEAKEKERE-ERIEERKRIREERKQELEKQLKERKEALKLL----EEE 82

Query: 73  ESLKERKEKTLTEDSQCDTMIPV 95
              +E  E   TED + D     
Sbjct: 83  NDDEEDAETEDTEDVEDDEWEGF 105



 Score = 30.0 bits (68), Expect = 0.99
 Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 18/56 (32%)

Query: 43 KKKKKKKKKK------------------KKKKKKKKKKKEEQSVKEIEESLKERKE 80
          KKKK KKK                    K+K++++KK +EE   KE EE ++ERK 
Sbjct: 1  KKKKAKKKGVEEVVFDKEARKEYLTGFHKRKQQRRKKAQEEAKEKEREERIEERKR 56



 Score = 27.3 bits (61), Expect = 7.0
 Identities = 19/76 (25%), Positives = 36/76 (47%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+K+  E+R +E EK+  E++E  K   EE  +++  + +  +  +  + +   E     
Sbjct: 53  ERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTD 112

Query: 70  EIEESLKERKEKTLTE 85
             EE + E K KT   
Sbjct: 113 YEEEYIDEDKYKTTVT 128


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 33.2 bits (76), Expect = 0.22
 Identities = 24/80 (30%), Positives = 45/80 (56%), Gaps = 9/80 (11%)

Query: 22  EEKEENEQEEKEKEDVEEEKNK------KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           +E E+N  E+KEK +  + KN+      + +K+ K+ K K  ++KK+K E  +K++ ++L
Sbjct: 512 KEAEKNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIENLIKKLRQAL 571

Query: 76  KE---RKEKTLTEDSQCDTM 92
           +       K+L E+ Q   M
Sbjct: 572 QNDNYESIKSLLEELQKALM 591


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 32.8 bits (75), Expect = 0.24
 Identities = 17/62 (27%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           ++E+K    + E     +EE ++K  + +K+  + K + KK +KE   + E++E++ ER+
Sbjct: 44  QKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKE---IAELKENIVERQ 100

Query: 80  EK 81
           E 
Sbjct: 101 EL 102



 Score = 28.9 bits (65), Expect = 3.5
 Identities = 13/72 (18%), Positives = 27/72 (37%), Gaps = 1/72 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EK+  +E + +     +NE E +    +  +K +K         K+     +K   +  K
Sbjct: 162 EKQAALEDKLETLVALQNELETQLN-SLNSQKAEKNALIAALAAKEASALGEKAALEEQK 220

Query: 70  EIEESLKERKEK 81
            + E+      K
Sbjct: 221 ALAEAAAAEAAK 232



 Score = 28.1 bits (63), Expect = 7.0
 Identities = 14/72 (19%), Positives = 28/72 (38%), Gaps = 1/72 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E KK +E++ +   +++ E     + ++E + N    +K +K         K+      K
Sbjct: 155 EDKKSLEEK-QAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEK 213

Query: 70  EIEESLKERKEK 81
              E  K   E 
Sbjct: 214 AALEEQKALAEA 225



 Score = 27.8 bits (62), Expect = 8.7
 Identities = 13/75 (17%), Positives = 33/75 (44%), Gaps = 6/75 (8%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK------KKEE 65
           KK +E++ ++++  E +Q   E +       + + + +      +K +K         +E
Sbjct: 147 KKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKE 206

Query: 66  QSVKEIEESLKERKE 80
            S    + +L+E+K 
Sbjct: 207 ASALGEKAALEEQKA 221


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 32.9 bits (75), Expect = 0.24
 Identities = 22/88 (25%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 8   AGEKKKRIEKRNKEEEKEEN-----EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            G+    I K  + +E+E       E+ E++  D+E +  KK     + ++ KK+K+K+K
Sbjct: 399 EGKIISAILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEK 458

Query: 63  KEEQSVKEIEESLKERKEKTLTEDSQCD 90
           +  + V+++ E  K  K  T  +    D
Sbjct: 459 ERPEFVEKVLELFKGSKIITYKKKKGKD 486



 Score = 28.3 bits (63), Expect = 6.8
 Identities = 21/113 (18%), Positives = 46/113 (40%), Gaps = 17/113 (15%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEK-----------------EKEDVEEEKNKKKKKKKKK 51
           G K++  E+  K+EE++E    +K                 +  +++EE+ K   K +K 
Sbjct: 367 GSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVEKS 426

Query: 52  KKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKG 104
           ++     + K  ++       E +K+ KEK        + ++ +FK   +   
Sbjct: 427 EEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITY 479


>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
          motility and secretion].
          Length = 214

 Score = 32.5 bits (74), Expect = 0.25
 Identities = 17/71 (23%), Positives = 38/71 (53%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          KK  +E++ ++ EK      E+ KE +EE   + KK   +  ++  ++ ++ KE   +++
Sbjct: 1  KKPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEK 60

Query: 71 IEESLKERKEK 81
          ++   +E  EK
Sbjct: 61 VDSIEEEISEK 71



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 22/84 (26%), Positives = 39/84 (46%), Gaps = 6/84 (7%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK------KKKKKKKKKKKK 62
            E K++IE+   + +K ++E  E+  E  E+ K     +K         +K+K   +K K
Sbjct: 21  EELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKEKVMSEKLK 80

Query: 63  KEEQSVKEIEESLKERKEKTLTED 86
           +  Q     EE  K+ K + + ED
Sbjct: 81  EPAQMSSTSEEEEKKAKLEEIPED 104



 Score = 27.9 bits (62), Expect = 7.0
 Identities = 16/72 (22%), Positives = 35/72 (48%), Gaps = 1/72 (1%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            +++IEK  K       E +EK +E   + K    +  ++  ++ ++ K+    E+ V 
Sbjct: 4  YLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEK-VD 62

Query: 70 EIEESLKERKEK 81
           IEE + E+++ 
Sbjct: 63 SIEEEISEKEKV 74


>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
          production and conversion].
          Length = 194

 Score = 32.3 bits (74), Expect = 0.25
 Identities = 11/79 (13%), Positives = 41/79 (51%), Gaps = 8/79 (10%)

Query: 8  AGEKKKRI----EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
          A E+ + I     +  ++ ++E   + E+  E++  +  K+ ++++++       + ++K
Sbjct: 15 AEEEAEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKEAERERQRIISSALLEARRK 74

Query: 64 E----EQSVKEIEESLKER 78
               E+ ++ + E+++E+
Sbjct: 75 LLEAKEEILESVFEAVEEK 93



 Score = 32.0 bits (73), Expect = 0.28
 Identities = 21/75 (28%), Positives = 43/75 (57%)

Query: 9  GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           E +K I+K  +E E+E  E  E+ +E+ E+ K + K++ ++  ++  +K +K+ E +  
Sbjct: 2  MELEKLIKKILREAEEEAEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKEAERERQ 61

Query: 69 KEIEESLKERKEKTL 83
          + I  +L E + K L
Sbjct: 62 RIISSALLEARRKLL 76



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 9/51 (17%), Positives = 32/51 (62%)

Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
          E + ++  ++ +++ ++  ++ +++ +K +E++ +E EE+++E   K   E
Sbjct: 5  EKLIKKILREAEEEAEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKE 55


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 32.9 bits (75), Expect = 0.25
 Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 2/79 (2%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK--KKKKKKKKKKKKKKKKEEQSV 68
           +KK I        K+ N  EE  K  V  +   K K    KKK  +  K++  KKEE   
Sbjct: 380 EKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNKEQDIKKEELLE 439

Query: 69  KEIEESLKERKEKTLTEDS 87
           KE  +  ++  +     D+
Sbjct: 440 KEFVKKSEKIPKNDELLDN 458



 Score = 32.5 bits (74), Expect = 0.39
 Identities = 21/79 (26%), Positives = 31/79 (39%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E KK       E++   N Q    K+    E++ K         K K    KKK  Q+ K
Sbjct: 369 ENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNK 428

Query: 70  EIEESLKERKEKTLTEDSQ 88
           E +   +E  EK   + S+
Sbjct: 429 EQDIKKEELLEKEFVKKSE 447


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 31.7 bits (73), Expect = 0.26
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 9/55 (16%)

Query: 44  KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKK 98
           K+KKKKKKKKK+ +  ++KK++  V E      ERKE     D+    ++   KK
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSE------ERKEDQKAVDAA---LLAAIKK 127



 Score = 29.1 bits (66), Expect = 2.1
 Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 11/62 (17%)

Query: 33  EKEDVEEEKNKKKKKKKKK-------KKKKKKKKKKKKEEQSVKEIEESLKE--RKEKTL 83
            K +    K KKKKKKKKK       KKKK    +++KE+Q  K ++ +L    +K   L
Sbjct: 74  PKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQ--KAVDAALLAAIKKVPLL 131

Query: 84  TE 85
             
Sbjct: 132 KG 133



 Score = 28.7 bits (65), Expect = 3.2
 Identities = 10/43 (23%), Positives = 26/43 (60%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           ++K+K+  +E +  ++KKKK    +++K+ +K  +   +  I+
Sbjct: 84  KKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAIK 126


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 33.0 bits (75), Expect = 0.26
 Identities = 11/43 (25%), Positives = 24/43 (55%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
           E   ++   +E E +E++K + EE   +K+  ++  ++KK  K
Sbjct: 145 ESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDK 187



 Score = 31.5 bits (71), Expect = 0.68
 Identities = 15/81 (18%), Positives = 40/81 (49%), Gaps = 6/81 (7%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE- 65
           G  E +   + + K+++K+  +  E E    ++E +K+  KK+   K +++ ++  ++E 
Sbjct: 250 GNIEYEDFFDPKEKDKKKDAGDDAELE----DDEPDKEAVKKEADSKPEEEDEEDDEQED 305

Query: 66  -QSVKEIEESLKERKEKTLTE 85
            Q  +E  E+  ++ +     
Sbjct: 306 DQDEEEPPEAAMDKVKLDEPV 326



 Score = 31.1 bits (70), Expect = 1.0
 Identities = 14/64 (21%), Positives = 33/64 (51%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           EE  ++ +  EK K +++     ++++KKK+K++  KK   K +++     E+     K+
Sbjct: 538 EEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKK 597

Query: 81  KTLT 84
            +  
Sbjct: 598 SSSA 601



 Score = 30.3 bits (68), Expect = 1.8
 Identities = 19/72 (26%), Positives = 37/72 (51%), Gaps = 3/72 (4%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            KK  + + +EE++E++EQE+ +  D EE       K K  +   +    +  +E S  E
Sbjct: 285 VKKEADSKPEEEDEEDDEQEDDQ--DEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFE 342

Query: 71  IEES-LKERKEK 81
             ++ LK++ E+
Sbjct: 343 KRQAKLKQQIEQ 354



 Score = 30.0 bits (67), Expect = 2.4
 Identities = 18/80 (22%), Positives = 42/80 (52%), Gaps = 4/80 (5%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
               E     E  +  E+  ++E E+ E++ +EEE+  ++K+  ++  ++KK  K   ++
Sbjct: 133 DDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDD 192

Query: 66  Q--SVKEIEESLK--ERKEK 81
           +   + E+ E L+  E +E+
Sbjct: 193 KFFKLDEMNEFLEATEAEEE 212



 Score = 29.6 bits (66), Expect = 2.7
 Identities = 16/79 (20%), Positives = 39/79 (49%), Gaps = 3/79 (3%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEK---EDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           A  K +  ++ + E+E +++E+E  E    +   +E   +    +  K+    +K++ K 
Sbjct: 289 ADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKL 348

Query: 65  EQSVKEIEESLKERKEKTL 83
           +Q ++++E+     K  TL
Sbjct: 349 KQQIEQLEKENLAPKSWTL 367



 Score = 29.2 bits (65), Expect = 3.6
 Identities = 17/62 (27%), Positives = 32/62 (51%), Gaps = 6/62 (9%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKK------KKKKKKKKKKKKKKEEQSVKEIEESL 75
           E++E++E+   E ED  + K K KKK        +  +  K+  KK+ + +  +E EE  
Sbjct: 242 EDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDD 301

Query: 76  KE 77
           ++
Sbjct: 302 EQ 303



 Score = 29.2 bits (65), Expect = 3.6
 Identities = 14/76 (18%), Positives = 35/76 (46%), Gaps = 1/76 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E  +  ++  K+E   + E+E++E +D +E+   +++  +    K K  +   +      
Sbjct: 276 EDDEPDKEAVKKEADSKPEEEDEE-DDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLES 334

Query: 70  EIEESLKERKEKTLTE 85
             E S  E+++  L +
Sbjct: 335 PKELSSFEKRQAKLKQ 350



 Score = 29.2 bits (65), Expect = 4.4
 Identities = 15/66 (22%), Positives = 33/66 (50%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           E E    +  + E ED EE+K ++++  ++K+  ++  ++KK ++  V +    L E  E
Sbjct: 143 ESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNE 202

Query: 81  KTLTED 86
                +
Sbjct: 203 FLEATE 208



 Score = 28.4 bits (63), Expect = 7.2
 Identities = 19/69 (27%), Positives = 30/69 (43%), Gaps = 7/69 (10%)

Query: 38  EEEKNKKKKKKKKK------KKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQC-D 90
           EE   K K  +K K       K ++++KKK+K   + K + ++ KERK      D     
Sbjct: 538 EEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKK 597

Query: 91  TMIPVFKKG 99
           +    F K 
Sbjct: 598 SSSAFFAKL 606



 Score = 28.0 bits (62), Expect = 8.8
 Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           +K K IEK   E ++ +  +E ++K+       K   K KK++K  K+K     ++ S  
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSA 601

Query: 70  EIEESLKERKEKT 82
              + L+E K   
Sbjct: 602 FFAK-LQETKAVE 613


>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
          Length = 673

 Score = 32.9 bits (75), Expect = 0.27
 Identities = 21/68 (30%), Positives = 36/68 (52%), Gaps = 3/68 (4%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE-IEESLKERKE 80
           +E E+  +E+KEK D  + KN+      + +K+ K+   K   +  VKE +E  LKE K+
Sbjct: 549 QEAEKFAKEDKEKRDAVDTKNQADSVVYQTEKQLKELGDKVPAD--VKEKVEAKLKELKD 606

Query: 81  KTLTEDSQ 88
              +  +Q
Sbjct: 607 AIASGSTQ 614


>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
          (DMAP1).  DNA methylation can contribute to
          transcriptional silencing through several
          transcriptionally repressive complexes, which include
          methyl-CpG binding domain proteins (MBDs) and histone
          deacetylases (HDACs). The chief enzyme that maintains
          mammalian DNA methylation, DNMT1, can also establish a
          repressive transcription complex. The non-catalytic
          amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
          DNMT1 associated protein), and can mediate
          transcriptional repression. DMAP1 has intrinsic
          transcription repressive activity, and binds to the
          transcriptional co-repressor TSG101. DMAP1 is targeted
          to replication foci through interaction with the far N
          terminus of DNMT1 throughout S phase, whereas HDAC2
          joins DNMT1 and DMAP1 only during late S phase,
          providing a platform for how histones may become
          deacetylated in heterochromatin following replication.
          Length = 175

 Score = 32.1 bits (73), Expect = 0.27
 Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 7/65 (10%)

Query: 13 KRIEKRNKEEEKEE-------NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          ++IE R KE EK+           +++       E  K++KK  KKK  +K    K+  +
Sbjct: 10 RKIEARKKEREKKAQDLQKLITAADQQAAGFSTAEARKREKKLPKKKIPQKPGPSKEDSK 69

Query: 66 QSVKE 70
           +V E
Sbjct: 70 VNVPE 74


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 32.5 bits (74), Expect = 0.27
 Identities = 14/73 (19%), Positives = 36/73 (49%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E+   ++   + E +  E  + E   ++  + KK++++   + ++ +KE+  +      L
Sbjct: 44  ERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVEL 103

Query: 76  KERKEKTLTEDSQ 88
           +E KE+   E+ Q
Sbjct: 104 QEEKEQLENEELQ 116


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 32.5 bits (74), Expect = 0.27
 Identities = 18/68 (26%), Positives = 27/68 (39%), Gaps = 7/68 (10%)

Query: 19  NKEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
             E   EE   EE E E   +VE+E+    ++ +  +++       KK      E EE  
Sbjct: 213 EVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKAL----EEEEEK 268

Query: 76  KERKEKTL 83
            ER E   
Sbjct: 269 GERPENAT 276



 Score = 32.5 bits (74), Expect = 0.33
 Identities = 12/58 (20%), Positives = 29/58 (50%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           A  +++  E+   E E E  ++E+   E  E+ + +       KK  +++++K ++ E
Sbjct: 216 ASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEEEKGERPE 273



 Score = 29.1 bits (65), Expect = 3.7
 Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 1/61 (1%)

Query: 14  RIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
            +E   +EE  EE E E E E ED E+  +++ +  +++       KK  +EE+   E  
Sbjct: 213 EVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEEEKGERP 272

Query: 73  E 73
           E
Sbjct: 273 E 273


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 32.2 bits (74), Expect = 0.27
 Identities = 16/66 (24%), Positives = 36/66 (54%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E ++ I+K  +E+E+ E    E E +    EK ++++++ ++K+   +    KK+ Q +K
Sbjct: 124 ELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQLK 183

Query: 70  EIEESL 75
              E +
Sbjct: 184 SQLEQI 189



 Score = 30.3 bits (69), Expect = 0.95
 Identities = 14/53 (26%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 29  QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           Q E+ K ++E+E   KK +++K++ +K+  + + K E   K  EE  +  +++
Sbjct: 117 QAEQGKSELEQEI--KKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKR 167


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 32.7 bits (75), Expect = 0.27
 Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 1/57 (1%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           +K+K     +        ++K   K  +K KK  +   KE E    + +E     DS
Sbjct: 453 DKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFI-KEFETDALKLQEILEKVDS 508


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 31.8 bits (73), Expect = 0.28
 Identities = 19/63 (30%), Positives = 27/63 (42%), Gaps = 17/63 (26%)

Query: 10 EKKKRIEK------RNKEEEKEENEQEEKEKED-----------VEEEKNKKKKKKKKKK 52
           K+K+  K      R+ E   +   + + +K+D             EEK K KKKK K K
Sbjct: 31 RKRKKKHKGLKSGSRHNEGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKKPKSK 90

Query: 53 KKK 55
          K K
Sbjct: 91 KPK 93



 Score = 29.9 bits (68), Expect = 1.2
 Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 1/39 (2%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          KKKK   K+ K   K KKK +EE   +  E   +++K K
Sbjct: 1  KKKKSSPKRSKGMAKSKKKTREELDAEARERK-RKKKHK 38



 Score = 27.2 bits (61), Expect = 9.4
 Identities = 22/108 (20%), Positives = 42/108 (38%), Gaps = 37/108 (34%)

Query: 1  MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK----------- 49
           +K+      K      ++K++ +EE + E +E         K+KKK K           
Sbjct: 1  KKKKSSPKRSKGMA---KSKKKTREELDAEARE--------RKRKKKHKGLKSGSRHNEG 49

Query: 50 --KKKKKKKKKKK------KKK-----EEQSVKEIEESLKERKEKTLT 84
            + K K + +KK      KK      EE+   + ++   + K+  L+
Sbjct: 50 NTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKK--PKSKKPKLS 95


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 32.3 bits (73), Expect = 0.29
 Identities = 25/127 (19%), Positives = 49/127 (38%), Gaps = 36/127 (28%)

Query: 2   RKRRGGAGEK-------KKRIEKRNKE------EEKEENEQEEKEKEDVEEEKNKKKKKK 48
           RKR+    E        K+ +E  +KE      E  ++++ E+K+    EEE+ +  + +
Sbjct: 125 RKRQPSMSETMPLYTLCKEDLESMDKEVDDILGEGSDDSDSEKKKPPGNEEEQEQAPQPR 184

Query: 49  KKKKKKKKK-----------------------KKKKKKEEQSVKEIEESLKERKEKTLTE 85
           K++    ++                       K+K  +E++   E E S +   E     
Sbjct: 185 KQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKRKLNEEDEEDAESESSFESSNEDEEGS 244

Query: 86  DSQCDTM 92
            S+ D M
Sbjct: 245 SSEADEM 251


>gnl|CDD|220624 pfam10187, Nefa_Nip30_N, N-terminal domain of NEFA-interacting
          nuclear protein NIP30.  This is a the N-terminal 100
          amino acids of a family of proteins conserved from
          plants to humans. The full-length protein has
          putatively been called NEFA-interacting nuclear protein
          NIP30, however no reference could be found to confirm
          this.
          Length = 99

 Score = 30.8 bits (70), Expect = 0.29
 Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 28/90 (31%)

Query: 10 EKKKRIEKRNKEEE-KEENE------------QEEKEKEDVE-EEKNKKKKK-------- 47
           +K+R E+     + K E E            QE K+K+  E EEK K K +        
Sbjct: 10 ARKRRQEEVRAPRDPKAEPEEEYDGRSLYERLQENKDKKQEEFEEKFKLKNQFRGLDEDE 69

Query: 48 ----KKKKKKKKKKKKKKKKEEQSVKEIEE 73
               + ++ ++  +KK+K+EE   +E+EE
Sbjct: 70 VEFLDEVEESRRAAEKKRKREEA--EELEE 97


>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
          Length = 158

 Score = 31.6 bits (72), Expect = 0.30
 Identities = 9/69 (13%), Positives = 27/69 (39%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           K   +   +  ++  +  +K  E+     +  ++   + K  ++    +  +E  +    
Sbjct: 90  KITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKN 149

Query: 73  ESLKERKEK 81
           ++ K  KEK
Sbjct: 150 KTKKHDKEK 158



 Score = 31.2 bits (71), Expect = 0.43
 Identities = 12/62 (19%), Positives = 25/62 (40%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
             K  +   ++ ++  EE+       E+V  E    ++    +  K+ K   K K ++  
Sbjct: 96  EIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHD 155

Query: 69  KE 70
           KE
Sbjct: 156 KE 157



 Score = 28.9 bits (65), Expect = 2.2
 Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 2/64 (3%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI--EESLKER 78
           EE  +     E E + +++     KK  ++        ++   E +  +E   +ES KE 
Sbjct: 84  EELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEV 143

Query: 79  KEKT 82
           K  T
Sbjct: 144 KLAT 147



 Score = 27.8 bits (62), Expect = 6.7
 Identities = 12/47 (25%), Positives = 19/47 (40%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           E+       N+E   E    +E   ++  +E     K K KK  K+K
Sbjct: 112 EEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDKEK 158



 Score = 27.8 bits (62), Expect = 6.8
 Identities = 13/51 (25%), Positives = 18/51 (35%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           KK +E+        E    E +           K+ K   K K KK  K+K
Sbjct: 108 KKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDKEK 158


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 32.9 bits (75), Expect = 0.30
 Identities = 11/44 (25%), Positives = 27/44 (61%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           E ++    ++E +K+ ++  +N  + K++K++K+KK KK+    
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPV 162



 Score = 30.9 bits (70), Expect = 1.1
 Identities = 12/44 (27%), Positives = 24/44 (54%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
             R  + EKE  ++  K  ++++  K +K++K+KK KK+     
Sbjct: 120 LDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163



 Score = 29.4 bits (66), Expect = 3.0
 Identities = 12/45 (26%), Positives = 26/45 (57%), Gaps = 4/45 (8%)

Query: 29  QEEKEKEDVEEEKNKKKKKK----KKKKKKKKKKKKKKKEEQSVK 69
           + ++  +  +E K K  K+     + K++K++K+KK KK++  V 
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163



 Score = 29.4 bits (66), Expect = 3.4
 Identities = 11/45 (24%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 23  EKEENEQEEKEKEDVEEEKNKK-KKKKKKKKKKKKKKKKKKKEEQ 66
           E +   + EKE +    ++++   + K++K++K+KK KK+     
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163



 Score = 28.6 bits (64), Expect = 5.4
 Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 30  EEKEKEDVEEEKNKKKKKKK---KKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           E       E+E  KK  K+     + K++K++K+KK ++Q +  + +   +R
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVLLDLHLDR 170


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 31.9 bits (73), Expect = 0.30
 Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 15/74 (20%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          EK +++E+  +E E EE  ++E E+E VEEE   +   +++ K               + 
Sbjct: 3  EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAK---------------IA 47

Query: 70 EIEESLKERKEKTL 83
          E+E  L E +E+ L
Sbjct: 48 ELEAKLDELEERYL 61


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap
          is involved in recombination. Rap (recombination adept
          with plasmid) increases lambda-by-plasmid recombination
          catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 31.9 bits (73), Expect = 0.30
 Identities = 11/35 (31%), Positives = 22/35 (62%)

Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
            K++K ++KK+K + + E + +K  +E LK R +
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARKEKLKTRSD 67


>gnl|CDD|173966 cd08047, TAF7, TATA Binding Protein (TBP) Associated Factor 7
           (TAF7) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 7 (TAF7) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryotes. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           preinitiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs are named after their
           electrophoretic mobility in polyacrylamide gels in
           different species. A new, unified nomenclature has been
           suggested for the pol II TAFs to show the relationship
           between TAF orthologs and paralogs. Several hypotheses
           are proposed for TAFs functions such as serving as
           activator-binding sites, core-promoter recognition or a
           role in essential catalytic activity. Each TAF, with the
           help of a specific activator, is required only for
           expression of subset of genes and is not universally
           involved for transcription as are GTFs. TAF7 is involved
           in the regulation of the transition from PIC assembly to
           initiation and elongation. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID. Several TAFs interact via histone-fold
           (HFD) motifs; the HFD is the interaction motif involved
           in heterodimerization of the core histones and their
           assembly into nucleosome octamers.
          Length = 162

 Score = 31.9 bits (73), Expect = 0.30
 Identities = 17/61 (27%), Positives = 31/61 (50%), Gaps = 11/61 (18%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKK-----------KKKKKKKKKKKKKKEEQSVKEIEESLK 76
           E ++ EKE  E + +KK K KK           K  +K++ +K   K+   ++E+E+ +K
Sbjct: 86  EPDDSEKEAKEAKLDKKDKPKKFEYPHGLTPPMKNVRKRRFRKTPSKKIAEIEEVEKEVK 145

Query: 77  E 77
            
Sbjct: 146 R 146



 Score = 27.2 bits (61), Expect = 8.5
 Identities = 13/74 (17%), Positives = 35/74 (47%), Gaps = 7/74 (9%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNK-------KKKKKKKKKKKKKKKKKKKKEEQS 67
            E  + E+E +E + ++K+K    E  +         +K++ +K   KK  + ++ E++ 
Sbjct: 85  YEPDDSEKEAKEAKLDKKDKPKKFEYPHGLTPPMKNVRKRRFRKTPSKKIAEIEEVEKEV 144

Query: 68  VKEIEESLKERKEK 81
            + ++E  +    +
Sbjct: 145 KRLLKEDTEAVSVE 158


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
          structure and biogenesis].
          Length = 107

 Score = 31.0 bits (70), Expect = 0.31
 Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE-QSVKEIEESLKERKEK 81
            K  + +EK  +K K    K KKK  KKKKKEE +    ++E L ++  K
Sbjct: 2  APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRK 52



 Score = 30.3 bits (68), Expect = 0.49
 Identities = 15/48 (31%), Positives = 23/48 (47%)

Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
          +  K KK +K K    K KKK  KKK+++  +      +E  +K   E
Sbjct: 6  QLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRKE 53


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 32.9 bits (75), Expect = 0.31
 Identities = 19/53 (35%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE-IEESLKERKEK 81
           EE  K+ +E  K K KK+K ++K KK  KK+     Q + + +E+ +KE   K
Sbjct: 378 EEYIKKQIEPRKGKSKKEKYEEKVKKVVKKQDYLSIQFINDLLEKYIKEFDSK 430


>gnl|CDD|232925 TIGR00335, primase_sml, DNA primase, eukaryotic-type, small
           subunit, putative.  Archaeal members differ
           substantially from eukaryotic members and should be
           considered putative pending experimental evidence. The
           protein is universal and single copy among completed
           archaeal and eukarotic genomes to date. DNA primase
           creates RNA primers needed for DNA replication.This
           model is named putative because the assignment is
           putative for archaeal proteins. Eukaryotic proteins
           scoring above the trusted cutoff can be considered
           authentic [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 297

 Score = 32.2 bits (73), Expect = 0.32
 Identities = 13/62 (20%), Positives = 25/62 (40%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
            EE+    K K      +K K  K+   +S K+   +L+   +K + +D      I + +
Sbjct: 201 LEEEILTSKLKILPNDLRKWKLIKEVIFKSEKKDYSALEIYIDKIVLDDKVTLDRIRLLR 260

Query: 98  KG 99
             
Sbjct: 261 HP 262


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 32.4 bits (74), Expect = 0.32
 Identities = 18/63 (28%), Positives = 36/63 (57%), Gaps = 3/63 (4%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           RN E +K + EQEE+ +++ EEE        KK + +    K++ +  + ++E++E L+ 
Sbjct: 106 RNYEADKLDEEQEERVEKEREEE--LAGDAMKKLENRTADSKREMEVLERLEELKE-LQS 162

Query: 78  RKE 80
           R+ 
Sbjct: 163 RRA 165



 Score = 30.1 bits (68), Expect = 1.9
 Identities = 27/116 (23%), Positives = 38/116 (32%), Gaps = 37/116 (31%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKE-------------------KEDVEEEKNKKKKKKKK 50
           E   R EK+ +EEE+EE+E   K                    ++D E+  N    K   
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGS 233

Query: 51  KKKKK-----KKKKKKKKEEQSVKEIE-------------ESLKERKEKTLTEDSQ 88
               K     KK   K+ E  S  + +              SL  RK+      SQ
Sbjct: 234 SSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTSQ 289


>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease.  The potyviridae
           family positive stand RNA viruses with genome encoding a
           polyprotein. members include zucchini yellow mosaic
           virus, and turnip mosaic viruses which cause
           considerable losses of crops worldwide. This family
           consists of a C terminus region from various plant
           potyvirus P1 proteins (found at the N terminus of the
           polyprotein). The C terminus of P1 is a serine-type
           protease responsible for autocatalytic cleavage between
           P1 and the helper component protease pfam00851. The
           entire P1 protein may be involved in virus-host
           interactions.
          Length = 245

 Score = 32.3 bits (74), Expect = 0.32
 Identities = 24/133 (18%), Positives = 56/133 (42%), Gaps = 14/133 (10%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
             E K       KE   +++EK  +   +K    +  KK++K K E+            +
Sbjct: 2   DLEAKVAERLLRKEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREE----------REE 51

Query: 80  EKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQRQKSLIIVN 139
            + L    Q      V K   +       + + SFR P  + + ++   ++++K +++ +
Sbjct: 52  RQFL----QGAYASIVSKITPIGTDKVSKTESVSFRTPYYKRTTKKMKKKKKKKKVVMSD 107

Query: 140 LVDMVLKAVLTVA 152
            ++ +++ VL +A
Sbjct: 108 KINYLIRQVLKIA 120



 Score = 31.9 bits (73), Expect = 0.43
 Identities = 25/95 (26%), Positives = 38/95 (40%), Gaps = 14/95 (14%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKE--ENEQEEKE------------KEDVEEEKNKKKK 46
           +RK       KK+   KR + EE++  +                   K +    +    K
Sbjct: 29  LRKLSPAQVAKKREKLKREEREERQFLQGAYASIVSKITPIGTDKVSKTESVSFRTPYYK 88

Query: 47  KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           +  KK KKKKKKKK    ++    I + LK  K+K
Sbjct: 89  RTTKKMKKKKKKKKVVMSDKINYLIRQVLKIAKKK 123


>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962).  This
          eukaryotic family of proteins has no known function.
          Length = 155

 Score = 31.5 bits (72), Expect = 0.33
 Identities = 11/48 (22%), Positives = 24/48 (50%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
             +  K +KKK ++ +K+  E   +   +E +KE +++  T +   D
Sbjct: 26 ATLRDDKLEKKKAERAEKRSLELDRLFFFQEQIKEDEKEPFTLEEIKD 73


>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
           replication, recombination, and repair / Transcription /
           Translation, ribosomal structure and biogenesis].
          Length = 513

 Score = 32.5 bits (74), Expect = 0.34
 Identities = 14/67 (20%), Positives = 27/67 (40%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           KRIEKR + +          E ED +  K  +   +++     + KK K  ++  ++ + 
Sbjct: 387 KRIEKRLERKLPSAVLLPLDEPEDAKLLKTTRPGLEEESDISDEIKKLKSSKKALLRGLG 446

Query: 73  ESLKERK 79
                 K
Sbjct: 447 VRFTLSK 453


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 32.0 bits (73), Expect = 0.35
 Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 26/90 (28%)

Query: 10  EKKKRIEK--------RNKEEEKEENEQEEKEKEDVEEEK-------NKKKKKKKKKKKK 54
            K+KRIE+         +      ++  ++ + +D E+E         K KK++ ++K++
Sbjct: 105 IKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKER 164

Query: 55  KKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
           ++++K             E  K R+E+ LT
Sbjct: 165 EEEEKA-----------AEEEKAREEEILT 183



 Score = 27.8 bits (62), Expect = 8.8
 Identities = 14/82 (17%), Positives = 34/82 (41%), Gaps = 5/82 (6%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-----KKK 55
            R+   G  ++ ++ + R + EE E   + +KE +   E+ +K        +      ++
Sbjct: 43  YRQPGQGTEDELRKRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEE 102

Query: 56  KKKKKKKKEEQSVKEIEESLKE 77
            + K+K+ EE +     +    
Sbjct: 103 DEIKRKRIEEDARNSDADDSDS 124


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 90 amino acids in
          length. There are two completely conserved L residues
          that may be functionally important.
          Length = 70

 Score = 29.6 bits (67), Expect = 0.35
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK 43
          K K+ E + +EE++EE E+  K +E  E EKNK
Sbjct: 23 KAKKKELKAEEEKREEEEEARKREERKEREKNK 55



 Score = 25.8 bits (57), Expect = 9.6
 Identities = 11/38 (28%), Positives = 27/38 (71%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
          +K  K +++K++++++ +K++++KE +  K  EE L E
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREKNKSFEELLNE 63


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 30.1 bits (68), Expect = 0.36
 Identities = 8/33 (24%), Positives = 13/33 (39%)

Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
          E EE E +D +E+ +    K  K        + 
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78



 Score = 29.0 bits (65), Expect = 0.76
 Identities = 5/31 (16%), Positives = 12/31 (38%)

Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          E+ ++D ++E +     K  K        + 
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78



 Score = 28.2 bits (63), Expect = 1.8
 Identities = 11/52 (21%), Positives = 19/52 (36%)

Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          R E   +EE+ E       E E+ E++ + +       K  K        E+
Sbjct: 27 RTEALIEEEKVEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 32.4 bits (74), Expect = 0.36
 Identities = 9/61 (14%), Positives = 26/61 (42%), Gaps = 2/61 (3%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
             + E   E+ E   +  E+  + ++ K  +  +++KK   +    +   ++ L+E +  
Sbjct: 175 AVRAEIAAEQAELTTLLSEQ--RAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRAN 232

Query: 82  T 82
            
Sbjct: 233 E 233



 Score = 31.6 bits (72), Expect = 0.60
 Identities = 10/61 (16%), Positives = 25/61 (40%), Gaps = 3/61 (4%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI---EESLK 76
            E   E+ E      E   ++    +  +++KK   +   +   +++ ++E+   E  LK
Sbjct: 178 AEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLK 237

Query: 77  E 77
            
Sbjct: 238 N 238



 Score = 30.8 bits (70), Expect = 1.0
 Identities = 12/73 (16%), Positives = 29/73 (39%), Gaps = 2/73 (2%)

Query: 8   AGEKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           A  + +   ++ +      E+  Q+ K  + +EE K    +   +    +KK ++ +  E
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANE 233

Query: 66  QSVKEIEESLKER 78
             +K    S +  
Sbjct: 234 SRLKNEIASAEAA 246


>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
           (Spore_III_AF).  This family represents the stage III
           sporulation protein AF (Spore_III_AF) of the bacterial
           endospore formation program, which exists in some but
           not all members of the Firmicutes (formerly called
           low-GC Gram-positives). The C-terminal region of these
           proteins is poorly conserved.
          Length = 185

 Score = 31.5 bits (72), Expect = 0.36
 Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 14/80 (17%)

Query: 12  KKRIEKRNKEEEK--------EENEQEEKEKEDVEE------EKNKKKKKKKKKKKKKKK 57
           +K++EK+ KEE          E +E  E    D++E      E++K+K+K K +      
Sbjct: 89  EKQVEKKLKEEYGVKVKDVEVEIDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDT 148

Query: 58  KKKKKKEEQSVKEIEESLKE 77
           +  K KEE+   E  E +K 
Sbjct: 149 QTSKPKEEEEESEEAEKIKN 168


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 32.4 bits (75), Expect = 0.37
 Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 13/75 (17%)

Query: 8   AGEKKKR---IEKRNKEE------EKEENEQEEK----EKEDVEEEKNKKKKKKKKKKKK 54
           A E KKR   +E RN+ +      EK   E  +K    EKE +E    + K+  K + K+
Sbjct: 516 AEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAAIKELKEALKGEDKE 575

Query: 55  KKKKKKKKKEEQSVK 69
             K K ++  + S K
Sbjct: 576 AIKAKTEELTQASQK 590


>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
           (trigger factor) [Posttranslational modification,
           protein turnover, chaperones].
          Length = 441

 Score = 32.3 bits (74), Expect = 0.38
 Identities = 20/76 (26%), Positives = 41/76 (53%), Gaps = 5/76 (6%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK-----EEQS 67
           K ++KR   E  +E  ++  E++ +EE K K +K  +++ K+   +K+K++      E +
Sbjct: 238 KEVKKRELPELDDEFAKKLGEEDTLEELKEKLRKNLERELKEATLEKRKEQLLDALVEAN 297

Query: 68  VKEIEESLKERKEKTL 83
             ++ ESL E +   L
Sbjct: 298 DFDLPESLVEAEIDNL 313



 Score = 29.2 bits (66), Expect = 4.0
 Identities = 11/52 (21%), Positives = 21/52 (40%), Gaps = 4/52 (7%)

Query: 20  KEEEKEENEQEEKE----KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
            EE  +     ++     K D+ EEK        KKK  +K+   ++   ++
Sbjct: 389 PEEVIKLYYNNQELLDALKADILEEKAVDLLLANKKKVTEKEVSFEELMNEA 440


>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit.  he A1/A0 ATP
          synthase is homologous to the V-type (V1/V0, vacuolar)
          ATPase, but functions in the ATP synthetic direction as
          does the F1/F0 ATPase of bacteria. The hydrophilic A1
          "stalk" complex (AhaABCDEFG) is the site of ATP
          generation and is coupled to the membrane-embedded
          proton translocating A0 complex. It is unclear
          precisely where AhaH fits into these complexes.
          Length = 85

 Score = 30.2 bits (68), Expect = 0.39
 Identities = 16/61 (26%), Positives = 40/61 (65%)

Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
          EE ++ E++ E++ EE  +++K++  + +++ ++  ++ EE++ K  EE +KE +E+   
Sbjct: 2  EEIKKAEEDAEELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEK 61

Query: 85 E 85
          E
Sbjct: 62 E 62



 Score = 29.0 bits (65), Expect = 1.1
 Identities = 17/66 (25%), Positives = 39/66 (59%)

Query: 8  AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
          A E++K+     +EE +E  E+ E+E   + EE  K+ +++ +K+ +K +++ +K+ E  
Sbjct: 18 AEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEKEAEKIREEGEKEIEAM 77

Query: 68 VKEIEE 73
            + +E
Sbjct: 78 KSKAKE 83



 Score = 27.5 bits (61), Expect = 3.1
 Identities = 16/75 (21%), Positives = 40/75 (53%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E+ K+ E+  +E  +E  E+ ++   +  EE  +  ++ +++  K  ++  K+ EE+  K
Sbjct: 2  EEIKKAEEDAEELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEK 61

Query: 70 EIEESLKERKEKTLT 84
          E E+  +E +++   
Sbjct: 62 EAEKIREEGEKEIEA 76


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 32.5 bits (74), Expect = 0.40
 Identities = 19/62 (30%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 10  EKKKRIEKRNKEE---EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           EK++ +EK  KEE   E++  EQ  +E+E    E ++ + K + +K+++K +   KK  +
Sbjct: 253 EKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKKASR 312

Query: 67  SV 68
           S 
Sbjct: 313 SA 314



 Score = 30.2 bits (68), Expect = 2.2
 Identities = 12/51 (23%), Positives = 31/51 (60%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           +E++ +E EK   E+ E+E++  E+ + +++K   +  + + K + +K+ E
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301



 Score = 29.5 bits (66), Expect = 4.1
 Identities = 29/116 (25%), Positives = 48/116 (41%), Gaps = 30/116 (25%)

Query: 21  EEEKE---ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI--EESL 75
           EEE +   + ++E + +E+      + K +K  + K + K+K  K    S K I   E L
Sbjct: 430 EEEHQIYRKLQEERRLREEAI----RAKAEKTARMKAEMKEKTLKMFLLSQKHIVYTEPL 485

Query: 76  KERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLT--NSFREPSLEMSYQRYSHR 129
           + +   T+T               VLY     P+ T  N   E     S+ R++HR
Sbjct: 486 EVQAGTTVT---------------VLYN----PANTVLNGKPEVWFRGSFNRWTHR 522


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 32.1 bits (73), Expect = 0.41
 Identities = 11/32 (34%), Positives = 15/32 (46%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           E   K+K  +K    K KK  KK + E +  E
Sbjct: 355 EAARKRKGDRKGVSHKAKKGGKKNQAETAGFE 386


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 30.7 bits (70), Expect = 0.41
 Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 4/52 (7%)

Query: 24  KEENEQ--EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           + E +   EEKE+E  E E+  K+ + K    K +KK++K  E Q   E++E
Sbjct: 65  RAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQ--AELQE 114



 Score = 29.9 bits (68), Expect = 0.64
 Identities = 14/50 (28%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 17  KRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           +  ++ + EE EQE  E+E +++E + K    K +KK++K  + + + +E
Sbjct: 65  RAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114



 Score = 27.6 bits (62), Expect = 4.1
 Identities = 17/79 (21%), Positives = 37/79 (46%), Gaps = 11/79 (13%)

Query: 7  GAGEKKKRIEKRNKEEEKEENEQE----EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
          G   K++ IE + +  +   N+      EK  ++V+        + +++ K ++K+++  
Sbjct: 21 GCAAKRQAIETQIEYAKAHGNQHRIAGLEKALKEVQAHCTDAGLRAERQAKVEEKEQE-- 78

Query: 63 KEEQSVKEIEESLKERKEK 81
               V E E+ LKE + K
Sbjct: 79 -----VAEREQELKEAQAK 92



 Score = 27.2 bits (61), Expect = 5.6
 Identities = 9/48 (18%), Positives = 29/48 (60%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           E++ ++E++ +E  + E E +E + +   ++  KK++K  + + + ++
Sbjct: 67  ERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114



 Score = 27.2 bits (61), Expect = 5.9
 Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 5/57 (8%)

Query: 8   AGEKKKRIEK-RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           AG + +R  K   KE+E  E EQE KE +     K    K +KK++K  + + + ++
Sbjct: 62  AGLRAERQAKVEEKEQEVAEREQELKEAQ----AKGDADKIEKKQRKLAEAQAELQE 114


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 31.0 bits (71), Expect = 0.42
 Identities = 18/74 (24%), Positives = 33/74 (44%), Gaps = 9/74 (12%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK----KKKKKK---KKKKKKKKKKKKKE 64
             RI++   +EE+ + E++  +K+   E+  KK    + K KK   KK +K      K  
Sbjct: 55  HARIDRL--DEERYDIEEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKSADAMLKAL 112

Query: 65  EQSVKEIEESLKER 78
             S  ++   L+  
Sbjct: 113 LGSKHKVSMDLRAN 126



 Score = 28.3 bits (64), Expect = 3.3
 Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 10/54 (18%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKK----KKKK---KKKKKKK---KKKKKKE 64
           ++     ++ ++E+ D+EE+  KK K     KKK    + K KK   KK +K  
Sbjct: 52  KKLHARIDRLDEERYDIEEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKSA 105


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 32.5 bits (74), Expect = 0.42
 Identities = 15/74 (20%), Positives = 33/74 (44%), Gaps = 8/74 (10%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK--------KKKKKKKKEEQSVK 69
           RN  EE+ + + ++  +++ E E    +++KK    +K         + K  K  E+   
Sbjct: 538 RNSTEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEA 597

Query: 70  EIEESLKERKEKTL 83
            ++  + +  EK L
Sbjct: 598 ALKMKMTDTSEKPL 611


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 32.1 bits (74), Expect = 0.43
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 20  KEEEKEENEQEEKEKEDVEEE 40
              E+EE E+EE+E+E+  EE
Sbjct: 299 AAAEEEEEEEEEEEEEEPSEE 319



 Score = 31.8 bits (73), Expect = 0.50
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 20  KEEEKEENEQEEKEKEDVEEE 40
             EE+EE E+EE+E+E  EEE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEE 320


>gnl|CDD|234706 PRK00269, zipA, cell division protein ZipA; Reviewed.
          Length = 293

 Score = 32.1 bits (73), Expect = 0.44
 Identities = 19/100 (19%), Positives = 46/100 (46%), Gaps = 14/100 (14%)

Query: 1   MRKRRGGAGEKKKRIEKR--NKEEEKEENEQE-------EKEKEDVEEEKNK-----KKK 46
            R+ RGG G+ K R+++   N  +E+E + +        +  KE   +E +      + +
Sbjct: 25  WRRMRGGKGKLKFRLDRSYSNLPDEEEGSAELLGPARVLDTHKEPQLDEHDLPSMSARPR 84

Query: 47  KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
           ++++  K  K++K+ +  E    ++   L E +    ++ 
Sbjct: 85  ERRRDTKTAKQQKRGRGSEPQQGDLNLDLDEVEPALFSDR 124


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 32.0 bits (73), Expect = 0.44
 Identities = 11/85 (12%), Positives = 23/85 (27%), Gaps = 21/85 (24%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQ---------------------EEKEKEDVEE 39
           M +R+      K++  K+ +E                               +  +   +
Sbjct: 127 MIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAK 186

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKE 64
             +      +K  K K   KK  K+
Sbjct: 187 LSSTASSGDEKSPKSKAAPKKAGKK 211



 Score = 31.3 bits (71), Expect = 0.77
 Identities = 11/44 (25%), Positives = 18/44 (40%), Gaps = 1/44 (2%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           K +E     + +K  +      +  +K  K K   KK  KK +K
Sbjct: 172 KAKEGPSR-RAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214



 Score = 27.8 bits (62), Expect = 7.9
 Identities = 15/60 (25%), Positives = 27/60 (45%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           R  E+E ++  +  K     EE K  KK      ++K K+   K+K+ +  +E   S + 
Sbjct: 95  RELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPES 154


>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1.  MAT1
           is an assembly/targeting factor for cyclin-dependent
           kinase-activating kinase (CAK), which interacts with the
           transcription factor TFIIH. The domain found to the
           N-terminal side of this domain is a C3HC4 RING finger.
          Length = 200

 Score = 31.7 bits (72), Expect = 0.44
 Identities = 17/77 (22%), Positives = 42/77 (54%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           + +     ++EE E+  +EEKE ++ +    +K+++++K  K+K K++   + E S    
Sbjct: 83  RNKRRLTREQEELEQALEEEKEMKEEKRLHLQKEEQEQKMAKEKDKQEIIDELETSNLPA 142

Query: 72  EESLKERKEKTLTEDSQ 88
              + + K+++   +SQ
Sbjct: 143 NVIIAQHKKQSKQLESQ 159



 Score = 29.0 bits (65), Expect = 3.5
 Identities = 24/100 (24%), Positives = 42/100 (42%), Gaps = 19/100 (19%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK------------KKKKKKKK 57
           E+    EK  KEE++   ++EE+E     ++  K+K K++                 + K
Sbjct: 96  EQALEEEKEMKEEKRLHLQKEEQE-----QKMAKEKDKQEIIDELETSNLPANVIIAQHK 150

Query: 58  KKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
           K+ K+ E Q   E  E  K     T  +  +  + +PV K
Sbjct: 151 KQSKQLESQ--VEKLERKKRVTFSTGIKKGEHISFVPVIK 188


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 32.2 bits (73), Expect = 0.44
 Identities = 21/107 (19%), Positives = 55/107 (51%), Gaps = 18/107 (16%)

Query: 1   MRKRRGGAGEKKKRIEK-------RNKEEEKEENE---------QEEKEKEDVEEEKNKK 44
           M++      E++KR E+       R+ +  K  +          QE +E+E VEEE+ ++
Sbjct: 393 MKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEE 452

Query: 45  KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
           +++++++++ ++++ + ++EE+ V+    S  E + +  +E      
Sbjct: 453 EEEEEEEQESEEEEGEDEEEEEEVEADNGS--EEEMEGSSEGDGDGE 497


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 32.2 bits (74), Expect = 0.44
 Identities = 16/72 (22%), Positives = 40/72 (55%), Gaps = 3/72 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E K+ +++  +E ++ E E E+ +K     +K  KK + +   +K+++ +K  + ++ + 
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKL---LKKLPKKARGQLPPEKREQLEKLLETKEKLS 387

Query: 70  EIEESLKERKEK 81
           E  E L+E  ++
Sbjct: 388 EELEELEEELKE 399



 Score = 31.5 bits (72), Expect = 0.82
 Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 15/93 (16%)

Query: 19  NKEEEKEENEQEEKEKEDVEEE-------------KNKKKKKKKKKKKKKKKKKKKKKEE 65
           +  E KEE ++ E+E +++EEE             K + +   +K+++ +K  + K+K  
Sbjct: 328 DFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLS 387

Query: 66  QSVKEIEESLKERKEKTLTEDSQCDTMIPVFKK 98
           + ++E+EE LKE KE+   E    +  I V K 
Sbjct: 388 EELEELEEELKELKEE--LESLYSEGKISVNKT 418



 Score = 28.8 bits (65), Expect = 5.0
 Identities = 12/61 (19%), Positives = 33/61 (54%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           KK ++K  K+   +   ++ ++ E + E K K  ++ ++ +++ K+ K++ +   S  +I
Sbjct: 354 KKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEELKELKEELESLYSEGKI 413

Query: 72  E 72
            
Sbjct: 414 S 414


>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
          lipoprotein.  This model represents a family of six
          predicted lipoproteins from a region of about 20
          tandemly arranged genes in the Treponema denticola
          genome. Two other neighboring genes share the
          lipoprotein signal peptide region but do not show more
          extensive homology. The function of this locus is
          unknown.
          Length = 412

 Score = 32.2 bits (73), Expect = 0.45
 Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 1/55 (1%)

Query: 9  GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
          G ++ R+EK +K EEK ENE   K + +   +  ++ K+   K      K +  K
Sbjct: 29 GLREMRVEKFSKMEEK-ENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGESDK 82


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 31.7 bits (72), Expect = 0.45
 Identities = 14/69 (20%), Positives = 35/69 (50%), Gaps = 10/69 (14%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           + N+E++KE+  + +  + ++ +   K  +     + +K+K+++K +            K
Sbjct: 106 EENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAA----------K 155

Query: 77  ERKEKTLTE 85
           ERKE+ + E
Sbjct: 156 ERKERLVAE 164



 Score = 29.8 bits (67), Expect = 2.0
 Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 16/77 (20%)

Query: 2   RKRRGGAGEKKKRIEKRNKE------------EEKEENEQEEKEKEDVEEEKNKKKKKKK 49
           RKR   A   K+R E+   E            + + +   ++KEKE    EK K K+ K+
Sbjct: 145 RKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKE----EKKKVKEAKR 200

Query: 50  KKKKKKKKKKKKKKEEQ 66
           ++K++K+         Q
Sbjct: 201 REKEEKRMAALVAAAAQ 217



 Score = 27.9 bits (62), Expect = 6.7
 Identities = 9/52 (17%), Positives = 28/52 (53%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           ++++ EK  + + +E    +   K        + +K+K+++K +  K++K++
Sbjct: 109 REQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKER 160


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
          coiled-coil proteins that are involved in pre-rRNA
          processing.
          Length = 105

 Score = 30.4 bits (69), Expect = 0.47
 Identities = 18/70 (25%), Positives = 45/70 (64%), Gaps = 2/70 (2%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
          +KR+EKR +++  +  E+E K++++ E ++  +  K+++  K++K++ +K   +   K++
Sbjct: 31 EKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKV 90

Query: 72 EESLKERKEK 81
          E    +R+EK
Sbjct: 91 ER--LKRREK 98



 Score = 29.7 bits (67), Expect = 0.78
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           KK  + K K    +K+ +K+ E+Q++K  E+ LK+ KE
Sbjct: 17 EKKAFRPKSKLTSWEKRMEKRLEQQAIKAREKELKDEKE 55



 Score = 27.4 bits (61), Expect = 5.6
 Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 8/72 (11%)

Query: 1   MRKRRGGAGEKKKRIE-KRNKEEEKEENEQEEKEKEDVEEEKN-------KKKKKKKKKK 52
           M KR      K +  E K  KE E++   Q  KE+   +EEK        K   KK ++ 
Sbjct: 34  MEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKVERL 93

Query: 53  KKKKKKKKKKKE 64
           K+++K+ K  KE
Sbjct: 94  KRREKRNKLLKE 105


>gnl|CDD|234635 PRK00104, scpA, segregation and condensation protein A; Reviewed.
          Length = 242

 Score = 31.7 bits (73), Expect = 0.47
 Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 14/92 (15%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK--------------KKKKKKKK 53
           A E K++ E+R +   K  ++ EE + E  E+E +                 +KK  K +
Sbjct: 105 AEELKEQEEERAQYFSKPPSDLEEFDAELDEDELDGDVTLLDLFLAFSKLMERKKANKPR 164

Query: 54  KKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
             K ++ + K E  +  I E L+ +K  + +E
Sbjct: 165 HTKIERDEYKIEDRMLYILELLESKKRLSFSE 196


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 31.3 bits (71), Expect = 0.48
 Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 1/67 (1%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K  + +K  +E +  E E  EKE    E +    + ++K  + +++    K+K+E   KE
Sbjct: 102 KDDQTKKL-EERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKE 160

Query: 71  IEESLKE 77
           I     E
Sbjct: 161 IARLKSE 167



 Score = 30.1 bits (68), Expect = 1.4
 Identities = 18/88 (20%), Positives = 40/88 (45%), Gaps = 3/88 (3%)

Query: 1   MRKRRGGAGEKKKRIEKR--NKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKK 57
           ++  +    E++K +EK    KE  ++E + E  E +  + E + +    K+KK+ + K+
Sbjct: 101 LKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKE 160

Query: 58  KKKKKKEEQSVKEIEESLKERKEKTLTE 85
             + K E   +K+     +   + T   
Sbjct: 161 IARLKSEASKIKQELVDAELEFQTTAAA 188


>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element. 
          Length = 370

 Score = 31.9 bits (72), Expect = 0.49
 Identities = 22/97 (22%), Positives = 42/97 (43%), Gaps = 8/97 (8%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           ++      ++ IE + KE +    E EE   + VE      +K  K+  + KKK ++ K+
Sbjct: 53  KKDNYSSLREDIETKGKEVQNFLKELEECITKQVEAHIENTEKCLKELMELKKKARELKE 112

Query: 64  EEQSVK--------EIEESLKERKEKTLTEDSQCDTM 92
           E +S+K        +IE   K +     +  ++   M
Sbjct: 113 ECRSIKKEHMETTLDIENQKKRQGVVDTSFTNRIQEM 149


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 31.2 bits (71), Expect = 0.50
 Identities = 9/48 (18%), Positives = 20/48 (41%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
            EE  + ++     +   +  + ++KK      +  +K +KK  EQ  
Sbjct: 88  SEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAEQEA 135



 Score = 29.6 bits (67), Expect = 1.9
 Identities = 18/65 (27%), Positives = 26/65 (40%), Gaps = 5/65 (7%)

Query: 22  EEKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
             K+E+E       +  EE    KK     K+K K+   ++KK   S     E L+    
Sbjct: 72  HPKDESEAAPDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLR---- 127

Query: 81  KTLTE 85
           K L E
Sbjct: 128 KKLAE 132


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 30.7 bits (70), Expect = 0.50
 Identities = 31/97 (31%), Positives = 46/97 (47%), Gaps = 31/97 (31%)

Query: 16  EKRNKEEEKEENEQEEKEKED--------------VEEEKNKKKKKKKKKKKKKKKKKK- 60
           E R +E ++E      +  ED                EE  K  +++KK+K+KKKKKKK 
Sbjct: 31  EAREEERKRERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKE 90

Query: 61  -----------KKKEEQS--VKEIEES---LKERKEK 81
                      KKKEEQ+  +K+ EE    +++ KEK
Sbjct: 91  LEDFYRFQLREKKKEEQADLLKKFEEDKERVEKMKEK 127



 Score = 29.2 bits (66), Expect = 1.6
 Identities = 19/69 (27%), Positives = 38/69 (55%), Gaps = 7/69 (10%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKED-----VEEEKNKKKKKKKKKKKKKK 56
           RK      E  K      +++EKE+ ++++KE ED     + E+K +++    KK ++ K
Sbjct: 61  RKGPAAGEEAGKA--AEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKKFEEDK 118

Query: 57  KKKKKKKEE 65
           ++ +K KE+
Sbjct: 119 ERVEKMKEK 127



 Score = 27.2 bits (61), Expect = 6.4
 Identities = 15/65 (23%), Positives = 34/65 (52%), Gaps = 8/65 (12%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQE--------EKEKEDVEEEKNKKKKKKKKKKK 53
                 AG+  +  +K  ++++K++ E E        EK+KE+  +   K ++ K++ +K
Sbjct: 64  PAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKKFEEDKERVEK 123

Query: 54  KKKKK 58
            K+K+
Sbjct: 124 MKEKR 128


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 30.1 bits (68), Expect = 0.50
 Identities = 25/72 (34%), Positives = 49/72 (68%), Gaps = 1/72 (1%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E ++RIEK  KEE K+   + ++E + +EEE  KK +++ +K  +KKKK+ +++ ++ ++
Sbjct: 14 EAEERIEKA-KEEAKKIIRKAKEEAKKIEEEIIKKAEEEAQKLIEKKKKEGEEEAKKILE 72

Query: 70 EIEESLKERKEK 81
          E E+ ++E K K
Sbjct: 73 EGEKEIEELKVK 84


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 32.1 bits (73), Expect = 0.51
 Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 8/94 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK--------KKKKKK 61
           EK    + +  +  K+  E +E   E ++E +  +K K  KK K+          KK  K
Sbjct: 11  EKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSK 70

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPV 95
             E  +    E+  K RK    T  S   +  P 
Sbjct: 71  PTESSAASSEEKPAKPRKSAESTRSSHPKSKAPS 104



 Score = 29.0 bits (65), Expect = 4.7
 Identities = 15/92 (16%), Positives = 38/92 (41%), Gaps = 6/92 (6%)

Query: 3   KRRGGAGEKKKRIEKRNKE--EEKEENEQEEKEKEDVEEEKNKK----KKKKKKKKKKKK 56
            ++   G+  K   + +++  E  +E EQ EK K   + ++  +    +  KK  K  + 
Sbjct: 15  TKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTES 74

Query: 57  KKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
                +++    ++  ES +    K+    ++
Sbjct: 75  SAASSEEKPAKPRKSAESTRSSHPKSKAPSTE 106


>gnl|CDD|129587 TIGR00496, frr, ribosome recycling factor.  This model finds only
           eubacterial proteins. Mitochondrial and/or chloroplast
           forms might be expected but are not currently known.
           This protein was previously called ribosome releasing
           factor. By releasing ribosomes from mRNA at the end of
           protein biosynthesis, it prevents inappropriate
           translation from 3-prime regions of the mRNA and frees
           the ribosome for new rounds of translation.
           EGAD|53116|YHR038W is part of the frr superfamily
           [Protein synthesis, Translation factors].
          Length = 176

 Score = 31.3 bits (71), Expect = 0.51
 Identities = 22/68 (32%), Positives = 39/68 (57%), Gaps = 7/68 (10%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE------QSVKEIEESLKE 77
           K+  EQ +    +V  + N K KK +K K+  + ++++ +EE      + +K+I+E LK+
Sbjct: 109 KKIAEQAKVAVRNVRRDANDKVKKLEKDKEISEDEERRLQEEIQKLTDEYIKKIDEILKD 168

Query: 78  RKEKTLTE 85
            KEK L E
Sbjct: 169 -KEKELME 175


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents
          the seryl-tRNA synthetase found in most organisms. This
          protein is a class II tRNA synthetase, and is
          recognized by the pfam model tRNA-synt_2b. The
          seryl-tRNA synthetases of two archaeal species,
          Methanococcus jannaschii and Methanobacterium
          thermoautotrophicum, differ considerably and are
          included in a different model [Protein synthesis, tRNA
          aminoacylation].
          Length = 418

 Score = 31.9 bits (73), Expect = 0.51
 Identities = 19/79 (24%), Positives = 39/79 (49%)

Query: 2  RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          + R        +++   + E +K  +E EE + +  E  K   K K +KK K ++ KK+ 
Sbjct: 19 KARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKEL 78

Query: 62 KKEEQSVKEIEESLKERKE 80
          K+ ++ + E+  +LK  + 
Sbjct: 79 KELKEELTELSAALKALEA 97



 Score = 31.9 bits (73), Expect = 0.57
 Identities = 18/79 (22%), Positives = 40/79 (50%), Gaps = 1/79 (1%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            K+ ++ R    + +  +    + E  +     ++ + K+ +  K+  K K +++  ++
Sbjct: 13 LVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIE 72

Query: 70 EIEESLKERKEKTLTEDSQ 88
          EI++ LKE KE+ LTE S 
Sbjct: 73 EIKKELKELKEE-LTELSA 90


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 31.8 bits (72), Expect = 0.52
 Identities = 19/50 (38%), Positives = 31/50 (62%)

Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
          K +++  K  EK E E ++++ E ++ E +KKKKK KK+K   + K  KK
Sbjct: 38 KTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKFKKEKVDVRVKVVKK 87


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 31.2 bits (71), Expect = 0.53
 Identities = 11/50 (22%), Positives = 23/50 (46%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQC 89
            K ++K+K++     KKK K +K  +++ K+    L +     L +    
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKKFGYA 202



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           EE++ ++     KKK K +K  K+ +++   K  +  +   K+
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198



 Score = 28.9 bits (65), Expect = 2.9
 Identities = 11/43 (25%), Positives = 25/43 (58%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           EEK K++     KKK K +K  K+  ++++ ++ +++    +K
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198


>gnl|CDD|192488 pfam10228, DUF2228, Uncharacterized conserved protein (DUF2228).
           This is a family of conserved proteins of approximately
           700 residues found from worms to humans.
          Length = 253

 Score = 31.6 bits (72), Expect = 0.54
 Identities = 14/50 (28%), Positives = 28/50 (56%)

Query: 35  EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
           +  +E   KKKK    KK ++   +  K+   S+++  + +K+RK+K +T
Sbjct: 94  KKSKEIPFKKKKISALKKIQELLTEWAKELGLSLEQKTKVMKQRKKKVVT 143



 Score = 29.3 bits (66), Expect = 2.6
 Identities = 16/51 (31%), Positives = 27/51 (52%)

Query: 53  KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
           KK K+   KKK+  ++K+I+E L E  ++      Q   ++   KK V+ K
Sbjct: 94  KKSKEIPFKKKKISALKKIQELLTEWAKELGLSLEQKTKVMKQRKKKVVTK 144


>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
           protein [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 234

 Score = 31.6 bits (72), Expect = 0.55
 Identities = 20/81 (24%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEK------EDVEEEKNK-KKKKKKKKKKKKKKKKKKK 62
           E++       KEEE E   QE +E       E  EE      ++  ++ +++ +  ++  
Sbjct: 39  EEELEQALEAKEEELESAAQELQEGIEEGAREGYEEGFQLGYEEGFEEGQEEGRVLERLA 98

Query: 63  KEEQSVKEIEESLKERKEKTL 83
           K     +   E+LKE  EK L
Sbjct: 99  KLIAEFQAELEALKEVVEKQL 119


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.
          This family includes the HABP4 family of
          hyaluronan-binding proteins, and the PAI-1 mRNA-binding
          protein, PAI-RBP1. HABP4 has been observed to bind
          hyaluronan (a glucosaminoglycan), but it is not known
          whether this is its primary role in vivo. It has also
          been observed to bind RNA, but with a lower affinity
          than that for hyaluronan. PAI-1 mRNA-binding protein
          specifically binds the mRNA of type-1 plasminogen
          activator inhibitor (PAI-1), and is thought to be
          involved in regulation of mRNA stability. However, in
          both cases, the sequence motifs predicted to be
          important for ligand binding are not conserved
          throughout the family, so it is not known whether
          members of this family share a common function.
          Length = 106

 Score = 30.1 bits (68), Expect = 0.56
 Identities = 21/82 (25%), Positives = 43/82 (52%), Gaps = 10/82 (12%)

Query: 4  RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKKKKK 62
          +R GAG       + N    K+E  +  +E+ + EE  + ++K+  +++  K+   ++++
Sbjct: 18 KREGAG-------RGNWGSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEE 70

Query: 63 KEEQSVKEI--EESLKERKEKT 82
           EE+  KE+  EE  K  +EK 
Sbjct: 71 VEEEEDKEMTLEEYEKALQEKR 92



 Score = 27.8 bits (62), Expect = 3.8
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 5/60 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-----KKKKKKKKKKKKKKKKKKKKE 64
           EK +  EK+  EEE  +    E+E+ + EE+K       +K  ++K+K     K +K KE
Sbjct: 46  EKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKALQEKRKALPALKVRKAKE 105


>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain.  This alpha
          helical domain is found in a set of bacterial plasmid
          replication proteins. The domain is found to the
          C-terminus of the primase/polymerase domain. Mutants of
          this domain are defective in template binding,
          dinucleotide formation and conformation change prior to
          DNA extension.
          Length = 135

 Score = 30.6 bits (69), Expect = 0.57
 Identities = 19/52 (36%), Positives = 32/52 (61%)

Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          E K++E+  EE  +K K++  K  + K K  +   E+  K+I++SL+E KEK
Sbjct: 1  EGKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEK 52



 Score = 28.2 bits (63), Expect = 3.6
 Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 10/68 (14%)

Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKK----------KKKKKKKKKKKKKEEQSVKEI 71
          E K+E E  E++ E ++EE  K  + K K          KK KK  ++ K+K++  +   
Sbjct: 1  EGKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEKKKAILNTA 60

Query: 72 EESLKERK 79
          +  + E K
Sbjct: 61 KGVICEGK 68


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.5 bits (72), Expect = 0.57
 Identities = 19/55 (34%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 20  KEEEKEENEQEEK-EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           K   KEE    +K  K D        +++K+KKK   K++KK  KEE+  +++EE
Sbjct: 69  KILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEK--EKLEE 121



 Score = 30.7 bits (70), Expect = 0.96
 Identities = 14/28 (50%), Positives = 20/28 (71%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           EEEK KKK   K++KK  K++K+K +E 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 28.4 bits (64), Expect = 5.5
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKE 33
           EKKK + K  K+  KEE E+ E+ 
Sbjct: 99  EKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 31.7 bits (72), Expect = 0.57
 Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 8/74 (10%)

Query: 9   GEKKKRIEKRNKEE----EKEENEQEEKEKEDVEEEKNKKK----KKKKKKKKKKKKKKK 60
            E + +++   K E       E      EKED+E  + +++    + +K  ++KKK +++
Sbjct: 344 QEWRSKLQALAKVESRSLLSVEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLREQ 403

Query: 61  KKKEEQSVKEIEES 74
             +E Q       S
Sbjct: 404 ADEEYQQRHATRAS 417


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 31.7 bits (72), Expect = 0.58
 Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 4/62 (6%)

Query: 44  KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
            K+K KK       KK K+KE +S  +  E +     + +    Q  TM+ V   GV + 
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCIEGIS----RLICSAKQQQTMLRVSIDGVEWN 386

Query: 104 GI 105
            +
Sbjct: 387 DV 388



 Score = 28.2 bits (63), Expect = 7.3
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 43  KKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
            K+K KK       KK K+K+ E   + IE
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCIE 360


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 31.8 bits (73), Expect = 0.60
 Identities = 18/59 (30%), Positives = 36/59 (61%), Gaps = 4/59 (6%)

Query: 12  KKRIEKRNKE---EEKEENEQEE-KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           K+++EK  +E    E+ +  Q+E  E +D ++E  + ++K +K K  K+ K+K +KE +
Sbjct: 219 KEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEKLKLPKEAKEKAEKELK 277


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 31.7 bits (72), Expect = 0.60
 Identities = 17/67 (25%), Positives = 32/67 (47%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           G KKK+ +  + +    E+  +++E    + +KN+KK + K  K     K   KKE    
Sbjct: 419 GSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKS 478

Query: 69  KEIEESL 75
           +E   ++
Sbjct: 479 QEDNNNI 485



 Score = 29.8 bits (67), Expect = 2.9
 Identities = 14/51 (27%), Positives = 20/51 (39%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           EK     E         K  KKKK K    K    +S+ + EE   ++ +K
Sbjct: 401 EKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKK 451


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 31.6 bits (71), Expect = 0.62
 Identities = 13/76 (17%), Positives = 37/76 (48%), Gaps = 8/76 (10%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
              G+K+++ E+    E  +E  Q ++  E  EE        K++  + ++ ++  +   
Sbjct: 289 VSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEE--------KEEDDENEENERHTELLA 340

Query: 66  QSVKEIEESLKERKEK 81
             + E+E+ ++E++ +
Sbjct: 341 DELNELEKGIEEKRRQ 356



 Score = 30.8 bits (69), Expect = 1.1
 Identities = 17/73 (23%), Positives = 39/73 (53%), Gaps = 1/73 (1%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
            E++ +++E EENE+  +   D   E  K  ++K+++ +       +K+ E  +  + + 
Sbjct: 320 GEEKEEDDENEENERHTELLADELNELEKGIEEKRRQMESATNPILQKRFESQLNVLLKE 379

Query: 75  LK-ERKEKTLTED 86
           L+ +RK+  + ED
Sbjct: 380 LELKRKQLEMEED 392



 Score = 29.3 bits (65), Expect = 3.9
 Identities = 14/68 (20%), Positives = 32/68 (47%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
             + + +    +E  E+++ED E E+N++  +    +  + +K  ++K  Q        L
Sbjct: 306 AHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRRQMESATNPIL 365

Query: 76  KERKEKTL 83
           ++R E  L
Sbjct: 366 QKRFESQL 373


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
          includes archaeal L12p, the protein that is
          functionally equivalent to L7/L12 in bacteria and the
          P1 and P2 proteins in eukaryotes. L12p is homologous to
          P1 and P2 but is not homologous to bacterial L7/L12. It
          is located in the L12 stalk, with proteins L10, L11,
          and 23S rRNA. L12p is the only protein in the ribosome
          to occur as multimers, always appearing as sets of
          dimers. Recent data indicate that most archaeal species
          contain six copies of L12p (three homodimers), while
          eukaryotes have four copies (two heterodimers), and
          bacteria may have four or six copies (two or three
          homodimers), depending on the species. The organization
          of proteins within the stalk has been characterized
          primarily in bacteria, where L7/L12 forms either two or
          three homodimers and each homodimer binds to the
          extended C-terminal helix of L10. L7/L12 is attached to
          the ribosome through L10 and is the only ribosomal
          protein that does not directly interact with rRNA.
          Archaeal L12p is believed to function in a similar
          fashion. However, hybrid ribosomes containing the large
          subunit from E. coli with an archaeal stalk are able to
          bind archaeal and eukaryotic elongation factors but not
          bacterial elongation factors. In several mesophilic and
          thermophilic archaeal species, the binding of 23S rRNA
          to protein L11 and to the L10/L12p pentameric complex
          was found to be temperature-dependent and cooperative.
          Length = 106

 Score = 29.8 bits (67), Expect = 0.62
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEE 40
              K EEKEE +++E+EKE+ EEE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 28.2 bits (63), Expect = 2.3
 Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 49 KKKKKKKKKKKKKKKEEQSVKEIEESL 75
            ++K ++K+++KKKEE+  +E EE+L
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEAL 99



 Score = 28.2 bits (63), Expect = 2.5
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE 38
           A    +   +  +EE+K+E E+EE+E+E + 
Sbjct: 70  AAAAAEEKAEEKEEEKKKEEEKEEEEEEALA 100



 Score = 27.8 bits (62), Expect = 3.7
 Identities = 8/25 (32%), Positives = 23/25 (92%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQS 67
           ++K ++K+++KKK+++K+++EE++
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEEA 98


>gnl|CDD|226034 COG3503, COG3503, Predicted membrane protein [Function unknown].
          Length = 323

 Score = 31.6 bits (72), Expect = 0.63
 Identities = 26/103 (25%), Positives = 33/103 (32%), Gaps = 20/103 (19%)

Query: 180 LAMCLLGWW----RCFANNYLQWAAVGTWIL-LNTQGWWRCFANNYLQWAA------VGT 228
           LA   LG W    R  A+++L  A V   +        WR      L+ AA        T
Sbjct: 47  LATTALGLWRYFARLIASSFLFLAGVSLSLSHSRGLRRWRFLVKRGLKLAALALAITAVT 106

Query: 229 WILLNTQGFISQGIGLNNKEYLIWYILFIVFVTYAMLPLPLKW 271
           W       FI  GI        +  I     +  A L LP   
Sbjct: 107 WFAF-PDSFIFFGI--------LHAIGLASLLGAAFLWLPRAV 140


>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA.  FadA (Fusobacterium
           adhesin A) is an adhesin which forms two alpha helices.
          Length = 126

 Score = 30.3 bits (68), Expect = 0.64
 Identities = 16/71 (22%), Positives = 41/71 (57%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E +   EK+ KE  ++E ++ ++ +   EE   K K++ K +  K + KK  K+ ++ + 
Sbjct: 44  EARFEEEKQEKETAEKEVQELKERQLGREELYAKLKEESKTRFHKDEYKKLLKRYDEYLN 103

Query: 70  EIEESLKERKE 80
           ++E+ + ++++
Sbjct: 104 KLEQKITDKEQ 114


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 31.0 bits (70), Expect = 0.64
 Identities = 26/88 (29%), Positives = 47/88 (53%), Gaps = 14/88 (15%)

Query: 12 KKRIEKR-NKEEEK-----EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           + +EK  NKEEEK       NEQ E   ED + ++N+ +++   + ++ K+K ++K+ E
Sbjct: 1  MEEVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQKLQQKEVE 60

Query: 66 --------QSVKEIEESLKERKEKTLTE 85
                  Q +K   E+ ++R EK  +E
Sbjct: 61 AQEYLDIAQRLKAEFENYRKRTEKEKSE 88


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 31.6 bits (72), Expect = 0.64
 Identities = 11/44 (25%), Positives = 28/44 (63%), Gaps = 2/44 (4%)

Query: 21  EEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKK 62
             E+EE E EE E+E  E   +++ +K+  ++ K++++ +++ +
Sbjct: 149 LPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRRSQ 192



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 16  EKRNKEEEKEENEQEE-KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            K   E E  E E+EE +E E+  EE    +  +K+  ++ K++++ ++  Q ++
Sbjct: 141 PKNEFELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRRSQVIQ 195


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 31.5 bits (72), Expect = 0.65
 Identities = 11/70 (15%), Positives = 26/70 (37%), Gaps = 4/70 (5%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES- 74
          E  +K   K+  + ++ E+E   +E+ K        K + +K      E     +   + 
Sbjct: 6  EPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANR 65

Query: 75 ---LKERKEK 81
             ++  + K
Sbjct: 66 LKKVEALRAK 75


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
          domain.  The centromere protein B (CENP-B) dimerisation
          domain is composed of two alpha-helices, which are
          folded into an antiparallel configuration. Dimerisation
          of CENP-B is mediated by this domain, in which monomers
          dimerise to form a symmetrical, antiparallel,
          four-helix bundle structure with a large hydrophobic
          patch in which 23 residues of one monomer form van der
          Waals contacts with the other monomer. This CENP-B
          dimer configuration may be suitable for capturing two
          distant CENP-B boxes during centromeric heterochromatin
          formation.
          Length = 101

 Score = 29.7 bits (66), Expect = 0.66
 Identities = 9/28 (32%), Positives = 19/28 (67%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNK 43
          +  + EEE +++E EE + ED +E+ ++
Sbjct: 13 DSDSDEEEDDDDEDEEDDDEDDDEDDDE 40


>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
          Length = 159

 Score = 30.8 bits (70), Expect = 0.67
 Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 16/93 (17%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEE-----KNKKKKKKKKKKKKKKKKKKKKKE 64
            K +R  K  KEE K+  E+ + + E+V EE       +     ++ K + +++K+K + 
Sbjct: 60  LKNERELKNAKEEGKKIVEEYKSKAENVYEEIVKEAHEEADLIIERAKLEAQREKEKAEY 119

Query: 65  E-----------QSVKEIEESLKERKEKTLTED 86
           E            S K +EES+ E + + L +D
Sbjct: 120 EIKNQAVDLAVLLSSKALEESIDESEHRRLIKD 152



 Score = 29.2 bits (66), Expect = 1.7
 Identities = 20/70 (28%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 12  KKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K  I+ R  E + K E   E+++K    + KN+++ K  K++ KK  ++ K K E   +E
Sbjct: 31  KAVIDSRQSEIDNKIEKADEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAENVYEE 90

Query: 71  IEESLKERKE 80
           I +   E  +
Sbjct: 91  IVKEAHEEAD 100


>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361). 
          This is a region of 120 amino acids that is conserved
          in a family of proteins found from plants to fungi. The
          function is not known.
          Length = 114

 Score = 29.9 bits (68), Expect = 0.68
 Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 16/89 (17%)

Query: 13 KRIEKRNK--EEEKEENEQE----EKEKEDVEEEKNKKKKKKK---------KKKKKKKK 57
          +R+ K+     + + E E+     +KE ED E  K +KK  KK         KK  ++ K
Sbjct: 10 ERLLKKKDLPADVRVEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERKKATRRLK 69

Query: 58 KKKKKKEEQSVKEIEESLKERKEKTLTED 86
          + +KK EE S ++  + LKE +      D
Sbjct: 70 QLEKKLEETSDEKEIKDLKE-QLSKAEVD 97


>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins, some contain Zn-finger domain; signature
           gene for I-A subtype; also known as TM1802 family.
          Length = 579

 Score = 31.6 bits (72), Expect = 0.69
 Identities = 13/59 (22%), Positives = 24/59 (40%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
               K      +K K+  E+     + KK+  K K  K   +K  E+ +K+I++     
Sbjct: 87  TYLLKTTEPLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQDLKPGE 145


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 31.5 bits (72), Expect = 0.69
 Identities = 10/35 (28%), Positives = 22/35 (62%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
             +   +KKK++KK+++K+ K+ +E+    E E+ 
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 8/37 (21%), Positives = 25/37 (67%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           V    +  +KKK++KK+++K+ K+ ++++   ++ ++
Sbjct: 174 VPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210



 Score = 28.8 bits (65), Expect = 4.1
 Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 42  NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
                  +KKK++KK+++K+ KE +   + +E   +
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
          (ETRAMP).  These sequences represent a family of
          proteins from the malaria parasite Plasmodium
          falciparum, several of which have been shown to be
          expressed specifically in the ring stage as well as the
          rodent parasite Plasmodium yoelii. A homologue from
          Plasmodium chabaudi was localized to the
          parasitophorous vacuole membrane. Members have an
          initial hydrophobic, Phe/Tyr-rich, stretch long enough
          to span the membrane, a highly charged region rich in
          Lys, a second putative transmembrane region and a
          second highly charged, low complexity sequence region.
          Some members have up to 100 residues of additional
          C-terminal sequence. These genes have been shown to be
          found in the sub-telomeric regions of both Plasmodium
          falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 29.1 bits (66), Expect = 0.70
 Identities = 13/25 (52%), Positives = 14/25 (56%)

Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKK 63
            K KK KK KK     +KKKK KK
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKNKK 52



 Score = 28.8 bits (65), Expect = 0.91
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 36 DVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
          +V  +K K KK KK     +KKKK KKK
Sbjct: 26 NVVAKKKKLKKLKKIDDDLEKKKKNKKK 53


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 30.7 bits (70), Expect = 0.71
 Identities = 14/64 (21%), Positives = 40/64 (62%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           + +  ++K+ KE +KEE + +++     EE +  K+++ ++K+++ ++K++  ++E   K
Sbjct: 46  KLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQK 105

Query: 70  EIEE 73
           + E 
Sbjct: 106 QQEL 109



 Score = 28.4 bits (64), Expect = 3.7
 Identities = 18/80 (22%), Positives = 47/80 (58%), Gaps = 1/80 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
             K   ++  KE +K + E ++KEKE   EE+K +K+     ++ +K K+++ ++++Q +
Sbjct: 32  AGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQEL 91

Query: 69  KEIEESLKERKEKTLTEDSQ 88
           ++ +++ ++  ++   E  Q
Sbjct: 92  QQKQQAAQQELQQKQQELLQ 111


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 31.5 bits (71), Expect = 0.71
 Identities = 21/78 (26%), Positives = 32/78 (41%), Gaps = 2/78 (2%)

Query: 2   RKRRGGAGEKKKRIE-KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           R       E   R+  K       + N  EE   E  E E + +     K +K+K  +++
Sbjct: 63  RVSSSWNNESYNRLPIKTKDNVVADVNNGEEFLSES-ESEASLEIDSDIKDEKQKSLEEQ 121

Query: 61  KKKEEQSVKEIEESLKER 78
           K   E  VK+  +S KER
Sbjct: 122 KIAPEIPVKQQIDSEKER 139



 Score = 28.1 bits (62), Expect = 10.0
 Identities = 16/61 (26%), Positives = 32/61 (52%), Gaps = 7/61 (11%)

Query: 27  NEQEEKEKEDVEEEKNKKKKKKKKK------KKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           N   ++EK     E NK   K++KK      KK +K +K++ + +  +++  E ++E  E
Sbjct: 338 NSLRDEEKARKYWEANKPVSKREKKDIFHLSKKLRKIEKERLRIQSEMRD-AEDIEEAIE 396

Query: 81  K 81
           +
Sbjct: 397 E 397


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 31.7 bits (73), Expect = 0.71
 Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 9/67 (13%)

Query: 21  EEEKEENEQEEKEKEDVEEE---------KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           E+E+EE ++ +      E E         K   K+ KK +K ++KK + +  + +   ++
Sbjct: 196 EDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKL 255

Query: 72  EESLKER 78
            E LKE 
Sbjct: 256 REKLKEE 262



 Score = 31.3 bits (72), Expect = 0.81
 Identities = 15/82 (18%), Positives = 39/82 (47%), Gaps = 11/82 (13%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-----------KKKKKK 58
             ++       E E+ +++++E+E+ED  ++     + +  +K            KK +K
Sbjct: 177 NAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRK 236

Query: 59  KKKKKEEQSVKEIEESLKERKE 80
            ++KK E  + + ++  K R++
Sbjct: 237 AQEKKVEGRLAQHKKYAKLREK 258


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 30.3 bits (69), Expect = 0.72
 Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 4/37 (10%)

Query: 13  KRIEK----RNKEEEKEENEQEEKEKEDVEEEKNKKK 45
           K+ EK      K  EKE  EQ +KE+E  E ++ ++K
Sbjct: 79  KKNEKEERELRKRAEKEALEQAKKEEELREAKRQQRK 115


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 30.6 bits (69), Expect = 0.74
 Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 16/92 (17%)

Query: 2   RKRRGGAGEKKKRIEK-----RNKEEEKEENEQEEKEKEDVEEEKNKK---------KKK 47
           R RR G    + R E+     R++E ++    +             ++         +++
Sbjct: 11  RSRRRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRR 70

Query: 48  KKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            +K+++ K  ++ KK+E Q  K I+E   E K
Sbjct: 71  DRKRERDKDAREPKKRERQ--KLIKEEDLEGK 100



 Score = 27.5 bits (61), Expect = 7.9
 Identities = 9/65 (13%), Positives = 33/65 (50%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R+ R  +  + +R     +   +  +    ++++   ++  ++ KK++++K  K++  + 
Sbjct: 40  RRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERDKDAREPKKRERQKLIKEEDLEG 99

Query: 62  KKEEQ 66
           K +E+
Sbjct: 100 KSDEE 104



 Score = 27.1 bits (60), Expect = 9.5
 Identities = 14/72 (19%), Positives = 34/72 (47%), Gaps = 2/72 (2%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R+RR  +    +    R+    +  +    + ++   E    K  ++ KK++++K  K++
Sbjct: 38  RRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERD--KDAREPKKRERQKLIKEE 95

Query: 62  KKEEQSVKEIEE 73
             E +S +E+E 
Sbjct: 96  DLEGKSDEEVEM 107


>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated.  This family of
          proteins is associated with U3 snoRNA. U3 snoRNA is
          required for nucleolar processing of pre-18S ribosomal
          RNA.
          Length = 87

 Score = 29.3 bits (66), Expect = 0.75
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          +E++      E  +E+++++K K K +  KK +K+ K  KK
Sbjct: 12 DEDRPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 28.2 bits (63), Expect = 2.2
 Identities = 13/43 (30%), Positives = 21/43 (48%)

Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
             +E        E  +++ KK+KKK K +  KK +K  + VK
Sbjct: 9  EAIDEDRPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVK 51


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 31.6 bits (72), Expect = 0.77
 Identities = 15/64 (23%), Positives = 26/64 (40%), Gaps = 4/64 (6%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
              E+ E     ++EK+++ +       KKKK   KK +K     E+   K +     + 
Sbjct: 301 IILEQGETKILSKEEKQELLDLL----FKKKKLTYKKLRKLLGLSEDAIFKGLRYEGLDN 356

Query: 79  KEKT 82
            EK 
Sbjct: 357 AEKA 360



 Score = 28.5 bits (64), Expect = 7.4
 Identities = 10/36 (27%), Positives = 18/36 (50%)

Query: 42  NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
            +     K+  K++ KK + K +E +    +E LKE
Sbjct: 526 EQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKE 561


>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752).  This
           domain family is found in eukaryotes, and is typically
           between 140 and 163 amino acids in length.
          Length = 148

 Score = 30.5 bits (69), Expect = 0.78
 Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 13/71 (18%)

Query: 10  EKKKR-------IEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKK----KKKKKKKK 56
           EK KR       +E           +  +++KE     +E N+KK+ K    + +KK+KK
Sbjct: 41  EKAKRLQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKK 100

Query: 57  KKKKKKKEEQS 67
           KKK+++ ++ S
Sbjct: 101 KKKEEENDDPS 111


>gnl|CDD|217943 pfam04180, LTV, Low temperature viability protein.  The
           low-temperature viability protein LTV1 is involved in
           ribosome biogenesis 40S subunit production.
          Length = 426

 Score = 31.5 bits (71), Expect = 0.78
 Identities = 21/87 (24%), Positives = 34/87 (39%), Gaps = 4/87 (4%)

Query: 10  EKKKRIEKRNKEEEKEE---NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           E+ K+ EK+N + E      +E E   + D    +       K +K K  K K       
Sbjct: 255 ERFKKFEKKNDDVEIGAADNSELEGSIEVDSNRLQEVLNDYYKTEKAK-SKVKLGAMTPF 313

Query: 67  SVKEIEESLKERKEKTLTEDSQCDTMI 93
           S + +  S  +R EK  T D + +   
Sbjct: 314 SDQSMSSSALDRTEKLTTLDDRFEKAE 340


>gnl|CDD|163426 TIGR03714, secA2, accessory Sec system translocase SecA2.  Members
           of this protein family are homologous to SecA and part
           of the accessory Sec system. This system, including both
           five core proteins for export and a variable number of
           proteins for glycosylation, operates in certain
           Gram-positive pathogens for the maturation and delivery
           of serine-rich glycoproteins such as the cell surface
           glycoprotein GspB in Streptococcus gordonii [Protein
           fate, Protein and peptide secretion and trafficking].
          Length = 762

 Score = 31.5 bits (72), Expect = 0.79
 Identities = 15/66 (22%), Positives = 30/66 (45%), Gaps = 6/66 (9%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK------EIEESLKERK 79
              ++  +K  V++ K K     K++ +K  +K ++  E++         E EESL  ++
Sbjct: 547 SWLKKYYKKYSVKDSKLKPSALFKRRFRKIVEKAQRASEDKGESAREQTNEFEESLSIQR 606

Query: 80  EKTLTE 85
           E    E
Sbjct: 607 ENIYAE 612


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
          modification].
          Length = 429

 Score = 31.5 bits (71), Expect = 0.80
 Identities = 17/75 (22%), Positives = 30/75 (40%), Gaps = 1/75 (1%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          +K+    +NK      N+ E     D    + +KKK      K K  K    K E+  K+
Sbjct: 7  RKRSGNNQNKNASVVNNKAEIAAMIDARRLE-QKKKGGVTNSKGKTNKVVDAKLEKEFKD 65

Query: 71 IEESLKERKEKTLTE 85
          + +  + ++  T  E
Sbjct: 66 VLQRFQVQENDTPKE 80


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 30.7 bits (70), Expect = 0.80
 Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 13/89 (14%)

Query: 10  EKKKRIEKRNKEEEKE--ENEQE--------EKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
           E+   ++K  +  EK   E  QE        +K +E+VEE + K K  +K K+  K  K 
Sbjct: 34  EEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKA 93

Query: 60  KKKKEEQSVKEIE---ESLKERKEKTLTE 85
           + K+ E+ +K ++   E L++R EK   E
Sbjct: 94  RLKELEKELKNLKWESEVLEQRFEKVERE 122



 Score = 28.0 bits (63), Expect = 7.0
 Identities = 15/62 (24%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 18 RNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
          + +  E ++NE+  EK   ++ +E  +  +  KK +++ ++ +KK K+ +  K+  ++LK
Sbjct: 33 KEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLK 92

Query: 77 ER 78
           R
Sbjct: 93 AR 94


>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal.  This domain
           corresponds to the alpha helical C terminal domain of
           phospholipase C beta.
          Length = 181

 Score = 30.5 bits (69), Expect = 0.80
 Identities = 27/96 (28%), Positives = 50/96 (52%), Gaps = 18/96 (18%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK----KKKKKKKKKKKEE-- 65
           K++I+K  +   +++  Q +K KE  E+EK + KKK  +K+    ++ K K+K  +E   
Sbjct: 36  KEQIQKLTELAREKQAAQLKKLKEISEKEKKELKKKLDRKRLERIQEAKTKEKHAQEREK 95

Query: 66  ------------QSVKEIEESLKERKEKTLTEDSQC 89
                       QS+K +EE+ K R+EK   + ++ 
Sbjct: 96  TEINRSHIQEVVQSIKRLEEAQKRRQEKLEEKQAEI 131


>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
          [Translation, ribosomal structure and biogenesis].
          Length = 335

 Score = 31.0 bits (71), Expect = 0.84
 Identities = 14/72 (19%), Positives = 29/72 (40%), Gaps = 1/72 (1%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK- 69
           KK I       +  +  +E K K   ++       KK  K    +++K+       +K 
Sbjct: 10 AKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINELKK 69

Query: 70 EIEESLKERKEK 81
          E+E+++ E   +
Sbjct: 70 EVEDAITELTPE 81


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 31.4 bits (72), Expect = 0.86
 Identities = 14/55 (25%), Positives = 26/55 (47%), Gaps = 1/55 (1%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKK-KKKKKKKKKEEQSVKEIEESLKERKEKTL 83
              E   +E+E  K +K+  + +KK   +    K  E+ V++ +E L E + K  
Sbjct: 812 LAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLA 866



 Score = 31.1 bits (71), Expect = 1.2
 Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 8/63 (12%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKK--------KKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           E  + EKE E +E+E ++ +KK        K  ++  +K+K+K  + +  + ++EE L  
Sbjct: 815 ELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKLEERLAV 874

Query: 78  RKE 80
            K 
Sbjct: 875 LKA 877



 Score = 28.0 bits (63), Expect = 9.2
 Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 4/65 (6%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEK----EDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            G  +    + +  KE EK E E +  EK    E    +  ++  +K+K+K  + + K  
Sbjct: 807 AGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLA 866

Query: 62  KKEEQ 66
           K EE+
Sbjct: 867 KLEER 871


>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
          Length = 390

 Score = 31.1 bits (71), Expect = 0.89
 Identities = 17/38 (44%), Positives = 26/38 (68%)

Query: 42  NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            K+  +  KK+KKKKKKK++K E   +K++ E L+E K
Sbjct: 160 EKRLDELTKKRKKKKKKKEEKVELDVLKKVLEWLEEGK 197


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 30.0 bits (68), Expect = 0.92
 Identities = 9/54 (16%), Positives = 15/54 (27%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           K  + E       E +     +    KKKK   +    +      E +     E
Sbjct: 99  KPSKLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAE 152



 Score = 28.9 bits (65), Expect = 2.8
 Identities = 6/48 (12%), Positives = 11/48 (22%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
                E       +    KKKK   +    +             + +E
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEE 154


>gnl|CDD|234537 TIGR04313, aro_clust_Mycop, aromatic cluster surface protein.
           Members of this family are absolutely restricted to the
           Mollicutes (Mycoplasma and Ureaplasma). All have a
           signal peptide, usually of the lipoprotein type,
           suggesting surface expression. Most members have lengths
           of about 280 residues but some members have a nearly
           full-length duplication. The mostly nearly invariant
           residue, a Trp,is part of a strongly conserved 9-residue
           motif, [ND]-W-[LY]-[WF]-X-[LF]-X-N-[LI], where X usually
           is hydrophobic. Because the hydrophobic six-residue core
           of this motif almost always contains three to four
           aromatic residues, we name this family aromatic cluster
           surface protein. Multiple paralogs may occur in a given
           Mycoplasma, usually clustered on the genome.
          Length = 297

 Score = 30.8 bits (70), Expect = 0.94
 Identities = 9/76 (11%), Positives = 29/76 (38%), Gaps = 6/76 (7%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEE------EKNKKKKKKKKKKKKKKKKKKKKK 63
           +K  + +   K+++K++  +     + +        + ++ +K K  K +K    +   K
Sbjct: 33  KKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNINSELYLK 92

Query: 64  EEQSVKEIEESLKERK 79
             +   E   ++    
Sbjct: 93  LLKLSLEYLNNINNSS 108


>gnl|CDD|223966 COG1035, FrhB, Coenzyme F420-reducing hydrogenase, beta subunit
           [Energy production and conversion].
          Length = 332

 Score = 30.8 bits (70), Expect = 0.94
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 30  EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           E KE E+V       +K  KKK K+  K+ +++ E 
Sbjct: 289 EVKEIEEVALGLRLVEKLAKKKLKRAAKEPEERVEF 324


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
           domain family is found in eukaryotes, and is typically
           between 99 and 114 amino acids in length. The family is
           found in association with pfam08697, pfam01585. There
           are two completely conserved residues (G and F) that may
           be functionally important. TIP is involved in enamel
           assembly by interacting with one of the major proteins
           responsible for biomineralisation of enamel - tuftelin.
          Length = 106

 Score = 29.3 bits (66), Expect = 0.96
 Identities = 12/53 (22%), Positives = 26/53 (49%), Gaps = 11/53 (20%)

Query: 2   RKRRGGAGEKKKR-----------IEKRNKEEEKEENEQEEKEKEDVEEEKNK 43
           R R     +KKK            I++  KE E+++ E E ++ ++ +E+ ++
Sbjct: 52  RGRGRRRSKKKKDYSKPVAFVSGGIKQAAKEPEEDDKEDESEDDDESDEDDDE 104


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
          family consists of several phage minor structural
          protein GP20 sequences of around 180 residues in
          length. The function of this family is unknown.
          Length = 156

 Score = 30.0 bits (68), Expect = 0.98
 Identities = 19/65 (29%), Positives = 36/65 (55%), Gaps = 3/65 (4%)

Query: 20 KEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
          K++ K  ++Q E  K+   D EE + K +K K++ K  K++ + K  + +    IE +LK
Sbjct: 33 KKQLKNRDKQIEDLKKKVKDNEELQKKIEKLKQQNKTAKEEYEAKLADTKLNNAIELALK 92

Query: 77 ERKEK 81
          + K +
Sbjct: 93 KAKAR 97


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 31.2 bits (71), Expect = 0.98
 Identities = 18/71 (25%), Positives = 43/71 (60%), Gaps = 3/71 (4%)

Query: 12  KKRIEKRNKEEEKEENEQEE-KEKEDVEEEKNKKKKKKKKK--KKKKKKKKKKKKEEQSV 68
           K+ +E+  +E EK E+E E  + +   +  K+++ + + ++  + +K+ ++KKK+ E+  
Sbjct: 442 KRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELE 501

Query: 69  KEIEESLKERK 79
           +++ E  K RK
Sbjct: 502 RKLAELRKMRK 512



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 5/84 (5%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----KKKKKKK 57
            + R      +KRI+K  +  E+ E E  E ++E +EE K + +K + +    +++ + K
Sbjct: 411 EEERREITVYEKRIKKLEETVERLEEENSELKRE-LEELKREIEKLESELERFRREVRDK 469

Query: 58  KKKKKKEEQSVKEIEESLKERKEK 81
            +K ++     + IE   KE +EK
Sbjct: 470 VRKDREIRARDRRIERLEKELEEK 493



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 6/82 (7%)

Query: 9   GEKKKRIEKRNK------EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            E  +R+E+ N       EE K E E+ E E E    E   K +K ++ + + ++ ++ +
Sbjct: 428 EETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLE 487

Query: 63  KEEQSVKEIEESLKERKEKTLT 84
           KE +  K+  E L+ +  +   
Sbjct: 488 KELEEKKKRVEELERKLAELRK 509



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 13/80 (16%), Positives = 38/80 (47%), Gaps = 12/80 (15%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK------------KKKKKKKKKK 60
           K  E+    E++   E+E +E    E+   K ++  ++            ++ K++ +K 
Sbjct: 396 KVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKL 455

Query: 61  KKKEEQSVKEIEESLKERKE 80
           + + E+  +E+ + +++ +E
Sbjct: 456 ESELERFRREVRDKVRKDRE 475



 Score = 28.9 bits (65), Expect = 4.7
 Identities = 19/61 (31%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
            + KEE    EKE    EEE+ +    +K+ KK ++  ++ ++E   +K   E LK   E
Sbjct: 395 SKVKEEERPREKE-GTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIE 453

Query: 81  K 81
           K
Sbjct: 454 K 454


>gnl|CDD|223731 COG0658, ComEC, Predicted membrane metal-binding protein [General
           function prediction only].
          Length = 453

 Score = 30.9 bits (70), Expect = 0.99
 Identities = 28/128 (21%), Positives = 43/128 (33%), Gaps = 14/128 (10%)

Query: 175 TMMSNLAM-CLLGWWRCFANNYLQWAAVGTWILLNTQGWWRCFANNYLQWAAVGTWILLN 233
             +S LA+  L+ W + F    L W    T  LL T           L        +LLN
Sbjct: 338 FWLSFLAVAALILWSQLFRLLGLSWPLPITLALLATLPLL-ILQFGRLGL----LGLLLN 392

Query: 234 TQGFISQGIGLNNKEYLIWYILFIVFVTYAMLPLPLKWCFIGGCTTSVLHVIITAQIKLH 293
               +S  I L +   L+   L    V++ ++PL L              +I  A  ++ 
Sbjct: 393 --LIVSPLISLISLPGLLANFLLSPLVSFLIVPLILLG------LLLAPLLIGAALARVR 444

Query: 294 RGPLSPCS 301
            G      
Sbjct: 445 PGVFGLIQ 452


>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031).  This
           protein is expressed in Plasmodium; its function is
           unknown. It may be the product of gene family pyst-b.
          Length = 228

 Score = 30.4 bits (69), Expect = 1.0
 Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 11/60 (18%)

Query: 13  KRIEKRN-----------KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           KRI K++           K+ E EEN  E +  E       KK K  +K KK +KK  K 
Sbjct: 144 KRIIKKDENSSVSEHEDFKQLENEENFLETEYNEITSSNNYKKLKINRKLKKAEKKLIKS 203


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 29.9 bits (68), Expect = 1.0
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           ++E+ EK  V ++  K K+++  KKK++K K K KK+
Sbjct: 101 KKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137



 Score = 28.8 bits (65), Expect = 2.4
 Identities = 15/45 (33%), Positives = 26/45 (57%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           K    +  ++E++E     K+ +  +EE+  KKK++K K K KKK
Sbjct: 93  KEFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 31.1 bits (71), Expect = 1.0
 Identities = 16/85 (18%), Positives = 32/85 (37%), Gaps = 2/85 (2%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           +++ E+  +E      E+++KE    +E   + K    K KK K + K+ +K E  + E 
Sbjct: 240 EQKAERLRQEAA--AYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEE 297

Query: 72  EESLKERKEKTLTEDSQCDTMIPVF 96
               + +                V 
Sbjct: 298 RPVEEGKPLAFRFPPPGKRLGKLVL 322



 Score = 30.7 bits (70), Expect = 1.3
 Identities = 13/62 (20%), Positives = 27/62 (43%), Gaps = 8/62 (12%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKK--------KKKKKKKKKKKKEEQSVKEIEESLKERK 79
            +++ E+   E    +K++K+  K        K    K KK K   + ++++E  L E +
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEER 298

Query: 80  EK 81
             
Sbjct: 299 PV 300


>gnl|CDD|224427 COG1510, COG1510, Predicted transcriptional regulators
           [Transcription].
          Length = 177

 Score = 30.4 bits (69), Expect = 1.0
 Identities = 15/73 (20%), Positives = 29/73 (39%), Gaps = 3/73 (4%)

Query: 3   KRRGGAGEKKKRIEKRNKE---EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
             R    EK KR     KE   +  EE  ++  +++  E  K  K K ++  K  +   +
Sbjct: 93  IFRTLFEEKWKREIDPTKEALKKLLEELNEDLDDRDLTERIKEIKSKLERLLKWSEDYYE 152

Query: 60  KKKKEEQSVKEIE 72
              +  + ++  E
Sbjct: 153 LLTRLLEFLESEE 165


>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 205

 Score = 30.5 bits (69), Expect = 1.0
 Identities = 9/51 (17%), Positives = 18/51 (35%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           K    E  +  EQ  ++    +  K+      + K   +K  K+K  +   
Sbjct: 50  KEAALEALQRAEQRLQQAAKAKRAKDAAANAAEGKAFLEKNAKEKGVKTLP 100



 Score = 29.3 bits (66), Expect = 2.4
 Identities = 9/54 (16%), Positives = 21/54 (38%), Gaps = 1/54 (1%)

Query: 8  AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          A  +K+   +  +  E+   +  + ++       N  + K   +K  K+K  K 
Sbjct: 46 ALVQKEAALEALQRAEQRLQQAAKAKRAKDAAA-NAAEGKAFLEKNAKEKGVKT 98


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
          structure and biogenesis].
          Length = 429

 Score = 31.0 bits (71), Expect = 1.1
 Identities = 16/76 (21%), Positives = 42/76 (55%), Gaps = 6/76 (7%)

Query: 12 KKRIEKRNKEEEKEEN--EQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
          +++++KR  +    +   E +E+ ++   ++EE + ++ +  K+  +  K+ +   +E  
Sbjct: 15 REKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELI 74

Query: 67 S-VKEIEESLKERKEK 81
          + VKE++E LKE +  
Sbjct: 75 AEVKELKEKLKELEAA 90



 Score = 28.3 bits (64), Expect = 7.3
 Identities = 12/67 (17%), Positives = 34/67 (50%), Gaps = 3/67 (4%)

Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
          +++  ++  +E  E + +  E  +E     K+ +   ++   + K+ K++   +KE+E +
Sbjct: 34 LDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK---LKELEAA 90

Query: 75 LKERKEK 81
          L E + +
Sbjct: 91 LDELEAE 97


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 31.2 bits (71), Expect = 1.1
 Identities = 14/53 (26%), Positives = 26/53 (49%)

Query: 29  QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           +EE   + +EE K    K+      ++ ++K  ++ EQ  +  EE  KE  +K
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKK 496



 Score = 29.7 bits (67), Expect = 3.0
 Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 3/75 (4%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK---EEQSV 68
            K       EE +E+  ++ ++  + +EE  K+  KK KK  +       ++    E++ 
Sbjct: 460 VKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVDTYLLLEELGINEETY 519

Query: 69  KEIEESLKERKEKTL 83
           +++E  L ++   TL
Sbjct: 520 EKLEALLAKKNNFTL 534



 Score = 29.3 bits (66), Expect = 3.6
 Identities = 10/59 (16%), Positives = 27/59 (45%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           E   K  E+ +    ++    + EE  +K  ++ ++  + +++ KK+  ++  K  E  
Sbjct: 446 EAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVD 504


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 30.2 bits (69), Expect = 1.1
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 34  KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            E  E EK KKK   K++KK  K++K K +E 
Sbjct: 92  YEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 28.7 bits (65), Expect = 4.4
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKE 33
           EKKK + K  K+  KEE ++ E+ 
Sbjct: 100 EKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 27.9 bits (63), Expect = 6.6
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           E  K+KKK   K++KK  K++K K E+
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEE 122



 Score = 27.9 bits (63), Expect = 7.5
 Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           R   +++     ++ EK D        + +K+KKK   K++KK  KEE+   ++EE
Sbjct: 69  RKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEK--DKLEE 122


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This
          family includes outer membrane proteins such as OmpH
          among others. Skp (OmpH) has been characterized as a
          molecular chaperone that interacts with unfolded
          proteins as they emerge in the periplasm from the Sec
          translocation machinery.
          Length = 140

 Score = 29.9 bits (68), Expect = 1.1
 Identities = 18/67 (26%), Positives = 41/67 (61%), Gaps = 3/67 (4%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
          E KKR  +  K E++ +  +E+ +K+     E  + KK+K+ +KK ++ ++K++K +++ 
Sbjct: 26 EFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDL 85

Query: 67 SVKEIEE 73
            ++ EE
Sbjct: 86 QKRQQEE 92



 Score = 28.7 bits (65), Expect = 2.8
 Identities = 18/72 (25%), Positives = 46/72 (63%), Gaps = 2/72 (2%)

Query: 12 KKRIEKRNKEEEKE-ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS-VK 69
          +K++EK  K+ + E E  ++E +K   + +K+     +  ++KK+K+ +KK +E Q   +
Sbjct: 20 QKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQ 79

Query: 70 EIEESLKERKEK 81
          ++++ L++R+++
Sbjct: 80 KLQQDLQKRQQE 91



 Score = 27.5 bits (62), Expect = 6.3
 Identities = 17/57 (29%), Positives = 29/57 (50%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
          K   +Q EKE +  + E  K +K+ +K K+K +K      E    K+ +E  K+ +E
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQE 73


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This
          is a family of uncharacterized proteins.
          Length = 449

 Score = 30.7 bits (69), Expect = 1.1
 Identities = 8/40 (20%), Positives = 22/40 (55%)

Query: 51 KKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
          KK+   +K  K+ EE++++E ++  ++  +    ++   D
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDD 72



 Score = 30.7 bits (69), Expect = 1.3
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN 42
          K+  I K  KE E+E  E+E+ ++ED +++ +
Sbjct: 34 KENAIRKLGKEAEEEAMEEEDDDEEDDDDDDD 65



 Score = 30.3 bits (68), Expect = 1.7
 Identities = 8/47 (17%), Positives = 25/47 (53%)

Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
          KK+   +K  K+ +++  ++E+   ++ ++   E ++    +D + D
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDD 79


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 30.7 bits (70), Expect = 1.1
 Identities = 21/43 (48%), Positives = 30/43 (69%), Gaps = 1/43 (2%)

Query: 20  KEEEKEENEQEE-KEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           +E+EK  +E  E  E+ D  EE  K+KKKKKKKKKKK++ ++ 
Sbjct: 63  QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105



 Score = 29.9 bits (68), Expect = 1.8
 Identities = 16/43 (37%), Positives = 28/43 (65%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           + KE+  +E+ +  +  +  EE   +KKKKKKKKKKK++ ++ 
Sbjct: 63  QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105



 Score = 29.5 bits (67), Expect = 2.3
 Identities = 16/42 (38%), Positives = 25/42 (59%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           EK    +E  E  +     E++ +EK KKKKKKKKK++ ++ 
Sbjct: 64  EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105



 Score = 29.5 bits (67), Expect = 2.6
 Identities = 14/38 (36%), Positives = 24/38 (63%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            N++  +  E  +  +   K+KKKKKKKKKKK++ ++ 
Sbjct: 68  LNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105



 Score = 29.1 bits (66), Expect = 3.1
 Identities = 15/41 (36%), Positives = 26/41 (63%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           EK  ++  E   +  + ++  K+KKKKKKKKKK+E+  + +
Sbjct: 66  EKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWM 106



 Score = 28.7 bits (65), Expect = 4.3
 Identities = 19/38 (50%), Positives = 25/38 (65%), Gaps = 3/38 (7%)

Query: 31  EKEKEDVEEEK---NKKKKKKKKKKKKKKKKKKKKKEE 65
           EKEK + E  +      + ++  K+KKKKKKKKKKKEE
Sbjct: 64  EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEE 101


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
          CCCH-type Zn-finger protein [General function
          prediction only].
          Length = 299

 Score = 30.8 bits (69), Expect = 1.1
 Identities = 16/73 (21%), Positives = 41/73 (56%), Gaps = 4/73 (5%)

Query: 1  MRKRRGGAGEKKKRIEKRNKEEE----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
          M K++  +G+K  R  ++  E++    K +N   + +    + E    K++ +KK+K + 
Sbjct: 6  MAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEKMRM 65

Query: 57 KKKKKKKEEQSVK 69
          ++K+++ E+Q ++
Sbjct: 66 EEKRREPEKQVIR 78



 Score = 29.3 bits (65), Expect = 3.0
 Identities = 24/103 (23%), Positives = 40/103 (38%), Gaps = 5/103 (4%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDV-----EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           +K R+E++ +E EK+        K  V      +   K    K    K++ +K +K    
Sbjct: 61  EKMRMEEKRREPEKQVIRAGVDPKTVVCALFLNKTCAKGDACKFAHGKEEARKTEKPDLY 120

Query: 66  QSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCP 108
             V++ EE +   K   +  D  C   I   + G    G  CP
Sbjct: 121 SDVRDKEEDVPLGKRPWINTDRVCKFFIEAMESGKYGWGWTCP 163


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 30.9 bits (71), Expect = 1.1
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 36  DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           DVEEE    K++ +    K+ ++K+KK E+  + 
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEIT 544



 Score = 28.2 bits (64), Expect = 7.6
 Identities = 7/30 (23%), Positives = 17/30 (56%)

Query: 33  EKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            +E++   K + +    K+ ++K+KK +K 
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541



 Score = 28.2 bits (64), Expect = 9.1
 Identities = 8/31 (25%), Positives = 17/31 (54%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
           + E+E    +E+ +    +E  +K+KK +K 
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 17/76 (22%), Positives = 43/76 (56%), Gaps = 1/76 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQSV 68
           EKK    +      +EE    E+E+E +E E ++   +  K +++++KK+ + ++ +Q +
Sbjct: 53  EKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQEL 112

Query: 69  KEIEESLKERKEKTLT 84
           +E +E+ +  +++ L 
Sbjct: 113 REAQEAHERARQELLE 128



 Score = 29.7 bits (67), Expect = 1.9
 Identities = 13/75 (17%), Positives = 40/75 (53%), Gaps = 2/75 (2%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          ++++ E   + E+ EE     + ++++EE +    + ++K K+++++ +  +K+   ++E
Sbjct: 4  EREQQELEERMEQMEE--DMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEE 61

Query: 71 IEESLKERKEKTLTE 85
              L+E    +  E
Sbjct: 62 ENRRLEEEAAASEEE 76


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family
          consists of several ATP synthase E chain sequences
          which are components of the CF(0) subunit.
          Length = 83

 Score = 28.6 bits (64), Expect = 1.1
 Identities = 13/33 (39%), Positives = 19/33 (57%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
          KEE+  E E +EK  E  +    KKK+ K++ K
Sbjct: 35 KEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAK 67



 Score = 28.6 bits (64), Expect = 1.3
 Identities = 13/36 (36%), Positives = 23/36 (63%), Gaps = 1/36 (2%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
          K++E++  E E +EK  +E+ K    KKK+ K++ K
Sbjct: 33 KKKEEKIREYEAQEKL-IEKAKAAYAKKKQAKQEAK 67



 Score = 26.7 bits (59), Expect = 5.6
 Identities = 13/50 (26%), Positives = 27/50 (54%), Gaps = 4/50 (8%)

Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ----SVKEIEESLK 76
          +K++E + E + ++K  +K K    KKK+ K++ +     S    +E+ K
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKSLAEGSSAPFDETPK 82


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 31.0 bits (71), Expect = 1.1
 Identities = 17/73 (23%), Positives = 41/73 (56%), Gaps = 4/73 (5%)

Query: 13  KRIEKRNKEE----EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           +R EK+ KEE    +   N++ ++  E+ + E+ +K ++       K K K++++EE+  
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549

Query: 69  KEIEESLKERKEK 81
           ++ +E+ ++  E 
Sbjct: 550 EKDKETEEDEPEG 562


>gnl|CDD|220774 pfam10477, EIF4E-T, Nucleocytoplasmic shuttling protein for mRNA
           cap-binding EIF4E.  EIF4E-T is the transporter protein
           for shuttling the mRNA cap-binding protein EIF4E
           protein, targeting it for nuclear import. EIF4E-T
           contains several key binding domains including two
           functional leucine-rich NESs (nuclear export signals)
           between residues 438-447 and 613-638 in the human
           protein. The other two binding domains are an
           EIF4E-binding site, between residues 27-42 in Q9EST3,
           and a bipartite NLS (nuclear localisation signals)
           between 194-211, and these lie in family EIF4E-T_N.
           EIF4E is the eukaryotic translation initiation factor 4E
           that is the rate-limiting factor for cap-dependent
           translation initiation.
          Length = 520

 Score = 31.1 bits (69), Expect = 1.1
 Identities = 17/62 (27%), Positives = 28/62 (45%)

Query: 33  EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTM 92
           +K   E++K      K+K +K+ KK+    KE         SL+   E  L E++  D  
Sbjct: 224 DKILEEDDKTSNGDGKQKGRKRTKKRTASVKEGHVECNGGVSLEPHDEVGLEEENAADQE 283

Query: 93  IP 94
           +P
Sbjct: 284 VP 285


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
          (DUF2040).  This entry is a conserved domain of
          approximately 130 residues of proteins conserved from
          fungi to humans. The proteins do contain a coiled-coil
          domain, but the function is unknown.
          Length = 128

 Score = 29.7 bits (67), Expect = 1.2
 Identities = 15/72 (20%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
          +   +E ++  +E+    D +E  +  K  K++KK+ K  ++K+ ++ + +  + E+ + 
Sbjct: 1  KKARKEAQKALEEDPSIYDYDEVYDSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAER 60

Query: 78 RK-EKTLTEDSQ 88
          RK E+ + E+ +
Sbjct: 61 RKREREIAEERK 72



 Score = 29.3 bits (66), Expect = 1.3
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
           + + EE  + E+EEKE+E++EEE +  K K 
Sbjct: 96  KKQLEENRKLEEEEKEREELEEENDVTKGKD 126



 Score = 28.9 bits (65), Expect = 1.8
 Identities = 22/89 (24%), Positives = 40/89 (44%), Gaps = 19/89 (21%)

Query: 11  KKKRIEKRNKEEEKEEN--------------EQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           K  + EK+  +  + +               E+ ++E+E  EE K +K+++K+  +   K
Sbjct: 28  KAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEFADK 87

Query: 57  KK-----KKKKKEEQSVKEIEESLKERKE 80
           +K      KK+ EE    E EE  +E  E
Sbjct: 88  EKFVTSAYKKQLEENRKLEEEEKEREELE 116



 Score = 27.8 bits (62), Expect = 4.4
 Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 11/74 (14%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKK------KKKKKKKKKKKKKKKKKEEQSVK 69
           E+R +E E  E  + +KE+E   +E   K+K      KK+ ++ +K ++++K++E     
Sbjct: 59  ERRKREREIAEERKLQKEREKEGDEFADKEKFVTSAYKKQLEENRKLEEEEKERE----- 113

Query: 70  EIEESLKERKEKTL 83
           E+EE     K K L
Sbjct: 114 ELEEENDVTKGKDL 127


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This is
           the highly conserved family of the major tail subunit
           protein.
          Length = 121

 Score = 29.3 bits (65), Expect = 1.2
 Identities = 16/68 (23%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           R+E + +E +K E E+E+++KE  EE         ++++   K+     +++QS+  + +
Sbjct: 45  RVEMK-EERDKMETEREKRDKESKEERDKFISTMNEQQRLMDKQNDMMGQQQQSIDSLSK 103

Query: 74  SLKERKEK 81
           S+ +   K
Sbjct: 104 SVGKLAHK 111


>gnl|CDD|221007 pfam11158, DUF2938, Protein of unknown function (DUF2938).  This
           bacterial family of proteins has no known function. Some
           members are thought to be membrane proteins however this
           cannot be confirmed.
          Length = 150

 Score = 29.9 bits (68), Expect = 1.2
 Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 5/61 (8%)

Query: 214 RCFANNYLQWAAVGTWILLNTQG-FISQGIG----LNNKEYLIWYILFIVFVTYAMLPLP 268
           R F    L +A VG W+    +G F    I     +  +  L W   + + V +A L L 
Sbjct: 20  RLFGVPSLNYAMVGRWLGHLPRGRFFHDSIAAAPPVPGERALGWLAHYAIGVLFAALLLL 79

Query: 269 L 269
           +
Sbjct: 80  I 80


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 31.1 bits (70), Expect = 1.2
 Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 34  KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           K  VE  K+KK KKKK  KKK+  K +   E+Q
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQQ 147



 Score = 29.2 bits (65), Expect = 4.3
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           KN+ +  K KK KKKK  KKK+  +  +
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQI 142


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 29.8 bits (67), Expect = 1.2
 Identities = 13/63 (20%), Positives = 32/63 (50%), Gaps = 5/63 (7%)

Query: 20  KEEEKEENEQEEKEKEDVEE-----EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
                EE+  +E+++ED        +  KK +K+++K  +K  + +++  E    + +E 
Sbjct: 73  TAGHGEEDTDDEEDEEDFATPSAVPQWGKKARKRQRKVIRKLLEAEEQLREDQYDDEDED 132

Query: 75  LKE 77
           ++E
Sbjct: 133 IEE 135


>gnl|CDD|217935 pfam04158, Sof1, Sof1-like domain.  Sof1 is essential for cell
          growth and is a component of the nucleolar rRNA
          processing machinery.
          Length = 88

 Score = 28.8 bits (65), Expect = 1.2
 Identities = 12/41 (29%), Positives = 23/41 (56%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
          KK +K K++ K+ KK+K++ + + S         ERK+  +
Sbjct: 48 KKAQKIKREMKEAKKRKEENRRKHSKPGSVPPKPERKKHVV 88


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 6/80 (7%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEK-----EDVEEEKNKKKKKKKKKKKK-KKKKKK 60
              E    +E+  K+   E  E EE+        +++EE  + +K+++   ++ K+  KK
Sbjct: 185 VLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKK 244

Query: 61  KKKEEQSVKEIEESLKERKE 80
             +E  ++ E  E   ER E
Sbjct: 245 YLEELLALYEYLEIELERAE 264



 Score = 30.7 bits (70), Expect = 1.3
 Identities = 9/77 (11%), Positives = 33/77 (42%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
               ++  +K + +  +E  +  E+E E +E+E  + +++  + + + K+ +++ +  + 
Sbjct: 69  LNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEP 128

Query: 68  VKEIEESLKERKEKTLT 84
               +  L         
Sbjct: 129 WGNFDLDLSLLLGFKYV 145


>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
           protein; Provisional.
          Length = 230

 Score = 30.3 bits (68), Expect = 1.2
 Identities = 14/87 (16%), Positives = 35/87 (40%), Gaps = 4/87 (4%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           + +     K ++   NE  +++  +  EEK   + +K    +  ++K   + E Q  K  
Sbjct: 119 QFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPELSEEKSDSELETQPQK-- 176

Query: 72  EESLKERKEKTLTEDSQCDTMIPVFKK 98
             +  + K+ ++        ++   KK
Sbjct: 177 --TQLKEKKPSIEHIDLSKMLLKELKK 201


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 11/56 (19%), Positives = 29/56 (51%), Gaps = 4/56 (7%)

Query: 32  KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
               ++EE+  K++++  +K+  +   +  +K  +  K     LKER+++  +E+ 
Sbjct: 456 LPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSK----KLKERQDQPDSEED 507



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 18/96 (18%), Positives = 33/96 (34%), Gaps = 21/96 (21%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKK--KKKKKKKEEQSVKEIEESLK----------ER 78
           E++ E  E E  +K+  +   +  +K  K  KK KE Q   + EE L             
Sbjct: 462 EEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPD 521

Query: 79  KEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSF 114
             +  + +++      V         ++    TN  
Sbjct: 522 PIEKTSLETEVSNEAKV---------LHHDLFTNGI 548


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 12/53 (22%), Positives = 26/53 (49%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
             KK+     ++E   +    E +K   +EE+ K +  K ++ K+ +  K+K+
Sbjct: 365 LAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQKR 417



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 12/75 (16%)

Query: 1   MRKRRGGAGEKKKRIEKRN------------KEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
            RK+   A E  K   KR             K +E        + K+   +E+  K +  
Sbjct: 343 ARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFI 402

Query: 49  KKKKKKKKKKKKKKK 63
           K ++ K+ +  K+K+
Sbjct: 403 KLEELKRLENGKQKR 417


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 30.8 bits (70), Expect = 1.3
 Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 5/66 (7%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R+R+      +K   K NKE  K +       KED  E   + K+ KK+  +K+ + ++ 
Sbjct: 43  RQRQFELDSLRKEFNKLNKEVAKLK-----IAKEDATELIAETKELKKEITEKEAEVQEA 97

Query: 62  KKEEQS 67
           K    +
Sbjct: 98  KAALDA 103


>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
          Length = 985

 Score = 30.9 bits (70), Expect = 1.3
 Identities = 17/79 (21%), Positives = 33/79 (41%), Gaps = 1/79 (1%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
             K K + ++  +++  E  ++E ++    +          K       K  K+ +E   
Sbjct: 405 PAKLKGLFEKLYKQKYAEVLKKELDRSFYRDYLQSTDDDILKVHGGYFSKDNKEGDEAIE 464

Query: 69  KEIEESLKERKEKTLTEDS 87
            EI+  LK+ KEK L+ D 
Sbjct: 465 DEIDLILKD-KEKLLSFDE 482


>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
          Length = 580

 Score = 30.6 bits (70), Expect = 1.3
 Identities = 8/48 (16%), Positives = 21/48 (43%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
           +  +++  +  EE  EE   +E+  E       ++ +  +   + K+K
Sbjct: 533 DALEKLRDQLLEERLEELIAKERLLEGHGLSSEERLELLQLLIELKRK 580


>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein.  Bacillus subtilis stage V
           sporulation protein R is involved in spore cortex
           formation. Little is known about cortex biosynthesis,
           except that it depends on several sigma E controlled
           genes, including spoVR.
          Length = 427

 Score = 30.6 bits (70), Expect = 1.3
 Identities = 7/47 (14%), Positives = 23/47 (48%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
             K   +  K  ++E  E  + +  D+      +++K ++++ ++K+
Sbjct: 172 PIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKR 218



 Score = 29.5 bits (67), Expect = 2.9
 Identities = 12/55 (21%), Positives = 31/55 (56%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           K+ I+   +EE+  + E+EE  +  V +       +++K ++++ ++K+  +E Q
Sbjct: 170 KRPIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQ 224


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 11/59 (18%), Positives = 31/59 (52%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K+ +++  ++ EK E +  + E++  +   +KKK K  + +++    +K+  E + +  
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIA 299



 Score = 30.7 bits (70), Expect = 1.3
 Identities = 15/59 (25%), Positives = 29/59 (49%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           E  ++ +EK E  +++  K + + EKN   KKK K  + +++    +K     KE+   
Sbjct: 242 EYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAK 300



 Score = 29.5 bits (67), Expect = 3.1
 Identities = 16/60 (26%), Positives = 32/60 (53%), Gaps = 2/60 (3%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE--EQSVKEIEESLKE 77
           KE   E  E+ EK ++D+ + + K +K    KKK K  + +++    E+ + E +E + +
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAK 300



 Score = 28.0 bits (63), Expect = 7.9
 Identities = 19/61 (31%), Positives = 31/61 (50%), Gaps = 5/61 (8%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           +E +E+ E+ EK+   +EE    K +K    KKK K  + +++     K I+E  KE   
Sbjct: 245 DELQEKLEKLEKDLAKLEE----KLEKNPNSKKKNKLAELEQQLASLEKRIDE-AKELIA 299

Query: 81  K 81
           K
Sbjct: 300 K 300


>gnl|CDD|129200 TIGR00092, TIGR00092, GTP-binding protein YchF.  This predicted
           GTP-binding protein is found in a single copy in every
           complete bacterial genome, and is found in Eukaryotes. A
           more distantly related protein, separated from this
           model, is found in the archaea. It is known to bind GTP
           and double-stranded nucleic acid. It is suggested to
           belong to a nucleoprotein complex and act as a
           translation factor [Unknown function, General].
          Length = 368

 Score = 30.5 bits (69), Expect = 1.3
 Identities = 16/55 (29%), Positives = 21/55 (38%), Gaps = 5/55 (9%)

Query: 25  EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            E   EE  K D    +     K+  + KK  +  K KKEE  + EI   L    
Sbjct: 126 FEIIDEELLKADEFLVE-----KRIGRSKKSAEGGKDKKEELLLLEIILPLLNGG 175


>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
           peptidase.  Corresponds to Merops family C37.
           Norwalk-like viruses (NLVs), including the Southampton
           virus, cause acute non-bacterial gastroenteritis in
           humans. The NLV genome encodes three open reading frames
           (ORFs). ORF1 encodes a polyprotein, which is processed
           by the viral protease into six proteins.
          Length = 535

 Score = 30.6 bits (69), Expect = 1.3
 Identities = 17/65 (26%), Positives = 27/65 (41%), Gaps = 5/65 (7%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK-----EIEESLK 76
            E    +  E E++D       +    + KK K KK + KK    S +     E +E  K
Sbjct: 204 XEVTYYDGCESEEDDEGWYIEPQDATPEGKKGKNKKGRGKKHNAFSRRGLSDEEYDEYKK 263

Query: 77  ERKEK 81
            R+E+
Sbjct: 264 IREER 268



 Score = 30.2 bits (68), Expect = 2.1
 Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 19/109 (17%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK-----------KKKKKKKKKK 63
           +   +  E +E++E    E +D   E  K K KK + KK           ++  + KK +
Sbjct: 206 VTYYDGCESEEDDEGWYIEPQDATPEGKKGKNKKGRGKKHNAFSRRGLSDEEYDEYKKIR 265

Query: 64  EEQSVK-EIEESLKERK-------EKTLTEDSQCDTMIPVFKKGVLYKG 104
           EE+  K  I+E L++R+       E+  TE   C+      ++ +    
Sbjct: 266 EERGGKYSIQEYLEDRERYEEELAERQATEADFCEEEEAKIRQRIFGLR 314



 Score = 28.7 bits (64), Expect = 5.2
 Identities = 16/63 (25%), Positives = 34/63 (53%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +R+ RGG    ++ +E R + EE+    Q  +     EEE   +++    +K +K++K++
Sbjct: 264 IREERGGKYSIQEYLEDRERYEEELAERQATEADFCEEEEAKIRQRIFGLRKTRKQRKEE 323

Query: 61  KKK 63
           + K
Sbjct: 324 RAK 326


>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103).  This
          family consists of several repeats of around 30
          residues in length which are found specifically in
          mature-parasite-infected erythrocyte surface antigen
          proteins from Plasmodium falciparum. This family often
          found in conjunction with pfam00226.
          Length = 215

 Score = 30.2 bits (67), Expect = 1.3
 Identities = 25/85 (29%), Positives = 47/85 (55%), Gaps = 7/85 (8%)

Query: 9  GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK-------KK 61
          GE  K   K   + +K++    EK +++++E+  K+ K+K KKK KK  K+       K 
Sbjct: 4  GEGDKEDVKEKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGNDKV 63

Query: 62 KKEEQSVKEIEESLKERKEKTLTED 86
          K  E  ++E++E +K++ E  + E+
Sbjct: 64 KGPEIIIEEVKEEIKKQVEDGIKEN 88


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 14/39 (35%), Positives = 16/39 (41%)

Query: 43  KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           K  KK   K K KKKKKK         E  +  K   +K
Sbjct: 351 KPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKK 389



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 26/73 (35%), Positives = 33/73 (45%), Gaps = 13/73 (17%)

Query: 22  EEKEENEQE-EKEKEDVEEEKNKK------------KKKKKKKKKKKKKKKKKKKEEQSV 68
           +EK  NE+  EK  E +++ K KK            KK  KK   K K KKKKKK   S 
Sbjct: 314 KEKNFNEERVEKYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASA 373

Query: 69  KEIEESLKERKEK 81
               E+ +E K  
Sbjct: 374 VPKSETSQEAKSS 386



 Score = 28.8 bits (65), Expect = 5.4
 Identities = 14/41 (34%), Positives = 17/41 (41%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           K  KK   K K KKKKKK       +S    E     +K+ 
Sbjct: 351 KPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKV 391


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 30.7 bits (70), Expect = 1.4
 Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 20/90 (22%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN----------KKKKKKKKKKKKK 55
             +G     IE+  K+ E+     E+K++++  E KN          K  K++  K  + 
Sbjct: 497 ASSGLSDDEIERMVKDAEE--YAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEA 554

Query: 56  KKKKKKKKEEQ--------SVKEIEESLKE 77
            KKK ++  E           +EIE   +E
Sbjct: 555 DKKKVEEAIEWLKEELEGEDKEEIEAKTEE 584



 Score = 30.3 bits (69), Expect = 1.8
 Identities = 17/75 (22%), Positives = 41/75 (54%), Gaps = 9/75 (12%)

Query: 10  EKKKRIEKRNKEE------EKEENEQEEKEKEDVEEEKNKKK---KKKKKKKKKKKKKKK 60
           ++K+RIE +N+ E      EK   E+ +K  E  +++  +     K++ + + K++ + K
Sbjct: 522 KRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAK 581

Query: 61  KKKEEQSVKEIEESL 75
            ++ ++ V+ I E +
Sbjct: 582 TEELQKVVQPIGERM 596



 Score = 29.5 bits (67), Expect = 3.1
 Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 4/76 (5%)

Query: 15  IEKRNKEEEKEENEQEEKE----KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           +  ++K   KE+            +++E      ++   + KK+K++ + K + E+ V  
Sbjct: 480 VSAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYS 539

Query: 71  IEESLKERKEKTLTED 86
           +E+SLKE  +K    D
Sbjct: 540 LEKSLKEEGDKLPEAD 555


>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
          [Translation, ribosomal structure and biogenesis].
          Length = 98

 Score = 28.9 bits (65), Expect = 1.4
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKK 63
             K  +KK+K  +KK++ KK K
Sbjct: 76 PPDKLIRKKRKLPRKKRRPKKPK 98


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 29.7 bits (67), Expect = 1.4
 Identities = 14/39 (35%), Positives = 23/39 (58%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
            G+ +  +E+  K  +K + EQ+E  K+  E EK +KKK
Sbjct: 103 NGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141



 Score = 28.5 bits (64), Expect = 2.9
 Identities = 17/83 (20%), Positives = 41/83 (49%), Gaps = 9/83 (10%)

Query: 13  KRIEKRNKEEEKEENEQE--------EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK-KK 63
           KRI+K  K   +EE   E         + ++ +++E  +     +   ++ KK + K KK
Sbjct: 63  KRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLKK 122

Query: 64  EEQSVKEIEESLKERKEKTLTED 86
           E++ +++  + L++ ++K +   
Sbjct: 123 EQKELRKKLDELEKEEKKKIWSV 145



 Score = 28.5 bits (64), Expect = 3.8
 Identities = 14/46 (30%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 12  KKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
           KK +E+ N + E   EE ++   + +  ++E  KK  + +K++KKK
Sbjct: 96  KKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141



 Score = 27.4 bits (61), Expect = 8.8
 Identities = 15/54 (27%), Positives = 34/54 (62%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E    ++ + K+E +E N   E   E++++ ++K KK++K+ +KK  + +K++K
Sbjct: 86  EMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEK 139


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 47  KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK-EKTLTE 85
           KKKK KK+++ KK KKK E  V ++E   K  K +K L E
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLE 162



 Score = 29.2 bits (66), Expect = 2.6
 Identities = 14/39 (35%), Positives = 22/39 (56%)

Query: 32  KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           K+K   EEE  K KKK +    K + +KK  K ++++ E
Sbjct: 124 KKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLE 162


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 27.9 bits (63), Expect = 1.4
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEE 65
          EK  +K+K+KK   +K+++KK  +EE
Sbjct: 39 EKPSEKRKRKKAAARKRRRKKLAREE 64



 Score = 26.4 bits (59), Expect = 5.2
 Identities = 8/21 (38%), Positives = 15/21 (71%)

Query: 46 KKKKKKKKKKKKKKKKKKEEQ 66
          K+K+KK   +K+++KK   E+
Sbjct: 44 KRKRKKAAARKRRRKKLAREE 64



 Score = 26.0 bits (58), Expect = 7.0
 Identities = 8/25 (32%), Positives = 16/25 (64%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQS 67
          K  +K+K+KK   +K+++KK   + 
Sbjct: 40 KPSEKRKRKKAAARKRRRKKLAREE 64


>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17. 
          Length = 93

 Score = 28.6 bits (64), Expect = 1.4
 Identities = 22/68 (32%), Positives = 31/68 (45%), Gaps = 8/68 (11%)

Query: 11 KKKRIEKRNKEEEKEENEQEE-----KEKEDVEEEKNKKKKKKKKK---KKKKKKKKKKK 62
          K+K   K   +  K+E ++       K      E K KK   KKKK   K KK  K KK+
Sbjct: 2  KRKGEAKDAAKVTKQEPKRRSARLSAKPAPPKPEPKPKKAAAKKKKPAVKGKKGAKGKKE 61

Query: 63 KEEQSVKE 70
           +++  KE
Sbjct: 62 TKQEEAKE 69


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 881

 Score = 30.7 bits (69), Expect = 1.4
 Identities = 11/56 (19%), Positives = 31/56 (55%)

Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
          +E ++ +Q   E+  +  EK    KK KKK+++++  ++    +  ++ + E +++
Sbjct: 3  DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGDLELLVERIQD 58



 Score = 29.5 bits (66), Expect = 3.1
 Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 3/61 (4%)

Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE-KTLTEDSQCD 90
           ++ D +++   ++ +   +K+   KK KKK+EE+ +   EE    + + + L E  Q  
Sbjct: 2  VDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLS--EEDAMLKGDLELLVERIQDP 59

Query: 91 T 91
           
Sbjct: 60 D 60


>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD.  This
           family consists of several putative bacterial stage IV
           sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
           is known to be essential for efficient sporulation
           although its exact function is unknown.
          Length = 383

 Score = 30.4 bits (69), Expect = 1.4
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 15/91 (16%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKE------------KEDVEEEKNKKKKKKKKKKKKK 55
            G KK  ++ R  E EK + E++EK             +E   E ++K  K  K++  + 
Sbjct: 277 IGGKKLPLKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVE- 335

Query: 56  KKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
             K KK  +++  KEI++  K + EK L E 
Sbjct: 336 --KGKKLAKKKLEKEIDKDAKIKDEKVLHER 364


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 30.3 bits (68), Expect = 1.4
 Identities = 15/83 (18%), Positives = 32/83 (38%), Gaps = 4/83 (4%)

Query: 3   KRRGGAGEKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEK---NKKKKKKKKKKKKKKKK 58
           +R G  G      E +  E E    + +   EK D  E+     + K    +   K++  
Sbjct: 188 RRIGVDGISLSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELD 247

Query: 59  KKKKKEEQSVKEIEESLKERKEK 81
            KK +  +++  +E +    ++ 
Sbjct: 248 AKKAELSKALAALEAANAADEDP 270


>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020).  This
          family of fungal proteins is conserved towards the
          C-terminus of HMG domains. The function is not known.
          Length = 49

 Score = 27.6 bits (61), Expect = 1.4
 Identities = 13/42 (30%), Positives = 20/42 (47%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          K      E+ K++    + KKK KKK  K+  K+K     +E
Sbjct: 7  KWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDDE 48



 Score = 26.8 bits (59), Expect = 2.6
 Identities = 13/47 (27%), Positives = 25/47 (53%)

Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
          N + +K+  +   E+NK    + + KKK KKK  K+  ++    I++
Sbjct: 1  NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDD 47


>gnl|CDD|241471 cd13317, PH_PLEKHO1_PLEKHO2, Pleckstrin homology
          domain-containing family O Pleckstrin homology domain. 
          The PLEKHO family members are PLEKHO1 (also called
          CKIP-1/Casein kinase 2-interacting protein
          1/CK2-interacting protein 1) and PLEKHO2 (PLEKHQ1/PH
          domain-containing family Q member 1). They both contain
          a single PH domain. PLEKHO1 acts as a scaffold protein
          that functions in plasma membrane recruitment,
          transcriptional activity modulation, and
          posttranscriptional modification regulation. As an
          adaptor protein it is involved in signaling pathways,
          apoptosis, differentiation, cytoskeleton, and bone
          formation. Not much is know about PLEKHO2. PH domains
          have diverse functions, but in general are involved in
          targeting proteins to the appropriate cellular location
          or in the interaction with a binding partner. They
          share little sequence conservation, but all have a
          common fold, which is electrostatically polarized. Less
          than 10% of PH domains bind phosphoinositide phosphates
          (PIPs) with high affinity and specificity. PH domains
          are distinguished from other PIP-binding domains by
          their specific high-affinity binding to PIPs with two
          vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
          or PtdIns(3,4,5)P3 which results in targeting some PH
          domain proteins to the plasma membrane. A few display
          strong specificity in lipid binding. Any specificity is
          usually determined by loop regions or insertions in the
          N-terminus of the domain, which are not conserved
          across all PH domains. PH domains are found in cellular
          signaling proteins such as serine/threonine kinase,
          tyrosine kinases, regulators of G-proteins, endocytotic
          GTPases, adaptors, as well as cytoskeletal associated
          molecules and in lipid associated enzymes.
          Length = 101

 Score = 28.6 bits (64), Expect = 1.5
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
          E EQE  + ED E  +  + +K + KKK+K +
Sbjct: 38 EKEQEVFDLEDYELCEYLRCQKSRSKKKRKSR 69


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 30.6 bits (69), Expect = 1.5
 Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 18/83 (21%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKK-----KKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           K E + E  +K    E K  KK+     K++ +K +++ +++ K E Q +K+IE  L ER
Sbjct: 50  KAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTER 109

Query: 79  -------------KEKTLTEDSQ 88
                        KEKTL    Q
Sbjct: 110 ATSLDRKDENLSSKEKTLESKEQ 132



 Score = 29.1 bits (65), Expect = 4.0
 Identities = 24/93 (25%), Positives = 53/93 (56%), Gaps = 10/93 (10%)

Query: 5   RGGAGEKKKRIEKRNKEEEK--------EENEQEEKEKEDVEEE-KNKKKKKKKKKKKKK 55
           RG A    + I+K  K E K        E  E+  K +E++E+E K+++++ K+ + +  
Sbjct: 48  RGKAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLT 107

Query: 56  KKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           ++     ++++++   E++L E KE++LT+ S+
Sbjct: 108 ERATSLDRKDENLSSKEKTL-ESKEQSLTDKSK 139


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 29.1 bits (65), Expect = 1.5
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK 47
           E+R+    + E+E+E   +E+ E+ K KKK K
Sbjct: 85  ERRSARMARLEDERELAREEEEEKRKKKKKNK 116



 Score = 27.9 bits (62), Expect = 4.0
 Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 3/39 (7%)

Query: 21  EEEKEENEQEEK---EKEDVEEEKNKKKKKKKKKKKKKK 56
             E ++ E+        ED  E   +++++K+KKKKK K
Sbjct: 78  FMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116



 Score = 27.5 bits (61), Expect = 4.3
 Identities = 14/37 (37%), Positives = 21/37 (56%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
           E + +E       + E E+E   EE+ +K+KKKKK K
Sbjct: 80  EIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116


>gnl|CDD|232933 TIGR00348, hsdR, type I site-specific deoxyribonuclease, HsdR
           family.  This gene is part of the type I restriction and
           modification system which is composed of three
           polypeptides R (restriction endonuclease), M
           (modification) and S (specificity). This group of
           enzymes recognize specific short DNA sequences and have
           an absolute requirement for ATP (or dATP) and
           S-adenosyl-L-methionine. They also catalyse the
           reactions of EC 2.1.1.72 and EC 2.1.1.73, with similar
           site specificity.(J. Mol. Biol. 271 (3), 342-348
           (1997)). Members of this family are assumed to differ
           from each other in DNA site specificity [DNA metabolism,
           Restriction/modification].
          Length = 667

 Score = 30.4 bits (69), Expect = 1.5
 Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 3/69 (4%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK--KKKKKKKKKKKEEQSVKEI 71
           R+ + + +++K +    ++  E + E   +  K+  K+K  K KK    + + E   K+I
Sbjct: 444 RLPEDHLDKKKLDAF-FDEIFELLPERIREITKESLKEKLQKTKKILFNEDRLESIAKDI 502

Query: 72  EESLKERKE 80
            E   + KE
Sbjct: 503 AEHYAKFKE 511


>gnl|CDD|129660 TIGR00569, ccl1, cyclin ccl1.  All proteins in this family for
           which functions are known are cyclins that are
           components of TFIIH, a complex that is involved in
           nucleotide excision repair and transcription initiation.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, StanfordUniversity) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 305

 Score = 30.2 bits (68), Expect = 1.5
 Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE-QSVKEIEESLKERKEKTLT 84
            N +E  +  D+  E     KK ++ + +K    KKK +E  S++E   S+K RK K   
Sbjct: 236 GNREELPQLIDIMRELRILVKKYEEPRSEKVAALKKKLDECHSIEENLSSVKIRKRKGYE 295

Query: 85  ED 86
           +D
Sbjct: 296 DD 297


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 30.7 bits (70), Expect = 1.5
 Identities = 16/60 (26%), Positives = 35/60 (58%), Gaps = 8/60 (13%)

Query: 12  KKRIEKRNKE----EE----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           ++++EK  +E    E+    K+E   E+ +K+++E+ K K ++ K  ++ KK  +K+  K
Sbjct: 216 EEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTK 275


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 29.4 bits (66), Expect = 1.5
 Identities = 11/34 (32%), Positives = 23/34 (67%)

Query: 45  KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
           K   KK+++K+++  +  KEE+ + EI + LK++
Sbjct: 92  KLTSKKEEEKEEEIPEPTKEEELLGEIRDLLKQQ 125


>gnl|CDD|206035 pfam13864, Enkurin, Calmodulin-binding.  This is a family of
          apparent calmodulin-binding proteins found at high
          levels in the testis and vomeronasal organ and at lower
          levels in certain other tissues. Enkurin is a scaffold
          protein that binds PI3 kinase to sperm transient
          receptor potential (canonical) (TRPC) channels. The
          mammalian transient receptor potential (canonical)
          channels are the primary candidates for the Ca(2+)
          entry pathway activated by the hormones, growth
          factors, and neurotransmitters that exert their effect
          through activation of PLC. Calmodulin binds to the
          C-terminus of all TRPC channels, and dissociation of
          calmodulin from TRPC4 results in profound activation of
          the channel.
          Length = 98

 Score = 28.7 bits (65), Expect = 1.5
 Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 23/85 (27%)

Query: 15 IEKRNKEEEKEENEQEEKEKEDV---------EEEKNKKKKKKKKKKKKK---------- 55
          + KR +E  +E+ E+E +E++           EEE+ +     KK   +           
Sbjct: 7  LLKRKEEIAEEKEERERQEEDPDCPPGHRLLSEEERLELLNGLKKNWDELNKEYQKLPVV 66

Query: 56 ----KKKKKKKKEEQSVKEIEESLK 76
               KK++K++ E+ + EIEE +K
Sbjct: 67 IDTPSKKRRKEELEKELAEIEEDIK 91


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 30.6 bits (70), Expect = 1.5
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 43  KKKKKKKKKKKKKKKKKKKKKEE 65
           +K++ KKKKK+KK KK + +K E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755



 Score = 29.4 bits (67), Expect = 3.9
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 47  KKKKKKKKKKKKKKKKKEEQSV 68
           +K++ KKKKK+KK KK E + +
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754



 Score = 29.0 bits (66), Expect = 4.2
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 41  KNKKKKKKKKKKKKKKKKKKK 61
           + ++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 29.0 bits (66), Expect = 4.7
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 40  EKNKKKKKKKKKKKKKKKKKK 60
           EK + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.7 bits (65), Expect = 6.5
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 45  KKKKKKKKKKKKKKKKKKKEEQS 67
           +K++ KKKKK+KK KK + E+  
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 20/83 (24%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV--------------- 68
            EE+E +  E E++   K K K   K   +KK   KKK  +E  +               
Sbjct: 32  DEESEGDLDELEELFSAKEKTKSASKDVSEKKSILKKKASQEFKILDPKRSQNLAILLRK 91

Query: 69  -----KEIEESLKERKEKTLTED 86
                +EI+E++ E  E  L+ D
Sbjct: 92  LHMSYEEIKEAILEGDEDVLSVD 114



 Score = 28.9 bits (65), Expect = 4.3
 Identities = 13/46 (28%), Positives = 25/46 (54%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
           E  K   K  +E  K+E E+EE+ K+ V+E    ++   ++K++  
Sbjct: 325 EFLKEFSKAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQKERNP 370


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of
          70kDa MW N terminal.  This domain is found in
          eukaryotes. This domain is about 90 amino acids in
          length. This domain is found associated with pfam00076.
          This domain is part of U1 snRNP, which is the pre-mRNA
          binding protein of the penta-snRNP spliceosome complex.
          It extends over a distance of 180 A from its RNA
          binding domain, wraps around the core domain of U1
          snRNP consisting of the seven Sm proteins and finally
          contacts U1-C, which is crucial for 5'-splice-site
          recognition.
          Length = 94

 Score = 28.4 bits (64), Expect = 1.6
 Identities = 11/41 (26%), Positives = 28/41 (68%)

Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           ++ K++   +  +   +K++++K++K+E+  K++EE LKE
Sbjct: 47 FKDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87



 Score = 28.4 bits (64), Expect = 1.8
 Identities = 11/40 (27%), Positives = 27/40 (67%)

Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          ++ ++E   E  E    K++++K++KK+K +KK +++ +E
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87



 Score = 27.6 bits (62), Expect = 3.3
 Identities = 11/43 (25%), Positives = 25/43 (58%)

Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           E +  + E      E   +K++++K++KK+K +KK ++E + 
Sbjct: 45 SEFKDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87



 Score = 27.6 bits (62), Expect = 3.3
 Identities = 9/36 (25%), Positives = 25/36 (69%)

Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          +D ++E   +  +   +K++++K++KK+K E+ ++E
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEE 83



 Score = 27.2 bits (61), Expect = 4.1
 Identities = 11/40 (27%), Positives = 26/40 (65%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
          K+  ++   E  +   EK +++K++KK+K +KK +++ K+
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87



 Score = 26.5 bits (59), Expect = 7.6
 Identities = 10/40 (25%), Positives = 26/40 (65%)

Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
          +  ++E   +  E   E++ ++K++KK+K +KK +++ K+
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 30.4 bits (68), Expect = 1.6
 Identities = 20/73 (27%), Positives = 41/73 (56%), Gaps = 5/73 (6%)

Query: 12  KKRIEKRNKEEEKEENE-----QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           K  +EK  +E +  +N+     +EE+ + D E E+ ++ KK+  +K   +  ++K K ++
Sbjct: 220 KHPLEKLEEEIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDE 279

Query: 67  SVKEIEESLKERK 79
            +K IE  +K+R 
Sbjct: 280 EIKFIEHDVKDRN 292


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 30.0 bits (67), Expect = 1.6
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKED 36
           E+K+  E++ +EE +EE +QEE +  D
Sbjct: 216 EEKQPQEEQEEEEVEEEAKQEEGQGTD 242



 Score = 28.9 bits (64), Expect = 3.9
 Identities = 15/75 (20%), Positives = 34/75 (45%), Gaps = 12/75 (16%)

Query: 20  KEEEKEENEQEE--------KEKEDVEEEKNKKK----KKKKKKKKKKKKKKKKKKEEQS 67
              E+  N  EE             V+  K K++    K ++  ++ K ++K+ ++E++ 
Sbjct: 167 PWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEE 226

Query: 68  VKEIEESLKERKEKT 82
            +  EE+ +E  + T
Sbjct: 227 EEVEEEAKQEEGQGT 241



 Score = 28.5 bits (63), Expect = 5.4
 Identities = 11/62 (17%), Positives = 34/62 (54%), Gaps = 5/62 (8%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           G   +    + K+E+     QE      ++E K ++K+ ++++++++ +++ K++E Q  
Sbjct: 187 GGSVQVKAPKPKQEQLLSKLQEY-----LQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGT 241

Query: 69  KE 70
            +
Sbjct: 242 DD 243


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 17/83 (20%), Positives = 39/83 (46%), Gaps = 8/83 (9%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK--------KKKKKKKKKKKKKKK 59
           A  K+K+ E++  +   E  EQ+  E E     K  +        +++  K K +    +
Sbjct: 619 AVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAE 678

Query: 60  KKKKEEQSVKEIEESLKERKEKT 82
           +K++ E  +++++  LK+  E+ 
Sbjct: 679 RKQQAETQLRQLDAQLKQLLEQQ 701


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK 41
           G       ++  EEE++E+E+EE+E+E+ EE +
Sbjct: 337 GRCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPE 369


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 30.6 bits (69), Expect = 1.6
 Identities = 15/31 (48%), Positives = 17/31 (54%)

Query: 25  EENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
           E  E  EK + + E E  K  KK KK KKKK
Sbjct: 228 EGGELIEKPQAEAENEAGKSDKKDKKSKKKK 258



 Score = 28.7 bits (64), Expect = 5.4
 Identities = 13/25 (52%), Positives = 13/25 (52%)

Query: 33  EKEDVEEEKNKKKKKKKKKKKKKKK 57
           EK   E E    K  KK KK KKKK
Sbjct: 234 EKPQAEAENEAGKSDKKDKKSKKKK 258


>gnl|CDD|224234 COG1315, COG1315, Uncharacterized conserved protein [Function
           unknown].
          Length = 543

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 19/105 (18%), Positives = 36/105 (34%), Gaps = 10/105 (9%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE- 65
           G   +     K   EE     E+ +K  + +      K + KK      K+         
Sbjct: 403 GMDPEIVERLKELTEEISLHEERLKKLTKLLVALVKVKIESKKNILPPDKESLLTAVNNT 462

Query: 66  -----QSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGI 105
                 S+++I+  L+  +E+ L       +   V  K  +Y G+
Sbjct: 463 KITLRNSIEKIKAELEGLQEE-LEVVGIEGS---VSVKNQVYSGV 503


>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Proteins.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Vetebrates contain two
           Proline-Serine-Threonine Phosphatase-Interacting
           Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
           mainly expressed in hematopoietic cells and are involved
           in the regulation of cell adhesion and motility.
           Mutations in PSTPIPs have been shown to cause
           autoinflammatory disorders. PSTPIP1 contains an
           N-terminal F-BAR domain, PEST motifs, and a C-terminal
           SH3 domain, while PSTPIP2 contains only the N-terminal
           F-BAR domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 239

 Score = 29.8 bits (67), Expect = 1.7
 Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 5/63 (7%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           +  R E    EE +E+ ++E K+ ED+      K+ +K KK+  KK  K KK  EQ  +E
Sbjct: 81  QSLREEAEKLEEFREKQKEERKKTEDI-----MKRSQKNKKELYKKTMKAKKSYEQKCRE 135

Query: 71  IEE 73
            ++
Sbjct: 136 KDK 138



 Score = 29.0 bits (65), Expect = 3.1
 Identities = 14/59 (23%), Positives = 29/59 (49%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           K K+   +   + K+  EQ+ +EK+  E+   K     + K+ +K KKK  + +  + +
Sbjct: 113 KNKKELYKKTMKAKKSYEQKCREKDKAEQAYEKSSSGAQPKEAEKLKKKAAQCKTSAEE 171


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
          chain; Provisional.
          Length = 1033

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
             + +E   E+E E++E ++  E   K +  K++K + K+ KK+KK+E Q + E
Sbjct: 27 SDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEIQKILE 81



 Score = 28.6 bits (64), Expect = 6.2
 Identities = 17/58 (29%), Positives = 35/58 (60%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
          E+E E+ E +E+ +   + E +K++K + K+ KK+KK++ +K  EQ    I+  +  +
Sbjct: 37 EDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEIQKILEQQNAAIDADMNNK 94



 Score = 28.2 bits (63), Expect = 9.8
 Identities = 15/54 (27%), Positives = 32/54 (59%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           ++ E   ++E E E+ +EE     K +  K++K + K+ KK+++Q +++I E 
Sbjct: 29 SDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEIQKILEQ 82


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 30.0 bits (67), Expect = 1.7
 Identities = 12/75 (16%), Positives = 38/75 (50%), Gaps = 1/75 (1%)

Query: 9   GEKKKRIEKRNKEEEK-EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             +K R+ + N   +  E+N++    + DV+ E+ ++ ++   +++++ ++       + 
Sbjct: 145 LAEKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEG 204

Query: 68  VKEIEESLKERKEKT 82
            +E+ E  +E  E +
Sbjct: 205 NRELNEEEEEEAEGS 219



 Score = 29.2 bits (65), Expect = 2.8
 Identities = 14/68 (20%), Positives = 33/68 (48%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+++R+E+ +  EE+E+ E       +   E N++++++ +     +     + E    K
Sbjct: 177 EEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKK 236

Query: 70  EIEESLKE 77
           + EE   E
Sbjct: 237 QGEEEEME 244



 Score = 28.0 bits (62), Expect = 7.8
 Identities = 14/75 (18%), Positives = 37/75 (49%), Gaps = 8/75 (10%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------KKKKKKKKKEEQS 67
            +   EEE+EE E  +  ++ V+ E  +  KK+ ++++         + + +++   E+ 
Sbjct: 205 NRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEV 264

Query: 68  VKEIEESLKERKEKT 82
            +  EES  ++++  
Sbjct: 265 PRNNEESPAKKQKVE 279


>gnl|CDD|233936 TIGR02578, cas_TM1811_Csm1, CRISPR-associated protein Cas10/Csm1,
           subtype III-A/MTUBE.  The family is designated Csm2, for
           CRISPR/Cas Subtype Mtube Protein 2. A typical example is
           TM1811 from Thermotoga maritima. CRISPR are Clustered
           Regularly Interspaced Short Palindromic Repeats. This
           protein family belongs to a conserved gene cluster
           regularly found near CRISPR repeats [Mobile and
           extrachromosomal element functions, Other].
          Length = 648

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 22/122 (18%), Positives = 43/122 (35%), Gaps = 13/122 (10%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEK----------EKEDVEEEKNKKKKKKKKKKKKKKKK 58
            E   R+E  +  EE+E   +EE                     K K   K   +    +
Sbjct: 82  LEIIDRLEAGDDREEEELGNEEEPLVSPFPKIPLGNSLRNAGGFKPKYPLKSLNRAVPPE 141

Query: 59  KKKKKEEQSVKEIEESLKERKEKTL-TEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREP 117
              K  +++ K + E      EK   +   + + ++ +F+K  L+  +   +   S  + 
Sbjct: 142 NDNKASQENYKRLMEKFLGDLEKLKPSYPDKIELLLTLFEK--LWSFVPAATNGGSKPDV 199

Query: 118 SL 119
           SL
Sbjct: 200 SL 201


>gnl|CDD|153367 cd07683, F-BAR_srGAP1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Protein 1.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs. srGAP1, also called Rho
           GTPase-Activating Protein 13 (ARHGAP13), is a Cdc42- and
           RhoA-specific GAP and is expressed later in the
           development of CNS (central nervous system) tissues. It
           is an important downstream signaling molecule of Robo1.
           srGAP1 contains an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 253

 Score = 30.0 bits (67), Expect = 1.7
 Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           E + KE EK+E +Q  +  + V   + + + +++   KK +K K+K++ + S  ++ +S+
Sbjct: 154 ESKLKEAEKQEEKQIGRSGDPVFHIRLEDRHQRRSSVKKIEKMKEKRQAKYSENKL-KSI 212

Query: 76  KERKEKTLT 84
           K R E  LT
Sbjct: 213 KARNEYLLT 221


>gnl|CDD|218481 pfam05177, RCSD, RCSD region.  Proteins contain this region include
           C.elegans UNC-89. This region is found repeated in
           UNC-89 and shows conservation in prolines, lysines and
           glutamic acids. Proteins with RCSD are involved in
           muscle M-line assembly, but the function of this region
           RCSD is not clear.
          Length = 101

 Score = 28.7 bits (63), Expect = 1.7
 Identities = 27/93 (29%), Positives = 41/93 (44%), Gaps = 3/93 (3%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKED--VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
             K   K+ K   +    + EK +ED   EE K+ +KK+K  +K   K K   KKE+   
Sbjct: 1   GVKSPGKKEKSPLRRSPSRTEKTEEDRATEEAKSPEKKEKSPEKVDGKPKSPTKKEKSPE 60

Query: 69  KEIEESLKE-RKEKTLTEDSQCDTMIPVFKKGV 100
           K   E +K   K++   E  +     P  K+ V
Sbjct: 61  KSATEEVKSPTKKEKSPEKVEEKPASPTKKERV 93



 Score = 28.3 bits (62), Expect = 2.6
 Identities = 19/61 (31%), Positives = 32/61 (52%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          +K+K  EK + + +    +++  EK   EE K+  KK+K  +K ++K     KKE    K
Sbjct: 37 KKEKSPEKVDGKPKSPTKKEKSPEKSATEEVKSPTKKEKSPEKVEEKPASPTKKERVQKK 96

Query: 70 E 70
          E
Sbjct: 97 E 97



 Score = 27.9 bits (61), Expect = 3.2
 Identities = 19/72 (26%), Positives = 35/72 (48%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E+ +  E+    E+KE++ ++   K     +K K  +K   ++ K   KK+K  E+   K
Sbjct: 24 EEDRATEEAKSPEKKEKSPEKVDGKPKSPTKKEKSPEKSATEEVKSPTKKEKSPEKVEEK 83

Query: 70 EIEESLKERKEK 81
              + KER +K
Sbjct: 84 PASPTKKERVQK 95


>gnl|CDD|218637 pfam05558, DREPP, DREPP plasma membrane polypeptide.  This family
           contains several plant plasma membrane proteins termed
           DREPPs as they are developmentally regulated plasma
           membrane polypeptides.
          Length = 201

 Score = 29.6 bits (66), Expect = 1.7
 Identities = 21/68 (30%), Positives = 27/68 (39%), Gaps = 3/68 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EKK   E    E EK+      KEKE V EE+ K+++          K  + K EE   K
Sbjct: 133 EKKPEEEAPAAETEKKP---AVKEKEIVVEEEKKEEEAVPVAAAPSTKAPETKVEETPAK 189

Query: 70  EIEESLKE 77
                  E
Sbjct: 190 APAAPAAE 197


>gnl|CDD|138835 PRK12279, PRK12279, 50S ribosomal protein L22/unknown domain fusion
           protein; Provisional.
          Length = 311

 Score = 30.0 bits (67), Expect = 1.7
 Identities = 16/74 (21%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 10  EKKKRIEK-RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           E++K +   + K+ +K    + + + E  +  K  K + K  +K +    +  K+E+Q +
Sbjct: 114 EREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDPELLKREQQVL 173

Query: 69  KEIEESLKERKEKT 82
           K +E++  +++E+T
Sbjct: 174 KVVEKTASQKEEET 187



 Score = 29.3 bits (65), Expect = 3.5
 Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 5/60 (8%)

Query: 29  QEEKEKEDVEEEKNKKKK-----KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
           + EKE   ++ +K+KK        K + KK  K  K + K  +  + ++  L +R+++ L
Sbjct: 114 EREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDPELLKREQQVL 173


>gnl|CDD|217838 pfam04004, Leo1, Leo1-like protein.  Members of this family are
           part of the Paf1/RNA polymerase II complex. The Paf1
           complex probably functions during the elongation phase
           of transcription. The Leo1 subunit of the yeast
           Paf1-complex binds RNA and contributes to complex
           recruitment. The subunit acts by co-ordinating
           co-transcriptional chromain modifications and helping
           recruitment of mRNA 3prime-end processing factors.
          Length = 312

 Score = 30.2 bits (68), Expect = 1.7
 Identities = 19/91 (20%), Positives = 38/91 (41%), Gaps = 7/91 (7%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +  R+  +G  K  I  ++ E EK E E++E       E+K + +++++ ++K K K   
Sbjct: 216 LATRQRKSGGVKITIVGKDPEHEKREREKKE-------EQKLRARRRRQNREKMKNKPPN 268

Query: 61  KKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
           +                   +   E S C+T
Sbjct: 269 RPGHGSGSDSNVAKAATTYSEDEDEGSDCET 299


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 13/35 (37%), Positives = 22/35 (62%), Gaps = 3/35 (8%)

Query: 34  KEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           ++DV    EEK K+KK KK+ K++++   + K  E
Sbjct: 386 RDDVYKEREEKEKEKKAKKEGKEERRIHFQNKSIE 420



 Score = 29.2 bits (66), Expect = 4.0
 Identities = 15/38 (39%), Positives = 22/38 (57%), Gaps = 8/38 (21%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           E+EEKEKE        KK KK+ K++++   + K  EE
Sbjct: 392 EREEKEKE--------KKAKKEGKEERRIHFQNKSIEE 421


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 15/66 (22%), Positives = 32/66 (48%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           IE+ N+E     +  EE  +E++ +E  K    ++  +++  ++      E++   +EE 
Sbjct: 36  IERINQERGSFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEF 95

Query: 75  LKERKE 80
            K R E
Sbjct: 96  PKARDE 101


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 2/75 (2%)

Query: 10   EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK--KKKKKKEEQS 67
            E   R++K     + EE    E  K     + ++ + K K+  ++ +   KK      + 
Sbjct: 928  ELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREG 987

Query: 68   VKEIEESLKERKEKT 82
             K   E    +KE  
Sbjct: 988  NKANSELKNFKKELA 1002


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 30.3 bits (68), Expect = 1.8
 Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 12/97 (12%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN---------KKKKKKKKK 51
           M KR     + +   EK+  EE+K + E+EE       +            K+ K+  + 
Sbjct: 481 MEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEM 540

Query: 52  KKKKKKKKKKKKEEQSV---KEIEESLKERKEKTLTE 85
           + KK +   K KEE+     KE +E  K ++ +  TE
Sbjct: 541 EIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETE 577



 Score = 29.9 bits (67), Expect = 2.7
 Identities = 22/90 (24%), Positives = 44/90 (48%), Gaps = 7/90 (7%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           +KR+    E   R   +     +E  E  ++ K+D+E E  K +   K K+++ +  +K+
Sbjct: 503 KKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKE 562

Query: 62  ----KKEEQSVKEIE---ESLKERKEKTLT 84
               +K ++S KE E    +L+  ++K L 
Sbjct: 563 AQELRKYQESEKETEVLMSALQAMQDKNLM 592


>gnl|CDD|203820 pfam07956, DUF1690, Protein of Unknown function (DUF1690).  Family
           of uncharacterized fungal proteins.
          Length = 142

 Score = 29.4 bits (66), Expect = 1.8
 Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 9/97 (9%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDV-----EEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           E  K+ E         ++ ++ +          + E   +K ++  K ++K+K KK K+ 
Sbjct: 50  ETLKKFEDTLNSSLLSDDSKDNENGLSSNLLNEKIESLTEKLEEFSKLEEKEKSKKLKEV 109

Query: 65  EQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVL 101
           E +  E+ E L E K K L     C   +  FKK VL
Sbjct: 110 EDARSELTECLLENKGKPLN----CYDEVEAFKKLVL 142


>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
           module that binds membranes and detects membrane
           curvature.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions including organelle
           biogenesis, membrane trafficking or remodeling, and cell
           division and migration. Mutations in BAR containing
           proteins have been linked to diseases and their
           inactivation in cells leads to altered membrane
           dynamics. A BAR domain with an additional N-terminal
           amphipathic helix (an N-BAR) can drive membrane
           curvature. These N-BAR domains are found in amphiphysins
           and endophilins, among others. BAR domains are also
           frequently found alongside domains that determine lipid
           specificity, such as the Pleckstrin Homology (PH) and
           Phox Homology (PX) domains which are present in beta
           centaurins (ACAPs and ASAPs) and sorting nexins,
           respectively. A FES-CIP4 Homology (FCH) domain together
           with a coiled coil region is called the F-BAR domain and
           is present in Pombe/Cdc15 homology (PCH) family
           proteins, which include Fes/Fes tyrosine kinases, PACSIN
           or syndapin, CIP4-like proteins, and srGAPs, among
           others. The Inverse (I)-BAR or IRSp53/MIM homology
           Domain (IMD) is found in multi-domain proteins, such as
           IRSp53 and MIM, that act as scaffolding proteins and
           transducers of a variety of signaling pathways that link
           membrane dynamics and the underlying actin cytoskeleton.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The I-BAR domain
           induces membrane protrusions in the opposite direction
           compared to classical BAR and F-BAR domains, which
           produce membrane invaginations. BAR domains that also
           serve as protein interaction domains include those of
           arfaptin and OPHN1-like proteins, among others, which
           bind to Rac and Rho GAP domains, respectively.
          Length = 194

 Score = 29.7 bits (67), Expect = 1.8
 Identities = 19/80 (23%), Positives = 42/80 (52%), Gaps = 1/80 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           +  K I+KR K+ +K   + +   +E +++ + KKK   K  + +++ ++ K+K E+  +
Sbjct: 91  KDLKEIKKRRKKLDKARLDYDAA-REKLKKLRKKKKDSSKLAEAEEELQEAKEKYEELRE 149

Query: 70  EIEESLKERKEKTLTEDSQC 89
           E+ E L + +EK        
Sbjct: 150 ELIEDLNKLEEKRKELFLSL 169


>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
           Provisional.
          Length = 231

 Score = 30.0 bits (67), Expect = 1.8
 Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 1/69 (1%)

Query: 21  EEEKEENEQEE-KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
           E  ++E +QE+    E V     K  K++ KKK K  K +K  K  + +   ++   + +
Sbjct: 61  EPSEQECQQEQLPVPESVPPAPVKTPKRRTKKKAKADKPEKSPKAVEKLCPPDDRDDKNE 120

Query: 80  EKTLTEDSQ 88
           EK  TE++Q
Sbjct: 121 EKEPTEEAQ 129


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 30.0 bits (68), Expect = 1.9
 Identities = 12/56 (21%), Positives = 21/56 (37%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           K I  +  + +K                 ++K+K  KK  KK    KK+K++    
Sbjct: 195 KNILLKLNDNDKTAASSASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPP 250


>gnl|CDD|220184 pfam09332, Mcm10, Mcm10 replication factor.  Mcm10 is a eukaryotic
           DNA replication factor that regulates the stability and
           chromatin association of DNA polymerase alpha.
          Length = 346

 Score = 29.9 bits (67), Expect = 1.9
 Identities = 18/89 (20%), Positives = 37/89 (41%), Gaps = 4/89 (4%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           KR+     K++ I++R ++       Q    K   +EE   KK++  + ++ +K    K 
Sbjct: 158 KRKRSESGKRRAIDERVEKNASSAKRQ----KSPEDEEPAMKKRRYLESEEFQKILAAKS 213

Query: 63  KEEQSVKEIEESLKERKEKTLTEDSQCDT 91
                ++E E   +E     L +  Q + 
Sbjct: 214 SHTGLLEEAEREAQEEYFNPLEKKEQMEE 242


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 29.4 bits (66), Expect = 1.9
 Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 2/77 (2%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
           GE  K + ++ KE   EE E   KE     E   ++K++  KK   K       + E  +
Sbjct: 99  GEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEPKI 158

Query: 69  KEIEESLKERKEKTLTE 85
           +   E      +K L E
Sbjct: 159 RPKVE--GPSPDKALVE 173



 Score = 28.7 bits (64), Expect = 4.4
 Identities = 20/95 (21%), Positives = 32/95 (33%), Gaps = 31/95 (32%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEK--------EDVEEEKNKKKKKKKK----------- 50
           EK+   ++ N + E+ + E+EE  K            E + K + K +            
Sbjct: 116 EKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEET 175

Query: 51  ------------KKKKKKKKKKKKKEEQSVKEIEE 73
                        KKKK   K KK +E+      E
Sbjct: 176 KIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPE 210


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 29.9 bits (67), Expect = 2.0
 Identities = 13/26 (50%), Positives = 15/26 (57%)

Query: 42  NKKKKKKKKKKKKKKKKKKKKKEEQS 67
            K K   KKK K KK KKK+ K E +
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKREAN 232



 Score = 28.7 bits (64), Expect = 4.3
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKKK 62
           EK+K   KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229



 Score = 27.6 bits (61), Expect = 9.5
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKK 60
           E++K   KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229


>gnl|CDD|222855 PHA02529, O, capsid-scaffolding protein; Provisional.
          Length = 278

 Score = 29.7 bits (67), Expect = 2.0
 Identities = 20/83 (24%), Positives = 38/83 (45%), Gaps = 4/83 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           EK K++  R KE++   +EQ     E VE   E+ +    +    +  +  ++ K  EE 
Sbjct: 176 EKVKKLFSRKKEDDAMTDEQFADVHEAVEGVAEKIDHTSAQVATTETAEPPQEGKDTEEN 235

Query: 67  SVKEIEES-LKERKEKTLTEDSQ 88
                + S LKE  +K  ++ ++
Sbjct: 236 GETAEQFSALKETLDKLASKFNE 258


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 30.0 bits (67), Expect = 2.0
 Identities = 19/74 (25%), Positives = 36/74 (48%), Gaps = 6/74 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK------KK 63
           E +K I      EE+EE E EE E+   +E      +K +++++++ K  K       + 
Sbjct: 129 EPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQT 188

Query: 64  EEQSVKEIEESLKE 77
           ++ SVK+  +S   
Sbjct: 189 KKPSVKDNGKSFWS 202


>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 215

 Score = 29.5 bits (66), Expect = 2.0
 Identities = 11/51 (21%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            KK R++ R  E  +   E+ + E+ + +++ ++++K+    K+ K + K+
Sbjct: 67  AKKSRVDAR--EAYRIAAEENKAEQLERDKQLSEQQKQAALAKEYKAQVKQ 115


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 30.4 bits (68), Expect = 2.0
 Identities = 20/100 (20%), Positives = 41/100 (41%), Gaps = 15/100 (15%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQEEKEKE---------DVEEEKNKKKKKKKKKKKKKKKK 58
              E     E+++ E+    NE +   KE         D +E++++++         + + 
Sbjct: 3912 NEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQP 3971

Query: 59   KKKKKEEQSVKEIE-----ESLK-ERKEKTLTEDSQCDTM 92
              ++   Q   E E     E LK + KE  +++DS  + M
Sbjct: 3972 DIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDM 4011



 Score = 28.4 bits (63), Expect = 8.2
 Identities = 16/76 (21%), Positives = 35/76 (46%), Gaps = 3/76 (3%)

Query: 8    AGEKKKRIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKK--KKKKKKKKKKKKKKKE 64
             G  K  +   N+  + EE  Q+ ++E  D+ E+ +    +K   +  ++   + ++K  
Sbjct: 3865 NGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSN 3924

Query: 65   EQSVKEIEESLKERKE 80
            EQS    E  L  +++
Sbjct: 3925 EQSAANNESDLVSKED 3940


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 29.8 bits (67), Expect = 2.1
 Identities = 16/58 (27%), Positives = 36/58 (62%), Gaps = 4/58 (6%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
            +E ++   E  ++E V EE+ +K + +  K+  +K+++++++EE+ V+E    LK R
Sbjct: 66  SQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVRE----LKAR 119


>gnl|CDD|218675 pfam05648, PEX11, Peroxisomal biogenesis factor 11 (PEX11).  This
           family consists of several peroxisomal biogenesis factor
           11 (PEX11) proteins from several eukaryotic species. The
           PEX11 peroxisomal membrane proteins promote peroxisome
           division in multiple eukaryotes.
          Length = 225

 Score = 29.5 bits (67), Expect = 2.1
 Identities = 14/50 (28%), Positives = 27/50 (54%), Gaps = 3/50 (6%)

Query: 48  KKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT--LTEDSQCDTMIPV 95
           ++K+KK KK+K  + +     K +++ LK+R      L ++  CD  I +
Sbjct: 148 QEKEKKLKKEKDSEGESTSERKLLKKILKKRPAALLDLVKNL-CDLFIAL 196


>gnl|CDD|218274 pfam04801, Sin_N, Sin-like protein conserved region.  Family of
           higher eukaryotic proteins. SIN was identified as a
           protein that interacts specifically with SXL (sex
           lethal) in a yeast two-hybrid assay. The interaction is
           mediated by one of the SXL RNA binding domains.
          Length = 422

 Score = 30.1 bits (68), Expect = 2.1
 Identities = 15/59 (25%), Positives = 36/59 (61%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           +K++++++ E    E ED +EE  ++ K+   K  + + +K++K+ EQS   +++ + E
Sbjct: 140 DKKDKRKKEEDTADEDEDPDEEAEEELKQVTVKFSRPETEKQRKRREQSYNFLQKKIAE 198



 Score = 28.9 bits (65), Expect = 4.7
 Identities = 18/64 (28%), Positives = 33/64 (51%), Gaps = 1/64 (1%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           +KR KEE+  + +++  E E  EE K    K  + + +K++K++++       K  EE  
Sbjct: 143 DKRKKEEDTADEDEDPDE-EAEEELKQVTVKFSRPETEKQRKRREQSYNFLQKKIAEEPW 201

Query: 76  KERK 79
            E K
Sbjct: 202 IELK 205


>gnl|CDD|177400 PHA02571, a-gt.4, hypothetical protein; Provisional.
          Length = 109

 Score = 28.5 bits (64), Expect = 2.1
 Identities = 10/43 (23%), Positives = 24/43 (55%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           + +E  ++E +E     + +N+ + +KK  K  KK +++ K+
Sbjct: 6  TDVEELTDEEVEELLSELQARNEAEAEKKAAKILKKNRREIKR 48


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 29.7 bits (67), Expect = 2.2
 Identities = 16/78 (20%), Positives = 38/78 (48%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            K+K ++      + +E E++  +K+ +E+   K  +K K K     +K K K   + + 
Sbjct: 200 NKQKTLKNILYNLKDKELEKKGYKKKQIEKFFQKIYEKYKNKPHFIIEKNKYKDLNKIIG 259

Query: 70  EIEESLKERKEKTLTEDS 87
           + ++S K++K+ +     
Sbjct: 260 KFKKSFKKKKKNSKKNYE 277


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 30.0 bits (68), Expect = 2.2
 Identities = 18/70 (25%), Positives = 34/70 (48%), Gaps = 10/70 (14%)

Query: 10  EKKKRIEKRNKEE------EKEENEQEEK----EKEDVEEEKNKKKKKKKKKKKKKKKKK 59
           ++K+ IE RN  +      EK   E  +K    EKE +E+   + K+  K +  ++ K K
Sbjct: 519 KRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIEKAVAELKEALKGEDVEEIKAK 578

Query: 60  KKKKEEQSVK 69
            ++ ++   K
Sbjct: 579 TEELQQALQK 588



 Score = 29.6 bits (67), Expect = 3.4
 Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 11/92 (11%)

Query: 6   GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK------KKKKKKKKKKKKKKKK 59
             +G  ++ IE+  KE   E N +E+K++++  E +N       + +K  K+   K   +
Sbjct: 494 ASSGLSEEEIERMVKE--AEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAE 551

Query: 60  KKKKEEQSVKEIEESLKER---KEKTLTEDSQ 88
           +K+K E++V E++E+LK     + K  TE+ Q
Sbjct: 552 EKEKIEKAVAELKEALKGEDVEEIKAKTEELQ 583


>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
          hydrophilic C-term.  This domain is a hydrophilic
          region found at the C-terminus of plant and metazoan
          pre-mRNA-splicing factor 38 proteins. The function is
          not known.
          Length = 97

 Score = 28.2 bits (63), Expect = 2.2
 Identities = 9/47 (19%), Positives = 31/47 (65%)

Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          ++EEE EE E +E+ +   E + ++ ++  +++ +++ +++K+ ++ 
Sbjct: 11 DEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKR 57


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 13/71 (18%)

Query: 20  KEEEKEENEQEEK-----EKEDVEEEKNKKKKKKKK------KKKKKKKKKKKKKEEQS- 67
           KEE+ EE +QE K     +K+   ++K+KK K  +         +KK    + K E QS 
Sbjct: 191 KEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESKLEAQSK 250

Query: 68  -VKEIEESLKE 77
            +  +++ LK+
Sbjct: 251 ELWSLKDDLKK 261



 Score = 29.0 bits (65), Expect = 5.0
 Identities = 20/82 (24%), Positives = 37/82 (45%), Gaps = 9/82 (10%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK--------KKKKKKKKKK 62
           KK   E + ++ E+ + E ++  K   +   + K KK K  +         +KK    + 
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLES 243

Query: 63  KEEQSVKEIEESLKERKEKTLT 84
           K E   KE+  SLK+  +K ++
Sbjct: 244 KLEAQSKEL-WSLKDDLKKHVS 264


>gnl|CDD|217145 pfam02616, ScpA_ScpB, ScpA/B protein.  ScpA and ScpB participate in
           chromosomal partition during cell division. It may act
           via the formation of a condensin-like complex containing
           smc that pull DNA away from mid-cell into both cell
           halves. These proteins are part of the Kleisin
           superfamily.
          Length = 177

 Score = 29.3 bits (66), Expect = 2.3
 Identities = 16/80 (20%), Positives = 39/80 (48%), Gaps = 6/80 (7%)

Query: 12  KKRIEKRNKEEE--KEENEQEEKEKEDVEEEKNKK----KKKKKKKKKKKKKKKKKKKEE 65
           +   +K    +E  K   E+  + ++ V +  +      K+K+K + +KK + + +   E
Sbjct: 56  RSFSKKPVTLDEFIKTLEEELVEVEKSVLDLIHAFNKLVKRKRKNRLRKKNETELEVSVE 115

Query: 66  QSVKEIEESLKERKEKTLTE 85
           + ++E+   L  R ++ LT+
Sbjct: 116 EVLEELLAFLGVRLKEVLTK 135


>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related.  This family
           represents a region within kinesin-related proteins from
           higher plants. Many family members also contain the
           pfam00225 domain. Kinesins are ATP-driven microtubule
           motor proteins that produce directed force. Some family
           members are associated with the phragmoplast, a
           structure composed mainly of microtubules that executes
           cytokinesis in higher plants.
          Length = 488

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 23  EKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKKKK 56
           E EE     +E+    E+E  K KK+ +K K+K K
Sbjct: 438 EAEEAATLAQERAMLAEQENEKLKKQIEKLKRKHK 472


>gnl|CDD|224559 COG1645, COG1645, Uncharacterized Zn-finger containing protein
           [General function prediction only].
          Length = 131

 Score = 28.6 bits (64), Expect = 2.3
 Identities = 10/58 (17%), Positives = 27/58 (46%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
              E   EE+E+E   E + + ++ + +      + KK+  E+ + +++   ++  K 
Sbjct: 51  GYREVVVEEEEEEVEAEVQEQLRRSRPELPDDSDELKKEIDEDPASEKLLSVIETLKR 108


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 28.1 bits (63), Expect = 2.4
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK 47
           E+R+    + E+ +EE+ +++ E EK  +K+K
Sbjct: 75  ERRSARLARLEDREEERLEKEEEREKRARKRK 106



 Score = 27.3 bits (61), Expect = 4.7
 Identities = 11/34 (32%), Positives = 23/34 (67%), Gaps = 2/34 (5%)

Query: 25  EENEQEEK--EKEDVEEEKNKKKKKKKKKKKKKK 56
           EE  +  +    ED EEE+ +K+++++K+ +K+K
Sbjct: 73  EEERRSARLARLEDREEERLEKEEEREKRARKRK 106


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
          This family consists of several brain acid soluble
          protein 1 (BASP1) or neuronal axonal membrane protein
          NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
          calmodulin-binding protein of unknown function.
          Length = 233

 Score = 29.4 bits (65), Expect = 2.4
 Identities = 25/95 (26%), Positives = 43/95 (45%), Gaps = 12/95 (12%)

Query: 6  GGAGEKKKRIEKRNKEEEKEEN--------EQEEKEKEDVEE----EKNKKKKKKKKKKK 53
          GG   KKK+    N E+ K+++        E+E   KE+ E     E  + K+ K++K  
Sbjct: 1  GGKLSKKKKGYNVNDEKAKDKDKKAEGAATEEEGTPKENEEAQAAAETTEVKEAKEEKPD 60

Query: 54 KKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
          K  +    K EE+  ++   + KE   K   E ++
Sbjct: 61 KDAQDTANKTEEKEGEKEAAAAKEEAPKAEPEKTE 95



 Score = 29.4 bits (65), Expect = 2.4
 Identities = 18/72 (25%), Positives = 37/72 (51%)

Query: 3  KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
          K+  GA  +++   K N+E +      E KE ++ + +K+ +    K ++K+ +K+    
Sbjct: 23 KKAEGAATEEEGTPKENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAA 82

Query: 63 KEEQSVKEIEES 74
          KEE    E E++
Sbjct: 83 KEEAPKAEPEKT 94


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 29.4 bits (67), Expect = 2.4
 Identities = 25/93 (26%), Positives = 46/93 (49%), Gaps = 15/93 (16%)

Query: 8   AGEKKKRIEKRNKEE---EKEENEQEEKEKED---------VEEEKNKKKKK---KKKKK 52
           A   KK      KEE    + E E+E KE+ +         +++E+   +K    +KK++
Sbjct: 47  AEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEE 106

Query: 53  KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
             ++K+K+    +Q ++E EE L+E  E+   E
Sbjct: 107 SLEEKEKELAARQQQLEEKEEELEELIEEQQQE 139



 Score = 28.7 bits (65), Expect = 3.5
 Identities = 21/92 (22%), Positives = 49/92 (53%), Gaps = 8/92 (8%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------KKKKKKK 61
           E KK  E   KE   E  E+  K + + E E  +++ + ++++K+         +K +  
Sbjct: 42  EAKKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESL 101

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
           +K+E+S++E E+ L  R+++   ++ + + +I
Sbjct: 102 EKKEESLEEKEKELAARQQQLEEKEEELEELI 133


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
          [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 28.1 bits (63), Expect = 2.4
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 20 KEEEKEENEQEEKEKEDVEEE 40
             E +E E+EEKE+E  EE 
Sbjct: 79 AAAEADEAEEEEKEEEAEEES 99



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 10/29 (34%), Positives = 15/29 (51%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKED 36
           AG +        +EEEKEE  +EE + + 
Sbjct: 75  AGAEAAAEADEAEEEEKEEEAEEESDDDM 103


>gnl|CDD|227479 COG5150, COG5150, Class 2 transcription repressor NC2, beta subunit
           (Dr1) [Transcription].
          Length = 148

 Score = 28.8 bits (64), Expect = 2.4
 Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E KK I   +  +  E  E EE  +  +EE +N K  +K+K+ K  K K      E+ ++
Sbjct: 61  EAKKTIAYEHVIKALENLEFEEYIESCMEEHENYKSYQKQKESKISKFKDSGLSMEELLR 120

Query: 70  EIEESL---KERKEKTLTED 86
           + EE     + R E +  +D
Sbjct: 121 QQEELFQNARLRFEHSFEDD 140


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
          of this family (Kri1p) is found to be required for 40S
          ribosome biogenesis in the nucleolus. This is the
          C-terminal domain of the family.
          Length = 93

 Score = 27.9 bits (63), Expect = 2.4
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKK 63
          ++E+ +K KKK  KK + ++ KKK  
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
          Reviewed.
          Length = 830

 Score = 30.1 bits (68), Expect = 2.4
 Identities = 9/44 (20%), Positives = 24/44 (54%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
           K+++ +R   ++ + ++ ++ E E+    K K K +K + K+ 
Sbjct: 19 VKQKVSRRRYRDDDDYDDYDDYEDEEPMPRKGKGKGRKPRGKRG 62


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 30.0 bits (67), Expect = 2.4
 Identities = 16/49 (32%), Positives = 22/49 (44%), Gaps = 8/49 (16%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKE--------DVEEEKNKKKKKKKKKK 52
           K  IE RN EE + E E E+ E E        D +E +N  +   +K  
Sbjct: 372 KMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADEKNS 420


>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
          Length = 233

 Score = 29.4 bits (66), Expect = 2.4
 Identities = 9/37 (24%), Positives = 16/37 (43%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           K   K    +E +E E+   + +K    KKK +   +
Sbjct: 183 KSYNKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVE 219


>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
          Length = 746

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 12/57 (21%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 9   GEKKKRIEKRNKEEEKEENEQ--EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           G+ + R +K    +  +EN+Q   E+ +E   ++ NK+       + +++ ++K+K+
Sbjct: 679 GQVRVRYDKELLSKMYQENQQKLAEEAREKALKDANKRAPIVAATQAREEVREKRKQ 735



 Score = 29.0 bits (65), Expect = 4.3
 Identities = 6/48 (12%), Positives = 21/48 (43%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
            K  ++  ++  +E  E+  K+           + +++ ++K+K+   
Sbjct: 691 SKMYQENQQKLAEEAREKALKDANKRAPIVAATQAREEVREKRKQTSG 738



 Score = 29.0 bits (65), Expect = 4.5
 Identities = 7/43 (16%), Positives = 20/43 (46%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           + ++K   E  EK  +D  +        + +++ ++K+K+   
Sbjct: 696 ENQQKLAEEAREKALKDANKRAPIVAATQAREEVREKRKQTSG 738


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 29.6 bits (67), Expect = 2.4
 Identities = 15/78 (19%), Positives = 28/78 (35%), Gaps = 5/78 (6%)

Query: 12  KKRIEK----RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           KK             E  +    + E    +  +  K K  + +  K   +KK+++ +  
Sbjct: 35  KKGGFLGLLFSKTAVEVTKLAAVDSELRKFQTREAAKVKDAEAQLLKDPAEKKREERKA- 93

Query: 68  VKEIEESLKERKEKTLTE 85
            K+IE S     E+   E
Sbjct: 94  AKKIERSTPSLIERKTQE 111



 Score = 28.1 bits (63), Expect = 7.4
 Identities = 10/77 (12%), Positives = 31/77 (40%), Gaps = 3/77 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EKK+   K  K+ E+      E++    +E K+  ++  +  + +K   ++ ++      
Sbjct: 85  EKKREERKAAKKIERSTPSLIERK---TQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFV 141

Query: 70  EIEESLKERKEKTLTED 86
            + +   +   +    +
Sbjct: 142 ALYKQEIQSPTRLNLIN 158


>gnl|CDD|219174 pfam06780, Erp_C, Erp protein C-terminus.  This family represents
           the C-terminus of bacterial Erp proteins that seem to be
           specific to Borrelia burgdorferi (a causative agent of
           Lyme disease). Borrelia Erp proteins are particularly
           heterogeneous, which might enable them to interact with
           a wide variety of host components.
          Length = 141

 Score = 28.9 bits (65), Expect = 2.5
 Identities = 18/60 (30%), Positives = 30/60 (50%)

Query: 22  EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           EE+E++   +  KE  +   + + K  +  K   K  + K KE   V EI+E L++ K K
Sbjct: 58  EEEEDSGLGKLLKELEDARDSLRTKLNEGNKPYTKDNEPKLKENVKVSEIKEDLEKLKSK 117


>gnl|CDD|218584 pfam05422, SIN1, Stress-activated map kinase interacting protein 1
           (SIN1).  This family consists of several
           stress-activated map kinase interacting protein 1
           (MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1
           mitogen-activated protein (MAP) kinase is a member of
           the eukaryotic stress-activated MAP kinase (SAPK)
           family. Sin1 interacts with Sty1/Spc1. Cells lacking
           Sin1 display many, but not all, of the phenotypes of
           cells lacking the Sty1/Spc1 MAP kinase including
           sterility, multiple stress sensitivity and a cell-cycle
           delay. Sin1 is phosphorylated after stress but this is
           not Sty1/Spc1-dependent.
          Length = 482

 Score = 29.6 bits (66), Expect = 2.5
 Identities = 11/90 (12%), Positives = 30/90 (33%), Gaps = 9/90 (10%)

Query: 1   MRKRRGGAGEKKK-------RIEKRNKEEEKEENEQEEKEKEDVEEEKN--KKKKKKKKK 51
           + +R   A E ++          K++  + K     +  +   + E      KK+     
Sbjct: 93  IHRRSNTAQELERLDQAVNLTSAKQSAIKIKSSVSTDYDDLRSISELDFLFSKKELPLTT 152

Query: 52  KKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
                K +     +  +  ++  L+ + E+
Sbjct: 153 HNTVNKARSVSNAKAPISGLQSLLEHKLEE 182


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 29.7 bits (66), Expect = 2.5
 Identities = 17/78 (21%), Positives = 38/78 (48%), Gaps = 5/78 (6%)

Query: 11  KKKRIEKRNKEEEKEE-NEQEEKEKEDV----EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
              +  K ++E ++EE +   +K K D+    E+E N +   +   K     +K++ + +
Sbjct: 226 SDVKDPKEDEELDEEEHDSAMDKVKLDLFADEEDEPNAEGVGEASDKNLSSFEKQQIEMD 285

Query: 66  QSVKEIEESLKERKEKTL 83
           + ++E+E+ L   KE   
Sbjct: 286 EQIEELEKELVAPKEWKY 303



 Score = 28.1 bits (62), Expect = 8.9
 Identities = 22/84 (26%), Positives = 34/84 (40%), Gaps = 19/84 (22%)

Query: 15  IEKRNK-----EEEKEENEQEEKE--------KEDVEEEKNK------KKKKKKKKKKKK 55
           +EK N+     E+  E +E  E E          D EEE+         K  K+  KK  
Sbjct: 167 LEKFNRDTLAAEDSNEASEGSEDEDIDYFQDMPSDDEEEEAIYYEDFFDKPTKEPVKKHS 226

Query: 56  KKKKKKKKEEQSVKEIEESLKERK 79
             K  K+ EE   +E + ++ + K
Sbjct: 227 DVKDPKEDEELDEEEHDSAMDKVK 250


>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein.  The PcfJ-like protein family
          includes the E. faecalis PcfJ protein, which is
          functionally uncharacterized. It is found in bacteria
          and viruses, and is typically between 159 and 170 amino
          acids in length. There is a conserved HCV sequence
          motif.
          Length = 169

 Score = 28.8 bits (65), Expect = 2.5
 Identities = 9/35 (25%), Positives = 19/35 (54%)

Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK 47
          + + K   ++EKE+ E+  K   ++E+   + K K
Sbjct: 33 RAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSK 67



 Score = 28.8 bits (65), Expect = 2.8
 Identities = 8/48 (16%), Positives = 25/48 (52%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
          K  +++   + +  +E++  +K +K+  + +K  ++ K K E +  + 
Sbjct: 28 KAAHDRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSKFEFTDGDY 75



 Score = 27.3 bits (61), Expect = 9.1
 Identities = 10/42 (23%), Positives = 21/42 (50%)

Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
          +   +    K +   E+EK +K +K+  + +K  ++ K K E
Sbjct: 28 KAAHDRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSKFE 69


>gnl|CDD|226369 COG3851, UhpB, Signal transduction histidine kinase,
           glucose-6-phosphate specific [Signal transduction
           mechanisms].
          Length = 497

 Score = 29.7 bits (67), Expect = 2.6
 Identities = 14/41 (34%), Positives = 19/41 (46%), Gaps = 2/41 (4%)

Query: 228 TWILLNTQGFISQGIGLNNKEYLIWYILFIVFVTYAMLPLP 268
           TW+ L     I   I L  K +L+WY+L  +      L LP
Sbjct: 162 TWLPLGPS-LIHHPINLRGK-HLVWYLLLFIIGLLLQLGLP 200


>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI).  YihI activates the
          GTPase activity of Der, a 50S ribosomal subunit
          stability factor. The stimulation is specific to Der as
          YihI does not stimulate the GTPase activity of Era or
          ObgE. The interaction of YihI with Der requires only
          the C-terminal 78 amino acids of YihI. A yihI deletion
          mutant is viable and shows a shorter lag period, but
          the same post-lag growth rate as a wild-type strain.
          yihI is expressed during the lag period. Overexpression
          of yihI inhibits cell growth and biogenesis of the 50S
          ribosomal subunit. YihI is an unusual, highly
          hydrophilic protein with an uneven distribution of
          charged residues, resulting in an N-terminal region
          with high pI and a C-terminal region with low pI.
          Length = 169

 Score = 28.9 bits (65), Expect = 2.6
 Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 17/66 (25%)

Query: 10 EKKKRIEK------RNKEEEKEENEQEEKEKED-----------VEEEKNKKKKKKKKKK 52
           K+K+  K      R+ EE + + ++   +K+D           + E K K +K+  KKK
Sbjct: 29 RKRKKKRKGLKSGSRHNEESESQKQKGAAQKKDPRIGSKKPVPLIVEFKVKPEKQAPKKK 88

Query: 53 KKKKKK 58
          K KK K
Sbjct: 89 KVKKPK 94


>gnl|CDD|217452 pfam03250, Tropomodulin, Tropomodulin.  Tropomodulin is a novel
           tropomyosin regulatory protein that binds to the end of
           erythrocyte tropomyosin and blocks head-to-tail
           association of tropomyosin along actin filaments.
           Limited proteolysis shows this protein is composed of
           two domains. The amino terminal domain contains the
           tropomyosin binding function.
          Length = 147

 Score = 28.9 bits (65), Expect = 2.7
 Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 28  EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV--KEIEESLKERKEKTLTE 85
           E++  E +D E+      +K+ K    K+K ++   EEQ     E+EE+L    +  L +
Sbjct: 74  EKQALEHKDREDLVPFTGEKRGKVFVPKQKPRELPAEEQITLDPELEEALANATDAELCD 133


>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus.  This family contains the N
           terminal region of several LUC7 protein homologues and
           only contains eukaryotic proteins. LUC7 has been shown
           to be a U1 snRNA associated protein with a role in
           splice site recognition. The family also contains human
           and mouse LUC7 like (LUC7L) proteins and human cisplatin
           resistance-associated overexpressed protein (CROP).
          Length = 252

 Score = 29.3 bits (66), Expect = 2.7
 Identities = 17/85 (20%), Positives = 40/85 (47%), Gaps = 4/85 (4%)

Query: 10  EKKKRIEKRNKEEEKEE--NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           ++K    K+  E  +EE      + + E++  E +++  K   + +   ++ K  +  + 
Sbjct: 96  DRKIDKAKQRLELTQEEQTKIAADSKAEEL-AELDEEIGKLLAEAEALGEEGKVDEAMKL 154

Query: 68  VKEIEESLKERKEKTLTEDSQCDTM 92
           +KE+EE LK +K++    D   +  
Sbjct: 155 MKEVEE-LKAKKKELEDSDEVRNAA 178


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor
          involved in ribosome biogenesis.
          Length = 112

 Score = 28.2 bits (63), Expect = 2.7
 Identities = 11/26 (42%), Positives = 21/26 (80%)

Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKE 64
          E  +KKK++KKK K+  +K++K++K+
Sbjct: 44 EGVSKKKQRKKKPKRLTRKQRKRQKK 69



 Score = 28.2 bits (63), Expect = 3.0
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
              E    KK++KKK K+  +K++K++KK
Sbjct: 40 ASKNEGVSKKKQRKKKPKRLTRKQRKRQKK 69



 Score = 27.5 bits (61), Expect = 5.1
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQS 67
             KKK++KKK K+  +K++K++++ 
Sbjct: 45 GVSKKKQRKKKPKRLTRKQRKRQKKG 70



 Score = 27.1 bits (60), Expect = 7.6
 Identities = 10/25 (40%), Positives = 19/25 (76%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKE 64
          +K ++KKK K+  +K++K++KK  E
Sbjct: 48 KKKQRKKKPKRLTRKQRKRQKKGLE 72



 Score = 27.1 bits (60), Expect = 7.9
 Identities = 11/28 (39%), Positives = 21/28 (75%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
          K K++KKK K+  +K++K++KK  E++ 
Sbjct: 48 KKKQRKKKPKRLTRKQRKRQKKGLERAE 75


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 29.9 bits (67), Expect = 2.7
 Identities = 18/75 (24%), Positives = 31/75 (41%), Gaps = 8/75 (10%)

Query: 7   GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           G   K+KR             E    E ED  +E     +K++K+++++ K   + K  +
Sbjct: 353 GPTRKRKRRRVPP------LPEYSSDEDEDDSDEDEVDYEKERKRRREEDKNFLRLKALE 406

Query: 67  SVKEIEESLKERKEK 81
             K     + ER EK
Sbjct: 407 LSKYA--GVNERMEK 419


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 18/67 (26%), Positives = 28/67 (41%), Gaps = 9/67 (13%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQ---------EEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
             E  ++ EK  KE+ K + +Q         E  +K   E    + +   +  +  KKKK
Sbjct: 157 KLELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKK 216

Query: 59  KKKKKEE 65
            K K EE
Sbjct: 217 AKSKYEE 223



 Score = 27.8 bits (62), Expect = 9.2
 Identities = 15/47 (31%), Positives = 24/47 (51%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           K K+KK++ K +KK+   +K   EE   K  +E L  + E  +  D 
Sbjct: 164 KEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDR 210


>gnl|CDD|193580 cd09891, NGN_Bact_1, Bacterial N-Utilization Substance G (NusG)
          N-terminal (NGN) domain, subgroup 1.  The N-Utilization
          Substance G (NusG) protein is involved in transcription
          elongation and termination in bacteria. NusG is
          essential in Escherichia coli and associates with RNA
          polymerase elongation and Rho-termination. Homologs of
          the NusG gene exist in all bacteria. The NusG
          N-terminal domain (NGN) is similar in all NusG
          homologs, but its C-terminal domain and the linker that
          separates these two domains are different. The domain
          organization of NusG suggests that the common
          properties of NusG and its homologs are due to their
          similar NGN domains.
          Length = 107

 Score = 28.2 bits (64), Expect = 2.9
 Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 12 KKRIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKK 51
          K+ +EKR + E  E+   E     E+V E KN KKK K++K
Sbjct: 16 KENLEKRIESEGLEDYIGEVLVPTEEVVEVKNGKKKVKERK 56


>gnl|CDD|180161 PRK05609, nusG, transcription antitermination protein NusG;
          Validated.
          Length = 181

 Score = 29.0 bits (66), Expect = 2.9
 Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 12 KKRIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKK 51
          K+ +E R +    E+   E     E+V E KN KKKK ++K
Sbjct: 22 KENLENRIETLGMEDLIGEVLVPTEEVVEVKNGKKKKVERK 62


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 29.8 bits (67), Expect = 2.9
 Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 3/78 (3%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           K +     EEE E   + +K +E  ++    K+KK + +   + K   K  +E    K+ 
Sbjct: 351 KLLALSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDN 410

Query: 72  EE--SLKERKEKTLTEDS 87
           +   S    K + +  +S
Sbjct: 411 KNKNSSFINKTENILTNS 428



 Score = 28.6 bits (64), Expect = 6.5
 Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 3/63 (4%)

Query: 15  IEKRNKEEEKEENEQEEKEKED---VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           I+  + +E+K ENE + + K D   +EE    K  K K      K +          + +
Sbjct: 376 IDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKTENILTNSPLKDELL 435

Query: 72  EES 74
           E++
Sbjct: 436 EKT 438


>gnl|CDD|148790 pfam07376, Prosystemin, Prosystemin.  This family consists of
          several plant specific prosystemin proteins.
          Prosystemin is the precursor protein of the 18 amino
          acid wound signal systemin which activates systemic
          defence in plant leaves against insect herbivores.
          Length = 193

 Score = 28.9 bits (63), Expect = 2.9
 Identities = 19/91 (20%), Positives = 47/91 (51%), Gaps = 4/91 (4%)

Query: 3  KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK----KKKK 58
          K +G   ++K ++E     +EKE+  ++E   +D++ + +  ++  K + ++    K+K 
Sbjct: 9  KNKGDDVQEKTKLEHEKGGDEKEKIIEKETPSQDIKNKDDDAQEIPKVEHEEGGDEKEKI 68

Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTEDSQC 89
           +K+   + + +IE    + K K   E+ +C
Sbjct: 69 IEKETISKCIIKIEGDDAQEKIKVEYEEEEC 99



 Score = 27.3 bits (59), Expect = 9.7
 Identities = 21/87 (24%), Positives = 43/87 (49%), Gaps = 6/87 (6%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVE------EEKNKKKKKKKKKKKKKK 56
           K  G   ++K ++E   +E EKE+  ++E   +D+       +EK K + ++   +K+  
Sbjct: 80  KIEGDDAQEKIKVEYEEEECEKEKIVEKETPSQDIGNKGDDAQEKPKVEHEEDGDEKETP 139

Query: 57  KKKKKKKEEQSVKEIEESLKERKEKTL 83
            +   K E +  +EI +   E +EK +
Sbjct: 140 SQDIIKIEGEDAQEIPKVECEEREKIV 166


>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 320

 Score = 29.3 bits (65), Expect = 3.0
 Identities = 9/53 (16%), Positives = 18/53 (33%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
              K      K  ++   +QE  +K  +E+    ++K   K  K+        
Sbjct: 263 RDAKLDFPEVKAVKEISLKQELLQKAALEKLLKGEQKSFDKLLKEAAIDILSA 315


>gnl|CDD|225048 COG2137, OraA, Uncharacterized protein conserved in bacteria
           [General function prediction only].
          Length = 174

 Score = 28.8 bits (65), Expect = 3.0
 Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 2   RKRRGGAGEKK---KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
            + R G G  +   +  +K   +E  EE  +   E+++ E  +   +KK K++ K   KK
Sbjct: 79  SRSRKGKGPARLKQELKQKGIDDEIIEEALELIDEEDEQERARKVLRKKFKRENKPPDKK 138

Query: 59  KKKK 62
           +K K
Sbjct: 139 EKAK 142


>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin. 
          Length = 137

 Score = 28.6 bits (64), Expect = 3.0
 Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 5/58 (8%)

Query: 6   GGAGEKKKRIEKRNKE-----EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
           G  GE+KK I+++ KE        +                    ++KK K  K K+K
Sbjct: 73  GQGGERKKGIKEKIKEKLPGGHGGKGQATTTTTTGTGGGYGQHGHEEKKGKMDKIKEK 130


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
          receptor (72 kDa mitochondrial outermembrane protein)
          (mitochondrial import receptor for the ADP/ATP carrier)
          (translocase of outermembrane tom70).  [Transport and
          binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 29.6 bits (66), Expect = 3.0
 Identities = 12/41 (29%), Positives = 26/41 (63%)

Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
           ++K +++++ K K  KK+++K +Q+ KE E   +E+K   
Sbjct: 55 YQRKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTA 95


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 29.6 bits (67), Expect = 3.1
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query: 46  KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           ++KKK KK  +K+ +  +     K  E      K++
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQR 119



 Score = 29.6 bits (67), Expect = 3.2
 Identities = 11/37 (29%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 44  KKKKKKKKKKKKKKKKKKKKEEQSVKEI-EESLKERK 79
           ++KKK KK  +K+ +  +   ++  KE+   S K+R 
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120


>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein.  This is a novel
          repeat in Pneumocystis carinii Major surface
          glycoprotein (MSG) some members of the alignment have
          up to nine repeats of this family, the repeats
          containing several conserved cysteines. The MSG of P.
          carinii is an important protein in host-pathogen
          interactions. Surface glycoprotein A from Pneumocystis
          carinii is a main target for the host immune system,
          this protein is implicated in the attachment of
          Pneumocystis carinii to the host alveolar epithelial
          cells, alveolar macrophages, host surfactant and
          possibly accounts in part for the hypoxia seen in
          Pneumocystis carinii pneumonia (PCP).
          Length = 81

 Score = 27.4 bits (61), Expect = 3.1
 Identities = 21/63 (33%), Positives = 36/63 (57%)

Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
          ++ + E++  E EE  +   E     K+K +K K+K  K KK  K EE  +KE++  LK+
Sbjct: 1  KDFKLEEKCKELEEYCEFLEEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKK 60

Query: 78 RKE 80
          +K+
Sbjct: 61 KKK 63


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 29.6 bits (66), Expect = 3.1
 Identities = 13/71 (18%), Positives = 37/71 (52%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
             + ++++ + + E +    E ++ +++D   E   +K +++K++    K+   KK +  
Sbjct: 883 RQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDK 942

Query: 68  VKEIEESLKER 78
           V +I+E +K  
Sbjct: 943 VNDIKEKVKNI 953


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 28.5 bits (64), Expect = 3.1
 Identities = 13/23 (56%), Positives = 14/23 (60%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKK 63
          K KKK KK  K  KK  K KK+K
Sbjct: 6  KAKKKAKKAAKAAKKGVKVKKRK 28



 Score = 28.5 bits (64), Expect = 3.5
 Identities = 12/24 (50%), Positives = 13/24 (54%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKK 63
           K  KKK KK  K  KK  K KK+
Sbjct: 4  AKKAKKKAKKAAKAAKKGVKVKKR 27



 Score = 27.0 bits (60), Expect = 8.9
 Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
          +   KK KKK KK  K  KK  K ++   +++  S++  + KTL
Sbjct: 1  KAPAKKAKKKAKKAAKAAKKGVKVKK---RKVRTSVRFFRPKTL 41


>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
          Length = 903

 Score = 29.5 bits (66), Expect = 3.2
 Identities = 14/44 (31%), Positives = 22/44 (50%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           K ++   +  + D  E+KN  K  KK K    K  K+ K++E S
Sbjct: 197 KFQDTFPKNSRSDKSEDKNDTKSSKKGKSSHVKGPKEAKEKESS 240


>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
          Length = 111

 Score = 28.0 bits (62), Expect = 3.2
 Identities = 16/58 (27%), Positives = 33/58 (56%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          KE+ E  +KE  +++ E+  + K+ K K ++K KK +++        + E+ KE ++K
Sbjct: 16 KEKEESIDKEIANIKNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKK 73



 Score = 26.4 bits (58), Expect = 9.9
 Identities = 17/74 (22%), Positives = 30/74 (40%)

Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           +     K E E E KE +   EEK KK +++          + +K+ E+   EI    K
Sbjct: 23 DKEIANIKNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKKAVEIINKAK 82

Query: 77 ERKEKTLTEDSQCD 90
          +       + S+ +
Sbjct: 83 QEASLIKLKISEDE 96


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 29.4 bits (66), Expect = 3.2
 Identities = 10/32 (31%), Positives = 22/32 (68%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
          +  K + K +K K+ KK+KK++++++   E+E
Sbjct: 22 RPVKDEAKPRKIKRVKKRKKREEKDELDDEVE 53



 Score = 28.6 bits (64), Expect = 5.8
 Identities = 11/38 (28%), Positives = 23/38 (60%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
          K++ K +K K+ KK+KK+++K + +  V+ +      R
Sbjct: 25 KDEAKPRKIKRVKKRKKREEKDELDDEVEFVRSFAPRR 62



 Score = 28.6 bits (64), Expect = 5.9
 Identities = 9/39 (23%), Positives = 21/39 (53%)

Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
          + +  E    +  K + K +K K+ KK+K+ +   E+++
Sbjct: 12 QILAPEIYGPRPVKDEAKPRKIKRVKKRKKREEKDELDD 50


>gnl|CDD|217483 pfam03305, Lipoprotein_X, Mycoplasma MG185/MG260 protein.  Most of
           the aligned regions in this family are found towards the
           middle of the member proteins.
          Length = 237

 Score = 28.9 bits (65), Expect = 3.2
 Identities = 10/75 (13%), Positives = 27/75 (36%), Gaps = 6/75 (8%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKED------VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           K ++   K  +K         K +         +KN     + K  K+  K +  + + +
Sbjct: 153 KDLKAPTKNNQKGGALGSVTSKGNANLLGKFTLDKNSLTSPEVKAFKEADKSEDPQTDSK 212

Query: 67  SVKEIEESLKERKEK 81
           +++  ++ +   K  
Sbjct: 213 TIELYKDKIPTDKNA 227


>gnl|CDD|218657 pfam05615, THOC7, Tho complex subunit 7.  The Tho complex is
           involved in transcription elongation and mRNA export
           from the nucleus.
          Length = 132

 Score = 28.4 bits (64), Expect = 3.2
 Identities = 19/66 (28%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 18  RNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
              E E+E  EQE++E E  +E  K++ ++ KK+ ++ K+ +K +K+ ++  K+I     
Sbjct: 63  EMNERERENYEQEKEEIETTIENAKDEIEELKKELEEAKRIRKNRKEYDELAKKITSLPV 122

Query: 77  ERKEKT 82
            R E  
Sbjct: 123 TRAETE 128


>gnl|CDD|148701 pfam07246, Phlebovirus_NSM, Phlebovirus nonstructural protein NS-M.
            This family consists of several Phlebovirus
           nonstructural NS-M proteins which represent the
           N-terminal region of the M polyprotein precursor. The
           function of this family is unknown.
          Length = 264

 Score = 29.1 bits (64), Expect = 3.3
 Identities = 21/97 (21%), Positives = 49/97 (50%), Gaps = 3/97 (3%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
            K +  A EK   I+++ KE E++   Q    +  ++++KN+ +K +   K+ ++ +  K
Sbjct: 144 EKLKKNAEEKDATIQRKTKEMEEDSRNQIAHHE--IQQKKNEIQKLRNDLKRGQEHRDAK 201

Query: 62  KKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKK 98
            + E+S+  ++  LKE  ++   E     + +   +K
Sbjct: 202 LRVEESMARVKG-LKEELKQLTDELQHAKSEMQSLRK 237


>gnl|CDD|221805 pfam12848, ABC_tran_2, ABC transporter.  This domain is related
          to pfam00005.
          Length = 85

 Score = 27.5 bits (62), Expect = 3.3
 Identities = 14/67 (20%), Positives = 36/67 (53%), Gaps = 4/67 (5%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          +++ E+  ++E+  E +Q+E +K   EE  ++ + K  K K+ + + K  +K +  +   
Sbjct: 20 EQKEERLEQQEKAYEKQQKEIKKL--EEFIDRFRAKASKAKQAQSRIKALEKMERIEKPV 77

Query: 70 EIEESLK 76
          E +  ++
Sbjct: 78 EKKPKIR 84


>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family.  Members of this family are
          short proteins that are rich in aspartate, glutamate,
          lysine and arginine. Although the function of these
          proteins is unknown, they are found to be ubiquitously
          expressed.
          Length = 38

 Score = 26.0 bits (58), Expect = 3.3
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKK 63
          E+N KK  KKKKK  K + K   K
Sbjct: 12 ERNAKKAAKKKKKGAKSQLKAAAK 35


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 29.1 bits (65), Expect = 3.3
 Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 4/79 (5%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           ++       K   K+   + K   ++  K+K      K     KK  K   K  KK KK 
Sbjct: 200 KKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAA----KTAVSAKKAAKTAAKAAKKAKKT 255

Query: 64  EEQSVKEIEESLKERKEKT 82
            ++++K+  +++K+  +K 
Sbjct: 256 AKKALKKAAKAVKKAAKKA 274



 Score = 29.1 bits (65), Expect = 3.7
 Identities = 20/96 (20%), Positives = 33/96 (34%), Gaps = 3/96 (3%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A +     +K  K   K   + ++  K+ +   K   K  KK  KK  K   K  K    
Sbjct: 231 AAKTAVSAKKAAKTAAKAAKKAKKTAKKAL---KKAAKAVKKAAKKAAKAAAKAAKGAAK 287

Query: 68  VKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
             + +   K++  K     S+        K+G   K
Sbjct: 288 ATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGK 323



 Score = 28.3 bits (63), Expect = 6.3
 Identities = 14/86 (16%), Positives = 31/86 (36%)

Query: 3   KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           K+     +   +   + K  +   + ++  +      +K KK  KK  KK  K  KK  K
Sbjct: 213 KKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAK 272

Query: 63  KEEQSVKEIEESLKERKEKTLTEDSQ 88
           K  ++  +  +   +  +       +
Sbjct: 273 KAAKAAAKAAKGAAKATKGKAKAKKK 298



 Score = 27.9 bits (62), Expect = 9.0
 Identities = 19/80 (23%), Positives = 32/80 (40%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           +K    A    K++ K+   +     ++  K      ++  K  KK  KK  K  KK  K
Sbjct: 213 KKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAK 272

Query: 62  KKEEQSVKEIEESLKERKEK 81
           K  + + K  + + K  K K
Sbjct: 273 KAAKAAAKAAKGAAKATKGK 292


>gnl|CDD|219289 pfam07083, DUF1351, Protein of unknown function (DUF1351).  This
           family consists of several bacterial and phage proteins
           of around 230 residues in length. The function of this
           family is unknown.
          Length = 220

 Score = 28.9 bits (65), Expect = 3.3
 Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 8/80 (10%)

Query: 10  EKKKRIEKRNKEEEKEEN--------EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
              K  E++ KE  ++          E+   +K D EE+ +   KK K   K    KK +
Sbjct: 96  SGIKEFEEKQKELRRDTVKEYLSEKAEEYGLDKRDFEEKYDDYVKKSKWLNKSVSLKKIE 155

Query: 62  KKEEQSVKEIEESLKERKEK 81
           +  +  +    + L+ERK  
Sbjct: 156 ESIDTLILSEAQKLEERKAA 175


>gnl|CDD|219106 pfam06614, Neuromodulin, Neuromodulin.  This family consists of
           several neuromodulin (Axonal membrane protein GAP-43)
           sequences and is found in conjunction with pfam00612.
           GAP-43 is a neuronal calmodulin-binding phosphoprotein
           that is concentrated in growth cones and pre-synaptic
           terminals.
          Length = 174

 Score = 28.7 bits (63), Expect = 3.3
 Identities = 13/38 (34%), Positives = 26/38 (68%)

Query: 20  KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
            E+E  E+ QEE++K+ VEE K  +  ++++ K+++ K
Sbjct: 131 PEQETAESSQEEEKKDAVEETKPSESAQQEEAKEEEAK 168


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.1 bits (65), Expect = 3.4
 Identities = 19/86 (22%), Positives = 45/86 (52%), Gaps = 2/86 (2%)

Query: 1   MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKK 59
           +++      EK + ++K  KEE  +E E+ E E E+V+E  K  + +  + ++  KK   
Sbjct: 133 LKEDYEELKEKLEELQKE-KEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPG 191

Query: 60  KKKKEEQSVKEIEESLKERKEKTLTE 85
           +    ++   E+E  ++  +E+ +++
Sbjct: 192 EVYDLKKRWDELEPGVELPEEELISD 217


>gnl|CDD|212057 cd11488, SLC5sbd_SGLT4, Na(+)/glucose cotransporter SGLT4 and
           related proteins; solute-binding domain.  Human SGLT4
           (hSGLT4) has been reported to be a low-affinity glucose
           transporter with unusual sugar selectivity: it
           transports D-mannose but not galactose or
           3-O-methyl-D-glucoside. It is encoded by the SLC5A9 gene
           and is expressed in intestine, kidney, liver, brain,
           lung, trachea, uterus, and pancreas. hSLGT4 is predicted
           to contain 14 membrane-spanning regions. This subgroup
           belongs to the solute carrier 5 (SLC5 )transporter
           family.
          Length = 605

 Score = 29.5 bits (66), Expect = 3.4
 Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 6/70 (8%)

Query: 91  TMIPVFKKGVLYKGIYCP---SLTNSFREPSLEMS---YQRYSHRQRQKSLIIVNLVDMV 144
            ++PV  +G++   I      SLT+ F   S   +   +QR      ++ L++V  V ++
Sbjct: 340 ELMPVGLRGLMIAVIMAALMSSLTSIFNSSSTLFTIDVWQRIRRNASEQELMVVGRVFIL 399

Query: 145 LKAVLTVAWI 154
           +  V+++ WI
Sbjct: 400 ILVVISILWI 409


>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score = 29.3 bits (66), Expect = 3.5
 Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 7/76 (9%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK-------KKKKK 63
               I    KEE  E +++      D EEE  ++++K  +K  +   +        K+ K
Sbjct: 220 YSSEIVNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETSIAKESK 279

Query: 64  EEQSVKEIEESLKERK 79
           E    +EIE   +ER 
Sbjct: 280 ELPDGEEIEFGNEERF 295


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 29.3 bits (66), Expect = 3.5
 Identities = 10/34 (29%), Positives = 17/34 (50%)

Query: 27  NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           +      +  +     +K++  KK+KKKK KK K
Sbjct: 341 HSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374



 Score = 28.9 bits (65), Expect = 4.1
 Identities = 12/45 (26%), Positives = 18/45 (40%), Gaps = 4/45 (8%)

Query: 22  EEKEENEQEEKEKEDVEEEKNK----KKKKKKKKKKKKKKKKKKK 62
             +               E+      K+K++  KK+KKKK KK K
Sbjct: 330 STETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374


>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
           This family represents the C-terminal region of
           merozoite surface protein 1 (MSP1) which are found in a
           number of Plasmodium species. MSP-1 is a 200-kDa protein
           expressed on the surface of the P. vivax merozoite.
           MSP-1 of Plasmodium species is synthesised as a
           high-molecular-weight precursor and then processed into
           several fragments. At the time of red cell invasion by
           the merozoite, only the 19-kDa C-terminal fragment
           (MSP-119), which contains two epidermal growth
           factor-like domains, remains on the surface. Antibodies
           against MSP-119 inhibit merozoite entry into red cells,
           and immunisation with MSP-119 protects monkeys from
           challenging infections. Hence, MSP-119 is considered a
           promising vaccine candidate.
          Length = 574

 Score = 29.5 bits (66), Expect = 3.5
 Identities = 17/71 (23%), Positives = 34/71 (47%), Gaps = 1/71 (1%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           KK+ I    + +      +    KE ++   ++  K K K ++   KK++  + +  +K+
Sbjct: 44  KKELITALTETKVNALYLEIAHLKELLQHSYDRYYKYKLKLERLYNKKEQIGQSKMQIKK 103

Query: 71  IEESLKERKEK 81
           +   LKER EK
Sbjct: 104 L-TLLKERLEK 113


>gnl|CDD|235841 PRK06606, PRK06606, branched-chain amino acid aminotransferase;
           Validated.
          Length = 306

 Score = 29.0 bits (66), Expect = 3.6
 Identities = 10/29 (34%), Positives = 17/29 (58%), Gaps = 9/29 (31%)

Query: 230 ILLNTQGFISQGIGLNNKEYLIWYILFIV 258
           +LL+ +G++S+G G N         +FIV
Sbjct: 183 LLLDVEGYVSEGSGEN---------IFIV 202


>gnl|CDD|223327 COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 843

 Score = 29.6 bits (67), Expect = 3.6
 Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 15/89 (16%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----KKKKKKKKKKKKKEEQSVK-- 69
              N +E   + E EE+E+  ++  K K  K         K   K         Q++K  
Sbjct: 439 LLNNAKEWIAKLELEERERTGIKSLKIKYNKVYGYYIEVTKSNAKLVPDDYIRRQTLKNA 498

Query: 70  ---------EIEESLKERKEKTLTEDSQC 89
                    E+EE L + +EK L  + + 
Sbjct: 499 ERFTTPELKELEEKLLDAEEKILALEYEL 527


>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family.  This protein
           is plasmid encoded and found to be essential for plasmid
           replication.
          Length = 279

 Score = 28.9 bits (65), Expect = 3.6
 Identities = 15/62 (24%), Positives = 27/62 (43%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E KK  +K       +E  +  KE         + ++K   KK+++ +K  K  E+   K
Sbjct: 179 ENKKLKKKGLIPLTLDEARRRAKEFHIERAFSYRTERKAFGKKRRRARKLAKLDEKDIRK 238

Query: 70  EI 71
           +I
Sbjct: 239 KI 240


>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain. 
          Length = 372

 Score = 29.2 bits (66), Expect = 3.6
 Identities = 19/42 (45%), Positives = 26/42 (61%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
          +EE  E + +  E E++   K KKKK KK +KK   KKKKK+
Sbjct: 33 DEESFEKDLDLSELEELFSAKAKKKKSKKSEKKSSSKKKKKE 74


>gnl|CDD|223839 COG0768, FtsI, Cell division protein FtsI/penicillin-binding
           protein 2 [Cell envelope biogenesis, outer membrane].
          Length = 599

 Score = 29.3 bits (66), Expect = 3.6
 Identities = 13/77 (16%), Positives = 28/77 (36%), Gaps = 3/77 (3%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE---EQSVKEI 71
             K   E  +   ++   +   + E+ +K+   K+ K K +      +K     +    I
Sbjct: 88  PRKITDEPLEATKDELAAKLLALLEDLDKRALLKELKSKSRLFTPLARKTNLTPEQAARI 147

Query: 72  EESLKERKEKTLTEDSQ 88
            E L +     +  DS+
Sbjct: 148 AELLLKLPGVDVETDSR 164


>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding.  This domain is
          found on CASC3 (cancer susceptibility candidate gene 3
          protein) which is also known as Barentsz (Btz). CASC3
          is a component of the EJC (exon junction complex) which
          is a complex that is involved in post-transcriptional
          regulation of mRNA in metazoa. The complex is formed by
          the association of four proteins (eIF4AIII, Barentsz,
          Mago, and Y14), mRNA, and ATP. This domain wraps around
          eIF4AIII and stacks against the 5' nucleotide.
          Length = 116

 Score = 27.8 bits (62), Expect = 3.7
 Identities = 7/39 (17%), Positives = 19/39 (48%)

Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
            E E++       E  + +E+K +K+++ +K   +  +
Sbjct: 1  KVESERQSGRTPSAEPTEPKEDKERKRREHEKYDDEDDE 39


>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
          Length = 312

 Score = 29.2 bits (66), Expect = 3.7
 Identities = 20/65 (30%), Positives = 33/65 (50%), Gaps = 5/65 (7%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK-----KKKKKKKKKKEEQ 66
           KK   K ++ +  +    ++KE+ D   E    +K +K K+K     KKKKK +K  EE+
Sbjct: 105 KKFFGKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNEEK 164

Query: 67  SVKEI 71
           S+   
Sbjct: 165 SLLNF 169


>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII.  Eukaryotic DNA topoisomerase
           II, GyrB, ParE.
          Length = 594

 Score = 29.4 bits (67), Expect = 3.7
 Identities = 13/55 (23%), Positives = 22/55 (40%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           E +  +EK   E      E+   E   + E+     K +   KK ++  +KKK  
Sbjct: 309 EVRFGVEKIVSECLLSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLS 363


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is
          a small protein module with a triple-stranded
          beta-sheet fold. This is a family of WW domain binding
          proteins.
          Length = 78

 Score = 27.2 bits (61), Expect = 3.8
 Identities = 12/50 (24%), Positives = 26/50 (52%)

Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
          E+K K+ KK K +++ +++ K  KK    ++   + LK ++E       +
Sbjct: 13 EQKKKELKKNKAERQARREAKLAKKNPDRLQRELDKLKAQEESGKLRPVE 62


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 28.6 bits (64), Expect = 3.8
 Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 16  EKRNKEEEKEE-NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           +K N E E+   NE  E   + V++E ++K      ++ +K + K KK++++ +K+++E 
Sbjct: 93  DKENVEPEQPTYNEMVEDLFDQVKDEVDEKNGAALIEELQKHRDKLKKEQKELLKKLDEL 152

Query: 75  LKERKEKTLTED 86
            KE K+K  + D
Sbjct: 153 EKEEKKKIWSVD 164


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 29.6 bits (66), Expect = 3.8
 Identities = 16/85 (18%), Positives = 32/85 (37%)

Query: 1    MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
            +       G K+K  +        E  E+E K              +++ +K+K    K 
Sbjct: 2118 LSNTVVILGVKEKNADDALSGTPGERLEREVKSVFVEAPRDMLFLLEEEVRKRKGSVMKF 2177

Query: 61   KKKEEQSVKEIEESLKERKEKTLTE 85
            K  ++ +V E    + + K+ +L E
Sbjct: 2178 KSSKKPAVLEAVLFVYKIKKASLRE 2202


>gnl|CDD|232945 TIGR00376, TIGR00376, DNA helicase, putative.  The gene product may
           represent a DNA helicase. Eukaryotic members of this
           family have been characterized as binding certain
           single-stranded G-rich DNA sequences (GGGGT and GGGCT).
           A number of related proteins are characterized as
           helicases [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 637

 Score = 29.4 bits (66), Expect = 3.8
 Identities = 16/94 (17%), Positives = 43/94 (45%), Gaps = 1/94 (1%)

Query: 14  RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           R+ K NK+   +   +   + + V + + K  +  +++ KK K   +K++    +K + +
Sbjct: 236 RLLKSNKQHSLDYLIENHPKYQIVADIREKIDELIEERNKKLKPSPQKRRGLSDIKILRK 295

Query: 74  SLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYC 107
           +LK+R+ + + E  +  +M    +       +  
Sbjct: 296 ALKKREARGI-ESLKIASMAEWIETNKSIDRLLK 328


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 29.4 bits (66), Expect = 3.8
 Identities = 11/49 (22%), Positives = 22/49 (44%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           ++    +  + K+  ++ K K      KKK+ KK     K +  + +EI
Sbjct: 590 DRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAEREI 638



 Score = 29.4 bits (66), Expect = 3.8
 Identities = 11/48 (22%), Positives = 20/48 (41%)

Query: 25  EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
           +   + E  +     ++ KKK      KKK+ KK     + ++  E E
Sbjct: 590 DRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAERE 637



 Score = 29.0 bits (65), Expect = 4.3
 Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 5/65 (7%)

Query: 4   RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           R   AG +  R  +    E K+  +Q +K+        +  KKK+ KK     K K   +
Sbjct: 581 REKFAGSQADRAARAEAAEAKKLAKQLKKKAL-----SDGGKKKQGKKAGGGGKSKAAAE 635

Query: 64  EEQSV 68
            E  V
Sbjct: 636 REIDV 640



 Score = 28.2 bits (63), Expect = 7.6
 Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 7/51 (13%)

Query: 20  KEEEKEE-------NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           K+EE E        ++ +   + +  E K   K+ KKK      KKK+ KK
Sbjct: 573 KDEEVEAYREKFAGSQADRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKK 623


>gnl|CDD|177973 PLN02339, PLN02339, NAD+ synthase (glutamine-hydrolysing).
          Length = 700

 Score = 29.3 bits (66), Expect = 3.9
 Identities = 12/30 (40%), Positives = 14/30 (46%)

Query: 73  ESLKERKEKTLTEDSQCDTMIPVFKKGVLY 102
           E L E     LT+   CD  +PV   GV Y
Sbjct: 67  ECLAEILVGDLTDGILCDIGMPVIHGGVRY 96


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 28.6 bits (65), Expect = 3.9
 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 9/45 (20%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKE---------EQSVKEIEES 74
           EE  KK  +K +K KKKKK++K   +         +QSVK I+++
Sbjct: 188 EELIKKLLEKPEKPKKKKKRRKGFSKDVRIAVNTIKQSVKMIKKT 232


>gnl|CDD|216597 pfam01601, Corona_S2, Coronavirus S2 glycoprotein.  The coronavirus
           spike glycoprotein forms the characteristic 'corona'
           after which the group is named. The Spike glycoprotein
           is translated as a large polypeptide that is
           subsequently cleaved to S1 pfam01600 and S2.
          Length = 609

 Score = 29.2 bits (66), Expect = 4.0
 Identities = 14/43 (32%), Positives = 17/43 (39%), Gaps = 9/43 (20%)

Query: 244 LNNKEYLI---WYI-LFIVFVTYAMLPLPLKWCFI----GGCT 278
           LN  E  I   WY+ L I     A + L L +C       GC 
Sbjct: 536 LNRYETYIKWPWYVWLAIGAALIAFVLL-LLFCCCMTGCCGCC 577


>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG.  Members of
          this family are the protein YabG, demonstrated for
          Bacillus subtilis to be an endopeptidase able to
          release N-terminal peptides from a number of
          sporulation proteins, including CotT, CotF, and SpoIVA.
          It appears to be expressed under control of sigma-K
          [Cellular processes, Sporulation and germination].
          Length = 283

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 11/36 (30%), Positives = 21/36 (58%)

Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
          +   + ++KK+KK+ K++ E S +   +  K  KEK
Sbjct: 50 ELVDESERKKRKKRFKERMETSYRLFRQDYKLMKEK 85


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 12/40 (30%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 13  KRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
           + + ++ KEE E+    + E   ED E+ + K+K K+++K
Sbjct: 116 RFLAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQK 155



 Score = 28.1 bits (63), Expect = 6.8
 Identities = 10/43 (23%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 25  EENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
            + ++EE+E+      E   +  ++ + K+K K+++K + EE 
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEET 161


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 28.5 bits (64), Expect = 4.1
 Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            K++E   +E  +EE E E K+ E ++E EKN K  +++++++ K      +  E  V E
Sbjct: 40  LKQLELLEEEIRREEAELE-KDLEYLQELEKNAKALEREREEESKNLHPVLRLLESEVLE 98

Query: 71  IEESLKERKEKTLTEDS 87
             E L +     L+E +
Sbjct: 99  ENELL-QDSLLELSERN 114


>gnl|CDD|225350 COG2740, COG2740, Predicted nucleic-acid-binding protein
          implicated in transcription termination
          [Transcription].
          Length = 95

 Score = 27.3 bits (61), Expect = 4.1
 Identities = 10/43 (23%), Positives = 19/43 (44%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
           +++  +K KKKK   +  K K  +    E+ + L E +    
Sbjct: 51 PDRECIEKAKKKKLFSRALKAKVSDSLYDELIQLLAELEPPRN 93


>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
           component [Carbohydrate transport and metabolism].
          Length = 472

 Score = 29.2 bits (66), Expect = 4.1
 Identities = 10/44 (22%), Positives = 21/44 (47%), Gaps = 1/44 (2%)

Query: 29  QEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
            + + ++D+E    K K+ K + K +        K +  +VK+I
Sbjct: 338 SKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKKI 381


>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain.  The
          sequences featured in this family are found repeated in
          a number of plant calmodulin-binding proteins, and are
          thought to constitute the calmodulin-binding domains.
          Binding of the proteins to calmodulin depends on the
          presence of calcium ions. These proteins are thought to
          be involved in various processes, such as plant defence
          responses and stolonisation or tuberization.
          Length = 115

 Score = 27.7 bits (62), Expect = 4.1
 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 2/52 (3%)

Query: 33 EKEDVEEEKNKKKKKKKKKKKKK--KKKKKKKKEEQSVKEIEESLKERKEKT 82
           +E  EE ++KK+  K +K+K K     KK    ++ VK +E+  K      
Sbjct: 1  TEEKKEERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEKVRKFNPRGP 52


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 28.8 bits (65), Expect = 4.1
 Identities = 12/30 (40%), Positives = 19/30 (63%), Gaps = 1/30 (3%)

Query: 11  KKKRIEKRNKEEEKEENEQEE-KEKEDVEE 39
           K + +E+R +  E+ E + EE  EK D+EE
Sbjct: 87  KAETLEERREIREEAEEKWEEALEKGDLEE 116


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 28.4 bits (64), Expect = 4.1
 Identities = 26/69 (37%), Positives = 35/69 (50%), Gaps = 14/69 (20%)

Query: 9   GEKKKRIEKR-NKEEEKEE--NEQEEK-EKEDVEE-----EKNKKKKKKKKK-----KKK 54
             KK R  KR  K ++K E   E E+  E+E  ++     +K KKKKKK+KK     KK 
Sbjct: 118 ARKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKTYVVAKKG 177

Query: 55  KKKKKKKKK 63
            K K  + K
Sbjct: 178 NKGKAGRPK 186


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
          represents a conserved region approximately 60 residues
          long within the eukaryotic targeting protein for Xklp2
          (TPX2). Xklp2 is a kinesin-like protein localised on
          centrosomes throughout the cell cycle and on spindle
          pole microtubules during metaphase. In Xenopus, it has
          been shown that Xklp2 protein is required for
          centrosome separation and maintenance of spindle
          bi-polarity. TPX2 is a microtubule-associated protein
          that mediates the binding of the C-terminal domain of
          Xklp2 to microtubules. It is phosphorylated during
          mitosis in a microtubule-dependent way.
          Length = 57

 Score = 26.6 bits (59), Expect = 4.2
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
          K++E++ K  E E+ E E ++KE+ EE   + +K
Sbjct: 15 KKLEEKEKALEAEKEEAEARQKEEEEEAIKQLRK 48


>gnl|CDD|149587 pfam08586, Rsc14, RSC complex, Rsc14/Ldb7 subunit.  RSC is an
          ATP-dependent chromatin remodelling complex found in
          yeast. The RSC components Rsc7/Npl6 and Rsc14/Ldb7
          interact physically and/or functionally with Rsc3,
          Rsc30, and Htl1 to form a module important for a broad
          range of RSC functions.
          Length = 101

 Score = 27.5 bits (61), Expect = 4.2
 Identities = 10/33 (30%), Positives = 21/33 (63%)

Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
          KEE +E  +QE+ +++D +  + K+   +K K+
Sbjct: 23 KEELRELTQQEDDQRKDRDLPELKRNDTEKGKR 55


>gnl|CDD|224911 COG2000, COG2000, Predicted Fe-S protein [General function
           prediction only].
          Length = 226

 Score = 28.7 bits (64), Expect = 4.2
 Identities = 18/90 (20%), Positives = 34/90 (37%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
            R  E  +   ++  K  +    E  +  +   + K   K+  K +KE + V  ++    
Sbjct: 26  SRCDEFAEALLKKLVKIDDCPFLELERFAENYDRLKAILKELGKAEKEMKIVGVLDGYEA 85

Query: 77  ERKEKTLTEDSQCDTMIPVFKKGVLYKGIY 106
           +   K L  +  C  ++  FK   L KG  
Sbjct: 86  DFVLKPLPNECSCREILFPFKSKELEKGDI 115


>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
           unknown].
          Length = 272

 Score = 28.9 bits (64), Expect = 4.3
 Identities = 15/96 (15%), Positives = 31/96 (32%), Gaps = 4/96 (4%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
             N E+  E +  E  EK+  +++  K          +  K+        S +  ++  +
Sbjct: 122 AENVEKVPESDAIEALEKQLTQQKSEKHNSSAINFIDELNKRLWSDPFVSSQRLRKQFRE 181

Query: 77  ERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTN 112
            +K     E  Q    + +  +  L   I   S   
Sbjct: 182 RKK----IEKKQEAKDLSLKNRAALDIDILPSSSDK 213


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 28.4 bits (64), Expect = 4.4
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 42  NKKKKKKKKKKKKKKKKKKKK 62
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 5.0
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 40  EKNKKKKKKKKKKKKKKKKKKK 61
           E +KK K K+ +K + KKKKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKK 105



 Score = 27.7 bits (62), Expect = 6.3
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 43  KKKKKKKKKKKKKKKKKKKKK 63
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105


>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
          Length = 367

 Score = 28.8 bits (65), Expect = 4.4
 Identities = 19/42 (45%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 54 KKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPV 95
          KK KKK+    EQ VKE+ E  K +K  TLT D   + +IP 
Sbjct: 1  KKTKKKETTTLEQVVKELIE--KGKKRGTLTYDEIAEKLIPF 40



 Score = 28.4 bits (64), Expect = 6.5
 Identities = 10/38 (26%), Positives = 18/38 (47%)

Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
          +  KK+    ++  K+  +K KK+   +  EI E L  
Sbjct: 2  KTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKLIP 39


>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
           Toc86/159, G and M domains.  The long precursor of the
           86K protein originally described is proposed to have
           three domains. The N-terminal A-domain is acidic,
           repetitive, weakly conserved, readily removed by
           proteolysis during chloroplast isolation, and not
           required for protein translocation. The other domains
           are designated G (GTPase) and M (membrane anchor); this
           family includes most of the G domain and all of M
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 763

 Score = 29.2 bits (65), Expect = 4.4
 Identities = 14/64 (21%), Positives = 35/64 (54%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
            K ++ K +KE+ K   E+ +   + +++++ +++ K+ K  KK  K+  +  +  S + 
Sbjct: 417 TKAQMAKLSKEQRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKEIGELPDGYSEEV 476

Query: 71  IEES 74
            EE+
Sbjct: 477 DEEN 480


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 28.6 bits (64), Expect = 4.4
 Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 2/45 (4%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
           K R+ K     E  +  QEE  K    ++ N+ +++ +K+  KKK
Sbjct: 149 KARLRKAQILLEGLQKNQEELFKL--LDKYNELREQVQKESSKKK 191


>gnl|CDD|216095 pfam00748, Calpain_inhib, Calpain inhibitor.  This region is found
           multiple times in calpain inhibitor proteins.
          Length = 131

 Score = 27.8 bits (62), Expect = 4.5
 Identities = 17/53 (32%), Positives = 29/53 (54%), Gaps = 7/53 (13%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKED-------VEEEKNKKKKKKKKKKKKK 55
           E+KK +E + KE+ KEE  ++  E+ED       + E K+K  K    K +++
Sbjct: 79  EEKKPVEDKVKEKAKEEKLEKLGEREDTIPPEYRLLEAKDKDGKPLLPKPEEE 131



 Score = 27.8 bits (62), Expect = 5.2
 Identities = 15/59 (25%), Positives = 26/59 (44%), Gaps = 14/59 (23%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKK--------------KKKKKKKKKKKKKKEEQS 67
           ++E + +E+K  ED  +EK K++K +K               + K K  K    K E+ 
Sbjct: 73  QREPDPEEKKPVEDKVKEKAKEEKLEKLGEREDTIPPEYRLLEAKDKDGKPLLPKPEEE 131


>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
           modification, protein turnover, chaperones].
          Length = 623

 Score = 29.2 bits (66), Expect = 4.5
 Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 6/78 (7%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           EE++ ++E+E+KE + + E   + K+    K K  +   +       +      L  + E
Sbjct: 491 EEDEADSEEEKKEFKPLLE---RVKEILGDKVKDVRLSHRLTDSPACLTTDGADLSTQME 547

Query: 81  KTLTEDSQCDTMIPVFKK 98
           + L    Q    +P  KK
Sbjct: 548 RLLKAQGQ---EVPESKK 562


>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1.  RNA
           polymerases catalyze the DNA dependent polymerisation of
           RNA. Prokaryotes contain a single RNA polymerase
           compared to three in eukaryotes (not including
           mitochondrial. and chloroplast polymerases). This
           domain, domain 1, represents the clamp domain, which a
           mobile domain involved in positioning the DNA,
           maintenance of the transcription bubble and positioning
           of the nascent RNA strand.
          Length = 330

 Score = 28.8 bits (65), Expect = 4.5
 Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 5/52 (9%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKK-----KKKKKKKKKKKEEQSVKEIE 72
            NE  +     V  +   K  KK+ KK     KKK    K  ++   +K  E
Sbjct: 105 LNESVKYFFLKVVIDPKGKNSKKRLKKINNLCKKKSICSKCGEDNGGLKAFE 156


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 28.8 bits (65), Expect = 4.6
 Identities = 7/44 (15%), Positives = 30/44 (68%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           +EE  ++E +E + +D +++    +++  ++++++++ ++K+ E
Sbjct: 137 DEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAE 180



 Score = 28.4 bits (64), Expect = 7.0
 Identities = 6/46 (13%), Positives = 30/46 (65%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           +EE EE+ + E +++D +++ +     +++  ++++++++ +++  
Sbjct: 134 DEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRA 179



 Score = 28.0 bits (63), Expect = 8.5
 Identities = 10/53 (18%), Positives = 31/53 (58%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           EE  EE+E+  K ++D +++ +        +++  ++++++++ E+   E+E 
Sbjct: 131 EESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEF 183


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45;
          Provisional.
          Length = 193

 Score = 28.5 bits (63), Expect = 4.7
 Identities = 18/70 (25%), Positives = 40/70 (57%), Gaps = 3/70 (4%)

Query: 4  RRGGAGEKKKRIEKRNKEEEKEENEQE---EKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
          R+  A E K+R      E EK + E E   E++ ED+ EE   + ++  +++++ +++K 
Sbjct: 6  RKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKP 65

Query: 61 KKKEEQSVKE 70
          K++ +  ++E
Sbjct: 66 KEEIDYPIQE 75


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 29.0 bits (65), Expect = 4.7
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEE 40
           EK   + +R  EEE+EE E+EE+E  + E  
Sbjct: 392 EKGHPVVERWAEEEEEEEEEEEEEAAEAEAP 422


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 27.6 bits (62), Expect = 4.7
 Identities = 9/44 (20%), Positives = 23/44 (52%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
             +E K  +E+ +K  +  +++   K++ K +KK ++ K   + 
Sbjct: 59  AGKELKLTDEEIQKALQAFQKKLQAKQQAKAEKKAEENKAAGEA 102



 Score = 27.2 bits (61), Expect = 7.0
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           +EE  K  +  +KK + K++ K +KK E++    E  L E  +K
Sbjct: 67  DEEIQKALQAFQKKLQAKQQAKAEKKAEENKAAGEAFLAENAKK 110


>gnl|CDD|211398 cd11560, W2_eIF5C_like, C-terminal W2 domain of the eukaryotic
          translation initiation factor 5C and similar proteins. 
          eIF5C appears to be essential for the initiation of
          protein translation; its actual function, and
          specifically that of the C-terminal W2 domain, are not
          well understood. The Drosophila ortholog, kra
          (krasavietz) or exba (extra bases), may be involved in
          translational inhibition in neural development. The
          structure of this C-terminal domain resembles that of a
          set of concatenated HEAT repeats.
          Length = 194

 Score = 28.3 bits (64), Expect = 4.8
 Identities = 19/65 (29%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 25 EENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
          EE+  E  ++E ++E  E  +K+  ++ KK+ +++ K+   EE+ VKEI  ++KE+ +K+
Sbjct: 10 EEHFAEHFKEEGLDELVEFYRKQASQEIKKELQQELKEMIAEEEPVKEIIAAVKEQMKKS 69

Query: 83 LTEDS 87
             + 
Sbjct: 70 SLPEH 74


>gnl|CDD|220265 pfam09491, RE_AlwI, AlwI restriction endonuclease.  This family
           includes the AlwI (recognises GGATC), Bsp6I (recognises
           GC^NGC), BstNBI (recognises GASTC), PleI(recognises
           GAGTC) and MlyI (recognises GAGTC) restriction
           endonucleases.
          Length = 429

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 11/51 (21%), Positives = 25/51 (49%)

Query: 31  EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
           + + +++     K +K++ K K   KK  K+  E++     + S K++ E 
Sbjct: 92  DNDIDEIVNRILKFRKERLKNKDNIKKLSKEYFEKECYSISDSSSKKKFET 142


>gnl|CDD|181784 PRK09334, PRK09334, 30S ribosomal protein S25e; Provisional.
          Length = 86

 Score = 26.9 bits (60), Expect = 4.9
 Identities = 12/36 (33%), Positives = 24/36 (66%)

Query: 48 KKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
          ++KKKKKK+  K +K+ + ++  ++E L +R  K +
Sbjct: 1  QQKKKKKKEGSKTEKEIKSTIITLDEELLKRVAKEV 36



 Score = 26.5 bits (59), Expect = 7.3
 Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 5/45 (11%)

Query: 43 KKKKKKKKKKKKKKKKKKKKK---EEQSVKEIEESLKERKEKTLT 84
          ++KKKKKK+  K +K+ K      +E+ +K + + +K  KEK +T
Sbjct: 1  QQKKKKKKEGSKTEKEIKSTIITLDEELLKRVAKEVK--KEKIVT 43


>gnl|CDD|222346 pfam13728, TraF, F plasmid transfer operon protein.  TraF protein
          undergoes proteolytic processing associated with
          export. The 19 amino acids at the amino terminus of the
          polypeptides appear to constitute a typical membrane
          leader peptide - not included in this family, while the
          remainder of the molecule is predicted to be primarily
          hydrophilic in character. F plasmid TraF and TraH are
          required for F pilus assembly and F plasmid transfer,
          and they are both localised to the outer membrane in
          the presence of the complete F transfer region,
          especially TraV, the putative anchor.
          Length = 215

 Score = 28.3 bits (64), Expect = 4.9
 Identities = 6/39 (15%), Positives = 14/39 (35%)

Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            + K+ +K        +     + +  + E L+E   K
Sbjct: 8  PPEPKEPEKAPAPAAPAQSPSATEQMAWLREYLEEALAK 46


>gnl|CDD|221630 pfam12542, CWC25, Pre-mRNA splicing factor.  This domain family
          is found in eukaryotes, and is approximately 100 amino
          acids in length. The family is found in association
          with pfam10197. There is a single completely conserved
          residue Y that may be functionally important. Cwc25 has
          been identified to associate with pre-mRNA splicing
          factor Cef1/Ntc85, a component of the Prp19-associated
          complex (NTC) involved in spliceosome activation. Cwc25
          is neither tightly associated with NTC nor required for
          spliceosome activation, but is required for the first
          catalytic reaction.
          Length = 95

 Score = 27.0 bits (60), Expect = 4.9
 Identities = 13/64 (20%), Positives = 28/64 (43%), Gaps = 9/64 (14%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK---------KKKKKKKKKK 62
          KKR++K  +E++ EE +Q+   +  +E   N         K         KK++++ ++ 
Sbjct: 31 KKRVDKLLEEKDAEEAKQKSSGEALIEGASNANSANDTWNKLREDPLLAIKKREQEARQA 90

Query: 63 KEEQ 66
              
Sbjct: 91 LMNN 94


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
          appears in eukaryotes as well as bacteria and tends to
          be found near the C-terminus of the metalloprotease
          M16C (pfam05193).
          Length = 248

 Score = 28.3 bits (64), Expect = 5.0
 Identities = 11/41 (26%), Positives = 23/41 (56%)

Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
          +K  +++++K +KKK    E+  ++I E   E +E   T +
Sbjct: 9  EKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPE 49



 Score = 28.3 bits (64), Expect = 6.0
 Identities = 13/44 (29%), Positives = 26/44 (59%)

Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           E +EE+ N+++++K +KKK    ++ K+K  +   E+EE    
Sbjct: 4  DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQST 47


>gnl|CDD|220897 pfam10883, DUF2681, Protein of unknown function (DUF2681).  This
          family of proteins is found in bacteria. Proteins in
          this family are typically between 81 and 117 amino
          acids in length.
          Length = 87

 Score = 27.0 bits (60), Expect = 5.0
 Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKK 48
          K K+ ++ N  + + ENEQ   EK   E E KN K ++K
Sbjct: 24 KLKKAQREN-RKLQAENEQLATEKAVAETEVKNAKVRQK 61


>gnl|CDD|221550 pfam12366, Casc1, Cancer susceptibility candidate 1.  This domain
           family is found in eukaryotes, and is typically between
           216 and 263 amino acids in length. Casc1 has many SNPs
           associated with cancer susceptibility.
          Length = 227

 Score = 28.5 bits (64), Expect = 5.0
 Identities = 10/51 (19%), Positives = 19/51 (37%), Gaps = 7/51 (13%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE--KTLTED 86
             E+ +  +       K+ K K  K       EI + +K+  E  + +  D
Sbjct: 71  IAEEAEGARGYVTDIPKEYKAKPVKY-----LEIPKPIKDPPELPEGMFPD 116



 Score = 28.5 bits (64), Expect = 5.1
 Identities = 9/43 (20%), Positives = 15/43 (34%)

Query: 29  QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           +  +E E          K+ K K  K  +  K  K+   + E 
Sbjct: 70  KIAEEAEGARGYVTDIPKEYKAKPVKYLEIPKPIKDPPELPEG 112


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
          inhibitor of apoptosis (IAP)-associated factor (VIAF)
          subfamily; VIAF is a Phd-like protein that functions in
          caspase activation during apoptosis. It was identified
          as an IAP binding protein through a screen of a human
          B-cell library using a prototype IAP. VIAF lacks a
          consensus IAP binding motif and while it does not
          function as an IAP antagonist, it still plays a
          regulatory role in the complete activation of caspases.
          VIAF itself is a substrate for IAP-mediated
          ubiquitination, suggesting that it may be a target of
          IAPs in the prevention of cell death. The similarity of
          VIAF to Phd points to a potential role distinct from
          apoptosis regulation. Phd functions as a cytosolic
          regulator of G protein by specifically binding to G
          protein betagamma (Gbg)-subunits. The C-terminal domain
          of Phd adopts a thioredoxin fold, but it does not
          contain a CXXC motif. Phd interacts with G protein beta
          mostly through the N-terminal helical domain.
          Length = 192

 Score = 28.4 bits (64), Expect = 5.0
 Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 3/62 (4%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E   +    N  E+K  +E +E+  E+ ++   ++ ++K+  + K   +K K  E   V 
Sbjct: 29 ELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKALAEKSKFGE---VY 85

Query: 70 EI 71
          EI
Sbjct: 86 EI 87


>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
          Length = 292

 Score = 28.6 bits (65), Expect = 5.0
 Identities = 13/30 (43%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 11 KKKRIEKRNKEEEKEENEQEE-KEKEDVEE 39
          K + IE+R +  E+ E + EE KE+ D+EE
Sbjct: 40 KAEEIEERREVREEAEEKYEEAKEEGDLEE 69


>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family.  This is a
          family of transcription elongation factors which
          includes those referred to as Bex proteins as well as
          those named TCEAL7. Bex1 was shown to be a novel link
          between neurotrophin signalling, the cell cycle, and
          neuronal differentiation, suggesting it might function
          by coordinating internal cellular states with the
          ability of cells to respond to external signals. TCEAL7
          has been shown negatively to regulate the NF-kappaB
          pathway, hence being important in ovarian cancer as it
          one of the genes frequently downregulated in this
          cancer. A closely related protein, TFIIS/TCEA, found in
          pfam07500 is involved in transcription elongation and
          transcript fidelity. TFIIS/TCEA promotes 3'
          endoribonuclease activity of RNA polymerase II (pol II)
          and allows pol II to bypass transcript pause or
          'arrest' during elongation process. It is thus possible
          that BEX is also acting in this way.
          Length = 97

 Score = 27.3 bits (61), Expect = 5.0
 Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 2/40 (5%)

Query: 15 IEKRNKE-EEKEENEQ-EEKEKEDVEEEKNKKKKKKKKKK 52
          +EK  KE E K E+E  EE+EK  +EE + KK +   +++
Sbjct: 1  MEKPCKENEGKPESEPKEEEEKRPLEEGEGKKPEGNFRRR 40


>gnl|CDD|153355 cd07671, F-BAR_PSTPIP1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Protein 1.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Proline-Serine-Threonine
           Phosphatase-Interacting Protein 1 (PSTPIP1), also known
           as CD2 Binding Protein 1 (CD2BP1), is mainly expressed
           in hematopoietic cells. It is a binding partner of the
           cell surface receptor CD2 and PTP-PEST, a tyrosine
           phosphatase which functions in cell motility and Rac1
           regulation. It also plays a role in the activation of
           the Wiskott-Aldrich syndrome protein (WASP), which
           couples actin rearrangement and T cell activation.
           Mutations in the gene encoding PSTPIP1 cause the
           autoinflammatory disorder known as PAPA (pyogenic
           sterile arthritis, pyoderma gangrenosum, and acne)
           syndrome. PSTPIP1 contains an N-terminal F-BAR domain,
           PEST motifs, and a C-terminal SH3 domain. F-BAR domains
           form banana-shaped dimers with a positively-charged
           concave surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 242

 Score = 28.4 bits (63), Expect = 5.0
 Identities = 19/80 (23%), Positives = 33/80 (41%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
           KK    K+  E+   E ++ E+  E      N K+ +K + K K+ +    + E    + 
Sbjct: 120 KKTMESKKTYEQRCREADEAEQTFERSSSTGNPKQSEKSQNKAKQCRDAATEAERVYKQN 179

Query: 71  IEESLKERKEKTLTEDSQCD 90
           IE+  K R E        C+
Sbjct: 180 IEQLDKARTEWETEHILTCE 199


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 29.1 bits (65), Expect = 5.0
 Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 3/73 (4%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK---KKKKKKKKKKKKEEQSVKEIEES 74
             K  EK+     +K+      EK+  K K+K K   +  K + KK++ EE+S      S
Sbjct: 720 PPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSS 779

Query: 75  LKERKEKTLTEDS 87
                  +  ++S
Sbjct: 780 SSSHHHSSSNKES 792


>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
           homology Domain (IMD), a dimerization module that binds
           and bends membranes.  Inverse (I)-BAR (or IMD) is a
           member of the Bin/Amphiphysin/Rvs (BAR) domain family.
           It is a dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions in the
           opposite direction compared to classical BAR and F-BAR
           domains, which produce membrane invaginations. IMD
           domains are found in Insulin Receptor tyrosine kinase
           Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
           Brain-specific Angiogenesis Inhibitor 1-Associated
           Protein 2-like (BAIAP2L) proteins. These are
           multi-domain proteins that act as scaffolding proteins
           and transducers of a variety of signaling pathways that
           link membrane dynamics and the underlying actin
           cytoskeleton. Most members contain an N-terminal IMD, an
           SH3 domain, and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus, exccept for MIM which does not
           carry an SH3 domain. Some members contain additional
           domains and motifs. The IMD domain binds and bundles
           actin filaments, binds membranes and produces membrane
           protrusions, and interacts with the small GTPase Rac.
          Length = 223

 Score = 28.5 bits (64), Expect = 5.0
 Identities = 17/68 (25%), Positives = 42/68 (61%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EKK  ++++   + +++ ++E K+K +  ++   + KK +KK +K    K ++K +Q+++
Sbjct: 100 EKKLELDQKVINKFEKDYKKEYKQKREDLDKARSELKKLQKKSQKSGTGKYQEKLDQALE 159

Query: 70  EIEESLKE 77
           E+ +  KE
Sbjct: 160 ELNDKQKE 167


>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; Very large protein
           containing McrA/HNH-nuclease related domain and a
           RuvC-like nuclease domain; signature gene for type II.
          Length = 799

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
               K  K +KK++KK +   K+     EQ +KE+
Sbjct: 524 NGTNKGTKNRKKRQKKNEDNIKEAASALEQKLKEL 558



 Score = 28.9 bits (65), Expect = 5.6
 Identities = 16/68 (23%), Positives = 30/68 (44%), Gaps = 4/68 (5%)

Query: 18  RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
           R  EE+ E     ++EK+++ +       KK K   K+K+K    KEE+  K +     +
Sbjct: 299 RIIEEQGETKILSKEEKQELLDLL----FKKNKLTYKQKRKLLGLKEEEIFKGLRYEGLK 354

Query: 78  RKEKTLTE 85
            ++     
Sbjct: 355 AEKNFNIS 362


>gnl|CDD|217360 pfam03087, DUF241, Arabidopsis protein of unknown function.  This
           family represents a number of Arabidopsis proteins.
           Their functions are unknown.
          Length = 230

 Score = 28.5 bits (64), Expect = 5.1
 Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 6/64 (9%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
               EK++K+   + +     E   ++EEK +  ++ K   K+ ++       E S++E+
Sbjct: 156 SLLSEKKSKKGGSQSSLSIRSELSRLDEEKRRDNEEVKDILKRLEEL------ENSIEEL 209

Query: 72  EESL 75
           E+ L
Sbjct: 210 EDEL 213


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast)
          and GNL3L (human) are putative GTPases which are
          required for growth and play a role in processing of
          nucleolar pre-rRNA. This family contains a potential
          nuclear localisation signal.
          Length = 80

 Score = 26.8 bits (60), Expect = 5.1
 Identities = 9/35 (25%), Positives = 27/35 (77%)

Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          E+  E++EE+K K++++K+++K+ +K ++ + ++ 
Sbjct: 45 EEILEEIEEKKRKQEEEKERRKEARKAERAEARKR 79


>gnl|CDD|184349 PRK13824, PRK13824, replication initiation protein RepC;
           Provisional.
          Length = 404

 Score = 28.7 bits (65), Expect = 5.1
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 315 INSAGMYTKYLTDRGQRRAF 334
           INSAG Y + LT R +R  F
Sbjct: 364 INSAGGYLRDLTRRAERGEF 383


>gnl|CDD|224023 COG1098, VacB, Predicted RNA binding protein (contains ribosomal
           protein S1 domain) [Translation, ribosomal structure and
           biogenesis].
          Length = 129

 Score = 27.6 bits (62), Expect = 5.2
 Identities = 14/44 (31%), Positives = 25/44 (56%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
           EE+ +K+ +K +  K + K+  +    + +K IEESL   K +T
Sbjct: 77  EEEPEKQHRKPRFSKSRPKEGFETLLSRLLKWIEESLSTLKRQT 120



 Score = 26.9 bits (60), Expect = 9.2
 Identities = 10/66 (15%), Positives = 29/66 (43%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
               G      +++E+  +++ ++    + + KE  E   ++  K  ++     K++ KK
Sbjct: 63  IDENGKISLSIRKLEEEPEKQHRKPRFSKSRPKEGFETLLSRLLKWIEESLSTLKRQTKK 122

Query: 62  KKEEQS 67
           K+  + 
Sbjct: 123 KRGGRG 128


>gnl|CDD|232985 TIGR00463, gltX_arch, glutamyl-tRNA synthetase, archaeal and
          eukaryotic family.  The glutamyl-tRNA synthetases of
          the eukaryotic cytosol and of the Archaea are more
          similar to glutaminyl-tRNA synthetases than to
          bacterial glutamyl-tRNA synthetases. This model models
          just the eukaryotic cytosolic and archaeal forms of the
          enzyme. In some eukaryotes, the glutamyl-tRNA
          synthetase is part of a longer, multifunctional
          aminoacyl-tRNA ligase. In many species, the charging of
          tRNA(gln) proceeds first through misacylation with Glu
          and then transamidation. For this reason, glutamyl-tRNA
          synthetases, including all known archaeal enzymes (as
          of 2010) may act on both tRNA(gln) and tRNA(glu)
          [Protein synthesis, tRNA aminoacylation].
          Length = 556

 Score = 28.6 bits (64), Expect = 5.2
 Identities = 22/71 (30%), Positives = 30/71 (42%), Gaps = 8/71 (11%)

Query: 3  KRRGGAGEK-------KKRIEKRNKEEE-KEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
          K RG A  K           E R K +E  E  E   +E   +  E+ K+  K+     K
Sbjct: 15 KYRGKANPKAVMGAVMSNNPELRKKAKEVLEAVEAAVEEVNSLSPEEQKELMKRLGLDIK 74

Query: 55 KKKKKKKKKEE 65
          KK+KK+K   E
Sbjct: 75 KKEKKRKGLRE 85


>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase.  Cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues, is the major component of wood and
           thus paper, and is synthesised by plants, most algae,
           some bacteria and fungi, and even some animals. The
           genes that synthesise cellulose in higher plants differ
           greatly from the well-characterized genes found in
           Acetobacter and Agrobacterium sp. More correctly
           designated as 'cellulose synthase catalytic subunits',
           plant cellulose synthase (CesA) proteins are integral
           membrane proteins, approximately 1,000 amino acids in
           length. There are a number of highly conserved residues,
           including several motifs shown to be necessary for
           processive glycosyltransferase activity.
          Length = 716

 Score = 28.9 bits (65), Expect = 5.2
 Identities = 13/47 (27%), Positives = 27/47 (57%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           + KK K   K KKK  K+K+ +    ++++I+E  ++ K   +++ S
Sbjct: 312 RRKKPKSASKAKKKDSKRKESEAPIFNLEDIDEGDEDEKSSLMSQLS 358


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 28.8 bits (64), Expect = 5.2
 Identities = 29/131 (22%), Positives = 56/131 (42%), Gaps = 17/131 (12%)

Query: 11  KKKRIEKRNKEEEK------EENEQEEKEKEDVEEEKNKKKKKKKKKKKK---------- 54
            K RI+K + E  K      EE  Q EK + ++  EK+  + +  + +++          
Sbjct: 369 DKDRIQKLSAELLKLEEWLQEERSQREKLEVELGIEKDCNRVQLSENRRELSELRSALRV 428

Query: 55  -KKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNS 113
            +K+K++ ++E+Q + +    L+ R +K   E    D       K  L++  Y   L + 
Sbjct: 429 LQKEKEQLQEEKQELLDYIRVLELRLDKEADEKWAEDAATCEDAKSYLHRAAYIDLLDSI 488

Query: 114 FREPSLEMSYQ 124
              P  E S +
Sbjct: 489 DESPLDESSSE 499


>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
          Length = 141

 Score = 27.7 bits (62), Expect = 5.3
 Identities = 21/83 (25%), Positives = 39/83 (46%), Gaps = 4/83 (4%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           K N  +  E   + E   ++  EE NK ++K   K K++ +KK + K+ +   + E   K
Sbjct: 51  KTNSSDVSEIEHEIETILKNAREEANKIRQKAIAKAKEEAEKKIEAKKAELESKYEAFAK 110

Query: 77  --ERKEKTLTEDSQCDTMIPVFK 97
               +++ L E     + +P FK
Sbjct: 111 QLANQKQELKEQLL--SQMPEFK 131


>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
          Length = 650

 Score = 28.6 bits (65), Expect = 5.3
 Identities = 15/40 (37%), Positives = 23/40 (57%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
            +  + E K +EDV+E   + KKK  K  K KKK++K+  
Sbjct: 211 YKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKERKRNP 250



 Score = 28.2 bits (64), Expect = 7.7
 Identities = 14/57 (24%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 31  EKEKEDVEEE---KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
           +K K+  E        KK + K ++  K+   + KK++  V ++++  KERK     
Sbjct: 198 KKGKKKFEASFYGYKGKKIELKNEEDVKEILAELKKKDFKVSKVKK--KERKRNPPP 252


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 28.8 bits (63), Expect = 5.3
 Identities = 15/53 (28%), Positives = 24/53 (45%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
           +++ R + R +E  K E ++E   +ED       K+K   KK      KK  K
Sbjct: 166 DEQHRHKARKQELRKREKDRERARREDAAAAAAAKQKAAAKKAAAPSGKKSAK 218



 Score = 28.4 bits (62), Expect = 6.7
 Identities = 13/49 (26%), Positives = 30/49 (61%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           KRN  E   + + E+K++  + E+   K+ K ++ + K +K++ +K+E+
Sbjct: 135 KRNLMEHIHKVKNEKKKERQLAEQLAAKRLKDEQHRHKARKQELRKREK 183


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 28.6 bits (64), Expect = 5.3
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEE 39
             +     EK  KE+E+EE  +EE+E+ED +E
Sbjct: 254 DQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|143630 cd07892, PolyPPase_VTC2-3_like, Polyphosphate(polyP) polymerase
           domain of yeast vacuolar transport chaperone (VTC)
           proteins VTC-2, and -3 , and similar proteins.
           Saccharomyces cerevisiae VTC-1, -2, -3, and -4 comprise
           the membrane-integral VTC complex. VTC-2, -3, and -4
           contain polyP polymerase domains. S. cerevisiae
           VTC-2,and -3 belong to this subgroup. For VTC4 it has
           been shown that this domain generates polyP from ATP by
           a phosphotransfer reaction releasing ADP. This activity
           is metal ion-dependent. The ATP gamma phosphate may be
           cleaved and then transferred to an acceptor phosphate to
           form polyP. PolyP is ubiquitous. In prokaryotes, it is a
           store of phosphate and energy. In eukaryotes, polyPs
           have roles in  bone calcification, and osmoregulation,
           and in phosphate transport in the symbiosis of
           mycorrhizal fungi and plants. This subgroup belongs to
           the CYTH/triphosphate tunnel metalloenzyme (TTM)-like
           superfamily, whose enzymes have a unique active site
           located within an eight-stranded beta barrel.
          Length = 303

 Score = 28.5 bits (64), Expect = 5.3
 Identities = 17/68 (25%), Positives = 36/68 (52%), Gaps = 4/68 (5%)

Query: 15  IEKRNKEEEKEENEQEE---KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
           +EK+  +E     E+++   KEK  +      K K +KK +K +K+    +  ++ V+ I
Sbjct: 98  VEKKTFDENTSSFEEDKLQLKEKY-INGFIFGKYKFEKKLQKMEKRGADLENLKKDVENI 156

Query: 72  EESLKERK 79
           ++ ++E K
Sbjct: 157 QDFIRENK 164


>gnl|CDD|203471 pfam06518, DUF1104, Protein of unknown function (DUF1104).  This
          family consists of several hypothetical proteins of
          unknown function which appear to be found largely in
          Helicobacter pylori.
          Length = 93

 Score = 26.9 bits (60), Expect = 5.4
 Identities = 15/65 (23%), Positives = 29/65 (44%), Gaps = 2/65 (3%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E  KR+ K  ++E K      +        + + + ++K +K+ +++  KK      SVK
Sbjct: 29 EIDKRLGKMKEKEAKNFKAGFKAAMRKNLSKMSVEDRRKFRKEVREEMNKKVDA--MSVK 86

Query: 70 EIEES 74
          E  E 
Sbjct: 87 EAREL 91


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 28.6 bits (64), Expect = 5.5
 Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 32  KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
           K    +       + K KK+K  KK+K+  K+E  ++K   ++      KT+  DS
Sbjct: 338 KTARRMALLAEAARVKAKKEKLAKKRKQVSKEEAAAIKAAGKAW----YKTMISDS 389


>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD.  YqfD is part of
           the sigma-E regulon in the sporulation program of
           endospore-forming Gram-positive bacteria. Mutation
           results in a sporulation defect in Bacillus subtilis.
           Members are found in all currently known
           endospore-forming bacteria, including the genera
           Bacillus, Symbiobacterium, Carboxydothermus,
           Clostridium, and Thermoanaerobacter [Cellular processes,
           Sporulation and germination].
          Length = 382

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 16/73 (21%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV- 68
           EK K+          E  ++   E ++  E+  K++  +K K+K +K+ KK+      + 
Sbjct: 295 EKTKKFGLWRWSLPIEIVKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKKELDPNAKIV 354

Query: 69  --KEIEESLKERK 79
             K + E ++  K
Sbjct: 355 SDKILSERVEGGK 367



 Score = 28.5 bits (64), Expect = 5.9
 Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 3/78 (3%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
            K    EK  K      +   E  KE   E    K+K +K  K++  +K K+K E++  K
Sbjct: 289 AKYDVEEKTKKFGLWRWSLPIEIVKETYYE---VKEKVEKVTKEEAIEKAKEKAEKELKK 345

Query: 70  EIEESLKERKEKTLTEDS 87
           E++ + K   +K L+E  
Sbjct: 346 ELDPNAKIVSDKILSERV 363


>gnl|CDD|227003 COG4656, RnfC, Predicted NADH:ubiquinone oxidoreductase, subunit
           RnfC [Energy production and conversion].
          Length = 529

 Score = 28.9 bits (65), Expect = 5.5
 Identities = 13/70 (18%), Positives = 31/70 (44%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           EK + + +R + ++ EE +   + +    E +  ++  + KK   K+          ++ 
Sbjct: 429 EKAEILAQRQELKKAEEAKTRFEARTARLEREKAERIPRMKKAAVKRLSFVTAALIDAIA 488

Query: 70  EIEESLKERK 79
              E +KE+K
Sbjct: 489 AALERVKEKK 498


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
          minor protein in the core of the virion is probably the
          viral helicase.
          Length = 322

 Score = 28.5 bits (63), Expect = 5.5
 Identities = 13/65 (20%), Positives = 35/65 (53%)

Query: 4  RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
          +R     K+++I+    +  +    +E++ K++  +E+     + ++K+K  K++  K+ 
Sbjct: 9  KRSSEELKQRQIQINLVDWFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKET 68

Query: 64 EEQSV 68
          E+ SV
Sbjct: 69 EDASV 73


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 33  EKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           E+ ++  EK K + KKK  K  KK +  
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174


>gnl|CDD|222635 pfam14264, Glucos_trans_II, Glucosyl transferase GtrII.  This
           family includes glucosyl transferase II from the
           Shigella phage SfII, which mediates seroconversion of S.
           flexneri when the phage is integrated into the host
           chromosome.
          Length = 317

 Score = 28.4 bits (64), Expect = 5.5
 Identities = 13/73 (17%), Positives = 24/73 (32%), Gaps = 10/73 (13%)

Query: 218 NNYLQWAAVGTWILLNTQGFISQGIGLNNKEYLIWY--ILFIVFVTYAMLPLPLKWCFIG 275
             Y  W  +G  +L      + + +       L     IL I+F+  +     L    I 
Sbjct: 20  YGYGGWVGLGRPLL----DLLKRILFGTGLTDLPPLPLILSILFLALSA----LLLARIF 71

Query: 276 GCTTSVLHVIITA 288
                ++ V+I  
Sbjct: 72  DLRNKLIAVLIAL 84


>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
           All proteins in this family for whcih functions are
           known are G-T or G-U mismatch glycosylases that function
           in base excision repair. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University). Used 2pf model [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 328

 Score = 28.5 bits (63), Expect = 5.5
 Identities = 27/116 (23%), Positives = 46/116 (39%), Gaps = 20/116 (17%)

Query: 2   RKRRGGAGEKKKRIE-KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           RKR+  + E  + +E K+  + +K     + KEK++       K   K K K+K  +   
Sbjct: 60  RKRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQE-------KITDKFKVKRKVDRFNG 112

Query: 61  KKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVL--YKGIYCPSLTNSF 114
             + E   K + + L              D +I     G++  YKG + P   N F
Sbjct: 113 VSEAELLTKTLPDILTF----------NLDIVIIGINPGLMAAYKGHHYPGPGNHF 158


>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein. 
          Length = 105

 Score = 27.3 bits (61), Expect = 5.6
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          K +KK  K        K KKKK  +  VK+
Sbjct: 5  KQQKKAAKAAAASAGGKAKKKKWSKGKVKD 34


>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein.  This family
          consists of several eukaryotic paraflagellar rod
          component proteins. The eukaryotic flagellum represents
          one of the most complex macromolecular structures found
          in any organism and contains more than 250 proteins. In
          addition to its locomotive role, the flagellum is
          probably involved in nutrient uptake since receptors
          for host low-density lipoproteins are localised on the
          flagellar membrane as well as on the flagellar pocket
          membrane.
          Length = 289

 Score = 28.5 bits (64), Expect = 5.6
 Identities = 19/76 (25%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          E K R +     EE  E  +E++E+ED  + K +K+   ++++ +K  ++  ++++   +
Sbjct: 9  EAKHRQKSAC--EEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQQACWR 66

Query: 70 EIEESLKERKEKTLTE 85
           I+E   ER+ + L E
Sbjct: 67 AIQE--LERELRDLAE 80


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 28.1 bits (63), Expect = 5.6
 Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 6/53 (11%)

Query: 20  KEEEKEENEQ-EEKEKEDVEE-----EKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
           K   KEE    ++  K D  +     +  K++KK   K++KK  K +K+K E+
Sbjct: 69  KVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEE 121


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 28.6 bits (64), Expect = 5.7
 Identities = 13/72 (18%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
            ++   R   EE+    + E  +E+ E E+ +  + K + + + + K++++ E+ + + +
Sbjct: 143 LRQEAMRRATEEEILEMRRETIEEEAELER-ENIRAKIEAEARGRAKEERENEDINREML 201

Query: 72  EESLKERKEKTL 83
           +    E +E  L
Sbjct: 202 KLKANEERETVL 213


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 28.3 bits (63), Expect = 5.7
 Identities = 14/30 (46%), Positives = 22/30 (73%)

Query: 34  KEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           K  +EEEK KKK   K++K+ KKK++ K++
Sbjct: 90  KNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 28.8 bits (65), Expect = 5.8
 Identities = 22/72 (30%), Positives = 41/72 (56%), Gaps = 8/72 (11%)

Query: 14  RIEKRNKEEEKEENEQE-----EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQS 67
            IE+   ++EK+E  +E     EKE  ++EEE    +++ K +K   +  ++ K+E EQ 
Sbjct: 418 EIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQV 477

Query: 68  VKEIEESLKERK 79
             E+E++  ER+
Sbjct: 478 RLELEQA--ERE 487


>gnl|CDD|216016 pfam00602, Flu_PB1, Influenza RNA-dependent RNA polymerase subunit
           PB1.  Two GTP binding sites exist in this protein.
          Length = 740

 Score = 28.7 bits (64), Expect = 5.9
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
           KN K++   K+  K  K+K+  KE+  +   E  ++     T+T+D++
Sbjct: 186 KNNKRRYDDKETGKLLKRKQPSKEKDRITREEYLIRALTLNTMTKDAE 233


>gnl|CDD|212058 cd11489, SLC5sbd_SGLT5, Na(+)/glucose cotransporter SGLT5 and
           related proteins; solute-binding domain.  Human SGLT5 is
           a glucose transporter, which also transports galactose.
           It is encoded by the SLC5A10 gene, and is exclusively
           expressed in the renal cortex. This subgroup belongs to
           the solute carrier 5 (SLC5) transporter family.
          Length = 604

 Score = 28.8 bits (64), Expect = 5.9
 Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 3/49 (6%)

Query: 109 SLTNSFREPSLEMS---YQRYSHRQRQKSLIIVNLVDMVLKAVLTVAWI 154
           SLT+ F   S   +   ++R      ++ L++V  +  V+   ++V WI
Sbjct: 364 SLTSIFNSSSTLFTMDIWRRLRPGASERELLLVGRLVTVILVGVSVVWI 412


>gnl|CDD|224520 COG1604, COG1604, CRISPR system related protein, RAMP superfamily
          [Defense    mechanisms].
          Length = 257

 Score = 28.2 bits (63), Expect = 5.9
 Identities = 16/58 (27%), Positives = 24/58 (41%)

Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
          +E      +K   EEE  K    + +    K  K+  +  EQ VK  + +L   K KT
Sbjct: 25 KEQIINLDKKNAFEEEIIKDFVNEIQSYLSKVLKRLGRSYEQYVKNTKITLNSFKLKT 82


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 28.5 bits (64), Expect = 6.0
 Identities = 28/90 (31%), Positives = 47/90 (52%), Gaps = 15/90 (16%)

Query: 11  KKKRIEKRNKEEE---KEENEQEEKEKEDVEE-----------EKNKKKKKKKKKKKKKK 56
           K K +E+  KEEE   K   +  EK  ED+EE           E+ ++K+ +  +K  K 
Sbjct: 92  KLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQMEILEKALKS 151

Query: 57  KKKKKKKEEQSVKEIEESL-KERKEKTLTE 85
             K  K+E +S++ + ++L KE +E+T  E
Sbjct: 152 YLKIVKEENKSLQRLAKALQKESEERTQDE 181


>gnl|CDD|227480 COG5151, SSL1, RNA polymerase II transcription
          initiation/nucleotide excision repair factor TFIIH,
          subunit SSL1 [Transcription / DNA replication,
          recombination, and repair].
          Length = 421

 Score = 28.4 bits (63), Expect = 6.0
 Identities = 13/55 (23%), Positives = 24/55 (43%)

Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
          Q+  + +  E E  +K  + K + +K     K    EQ  K   + + + KE +L
Sbjct: 5  QKSFDSDKSESEDEQKNGRVKVRSRKTDDMNKGYSWEQEYKRSWDDVNDDKEGSL 59


>gnl|CDD|227621 COG5305, COG5305, Predicted membrane protein [Function unknown].
          Length = 552

 Score = 28.5 bits (64), Expect = 6.1
 Identities = 19/123 (15%), Positives = 37/123 (30%), Gaps = 18/123 (14%)

Query: 171 WTVYTMMSNLAMCLLGWWRC-FANNYLQWAAVGTWILLNTQGWWRCFANNYLQWAAVGTW 229
               T++S     LL   R   +   L    +   +L+        +   +     +   
Sbjct: 173 AVATTLISAT--LLLRAIRLPTSRKLLPGWLIYALLLIL--SLLTHY--FFALTV-IAHG 225

Query: 230 ILLNTQGFISQGIGLNNKEYLIWYILFIVFVTYAMLPLPLKWCFIGGCTTSVLHVIITAQ 289
           + +     +   +      +LIW++L       A LP    W  +     +   V I   
Sbjct: 226 VYV----LLWASLKNRQSLFLIWWLLATAAGLLAFLP----WLLVYF--ANSFRVGIPDL 275

Query: 290 IKL 292
           I L
Sbjct: 276 ITL 278


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
           protein.  This domain family is found in eukaryotes, and
           is typically between 212 and 238 amino acids in length.
           The family is found in association with pfam01805. There
           are two completely conserved residues (W and H) that may
           be functionally important. PRP21 is required for
           assembly of the prespliceosome and it interacts with U2
           snRNP and/or pre-mRNA in the prespliceosome. This family
           also contains proteins similar to PRP21, such as the
           mammalian SF3a. SF3a also interacts with U2 snRNP from
           the prespliceosome, converting it to its active form.
          Length = 230

 Score = 28.2 bits (63), Expect = 6.2
 Identities = 20/80 (25%), Positives = 32/80 (40%), Gaps = 11/80 (13%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK-------KKKKEEQSVKEIEESL 75
             EE E+ E+E+ED++EEK ++  KK     K+ K         K +K+           
Sbjct: 104 APEEEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGESRLKIRKDYVPS---GNKP 160

Query: 76  KERKEKTLTEDSQ-CDTMIP 94
           +  K K           +IP
Sbjct: 161 RAEKPKKKMIKCPITGELIP 180



 Score = 27.4 bits (61), Expect = 9.5
 Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 8/57 (14%)

Query: 19  NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------KKKKKKKKKEEQS 67
             EEE+E  E+EE   E+ +EE  KK     K+ K         K +K       + 
Sbjct: 104 APEEEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGESRLKIRKDYVPSGNKP 160


>gnl|CDD|236539 PRK09490, metH, B12-dependent methionine synthase; Provisional.
          Length = 1229

 Score = 28.6 bits (65), Expect = 6.2
 Identities = 27/97 (27%), Positives = 43/97 (44%), Gaps = 22/97 (22%)

Query: 267 LPLKWCFIGGCTTSVLHVIITAQIKLH---RGPLSPCSVHEMGALCLVYIGINSAGMYTK 323
           +PL    IGG TTS  H   TA +K+     GP     V+   A         + G+ + 
Sbjct: 833 IPL---LIGGATTSKAH---TA-VKIAPNYSGP----VVYVTDA-------SRAVGVVSS 874

Query: 324 YLTDRGQRRAFLETHRSTETRMKTQKENDQQEKLLLS 360
            L+D  QR A++   R+   +++ Q    +  K LL+
Sbjct: 875 LLSDE-QRDAYVAETRAEYEKVREQHARKKPRKPLLT 910


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
          protein involved in ribosome biogenesis.
          Length = 173

 Score = 27.8 bits (62), Expect = 6.3
 Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 2/45 (4%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
          KN+    K  +K+ K+K +KKK        I ++    K+KTL +
Sbjct: 6  KNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWD--KKKTLAQ 48



 Score = 27.4 bits (61), Expect = 8.1
 Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 41 KNKKKKKKKKKKKKKKKKKKKKKE 64
          K K +    K  +K+ K+K +KK+
Sbjct: 4  KKKNRSSNYKVNRKRLKRKDRKKK 27


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
          binding.  This is the C-terminal conserved coiled coil
          region of a family of TATA element modulatory factor 1
          proteins conserved in eukaryotes. The proteins bind to
          the TATA element of some RNA polymerase II promoters
          and repress their activity. by competing with the
          binding of TATA binding protein. TMF1_TATA_bd is the
          most conserved part of the TMFs. TMFs are
          evolutionarily conserved golgins that bind Rab6, a
          ubiquitous ras-like GTP-binding Golgi protein, and
          contribute to Golgi organisation in animal and plant
          cells. The Rab6-binding domain appears to be the same
          region as this C-terminal family.
          Length = 121

 Score = 27.2 bits (61), Expect = 6.3
 Identities = 17/73 (23%), Positives = 40/73 (54%), Gaps = 8/73 (10%)

Query: 18 RNKEEE----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
          R  E E    K+E  + E E+++  +E  K  ++ ++ ++ KK+ ++ +KE + +++  E
Sbjct: 27 RRLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQRYE 86

Query: 74 S----LKERKEKT 82
          +    L E+ E+ 
Sbjct: 87 TTLELLGEKSERV 99


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 28.7 bits (64), Expect = 6.4
 Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 1/80 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ-SV 68
           ++   +E R K  E+ E + E +  E  EE K  K K   +  K + +K+ + KE + S+
Sbjct: 107 DRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESI 166

Query: 69  KEIEESLKERKEKTLTEDSQ 88
             I+  L E + +    D++
Sbjct: 167 SRIKNDLSEMQCRAQNADTE 186



 Score = 28.4 bits (63), Expect = 7.3
 Identities = 19/86 (22%), Positives = 33/86 (38%), Gaps = 5/86 (5%)

Query: 9   GEKKKRIEKRNKEEEKEENEQEEKEKEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
            EK   I    K+ E    + + + ++ V    E K K +  K   ++ +++     KE 
Sbjct: 356 KEKNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKER 415

Query: 66  QSVKEIEESLKERKEKTLTEDSQCDT 91
             ++ I  S    KE T T  S    
Sbjct: 416 DGLRAILNSY--DKELTETSVSGQLM 439


>gnl|CDD|223436 COG0359, RplI, Ribosomal protein L9 [Translation, ribosomal
           structure and biogenesis].
          Length = 148

 Score = 27.6 bits (62), Expect = 6.4
 Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 5/72 (6%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ---- 66
           K   +       +  E  + + EK+  EE    +  K+K + K  +   K  ++ +    
Sbjct: 33  KGLAVPATKGNLKLLEARRAKLEKKAAEELAEAEALKEKLEGKTVEIAVKAGEDGKLFGS 92

Query: 67  -SVKEIEESLKE 77
            + K+I E+LK 
Sbjct: 93  VTSKDIAEALKA 104


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 28.4 bits (64), Expect = 6.5
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 3/69 (4%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
            R KE+ K+   QE   K    EE  ++ ++ + K +    KK +   E  + E E+ L+
Sbjct: 207 DRAKEKLKKL-LQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTE--IAEAEKKLE 263

Query: 77  ERKEKTLTE 85
           + +  T  E
Sbjct: 264 QCRGFTFKE 272


>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region.  Remorins are
          plant-specific plasma membrane-associated proteins. In
          tobacco remorin co-purifies with lipid rafts. Most
          remorins have a variable, proline-rich C-half and a
          more conserved N-half that is predicted to form coiled
          coils. Consistent with this, circular dichroism studies
          have demonstrated that much of the protein is
          alpha-helical. Remorins exist in plasma membrane
          preparations as oligomeric structures and form
          filaments in vitro. The proteins can bind polyanions
          including the extracellular matrix component
          oligogalacturonic acid (OGA). In vitro, remorin in
          plasma membrane preparations is phosphorylated
          (principally on threonine residues) in the presence of
          OGA and thus co-purifies with a protein kinases(s). The
          biological functions of remorins are unknown but roles
          as components of the membrane/cytoskeleton are
          possible.
          Length = 112

 Score = 26.9 bits (60), Expect = 6.7
 Identities = 17/55 (30%), Positives = 28/55 (50%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
           +K +I  + + EE +    E K+K   E E  K ++K +KKK +  +K K K  
Sbjct: 16 AEKAKINNKYQREEAKIQAWENKKKAKAEAELKKIERKLEKKKAEAAEKLKNKLA 70


>gnl|CDD|212164 cd11650, AT4G37440_like, Uncharacterized protein domain conserved
           in plants.  This domain contains an extensive protein
           sequence fragment that appears conserved in a number of
           plant proteins, including the gene product of
           Arabidopsis thaliana locus AT4G37440, which has been
           identified in transcriptional profiling as expressed at
           different levels in white cabbage cultivars.
          Length = 253

 Score = 28.2 bits (63), Expect = 6.7
 Identities = 13/50 (26%), Positives = 27/50 (54%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
           ++     E+E+  Q E  K + E  K+     ++ ++K  K++K+K+ EE
Sbjct: 114 DRELALIEQEKELQLEGTKAEGENSKSAPFSGERHERKVMKRRKRKRVEE 163


>gnl|CDD|218871 pfam06046, Sec6, Exocyst complex component Sec6.  Sec6 is a
          component of the multiprotein exocyst complex. Sec6
          interacts with Sec8, Sec10 and Exo70.These exocyst
          proteins localise to regions of active exocytosis-at
          the growing ends of interphase cells and in the medial
          region of cells undergoing cytokinesis-in an
          F-actin-dependent and exocytosis- independent manner.
          Length = 557

 Score = 28.5 bits (64), Expect = 6.8
 Identities = 16/59 (27%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK--KKKEEQSVKE-IEESLKERKEKTL 83
          E+EEKE E  E  +   K +      + + + +  K K   +++E +EE  +E +E+ L
Sbjct: 41 EREEKEDEKAEALQEATKDENNDDAGRFRGRPRGYKDKFLDALEESVEERFEECRERFL 99


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 27.2 bits (61), Expect = 6.9
 Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 2/34 (5%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK 49
            +R ++EEK+    EE+ K   EE K ++++KKK
Sbjct: 95  AQRARDEEKK--LDEEEAKRQHEEAKEREREKKK 126



 Score = 26.9 bits (60), Expect = 9.7
 Identities = 10/37 (27%), Positives = 27/37 (72%)

Query: 27  NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           +E+ E ++   EE+K  +++ K++ ++ K+++++KKK
Sbjct: 90  DERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 27.3 bits (61), Expect = 7.0
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 37  VEEEKNKKKKKKKKKKKKKKKKKK 60
            +E KN++KK + K+K K   KKK
Sbjct: 109 RKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 26.9 bits (60), Expect = 8.7
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKK 63
           K  K ++KK + K+K K   KKK
Sbjct: 110 KELKNRRKKVRGKEKTKVSGKKK 132


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 28.1 bits (63), Expect = 7.0
 Identities = 12/37 (32%), Positives = 25/37 (67%)

Query: 24  KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
           KE  +QEEK+++  E++  + +  KK +K+ +K++ K
Sbjct: 155 KEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191



 Score = 28.1 bits (63), Expect = 7.1
 Identities = 12/40 (30%), Positives = 29/40 (72%)

Query: 42  NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
            K K+ +K+++K+++  +KK +  +S K+IE+ +++R+ K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 27.4 bits (61), Expect = 7.0
 Identities = 7/37 (18%), Positives = 16/37 (43%)

Query: 17  KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
           ++N             E+   E  + + KK+++ +KK
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKK 140


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 27.3 bits (61), Expect = 7.0
 Identities = 18/81 (22%), Positives = 44/81 (54%), Gaps = 1/81 (1%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
           E+++  E R    ++   EQ+ + K   E  K +KK+ K++ +K  + ++++KK  ++ K
Sbjct: 43  EQRQTQELRR-LPKRIRAEQKTRLKMFKESLKIEKKELKQEVEKLPRFQEQEKKRMKAEK 101

Query: 70  EIEESLKERKEKTLTEDSQCD 90
           E +E   +++E+      + +
Sbjct: 102 EEQEQKHQKQEREFLAKQEEN 122


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 28.7 bits (64), Expect = 7.1
 Identities = 16/48 (33%), Positives = 29/48 (60%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
           E + K+EEK+E + EE + E+V+EE  +++K    +  +K   +  KK
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK 325


>gnl|CDD|113815 pfam05058, ActA, ActA Protein.  The ActA family is found in
           Listeria and is associated with motility. ActA protein
           acts as a scaffold to assemble and activate host cell
           actin cytoskeletal factors at the bacterial surface,
           resulting in directional actin polymerisation and
           propulsion of the bacterium through the cytoplasm of the
           host cell.
          Length = 601

 Score = 28.3 bits (62), Expect = 7.1
 Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK---KKKKKKKKKEE 65
           +K   +R+KEE K + E++  E+ +   + N KK+    ++ K   K  + +K KEE
Sbjct: 513 QKEATERDKEEMKPQTEEKMVEESEPANDANGKKRSAGIEEGKLIAKSAEDEKAKEE 569


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 27.2 bits (60), Expect = 7.1
 Identities = 9/17 (52%), Positives = 14/17 (82%)

Query: 20  KEEEKEENEQEEKEKED 36
            E +KEE ++EE+E+ED
Sbjct: 88  AEAKKEEKKEEEEEEED 104


>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1.  This family
          consists of several bacterial histone H1-like Hc1
          proteins. In Chlamydia, Hc1 is expressed in the late
          stages of the life cycle, concomitant with the
          reorganisation of chlamydial reticulate bodies into
          elementary bodies. This suggests that Hc1 protein plays
          a role in the condensation of chromatin during
          intracellular differentiation.
          Length = 123

 Score = 27.0 bits (59), Expect = 7.2
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          KE ++ EK+   KKK  KK  K  K  KK  +   K+
Sbjct: 50 KESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKK 86


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 28.3 bits (64), Expect = 7.2
 Identities = 11/37 (29%), Positives = 23/37 (62%), Gaps = 5/37 (13%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKE-----DVEEEK 41
           E+++R+E+   E  +EE E++E++K      + E+ K
Sbjct: 65  EQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKGK 101


>gnl|CDD|222559 pfam14126, DUF4293, Domain of unknown function (DUF4293).  This
           family of integral membrane proteins is functionally
           uncharacterized. This family of proteins is found in
           bacteria. Proteins in this family are typically between
           136 and 154 amino acids in length.
          Length = 111

 Score = 27.1 bits (61), Expect = 7.3
 Identities = 5/31 (16%), Positives = 13/31 (41%)

Query: 126 YSHRQRQKSLIIVNLVDMVLKAVLTVAWISY 156
           + +R+ Q  L  +N++  +    L   +   
Sbjct: 36  FKNRKLQIRLCRLNILLNLGLLGLFAYFFLN 66


>gnl|CDD|217930 pfam04152, Mre11_DNA_bind, Mre11 DNA-binding presumed domain.  The
           Mre11 complex is a multi-subunit nuclease that is
           composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved
           in checkpoint signalling and DNA replication. Mre11 has
           an intrinsic DNA-binding activity that is stimulated by
           Rad50 on its own or in combination with Nbs1.
          Length = 166

 Score = 27.5 bits (62), Expect = 7.3
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 47  KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
            KKKK+K K KK ++   ++      E L   K + L ++
Sbjct: 105 YKKKKRKTKTKKTEEDDPDEEELLPPEELDTLKVEDLVKE 144


>gnl|CDD|224811 COG1899, DYS1, Deoxyhypusine synthase [Posttranslational
           modification, protein turnover, chaperones].
          Length = 318

 Score = 28.0 bits (63), Expect = 7.4
 Identities = 12/41 (29%), Positives = 24/41 (58%), Gaps = 7/41 (17%)

Query: 105 IYCPSLTNSFREPSLEMSYQRYSHRQRQKSLIIVNLV-DMV 144
           I+CP++T+S    S+      + HR+ Q S + +++V D+ 
Sbjct: 187 IFCPAITDS----SIGD--MLWFHREEQGSDLKIDIVEDVH 221


>gnl|CDD|223624 COG0550, TopA, Topoisomerase IA [DNA replication, recombination,
           and repair].
          Length = 570

 Score = 28.3 bits (64), Expect = 7.4
 Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 11  KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
                  R  E E ++ E   K+K++ EE  NK K K  K    +KK KK+ 
Sbjct: 205 GGGEFSARLTEIEGKK-EGRLKDKDEAEEIVNKLKGKPAKVVSVEKKPKKRS 255



 Score = 28.0 bits (63), Expect = 9.0
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 38  EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
            E + KK+ + K K + ++   K K +   V  +E+  K+R 
Sbjct: 214 TEIEGKKEGRLKDKDEAEEIVNKLKGKPAKVVSVEKKPKKRS 255


>gnl|CDD|232988 TIGR00468, pheS, phenylalanyl-tRNA synthetase, alpha subunit.
          Most phenylalanyl-tRNA synthetases are heterodimeric,
          with 2 alpha (pheS) and 2 beta (pheT) subunits. This
          model describes the alpha subunit, which shows some
          similarity to class II aminoacyl-tRNA ligases.
          Mitochondrial phenylalanyl-tRNA synthetase is a single
          polypeptide chain, active as a monomer, and similar to
          this chain rather than to the beta chain, but excluded
          from this model. An interesting feature of the
          alignment of all sequences captured by this model is a
          deep split between non-spirochete bacterial examples
          and all other examples; supporting this split is a
          relative deletion of about 50 residues in the former
          set between two motifs well conserved throughout the
          alignment [Protein synthesis, tRNA aminoacylation].
          Length = 293

 Score = 28.0 bits (63), Expect = 7.5
 Identities = 13/53 (24%), Positives = 23/53 (43%), Gaps = 1/53 (1%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
          +  K++ K  KEE K        E K ++++E  K K + +      K K + 
Sbjct: 4  DLLKQLGKLTKEETKPALGALINEVKIELQDELTKLKPELESAGLWSKLKFET 56


>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Multidomain protein with permuted HD nuclease domain,
           palm domain and Zn-ribbon; MTH326-like has inactivated
           polymerase catalytic domain; alr1562 and slr7011 -
           predicted only on the basis of size, presence of HD
           domain, and location with RAMPs in one operon; signature
           gene for type III; also known as Crm2 family.
          Length = 475

 Score = 28.2 bits (63), Expect = 7.5
 Identities = 12/57 (21%), Positives = 25/57 (43%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
            K +    ++  +  KRN+EE + + E   +E  ++ E++ +      K KK     
Sbjct: 270 LKEKLVLIDEVIKRSKRNREEPRLKYENFREELRELLEKEPEIVYLFIKAKKNPGPY 326


>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
           TRaffic INducer (NOSTRIN).  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Nitric Oxide Synthase TRaffic
           INducer (NOSTRIN) is expressed in endothelial and
           epithelial cells and is involved in the regulation,
           trafficking and targeting of endothelial NOS (eNOS).
           NOSTRIN facilitates the endocytosis of eNOS by
           coordinating the functions of dynamin and the
           Wiskott-Aldrich syndrome protein (WASP). Increased
           expression of NOSTRIN may be correlated to preeclampsia.
           NOSTRIN contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           The F-BAR domain of NOSTRIN is necessary and sufficient
           for its membrane association and is responsible for its
           subcellular localization.
          Length = 239

 Score = 27.7 bits (62), Expect = 7.5
 Identities = 17/51 (33%), Positives = 28/51 (54%)

Query: 26  ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
           ENE+ + + ED ++   K+K   K KK  + + K+ +K E   K+ EES  
Sbjct: 138 ENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKEDEKLEAKRKKGEESRL 188


>gnl|CDD|100810 PRK01194, PRK01194, V-type ATP synthase subunit E; Provisional.
          Length = 185

 Score = 27.6 bits (61), Expect = 7.5
 Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 3/77 (3%)

Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK---KKKKKKKKKKKKKKKKKKKEEQSVK 69
          K IEK  +E++KE N++  K  E +E+E + K    K+  +KK + +  + KK       
Sbjct: 8  KDIEKSREEKKKEINDEYSKRIEKLEKECDSKIQSIKEYYEKKMRAEISRLKKSIIDKAN 67

Query: 70 EIEESLKERKEKTLTED 86
              S+K  K + + +D
Sbjct: 68 IEARSIKREKRREILKD 84


>gnl|CDD|217636 pfam03606, DcuC, C4-dicarboxylate anaerobic carrier. 
          Length = 456

 Score = 28.0 bits (63), Expect = 7.6
 Identities = 7/34 (20%), Positives = 17/34 (50%)

Query: 13  KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
           ++I+      EK+E+++   +K+    E   + K
Sbjct: 216 QKIKDDPVYSEKDESDEHFMDKQSEVTEIFSRPK 249


>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
           family).  Protein phosphatase 2A (PP2A) is a major
           intracellular protein phosphatase that regulates
           multiple aspects of cell growth and metabolism. The
           ability of this widely distributed heterotrimeric enzyme
           to act on a diverse array of substrates is largely
           controlled by the nature of its regulatory B subunit.
           There are multiple families of B subunits (See also
           pfam01240), this family is called the B56 family.
          Length = 405

 Score = 28.0 bits (63), Expect = 7.7
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 35  EDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
            D+ +E   K K+K+ K+KKK+KK+KK 
Sbjct: 369 PDLFDECGNKYKEKEAKEKKKEKKRKKT 396


>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein.  A number of members of
          this family are annotated as being transmembrane
          proteins induced by tumour necrosis factor alpha, but
          no literature was found to support this.
          Length = 330

 Score = 28.2 bits (63), Expect = 7.7
 Identities = 18/58 (31%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKK--KKKKKKKKKKEEQSVKEIEESLKERK 79
          K++ E+ EK +E       +++K+ K+     KK KK    ++ + V+++EE +KERK
Sbjct: 24 KQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLKKLKKSLTPEDSELVEQLEEQIKERK 81


>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain.  Mitosin or
          centromere-associated protein-F (Cenp-F) is found bound
          across the centromere as one of the proteins of the
          outer layer of the kinetochore. Most of the
          kinetochore/centromere functions appear to depend upon
          binding of the C-terminal par to f the molecule,
          whereas the N-terminal part, here, may be a cytoplasmic
          player in controlling the function of microtubules and
          dynein.
          Length = 288

 Score = 27.8 bits (61), Expect = 7.7
 Identities = 17/61 (27%), Positives = 36/61 (59%), Gaps = 3/61 (4%)

Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
          ++ K+E +Q + + + +E      +K+K+K + +K +    K+E Q + EI E L++ K+
Sbjct: 28 DKLKKEKQQRQFQLDSLEA---ALQKQKQKVENEKTEGANLKRENQRLMEICEHLEKTKQ 84

Query: 81 K 81
          K
Sbjct: 85 K 85


>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
          Length = 871

 Score = 28.3 bits (64), Expect = 7.8
 Identities = 9/37 (24%), Positives = 18/37 (48%)

Query: 23  EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
           E+   E  EKE+E +   +     ++K K+ K+   +
Sbjct: 625 EQGRQEALEKEEELLSRLQQLPDGEQKAKETKRMISR 661


>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive type)
            [DNA replication, recombination, and repair].
          Length = 1444

 Score = 28.4 bits (64), Expect = 7.8
 Identities = 11/35 (31%), Positives = 17/35 (48%)

Query: 44   KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
            K K ++  K+K  K   K+K   +V EI   +  R
Sbjct: 1321 KAKMEEINKRKGNKASPKEKNLLTVLEIVLEMLAR 1355



 Score = 28.1 bits (63), Expect = 9.0
 Identities = 14/64 (21%), Positives = 28/64 (43%), Gaps = 2/64 (3%)

Query: 12  KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--KKKKKKKKKEEQSVK 69
            +  E    EE   E  ++  ++    E+K K +  K +K K     +K +K K  + +K
Sbjct: 171 SEEQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGRKIKSTEEIK 230

Query: 70  EIEE 73
            + +
Sbjct: 231 PLIK 234


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 28.1 bits (63), Expect = 7.9
 Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 1/54 (1%)

Query: 32  KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
           K  E  EE +   K  ++K KK+KK    K+  E+  K +E  +K++  K   E
Sbjct: 161 KTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEA-IKKKDIKNPLE 213


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
          autophagy-related subunit 14.  The Atg14 or Apg14
          proteins are hydrophilic proteins with a predicted
          molecular mass of 40.5 kDa, and have a coiled-coil
          motif at the N terminus region. Yeast cells with mutant
          Atg14 are defective not only in autophagy but also in
          sorting of carboxypeptidase Y (CPY), a vacuolar-soluble
          hydrolase, to the vacuole. Subcellular fractionation
          indicate that Apg14p and Apg6p are peripherally
          associated with a membrane structure(s). Apg14p was
          co-immunoprecipitated with Apg6p, suggesting that they
          form a stable protein complex. These results imply that
          Apg6/Vps30p has two distinct functions: in the
          autophagic process and in the vacuolar protein sorting
          pathway. Apg14p may be a component specifically
          required for the function of Apg6/Vps30p through the
          autophagic pathway. There are 17 auto-phagosomal
          component proteins which are categorized into six
          functional units, one of which is the AS-PI3K complex
          (Vps30/Atg6 and Atg14). The AS-PI3K complex and the
          Atg2-Atg18 complex are essential for nucleation, and
          the specific function of the AS-PI3K apparently is to
          produce phosphatidylinositol 3-phosphate (PtdIns(3)P)
          at the pre-autophagosomal structure (PAS). The
          localisation of this complex at the PAS is controlled
          by Atg14. Autophagy mediates the cellular response to
          nutrient deprivation, protein aggregation, and pathogen
          invasion in humans, and malfunction of autophagy has
          been implicated in multiple human diseases including
          cancer. This effect seems to be mediated through direct
          interaction of the human Atg14 with Beclin 1 in the
          human phosphatidylinositol 3-kinase class III complex.
          Length = 307

 Score = 28.1 bits (63), Expect = 8.0
 Identities = 15/74 (20%), Positives = 34/74 (45%), Gaps = 4/74 (5%)

Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK----KKKEEQS 67
          + R++      E EE +Q+ +E  +    ++ K      K +  +KK++    + +  Q 
Sbjct: 24 ELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQL 83

Query: 68 VKEIEESLKERKEK 81
           +EIE+  +  +E 
Sbjct: 84 KEEIEQKRERIEEL 97


>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex.  Med21 has been
           known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
           Drosophila. The heterodimer of the two subunits Med7 and
           Med21 appears to act as a hinge between the middle and
           the tail regions of Mediator.
          Length = 132

 Score = 27.2 bits (61), Expect = 8.0
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 25  EENEQEEKEKEDVEEEKNKKKKK 47
           EE  + E E+E+  +EK K  KK
Sbjct: 99  EELREVEAEREEAVKEKEKLLKK 121


>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
          Length = 362

 Score = 28.2 bits (63), Expect = 8.1
 Identities = 11/49 (22%), Positives = 22/49 (44%)

Query: 32  KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
           K K D        +K   K++K+ +K + +  +E +   +E +   R E
Sbjct: 253 KAKRDPNGTFQVARKAAMKRRKRNRKLRARNAKELAAMRMEANQIRRNE 301


>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451).  This
          presumed domain is functionally uncharacterized. This
          domain is found in eukaryotes. This domain is typically
          between 199 to 238 amino acids in length. This domain
          is found associated with pfam06512, pfam00520. This
          domain has a conserved ADD sequence motif.
          Length = 222

 Score = 27.8 bits (62), Expect = 8.2
 Identities = 6/33 (18%), Positives = 15/33 (45%)

Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
          +  K+++ +  K KK++    S +   E   + 
Sbjct: 9  ESAKERRNRNDKNKKEEHSIGSEEGDSEKEPKS 41


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 28.2 bits (63), Expect = 8.2
 Identities = 14/72 (19%), Positives = 32/72 (44%), Gaps = 4/72 (5%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK----KKEEQSVKEIEESLK 76
           EE   E +    + + V EE    +++ ++ +++ K KK+        E +V +++  + 
Sbjct: 287 EEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQALVV 346

Query: 77  ERKEKTLTEDSQ 88
              E+ L    Q
Sbjct: 347 ASSERLLELAQQ 358


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 28.1 bits (62), Expect = 8.3
 Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 2/84 (2%)

Query: 2   RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
           R R   AG + + +++     EK E E+++ E E     K  +K ++   K+++K +  +
Sbjct: 559 RSRADSAGARNE-VDRLLDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATR 617

Query: 62  KKEEQSVKEIEESLKERKEKTLTE 85
            + E+   E+ E   ER EK+  E
Sbjct: 618 IEFERKSAELLEEA-ERLEKSEAE 640


>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
           repair].
          Length = 480

 Score = 28.2 bits (63), Expect = 8.3
 Identities = 18/73 (24%), Positives = 31/73 (42%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
            K  K+EE+    + EKE E  E+   K K ++   +K  +   +K  ++ S  E  E  
Sbjct: 276 YKLQKKEEQTLRTRIEKELEKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLEVD 335

Query: 76  KERKEKTLTEDSQ 88
            + K+K       
Sbjct: 336 FKSKKKREEAKRG 348


>gnl|CDD|227481 COG5152, COG5152, Uncharacterized conserved protein, contains RING
           and CCCH-type Zn-fingers [General function prediction
           only].
          Length = 259

 Score = 27.7 bits (61), Expect = 8.4
 Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 8/95 (8%)

Query: 2   RKRRGGAGEK-KKRIEKRNKEEEKEENEQEEKEKED---VEEEKNKKKKKKKKKKKKKKK 57
              +  + EK +K+ +K N  EEK     EEK   D   + +  N +  +   + + K +
Sbjct: 6   LVNKSSSDEKNQKKRQKINFSEEKLVASDEEKGSSDLMSLAKSGNSRTLQLSHENEGKLQ 65

Query: 58  KKKKKKEEQSVKEIEESLKE----RKEKTLTEDSQ 88
           KK +  ++ ++   ++S KE     + K L E ++
Sbjct: 66  KKGEDLDKYTLTVNDDSTKEDLLNFERKELAEKAK 100


>gnl|CDD|215581 PLN03109, PLN03109, ETHYLENE-INSENSITIVE3-like3 protein;
          Provisional.
          Length = 599

 Score = 28.3 bits (63), Expect = 8.5
 Identities = 16/50 (32%), Positives = 29/50 (58%)

Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
          EE E ED+E    K + K K+ K+++KK ++   E+   K+I +  + +K
Sbjct: 35 EEIEAEDLERRMWKDRIKLKRIKERQKKLQQAALEKSKPKKISDQARRKK 84


>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
          Length = 1119

 Score = 28.2 bits (63), Expect = 8.5
 Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 5/69 (7%)

Query: 21  EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES-----L 75
           E  K E E  +  K++++E  +   K++  + +K  +K KK++E     E  +S     L
Sbjct: 575 ESSKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKKEREADDDPEHLDSFPILSL 634

Query: 76  KERKEKTLT 84
            +  ++T  
Sbjct: 635 SDLNKETEE 643


>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain.  This domain is
           found in Plasmodium Duffy binding proteins. Plasmodium
           vivax and Plasmodium knowlesi merozoites invade human
           erythrocytes that express Duffy blood group surface
           determinants. The Duffy receptor family is localised in
           micronemes, an organelle found in all organisms of the
           phylum Apicomplexa.
          Length = 276

 Score = 27.7 bits (62), Expect = 8.6
 Identities = 22/71 (30%), Positives = 40/71 (56%), Gaps = 7/71 (9%)

Query: 10  EKKKRIEK-----RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK--KKKKKKKKKKKKK 62
           E+KK++EK     + K+  K  +++  K K   ++ K     KKK   K+KKK KK KK+
Sbjct: 200 ERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKKR 259

Query: 63  KEEQSVKEIEE 73
             +++  +I++
Sbjct: 260 NNKKNYSDIKD 270


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 27.3 bits (61), Expect = 8.7
 Identities = 17/66 (25%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
           I + ++  ++++ E    ++E VE ++ +  + +++ K  +K K+KK+KE ++    EE+
Sbjct: 70  IRQLDQRIQQQQQELALLQQE-VEAKRERLLEARRELKALEKLKEKKQKEYRA----EEA 124

Query: 75  LKERKE 80
            +E+KE
Sbjct: 125 KREQKE 130


>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel.  The
           family carries eight putative transmembrane domains,
           and, although it has no similarity to other known
           channel proteins, it is clearly a calcium-activated
           ionic channel. It is expressed in various secretory
           epithelia, the retina and sensory neurons, and mediates
           receptor-activated chloride currents in diverse
           physiological processes.
          Length = 449

 Score = 27.9 bits (63), Expect = 8.7
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 44  KKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
           KKK +KKK KKKK+K+KK +E + ++ IE 
Sbjct: 270 KKKIRKKKIKKKKEKEKKPEENEILERIEN 299


>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1.  This model
          describes peptide chain release factor 1 (PrfA, RF-1),
          and excludes the related peptide chain release factor 2
          (PrfB, RF-2). RF-1 helps recognize and terminate
          translation at UAA and UAG stop codons. The
          mitochondrial release factors are prfA-like, although
          not included above the trusted cutoff for this model.
          RF-1 does not have a translational frameshift [Protein
          synthesis, Translation factors].
          Length = 360

 Score = 28.1 bits (63), Expect = 8.8
 Identities = 14/80 (17%), Positives = 39/80 (48%), Gaps = 2/80 (2%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
          K   +EK     E+ E  +      +V  +++K +K  K+  + ++     ++ +Q+ ++
Sbjct: 2  KPSLLEKLESLLERYEELEALLSDPEVISDQDKLRKLSKEYSQLEEIVDCYREYQQAQED 61

Query: 71 IEES--LKERKEKTLTEDSQ 88
          I+E+  + E  +  + E ++
Sbjct: 62 IKEAKEILEESDPEMREMAK 81


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 27.9 bits (62), Expect = 8.9
 Identities = 15/71 (21%), Positives = 32/71 (45%)

Query: 16  EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
           +  N+E + E   +     +D E +   K+K +++ ++ K  K    + E   +  +E  
Sbjct: 212 DGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYE 271

Query: 76  KERKEKTLTED 86
           +E  E+   ED
Sbjct: 272 EEAGEEQENED 282


>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein.  This family
           consists of the p50 and variable adherence-associated
           antigen (Vaa) adhesins from Mycoplasma hominis. M.
           hominis is a mycoplasma associated with human urogenital
           diseases, pneumonia, and septic arthritis. An adhesin is
           a cell surface molecule that mediates adhesion to other
           cells or to the surrounding surface or substrate. The
           Vaa antigen is a 50-kDa surface lipoprotein that has
           four tandem repetitive DNA sequences encoding a periodic
           peptide structure, and is highly immunogenic in the
           human host. p50 is also a 50-kDa lipoprotein, having
           three repeats A,B and C, that may be a tetramer of
           191-kDa in its native environment.
          Length = 353

 Score = 27.7 bits (61), Expect = 9.1
 Identities = 22/91 (24%), Positives = 41/91 (45%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A  +KK I+ +N +  KE  E+  K + ++EE K    +  +  K   K   + +K+ + 
Sbjct: 102 AKNEKKAIDDKNAQIAKELAERNAKIQSNIEELKKINNEAFELSKTVNKTIAEVEKKFKI 161

Query: 68  VKEIEESLKERKEKTLTEDSQCDTMIPVFKK 98
            K+ +E L+   +  L +  Q D    V   
Sbjct: 162 PKDFKEQLENFADDLLDKSRQIDEFTTVTST 192


>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1.  This protein
          family shows weak but suggestive similarity to
          translation initiation factor SUI1 and its prokaryotic
          homologs.
          Length = 173

 Score = 27.5 bits (61), Expect = 9.1
 Identities = 12/57 (21%), Positives = 26/57 (45%), Gaps = 5/57 (8%)

Query: 20 KEEEKEENEQE-----EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
          K    +    E       E ++ + EK+    +K ++ ++  KKKKK  ++ ++K  
Sbjct: 40 KLYGTDSPTAEAEAVTIGEAQEEKGEKDLLGIQKAQEAREGGKKKKKLPQKVTIKRE 96


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 406

 Score = 28.0 bits (63), Expect = 9.2
 Identities = 10/64 (15%), Positives = 30/64 (46%)

Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
          +    I    +  E +E   + ++ +    EK K+    ++++ + +  + K++ ++  +
Sbjct: 4  DALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLRE 63

Query: 70 EIEE 73
          EIE 
Sbjct: 64 EIER 67


>gnl|CDD|240247 PTZ00055, PTZ00055, glutathione synthetase; Provisional.
          Length = 619

 Score = 28.2 bits (63), Expect = 9.2
 Identities = 8/40 (20%), Positives = 16/40 (40%), Gaps = 1/40 (2%)

Query: 246 NKEYLIWYILFIVFVTYAMLPLPL-KWCFIGGCTTSVLHV 284
             + L      +  V++ + PLP  +      C  ++L V
Sbjct: 64  KAKQLEANDGILKMVSFVLFPLPFPRKLLEDCCLCTLLLV 103


>gnl|CDD|153366 cd07682, F-BAR_srGAP2, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Protein 2.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs. srGAP2 is expressed in zones
           of neuronal differentiation. It plays a role in the
           regeneration of neurons and axons. srGAP2 contains an
           N-terminal F-BAR domain, a Rho GAP domain, and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
          Length = 263

 Score = 27.7 bits (61), Expect = 9.4
 Identities = 18/74 (24%), Positives = 35/74 (47%)

Query: 8   AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           A  K K  EK+ +++      QE+++     +     + ++K  ++   KK +K KE++ 
Sbjct: 152 AQSKLKEAEKQEEKQMSRSVRQEDRQTPRSPDSTTNIRIEEKHVRRSSVKKIEKMKEKRQ 211

Query: 68  VKEIEESLKERKEK 81
            K  E  LK  K +
Sbjct: 212 AKYTENKLKAIKAR 225


>gnl|CDD|235860 PRK06778, PRK06778, hypothetical protein; Validated.
          Length = 289

 Score = 27.6 bits (61), Expect = 9.5
 Identities = 10/48 (20%), Positives = 19/48 (39%)

Query: 46  KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
           K       K       ++ E+  +++ E     K+K+ TE  +  T I
Sbjct: 74  KISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSI 121


>gnl|CDD|176058 cd08676, C2A_Munc13-like, C2 domain first repeat in Munc13
           (mammalian uncoordinated)-like proteins.  C2-like
           domains are thought to be involved in phospholipid
           binding in a Ca2+ independent manner in both Unc13 and
           Munc13. Caenorabditis elegans Unc13 has a central domain
           with sequence similarity to PKC, which includes C1 and
           C2-related domains. Unc13 binds phorbol esters and DAG
           with high affinity in a phospholipid manner.  Mutations
           in Unc13 results in abnormal neuronal connections and
           impairment in cholinergic neurotransmission in the
           nematode.  Munc13 is the mammalian homolog which are
           expressed in the brain.  There are 3 isoforms (Munc13-1,
           -2, -3) and are thought to play a role in
           neurotransmitter release and are hypothesized to be
           high-affinity receptors for phorbol esters.  Unc13 and
           Munc13 contain both C1 and C2 domains.  There are two C2
           related domains present, one central and one at the
           carboxyl end.  Munc13-1 contains a third C2-like domain.
            Munc13 interacts with syntaxin, synaptobrevin, and
           synaptotagmin suggesting a role for these as scaffolding
           proteins. C2 domains fold into an 8-standed
           beta-sandwich that can adopt 2 structural arrangements:
           Type I and Type II, distinguished by a circular
           permutation involving their N- and C-terminal beta
           strands. Many C2 domains are Ca2+-dependent
           membrane-targeting modules that bind a wide variety of
           substances including bind phospholipids, inositol
           polyphosphates, and intracellular proteins.  Most C2
           domain proteins are either signal transduction enzymes
           that contain a single C2 domain, such as protein kinase
           C, or membrane trafficking proteins which contain at
           least two C2 domains, such as synaptotagmin 1.  However,
           there are a few exceptions to this including RIM
           isoforms and some splice variants of piccolo/aczonin and
           intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions. This cd contains the second C2
           repeat, C2B, and has a type-II topology.
          Length = 153

 Score = 27.0 bits (60), Expect = 9.6
 Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 2/46 (4%)

Query: 39  EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
              ++++  +K KK+K  +KK   K+    K I+ +  E K +TL 
Sbjct: 58  VPASRERNSEKSKKRKSHRKKAVLKDTVPAKSIKVT--EVKPQTLN 101


>gnl|CDD|221525 pfam12314, IMCp, Inner membrane complex protein.  This domain is
          found in bacteria and eukaryotes. This domain is about
          120 amino acids in length. This family is the inner
          membrane complex of parasitic organisms. This is a
          cytoskeletal structure associated with the pellicle of
          these parasites.
          Length = 98

 Score = 26.5 bits (59), Expect = 9.8
 Identities = 17/76 (22%), Positives = 28/76 (36%), Gaps = 13/76 (17%)

Query: 11 KKKRIEKRNKEEEKEENEQEEKEKE-----------DVEE--EKNKKKKKKKKKKKKKKK 57
           K  I+++ KE  K   +  EK  E           +V E   K K     K + K++  
Sbjct: 1  PKPVIQEKIKEVPKYIVKYVEKIVEVPEVKYVDKIVEVPEYVYKEKIIYVPKVEIKERII 60

Query: 58 KKKKKKEEQSVKEIEE 73
             K   ++  K +E 
Sbjct: 61 PVPKIVTKEIEKYVEV 76


>gnl|CDD|150240 pfam09495, DUF2462, Protein of unknown function (DUF2462).  This
          protein is highly conserved, but its function is
          unknown. It can be isolated from HeLa cell nucleoli and
          is found to be homologous with Leydig cell tumour
          protein whose function is unknown.
          Length = 82

 Score = 26.2 bits (58), Expect = 9.8
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 3/48 (6%)

Query: 41 KNKKKKKKKKKKKK---KKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
            KKK+K  KK      K KK +  + ++  K +   L +  EK +  
Sbjct: 18 AGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKVLTSGLNKSTEKLIAS 65


>gnl|CDD|219555 pfam07753, DUF1609, Protein of unknown function (DUF1609).  This
          region is found in a number of hypothetical proteins
          thought to be expressed by the eukaryote
          Encephalitozoon cuniculi, an obligate intracellular
          microsporidial parasite. It is approximately 200
          residues long.
          Length = 230

 Score = 27.4 bits (61), Expect = 9.8
 Identities = 19/71 (26%), Positives = 36/71 (50%)

Query: 6  GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
          GGA  KKK  +K++K         +   +     EK K++  K  ++K K +  ++ KE+
Sbjct: 18 GGARAKKKGGKKKSKGGRHCYKIHKRVLRWRKSPEKIKEELDKGSEEKWKGRSIEEIKEQ 77

Query: 66 QSVKEIEESLK 76
          + + +I E L+
Sbjct: 78 KVLHDITEVLE 88


>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor.  The ribosome recycling
           factor (RRF / ribosome release factor) dissociates the
           ribosome from the mRNA after termination of translation,
           and is essential bacterial growth. Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.
          Length = 165

 Score = 27.1 bits (61), Expect = 10.0
 Identities = 16/48 (33%), Positives = 29/48 (60%), Gaps = 7/48 (14%)

Query: 42  NKKKKKKKKKKKKKKKKKKKKKEE------QSVKEIEESLKERKEKTL 83
           N K KK +K K+  + + K+ ++E      + +K+I+E LK+ KEK +
Sbjct: 118 NDKLKKLEKDKEISEDEVKRAEKEIQKLTDKYIKKIDELLKK-KEKEI 164


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 26.6 bits (59), Expect = 10.0
 Identities = 9/27 (33%), Positives = 13/27 (48%)

Query: 41  KNKKKKKKKKKKKKKKKKKKKKKEEQS 67
           K K+KK +    +  KK   KK+   S
Sbjct: 79  KKKRKKSRHVSSRSAKKISAKKRRRSS 105


>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
          Length = 479

 Score = 27.8 bits (62), Expect = 10.0
 Identities = 14/42 (33%), Positives = 20/42 (47%)

Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
          KKKKKKKKK  K+     +         +L  RK++   E +
Sbjct: 9  KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGT 50


>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS).  This family
           consists of several mammalian selenoprotein S (SelS)
           sequences. SelS is a plasma membrane protein and is
           present in a variety of tissues and cell types. The
           function of this family is unknown.
          Length = 190

 Score = 27.2 bits (60), Expect = 10.0
 Identities = 13/56 (23%), Positives = 31/56 (55%)

Query: 10  EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
             +K I   + ++     E  E  +  ++E+ + K  K K+K+K+ +++K+++K E
Sbjct: 65  SSEKEITVFDPDKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQLEEEKRRQKIE 120


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.132    0.400 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,104,481
Number of extensions: 1786278
Number of successful extensions: 29912
Number of sequences better than 10.0: 1
Number of HSP's gapped: 17711
Number of HSP's successfully gapped: 2924
Length of query: 362
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 264
Effective length of database: 6,590,910
Effective search space: 1740000240
Effective search space used: 1740000240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.7 bits)