RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6892
         (190 letters)



>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
          This family contains, amongst other G-protein-coupled
          receptors (GCPRs), members of the opsin family, which
          have been considered to be typical members of the
          rhodopsin superfamily. They share several motifs,
          mainly the seven transmembrane helices, GCPRs of the
          rhodopsin superfamily. All opsins bind a chromophore,
          such as 11-cis-retinal. The function of most opsins
          other than the photoisomerases is split into two steps:
          light absorption and G-protein activation.
          Photoisomerases, on the other hand, are not coupled to
          G-proteins - they are thought to generate and supply
          the chromophore that is used by visual opsins.
          Length = 251

 Score = 63.1 bits (154), Expect = 2e-12
 Identities = 15/40 (37%), Positives = 24/40 (60%)

Query: 37 LAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLV 76
          L I   K+L+  TN FL++LA+ DL+  + + P  +  LV
Sbjct: 1  LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLV 40


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 47.7 bits (113), Expect = 1e-06
 Identities = 24/37 (64%), Positives = 29/37 (78%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
           G  D    EEEEEEEEEEEEEEE+E++E  EE+ENE+
Sbjct: 856 GGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 43.8 bits (103), Expect = 2e-05
 Identities = 22/36 (61%), Positives = 27/36 (75%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
           + EEEEEEEEEEEEE+E++E  EE+E E E  L  E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897



 Score = 42.7 bits (100), Expect = 5e-05
 Identities = 23/69 (33%), Positives = 40/69 (57%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           A D   ++E  E+E   E + E +QDE   +     +    EE+E  E+EE++EEEE+++
Sbjct: 823 ADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEE 882

Query: 164 GEEKRVEDK 172
            EE+  E++
Sbjct: 883 EEEEEEENE 891



 Score = 41.9 bits (98), Expect = 9e-05
 Identities = 20/37 (54%), Positives = 26/37 (70%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
           +E   EEEEEEEEEEEEEEE+E +E L  +  E  ++
Sbjct: 868 EEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQK 904



 Score = 41.9 bits (98), Expect = 1e-04
 Identities = 22/44 (50%), Positives = 30/44 (68%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           + EEEEEEEEEEE+E++E  EE+E E+E   P   E  E  +K+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 40.4 bits (94), Expect = 3e-04
 Identities = 23/72 (31%), Positives = 39/72 (54%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E Q      +DET  +E   E + E +++E+  D     D  + E    EE+E  E+EE+
Sbjct: 820 ETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEE 879

Query: 156 KEEEEDKQGEEK 167
           +EEEE+++ E +
Sbjct: 880 EEEEEEEEEENE 891



 Score = 40.4 bits (94), Expect = 3e-04
 Identities = 20/36 (55%), Positives = 25/36 (69%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER 141
           +E   EEEEEEEEEEEEEEE E+   LE  E  +++
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 39.6 bits (92), Expect = 6e-04
 Identities = 20/52 (38%), Positives = 29/52 (55%)

Query: 89  LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           L      E +  + G       +  + EEEEEEEEEE+E++E  EE+E E+E
Sbjct: 837 LNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEE 888



 Score = 38.4 bits (89), Expect = 0.001
 Identities = 25/84 (29%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED----EEKKE 157
           AG +DE   + E + ++ E ++E  EQ+   E     K+     +   G D    EE++E
Sbjct: 809 AGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEE 868

Query: 158 EEEDKQGEEKRVEDKIGDEKRLEE 181
           EEE+++ EE+  E++  +E+  EE
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 38.1 bits (88), Expect = 0.002
 Identities = 24/94 (25%), Positives = 47/94 (50%), Gaps = 12/94 (12%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE--------- 153
           G  +     E  E++E E + E Q  D  ++++  E+E     + E  +DE         
Sbjct: 799 GKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS 858

Query: 154 ---EKKEEEEDKQGEEKRVEDKIGDEKRLEENKK 184
              + +EEEE+++ EE+  E++  +E+  EEN++
Sbjct: 859 DGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 35.7 bits (82), Expect = 0.013
 Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 4/81 (4%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE----EEEDK 162
           E   +E + + E E EE E E +   E  E+E E    EE E  EDE + E     E + 
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVET 757

Query: 163 QGEEKRVEDKIGDEKRLEENK 183
           +G+ K  E +   E   +E++
Sbjct: 758 EGDRKETEHEGETEAEGKEDE 778



 Score = 35.4 bits (81), Expect = 0.014
 Identities = 22/69 (31%), Positives = 31/69 (44%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           A +     E E E  E+E E E  ++    EDE E E     E E   D ++ E E + +
Sbjct: 712 AEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETE 771

Query: 164 GEEKRVEDK 172
            E K  ED+
Sbjct: 772 AEGKEDEDE 780



 Score = 35.4 bits (81), Expect = 0.014
 Identities = 25/73 (34%), Positives = 33/73 (45%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
             EE EE E+E E E E + +   E D  E E     E E  EDE++ E +  + GE K 
Sbjct: 734 TGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKG 793

Query: 169 VEDKIGDEKRLEE 181
            E   G  +   E
Sbjct: 794 DEGAEGKVEHEGE 806



 Score = 35.4 bits (81), Expect = 0.015
 Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 4/87 (4%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQE-QDEYLEEDENEKERRLPEEDERGEDEE---KK 156
           + G       E+E E E + E E E E   E   E E E E    E D +GE E    + 
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEH 717

Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEENK 183
           E E + +G E   E + G+E    E++
Sbjct: 718 EGETEAEGTEDEGEIETGEEGEEVEDE 744



 Score = 35.0 bits (80), Expect = 0.020
 Identities = 22/61 (36%), Positives = 29/61 (47%), Gaps = 2/61 (3%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           E + E+E E E E +E D   E +  E E     E E  EDE + E  E+  GEE   E 
Sbjct: 688 ERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEE--GEEVEDEG 745

Query: 172 K 172
           +
Sbjct: 746 E 746



 Score = 35.0 bits (80), Expect = 0.021
 Identities = 23/96 (23%), Positives = 43/96 (44%), Gaps = 7/96 (7%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           +AG   E   +E  E + E E E E  + +  E     +      +DE GE E   E + 
Sbjct: 784 QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQG 843

Query: 161 DKQGEEKRVEDKIG-------DEKRLEENKKKKTQK 189
           + + +EK V+   G       +E+  EE ++++ ++
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEE 879



 Score = 34.6 bits (79), Expect = 0.028
 Identities = 18/68 (26%), Positives = 30/68 (44%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           G   E +  E E+E E E + E + + E   E + E+E     E +  + + + E EE +
Sbjct: 657 GENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716

Query: 163 QGEEKRVE 170
              E   E
Sbjct: 717 HEGETEAE 724



 Score = 34.2 bits (78), Expect = 0.036
 Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 3/83 (3%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           + T  E E   E E E  EE   +   E +   K     E +   E + ++E E + + +
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK 702

Query: 166 EKRVEDKIGDEKRLEENKKKKTQ 188
           E   +   G+ +  E   + +T+
Sbjct: 703 EADHK---GETEAEEVEHEGETE 722



 Score = 33.8 bits (77), Expect = 0.054
 Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 5/88 (5%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE-- 153
           E+   + G   E   E E E  EE   E EQE  E   + ENE E  +P E  +GE E  
Sbjct: 639 EHTGERTGEEGERPTEAEGENGEESGGEAEQE-GETETKGENESEGEIPAE-RKGEQEGE 696

Query: 154 -EKKEEEEDKQGEEKRVEDKIGDEKRLE 180
            E + +E D +GE +  E +   E   E
Sbjct: 697 GEIEAKEADHKGETEAEEVEHEGETEAE 724



 Score = 33.0 bits (75), Expect = 0.081
 Identities = 18/73 (24%), Positives = 32/73 (43%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
               EE   E E+E E + + E   E E   ER+  +E E   + ++ + + + + EE  
Sbjct: 657 GENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716

Query: 169 VEDKIGDEKRLEE 181
            E +   E   +E
Sbjct: 717 HEGETEAEGTEDE 729



 Score = 32.7 bits (74), Expect = 0.11
 Identities = 22/84 (26%), Positives = 46/84 (54%), Gaps = 3/84 (3%)

Query: 103 GARDETAREEEEEEEEEEEEEE---EQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           G R ET  E E E E +E+E+E   +  +D  ++ DE  + +   E +    ++++ E +
Sbjct: 759 GDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQ 818

Query: 160 EDKQGEEKRVEDKIGDEKRLEENK 183
            + Q ++  V+D+ G+++   EN+
Sbjct: 819 SETQADDTEVKDETGEQELNAENQ 842



 Score = 32.7 bits (74), Expect = 0.11
 Identities = 22/72 (30%), Positives = 33/72 (45%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
           A   +     E E  E+E E E  +E +E  +E E E E +   E E    E + E E +
Sbjct: 712 AEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETE 771

Query: 162 KQGEEKRVEDKI 173
            +G+E   E +I
Sbjct: 772 AEGKEDEDEGEI 783



 Score = 32.7 bits (74), Expect = 0.13
 Identities = 20/84 (23%), Positives = 39/84 (46%), Gaps = 3/84 (3%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK---E 157
           +     E   +E+E E +  E+ E +  +    + E+E E    E+DE     E +    
Sbjct: 767 EGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDT 826

Query: 158 EEEDKQGEEKRVEDKIGDEKRLEE 181
           E +D+ GE++   +  G+ K+ E+
Sbjct: 827 EVKDETGEQELNAENQGEAKQDEK 850



 Score = 32.3 bits (73), Expect = 0.17
 Identities = 20/87 (22%), Positives = 43/87 (49%), Gaps = 1/87 (1%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE-RGEDEEKKEEEED 161
           G  +   + E E E + +E E E E +   +EDE+E E +  E+ E +G++  + + E +
Sbjct: 745 GEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHE 804

Query: 162 KQGEEKRVEDKIGDEKRLEENKKKKTQ 188
            + E    ++  G  +   ++ + K +
Sbjct: 805 GETEAGEKDEHEGQSETQADDTEVKDE 831



 Score = 31.5 bits (71), Expect = 0.29
 Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 4/88 (4%)

Query: 103 GARDETAREEEEEE---EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           G  +     E E E   E + E+E E E +    + + E E    E +   E E  ++E 
Sbjct: 671 GETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEG 730

Query: 160 EDKQGEE-KRVEDKIGDEKRLEENKKKK 186
           E + GEE + VED+   E   +   + +
Sbjct: 731 EIETGEEGEEVEDEGEGEAEGKHEVETE 758



 Score = 30.0 bits (67), Expect = 0.78
 Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 4/86 (4%)

Query: 101 KAGARDETAREEEEEEEE-EEEEEEEQEQDEYLEEDENEKERRLPE---EDERGEDEEKK 156
           + G   E   +E E E E + E E E ++ E   E E E E +  E   E + GED E K
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMK 792

Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEEN 182
            +E  +   E   E + G++   E  
Sbjct: 793 GDEGAEGKVEHEGETEAGEKDEHEGQ 818



 Score = 29.6 bits (66), Expect = 1.3
 Identities = 26/89 (29%), Positives = 37/89 (41%), Gaps = 10/89 (11%)

Query: 103 GARDET-AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE-RGEDEEKKEEEE 160
           G   E     E   EE E   E E E  E     E+  E     E E +GE+E + E   
Sbjct: 633 GDVAEAEHTGERTGEEGERPTEAEGENGE-----ESGGEAEQEGETETKGENESEGEIPA 687

Query: 161 DKQGE---EKRVEDKIGDEKRLEENKKKK 186
           +++GE   E  +E K  D K   E ++ +
Sbjct: 688 ERKGEQEGEGEIEAKEADHKGETEAEEVE 716


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 43.8 bits (104), Expect = 2e-05
 Identities = 24/72 (33%), Positives = 47/72 (65%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
               E+EEEEE+  ++E+E++ D+ LEE+E + +    EEDE  ED + +++EE+++ E+
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEK 376

Query: 167 KRVEDKIGDEKR 178
           ++ + K  +  R
Sbjct: 377 EKKKKKSAESTR 388



 Score = 38.4 bits (90), Expect = 0.001
 Identities = 21/76 (27%), Positives = 42/76 (55%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
             + EE+EEEEE+   ++DE  ++D+ E+E    +  +  EDEE ++ +++   EE+  E
Sbjct: 316 LGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEE 375

Query: 171 DKIGDEKRLEENKKKK 186
            +   +K  E  + + 
Sbjct: 376 KEKKKKKSAESTRSEL 391



 Score = 38.4 bits (90), Expect = 0.001
 Identities = 18/64 (28%), Positives = 42/64 (65%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
             ++   EE+  ++E+EE++++  ++E  + D +++E    +ED   ED+E++EEEE ++
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378

Query: 164 GEEK 167
            ++K
Sbjct: 379 KKKK 382



 Score = 36.5 bits (85), Expect = 0.005
 Identities = 17/68 (25%), Positives = 44/68 (64%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           +E    ++E+EE+++++ EE+E+D  L ++E ++E    ++++  E+EE+++E++ K+  
Sbjct: 325 EEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSA 384

Query: 166 EKRVEDKI 173
           E    +  
Sbjct: 385 ESTRSELP 392



 Score = 35.7 bits (83), Expect = 0.009
 Identities = 18/67 (26%), Positives = 45/67 (67%), Gaps = 4/67 (5%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           +    +E   +  ++E+EE+++++ +E++E ++  + E++    EEDE  +DE+ +EEEE
Sbjct: 318 QGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEED----EEDEDSDDEDDEEEEE 373

Query: 161 DKQGEEK 167
           +++ ++K
Sbjct: 374 EEKEKKK 380



 Score = 35.7 bits (83), Expect = 0.010
 Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 10/86 (11%)

Query: 106 DETAREEEE-----EEE-----EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           +E A+EE E     E E       EEE++E+E+D     D+ + E    ++D  G  + +
Sbjct: 261 EELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGE 320

Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEE 181
           ++EEE++ G +   E+   D+   EE
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEE 346



 Score = 34.6 bits (80), Expect = 0.027
 Identities = 31/93 (33%), Positives = 53/93 (56%), Gaps = 14/93 (15%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYL------------EEDENEKERRLPEEDER--GEDEEKK 156
           EE++EEEE+ +E  +   DE+             EEDE E+E  + +EDE    +D E++
Sbjct: 286 EEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEE 345

Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           EE+ D   EE+  ED+  D++  EE ++++ +K
Sbjct: 346 EEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378



 Score = 34.2 bits (79), Expect = 0.033
 Identities = 19/52 (36%), Positives = 30/52 (57%)

Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
           EEE  +E+ E L++ E E+ RR+  E+E  E+EE  +E  D   +E   +D 
Sbjct: 260 EEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDD 311



 Score = 33.4 bits (77), Expect = 0.066
 Identities = 18/69 (26%), Positives = 41/69 (59%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
            +E     ++E+EE+++++ E+E+++    DE E E     +DE  E+EE++E+E+ K+ 
Sbjct: 323 EEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382

Query: 165 EEKRVEDKI 173
             +    ++
Sbjct: 383 SAESTRSEL 391



 Score = 33.0 bits (76), Expect = 0.091
 Identities = 21/80 (26%), Positives = 42/80 (52%), Gaps = 8/80 (10%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            + EE+EEEEE+  + E +E  ++D         EE+E   D   +EE+E+ +  +   +
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDL--------EEEEEDVDLSDEEEDEEDEDSDDEDD 368

Query: 171 DKIGDEKRLEENKKKKTQKR 190
           ++  +E++ ++ KK     R
Sbjct: 369 EEEEEEEKEKKKKKSAESTR 388



 Score = 32.3 bits (74), Expect = 0.15
 Identities = 20/85 (23%), Positives = 49/85 (57%), Gaps = 3/85 (3%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           D+    ++++     + EE++E++E   +DE+E++    ++D   E+E+    +E++  E
Sbjct: 303 DDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDD---DDDLEEEEEDVDLSDEEEDEE 359

Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
           ++  +D+  +E+  EE +KKK +  
Sbjct: 360 DEDSDDEDDEEEEEEEKEKKKKKSA 384



 Score = 26.9 bits (60), Expect = 8.1
 Identities = 12/40 (30%), Positives = 27/40 (67%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           +    D    +EEE+EE+E+ ++E+ E++E  E+++ +K+
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 43.6 bits (103), Expect = 2e-05
 Identities = 19/80 (23%), Positives = 37/80 (46%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
                K  AR +    EE +E+E+EEEE+  ++ E  E +++E+E     ED   E    
Sbjct: 352 SKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSD 411

Query: 156 KEEEEDKQGEEKRVEDKIGD 175
              + + + +++   D   +
Sbjct: 412 VGSDSESKADKESASDSDSE 431



 Score = 35.5 bits (82), Expect = 0.013
 Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 14/74 (18%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           + EE +E+E+EEEEQ  DE               E+E GED E++  +  + G  +   D
Sbjct: 366 DFEEVDEDEDEEEEQRSDE--------------HEEEEGEDSEEEGSQSREDGSSESSSD 411

Query: 172 KIGDEKRLEENKKK 185
              D +   + +  
Sbjct: 412 VGSDSESKADKESA 425



 Score = 31.2 bits (71), Expect = 0.29
 Identities = 13/72 (18%), Positives = 27/72 (37%), Gaps = 4/72 (5%)

Query: 123 EEEQEQDEY----LEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKR 178
            +E +  +     L+  + E+     +E+E    +E +EEE +   EE     + G  + 
Sbjct: 349 TKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSES 408

Query: 179 LEENKKKKTQKR 190
             +       K 
Sbjct: 409 SSDVGSDSESKA 420



 Score = 30.1 bits (68), Expect = 0.68
 Identities = 20/79 (25%), Positives = 30/79 (37%), Gaps = 19/79 (24%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           + EE +E+E+EEEE++  +   EE E+                    EEE  Q  E    
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDS-------------------EEEGSQSREDGSS 406

Query: 171 DKIGDEKRLEENKKKKTQK 189
           +   D     E+K  K   
Sbjct: 407 ESSSDVGSDSESKADKESA 425



 Score = 28.9 bits (65), Expect = 1.8
 Identities = 14/75 (18%), Positives = 24/75 (32%)

Query: 77  RVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
           + R  L             E +  ++   +E   E+ EEE  +  E+   E    +  D 
Sbjct: 357 KRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDS 416

Query: 137 NEKERRLPEEDERGE 151
             K  +    D   E
Sbjct: 417 ESKADKESASDSDSE 431


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 43.5 bits (102), Expect = 3e-05
 Identities = 31/110 (28%), Positives = 54/110 (49%), Gaps = 23/110 (20%)

Query: 72  ILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEY 131
           I+   R   +L+   +  + N + ++        DE   E++++EE++EEEEEE+E+ + 
Sbjct: 130 IIIKRRRARHLAEEDMSPRDNFVIDDD-------DEDEDEDDDDEEDDEEEEEEEEEIKG 182

Query: 132 LEEDENEKERRLPEEDERGEDEEKKEEEEDKQ--------GEEKRVEDKI 173
            ++++        EEDE GED   ++ E DK         G    V DKI
Sbjct: 183 FDDED--------EEDEGGEDFTYEKSEVDKTDCFKFIEAGAGDDVFDKI 224



 Score = 33.5 bits (76), Expect = 0.057
 Identities = 16/48 (33%), Positives = 31/48 (64%)

Query: 133 EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
           EED + ++  + ++D+  EDE+  +EE+D++ EE+  E K  D++  E
Sbjct: 142 EEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189



 Score = 33.5 bits (76), Expect = 0.069
 Identities = 21/67 (31%), Positives = 43/67 (64%), Gaps = 1/67 (1%)

Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
            EE+   +D ++ +D++E E    ++DE  ++EE++EEEE K  +++  ED+ G++   E
Sbjct: 141 AEEDMSPRDNFVIDDDDEDED-EDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYE 199

Query: 181 ENKKKKT 187
           +++  KT
Sbjct: 200 KSEVDKT 206



 Score = 30.8 bits (69), Expect = 0.50
 Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 11/77 (14%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK------EEEEDK 162
           AR   EE+    +     + DE  +ED++++     E+DE  E+EE++      E+EED+
Sbjct: 137 ARHLAEEDMSPRDNFVIDDDDEDEDEDDDDE-----EDDEEEEEEEEEIKGFDDEDEEDE 191

Query: 163 QGEEKRVEDKIGDEKRL 179
            GE+   E    D+   
Sbjct: 192 GGEDFTYEKSEVDKTDC 208


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 42.6 bits (100), Expect = 5e-05
 Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 1/90 (1%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           K G  D+    + ++ ++E  EE+          D  E+E +L  E     + E+ E+ E
Sbjct: 262 KRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSE 321

Query: 161 DKQGEEKRVEDKIGDE-KRLEENKKKKTQK 189
           + + E+   E  +  + K+L++ K KK   
Sbjct: 322 ESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351



 Score = 35.7 bits (82), Expect = 0.010
 Identities = 17/81 (20%), Positives = 36/81 (44%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
           R  +++ +E + ++ + E  E     ++      PEE E     E   + E +Q E+   
Sbjct: 263 RGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEE 322

Query: 170 EDKIGDEKRLEENKKKKTQKR 190
            ++  +E+    +KK K  K+
Sbjct: 323 SEEEKNEEEGGLSKKGKKLKK 343



 Score = 34.2 bits (78), Expect = 0.036
 Identities = 19/85 (22%), Positives = 42/85 (49%), Gaps = 4/85 (4%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            E   + E E++E+ EE EE++ +E  E   ++K ++L +   +    +K + +     +
Sbjct: 306 PEIPAKPEIEQDEDSEESEEEKNEE--EGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSD 363

Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
           +  ++ +  D   L   KK+K  K+
Sbjct: 364 DSDIDGE--DSVSLVTAKKQKEPKK 386



 Score = 33.0 bits (75), Expect = 0.085
 Identities = 13/71 (18%), Positives = 27/71 (38%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
              +  + EE E++   E   + +   +ED  E E    EE+     + KK ++   +  
Sbjct: 290 SSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKN 349

Query: 166 EKRVEDKIGDE 176
               +D    +
Sbjct: 350 GLDKDDSDSGD 360



 Score = 31.5 bits (71), Expect = 0.23
 Identities = 19/88 (21%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 105 RDETAREEEE---EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
           +D    +E++    ++  E+ +EE+ + +  +  +N+K+    ++ +RG D++  E + D
Sbjct: 216 KDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD 275

Query: 162 KQGEEKRVEDKIGDEKRLEENKKKKTQK 189
              +E R ED I D      + +++  K
Sbjct: 276 DGDDEGREEDYISDSSASGNDPEEREDK 303



 Score = 26.5 bits (58), Expect = 9.6
 Identities = 16/67 (23%), Positives = 33/67 (49%)

Query: 120 EEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRL 179
           +E+EE + +       ++ K + L  +DE   DE  K  E+  + + K+ + K+   K+ 
Sbjct: 195 DEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKK 254

Query: 180 EENKKKK 186
            ++ KK 
Sbjct: 255 LDDDKKG 261


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 42.8 bits (101), Expect = 5e-05
 Identities = 22/60 (36%), Positives = 34/60 (56%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
           + REEEE  + E EE  E EQ E  EED   +E     EDE  +++E +E + +++ E +
Sbjct: 390 SQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449



 Score = 34.0 bits (78), Expect = 0.040
 Identities = 20/73 (27%), Positives = 33/73 (45%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           D+ A + +  E   + EEEE  Q E  E  E E+  R  E+    E     E+E + + E
Sbjct: 377 DQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDE 436

Query: 166 EKRVEDKIGDEKR 178
            +  + +  +E R
Sbjct: 437 LEEAQPEEEEEAR 449



 Score = 33.6 bits (77), Expect = 0.043
 Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 1/76 (1%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP-EEDERGEDEEKKEEEEDKQGEEKRV 169
            E ++E E+ +  E   Q E  E  + E E RL  E+ ER E++ +  E      +E   
Sbjct: 374 AEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFED 433

Query: 170 EDKIGDEKRLEENKKK 185
           ED++ + +  EE + +
Sbjct: 434 EDELEEAQPEEEEEAR 449


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 42.6 bits (100), Expect = 5e-05
 Identities = 33/81 (40%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E+    +   +E    EEEEEEEEEEEEEEQE +E  EE E+E+E    E D   E+E +
Sbjct: 430 ESPSMASQESEEEESVEEEEEEEEEEEEEEQESEE--EEGEDEEEEEEVEADNGSEEEME 487

Query: 156 KEEEEDKQGEEKRVEDKIGDE 176
              E D  GEE   + +  + 
Sbjct: 488 GSSEGDGDGEEPEEDAERRNS 508



 Score = 40.7 bits (95), Expect = 3e-04
 Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 5/90 (5%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP-----EEDERGEDEEK 155
            A    E     EEEEEEEEEEEEE+++ E  E ++ E+E  +      EE+  G  E  
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493

Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
            + EE ++  E+R  +  G  +  E  + +
Sbjct: 494 GDGEEPEEDAERRNSEMAGISRMSEGQQPR 523



 Score = 39.5 bits (92), Expect = 5e-04
 Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 2/95 (2%)

Query: 95  CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
            +     + + +  +   +E EEEE  EEEE+E++E  EE++  +E    +E+E  E E 
Sbjct: 419 SDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEA 478

Query: 155 KKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
               EE+ +G  +   D  G+E   +  ++     
Sbjct: 479 DNGSEEEMEGSSEGDGD--GEEPEEDAERRNSEMA 511



 Score = 39.1 bits (91), Expect = 7e-04
 Identities = 23/78 (29%), Positives = 40/78 (51%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           +E   EEEEE+E EEEE E++E++E +E D   +E      +  G+ EE +E+ E +  E
Sbjct: 450 EEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSE 509

Query: 166 EKRVEDKIGDEKRLEENK 183
              +      ++    + 
Sbjct: 510 MAGISRMSEGQQPRGSSV 527



 Score = 28.7 bits (64), Expect = 2.1
 Identities = 23/97 (23%), Positives = 39/97 (40%), Gaps = 11/97 (11%)

Query: 96  ENQLFKAGARDETAREEEEEE--EEEEEEEEEQEQDEYLEEDENEKER---------RLP 144
           E Q  +    ++   EEE E     EEE E   E D   EE E + ER         R+ 
Sbjct: 458 EEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMS 517

Query: 145 EEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
           E  +      + E  +++  + + ++ +   E+  EE
Sbjct: 518 EGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEE 554


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 41.9 bits (99), Expect = 1e-04
 Identities = 16/80 (20%), Positives = 36/80 (45%), Gaps = 4/80 (5%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           + EEE ++E   ++     E  ++D+ E +    EEDE     E  E+++D   ++    
Sbjct: 105 DIEEELQDEPRYDDAYRDLEEDDDDDEESD----EEDEESSKSEDDEDDDDDDDDDDIAT 160

Query: 171 DKIGDEKRLEENKKKKTQKR 190
            +   E+R    + ++ +  
Sbjct: 161 RERSLERRRRRREWEEKRAE 180



 Score = 40.7 bits (96), Expect = 2e-04
 Identities = 22/90 (24%), Positives = 45/90 (50%), Gaps = 5/90 (5%)

Query: 92  NILCENQ--LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYL-EEDENEKERRLPEEDE 148
           N+   +Q  +F  G  +E  ++E   ++   + EE+ + DE   EEDE   +    E+D+
Sbjct: 91  NVFGSDQVVIFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDD 150

Query: 149 RGEDEEKKEEEEDKQGEEKRVEDKIGDEKR 178
             +D++  +    ++  E+R   +  +EKR
Sbjct: 151 DDDDDD--DIATRERSLERRRRRREWEEKR 178



 Score = 39.2 bits (92), Expect = 7e-04
 Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 6/78 (7%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
            + EEE ++E    +    LEED+++      +E+   EDEE  + E+D+  ++   +D 
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDD------DEESDEEDEESSKSEDDEDDDDDDDDDD 157

Query: 173 IGDEKRLEENKKKKTQKR 190
           I   +R  E ++++ +  
Sbjct: 158 IATRERSLERRRRRREWE 175



 Score = 34.6 bits (80), Expect = 0.022
 Identities = 11/67 (16%), Positives = 34/67 (50%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
            D     EE+++++EE +EE++E  +  ++++++ +    +   R    E++    + + 
Sbjct: 117 DDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEE 176

Query: 165 EEKRVED 171
           +   +E 
Sbjct: 177 KRAELEF 183



 Score = 32.3 bits (74), Expect = 0.15
 Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 4/59 (6%)

Query: 104 ARDETAREEEEEEEEEEE----EEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
             +E   ++EE +EE+EE    E++E + D+  ++D   +ER L     R E EEK+ E
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAE 180


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 41.7 bits (97), Expect = 1e-04
 Identities = 28/90 (31%), Positives = 52/90 (57%), Gaps = 6/90 (6%)

Query: 105  RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE----RGEDEEKKEEEE 160
            + E A+ + EE ++ EEE+++ EQ +  E +E +K   L + +E    +  +E KK EE+
Sbjct: 1614 KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673

Query: 161  DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             K+ EE +  ++  DEK+  E  KK+ ++ 
Sbjct: 1674 KKKAEEAKKAEE--DEKKAAEALKKEAEEA 1701



 Score = 41.3 bits (96), Expect = 2e-04
 Identities = 28/101 (27%), Positives = 56/101 (55%)

Query: 90   KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
            ++N +   +  K    D+   EE ++ EE+E++  E  + E  E  + E+ ++   E+++
Sbjct: 1657 EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK 1716

Query: 150  GEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
              +E KK EEE+K   E+  ++   D+K+ EE KK + +K+
Sbjct: 1717 KAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757



 Score = 37.4 bits (86), Expect = 0.003
 Identities = 27/93 (29%), Positives = 50/93 (53%), Gaps = 1/93 (1%)

Query: 98   QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
            QL K  A ++   EE ++ EEE + +  +E  +  E+ +  +E +  EEDE+   E  K+
Sbjct: 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696

Query: 158  EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            E E+ +  E+ ++ K  +EK+  E  KK  ++ 
Sbjct: 1697 EAEEAKKAEE-LKKKEAEEKKKAEELKKAEEEN 1728



 Score = 37.4 bits (86), Expect = 0.003
 Identities = 26/93 (27%), Positives = 56/93 (60%), Gaps = 1/93 (1%)

Query: 98   QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
            +L KA    +   + +++E EE+++ EE ++ E   + +  +E +  EED++  +E KK 
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683

Query: 158  EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            EE++K+  E  ++ +  + K+ EE KKK+ +++
Sbjct: 1684 EEDEKKAAEA-LKKEAEEAKKAEELKKKEAEEK 1715



 Score = 37.0 bits (85), Expect = 0.004
 Identities = 26/80 (32%), Positives = 49/80 (61%)

Query: 111  EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            EEE++ + EE ++ E+ + +  E  + E+E++  E+ ++ E EEKK+ EE K+ EE+   
Sbjct: 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661

Query: 171  DKIGDEKRLEENKKKKTQKR 190
                + K+ EE+KKK  + +
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAK 1681



 Score = 37.0 bits (85), Expect = 0.004
 Identities = 22/88 (25%), Positives = 51/88 (57%), Gaps = 6/88 (6%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE----RGEDEEKKEEEED 161
            +E  ++  E  ++E EE ++ E+ +  E +E +K   L + +E    + E+ +K+ EE+ 
Sbjct: 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK 1743

Query: 162  KQGEEKRVEDKIGDEKRLEENKKKKTQK 189
            K+ EE + ++   ++K++   KK++ +K
Sbjct: 1744 KKAEEAKKDE--EEKKKIAHLKKEEEKK 1769



 Score = 36.7 bits (84), Expect = 0.006
 Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 8/103 (7%)

Query: 96   ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
            E     A  + E A++ EE   EE  +  E+E+    EE +  +E ++  E+ +  +EEK
Sbjct: 1573 EEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK 1632

Query: 156  K--------EEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            K        E EE K+ EE +  ++    K  EE KK +  K+
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675



 Score = 36.3 bits (83), Expect = 0.008
 Identities = 22/93 (23%), Positives = 50/93 (53%)

Query: 98   QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
            ++ K    ++  + EE ++ EE + + E+ +    E+ + E+ ++   E+++  +E KK 
Sbjct: 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655

Query: 158  EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            EEE+K    +  +    D+K+ EE KK +  ++
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688



 Score = 35.5 bits (81), Expect = 0.014
 Identities = 24/85 (28%), Positives = 49/85 (57%), Gaps = 2/85 (2%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            DE  ++ EE+++ +E +++ +E  +  E  +  +E +  EE ++  +E KK +E  K+ E
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480

Query: 166  EKRVEDKIGDEKRLEENKKKKTQKR 190
            E +  D+   +K+ EE KKK  + +
Sbjct: 1481 EAKKADEA--KKKAEEAKKKADEAK 1503



 Score = 35.1 bits (80), Expect = 0.018
 Identities = 21/87 (24%), Positives = 46/87 (52%), Gaps = 3/87 (3%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            DE  +    +++ +E +++ +E+ +  E  +  +E +  +E ++  +E KK EE  K+ E
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467

Query: 166  EKRVED---KIGDEKRLEENKKKKTQK 189
            E +  D   K  +E +  +  KKK ++
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEE 1494



 Score = 35.1 bits (80), Expect = 0.021
 Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 7/92 (7%)

Query: 104  ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER-----RLPEEDERGEDEEKKEE 158
            A DE    EE+ E  E+++EE +++ +  ++   EK++     +  EED++  DE KK  
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414

Query: 159  EEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
               K+ +E   + K  ++K+ +E KKK  + +
Sbjct: 1415 AAKKKADE--AKKKAEEKKKADEAKKKAEEAK 1444



 Score = 34.7 bits (79), Expect = 0.024
 Identities = 29/102 (28%), Positives = 54/102 (52%), Gaps = 7/102 (6%)

Query: 96   ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
               L K     + A E +++E EE+++ EE ++ E   + + E+ ++  EED++  +E K
Sbjct: 1691 AEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750

Query: 156  KEEEE-------DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            K+EEE        K+ E+K  E +   E  +EE   ++ +KR
Sbjct: 1751 KDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792



 Score = 34.7 bits (79), Expect = 0.027
 Identities = 25/94 (26%), Positives = 52/94 (55%), Gaps = 6/94 (6%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE----RRLPEEDERGEDEEKK 156
            +A    E A+ E E   +E E  EE+ +    +++E +K+    ++  EE ++ ++ +KK
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399

Query: 157  EEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             EE+ K+ +E  ++     +K+ +E KKK  +K+
Sbjct: 1400 AEEDKKKADE--LKKAAAAKKKADEAKKKAEEKK 1431



 Score = 34.3 bits (78), Expect = 0.031
 Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 6/94 (6%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            KA A  + A E+++ +E +++ EE++++ + L++    K++   +E ++  +E+KK +E 
Sbjct: 1379 KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK--ADEAKKKAEEKKKADEA 1436

Query: 161  DKQGEEKRVED----KIGDEKRLEENKKKKTQKR 190
             K+ EE +  D    K  + K+ EE KKK  + +
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470



 Score = 34.0 bits (77), Expect = 0.041
 Identities = 20/89 (22%), Positives = 49/89 (55%), Gaps = 4/89 (4%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
             E  +EE +++ +  +++ E+++     + + E++++  +E ++    +KK +E  K+ E
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428

Query: 166  EKRVED----KIGDEKRLEENKKKKTQKR 190
            EK+  D    K  + K+ +E KKK  + +
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAK 1457



 Score = 34.0 bits (77), Expect = 0.044
 Identities = 20/85 (23%), Positives = 47/85 (55%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            DE  ++ EE +++ +E ++  E  +  +E +  +E +  +E ++ E+ +K +E +  + +
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK 1545

Query: 166  EKRVEDKIGDEKRLEENKKKKTQKR 190
            +K  E K  +E +  E KKK  + +
Sbjct: 1546 KKADELKKAEELKKAEEKKKAEEAK 1570



 Score = 33.6 bits (76), Expect = 0.063
 Identities = 21/90 (23%), Positives = 49/90 (54%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            +A  + E A+++ +E ++  E +++ ++ +  EE +   E +  EE ++ ++ +K EE++
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546

Query: 161  DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
                 +K  E K  +EK+  E  KK  + +
Sbjct: 1547 KADELKKAEELKKAEEKKKAEEAKKAEEDK 1576



 Score = 33.2 bits (75), Expect = 0.071
 Identities = 26/92 (28%), Positives = 47/92 (51%), Gaps = 2/92 (2%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            KA    + A E ++  E +++ +E ++ +E  + DE +K     + DE  + EEKK+ +E
Sbjct: 1491 KAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE 1550

Query: 161  DKQGEE--KRVEDKIGDEKRLEENKKKKTQKR 190
             K+ EE  K  E K  +E +  E  K    ++
Sbjct: 1551 LKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582



 Score = 33.2 bits (75), Expect = 0.082
 Identities = 25/88 (28%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 104  ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            A +    +E ++ EE ++ +E ++ +E  + DE +K   L + +E+ + EE K+ EEDK 
Sbjct: 1518 AEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKN 1577

Query: 164  GEEKRVED-KIGDEKRLEENKKKKTQKR 190
               ++ E+ K  +E R+EE  K   +++
Sbjct: 1578 MALRKAEEAKKAEEARIEEVMKLYEEEK 1605



 Score = 33.2 bits (75), Expect = 0.086
 Identities = 23/87 (26%), Positives = 46/87 (52%), Gaps = 2/87 (2%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            +E  + EE +++ EE ++ ++ + +  E  + ++ ++  EE ++  DE KK  E  K+ +
Sbjct: 1454 EEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513

Query: 166  E--KRVEDKIGDEKRLEENKKKKTQKR 190
            E  K  E K  DE +  E  KK  + +
Sbjct: 1514 EAKKAEEAKKADEAKKAEEAKKADEAK 1540



 Score = 32.4 bits (73), Expect = 0.13
 Identities = 21/94 (22%), Positives = 50/94 (53%), Gaps = 6/94 (6%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            KA    + A E++++ +E ++    +++ +  E  +  +E++  +E ++  +E KK +E 
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKAD--EAKKKAEEKKKADEAKKKAEEAKKADEA 1449

Query: 161  DKQGEEKR----VEDKIGDEKRLEENKKKKTQKR 190
             K+ EE +     + K  + K+ +E KKK  + +
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483



 Score = 32.4 bits (73), Expect = 0.14
 Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 6/96 (6%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            KA    + A   +++ EE ++  E  + +     DE E      E  E+ ++E KK+ + 
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382

Query: 161  -DKQGEEKRVEDKIG-----DEKRLEENKKKKTQKR 190
              K+ EEK+  D+       D+K+ +E KK    K+
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418



 Score = 32.4 bits (73), Expect = 0.15
 Identities = 25/90 (27%), Positives = 46/90 (51%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            KA      A  + EE ++ EE   E+    Y EE + + E     E+ + + EE K+ EE
Sbjct: 1571 KAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEE 1630

Query: 161  DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            +K+  E+  + +  ++K+ EE KK + + +
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENK 1660



 Score = 32.0 bits (72), Expect = 0.19
 Identities = 25/102 (24%), Positives = 53/102 (51%)

Query: 89   LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
            LKK    + +  +A  + E  ++ +E +++ EE ++  E  +  EE +  +E +   E+ 
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEA 1469

Query: 149  RGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            +  DE KK+ EE K+ +E + + +   +K  E  K  + +K+
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511



 Score = 32.0 bits (72), Expect = 0.19
 Identities = 18/86 (20%), Positives = 42/86 (48%), Gaps = 3/86 (3%)

Query: 104  ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            A+ +    +++ +  +++ EE ++  E  + +           +E+ E  EKK+EE  K+
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379

Query: 164  GEEKRVEDKIGDEKRLEENKKKKTQK 189
             +  +   K  +EK+  +  KKK ++
Sbjct: 1380 ADAAK---KKAEEKKKADEAKKKAEE 1402



 Score = 32.0 bits (72), Expect = 0.21
 Identities = 24/94 (25%), Positives = 46/94 (48%), Gaps = 4/94 (4%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            +A  + E A++ +E +++ EE ++ +E  +  EE +   E +   E+ +  DE KK+ EE
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEE 1494

Query: 161  DKQGEE----KRVEDKIGDEKRLEENKKKKTQKR 190
             K+  +         K  DE +  E  KK  + +
Sbjct: 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAK 1528



 Score = 32.0 bits (72), Expect = 0.21
 Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 3/87 (3%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            +E  + EEE   EE  + EE     +       K     + DE  + EEKK+ +E K+ E
Sbjct: 1240 EEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAE 1299

Query: 166  EKRVED---KIGDEKRLEENKKKKTQK 189
            EK+  D   K  +E +  +  KKK ++
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEE 1326



 Score = 31.6 bits (71), Expect = 0.22
 Identities = 30/95 (31%), Positives = 49/95 (51%), Gaps = 5/95 (5%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK----ERRLPEEDERGEDEEKK 156
            KA    + A E ++ +E +++ EE ++ DE  ++ E  K    E +   E ++  DE KK
Sbjct: 1458 KAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517

Query: 157  EEEEDKQGEEKRVED-KIGDEKRLEENKKKKTQKR 190
             EE  K  E K+ E+ K  DE +  E KKK  + +
Sbjct: 1518 AEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552



 Score = 31.3 bits (70), Expect = 0.33
 Identities = 17/80 (21%), Positives = 41/80 (51%)

Query: 111  EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            EE+++ +E +++ EE ++ +  ++   E +++     ++ E+ +K  E    + E    E
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358

Query: 171  DKIGDEKRLEENKKKKTQKR 190
             +  +EK     KKK+  K+
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKK 1378



 Score = 30.9 bits (69), Expect = 0.44
 Identities = 16/80 (20%), Positives = 40/80 (50%)

Query: 107  ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
            E  ++ +E ++ EE+++ ++ + +  E  + ++ ++  EE ++  D  KK+ EE K+  E
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346

Query: 167  KRVEDKIGDEKRLEENKKKK 186
                +        E  ++K 
Sbjct: 1347 AAKAEAEAAADEAEAAEEKA 1366



 Score = 30.9 bits (69), Expect = 0.47
 Identities = 22/90 (24%), Positives = 46/90 (51%), Gaps = 4/90 (4%)

Query: 101  KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            +A  + E A++ +E +++ EE +++ +  +   E+  +       E E   DE +  EE+
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365

Query: 161  DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             +  E+K+ E K    K+ +  KKK  +K+
Sbjct: 1366 AEAAEKKKEEAK----KKADAAKKKAEEKK 1391



 Score = 30.5 bits (68), Expect = 0.60
 Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 109  AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
            A E+++ EE ++ EE++       EE +  +E R+ E  +  E+E+K + EE K+ EE +
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAK 1619

Query: 169  VEDKIGDEKRLEENKKKKTQKR 190
            +  K  + K+ EE KKK  Q +
Sbjct: 1620 I--KAEELKKAEEEKKKVEQLK 1639



 Score = 30.5 bits (68), Expect = 0.63
 Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            +E  + +E +++ EE ++ ++ + +  E  +     +   E+ +   E  K E E    E
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358

Query: 166  EKRVEDKI-GDEKRLEENKKK 185
             +  E+K    EK+ EE KKK
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKK 1379



 Score = 30.5 bits (68), Expect = 0.67
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 4/90 (4%)

Query: 105  RDETAREEEEEEEEEE----EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            + E AR+ EEE + EE    E+ ++ E  +  EE + + E     E+ER  +E +K EE 
Sbjct: 1201 KAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEA 1260

Query: 161  DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
                  +R      +E R  +  KK  +K+
Sbjct: 1261 RMAHFARRQAAIKAEEARKADELKKAEEKK 1290



 Score = 30.5 bits (68), Expect = 0.70
 Identities = 22/83 (26%), Positives = 44/83 (53%), Gaps = 1/83 (1%)

Query: 109  AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE-EDERGEDEEKKEEEEDKQGEEK 167
            A E +++ +E ++ EE ++ DE  + +E +K     + E+++  DE KK EE  K  E+K
Sbjct: 1505 AAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK 1564

Query: 168  RVEDKIGDEKRLEENKKKKTQKR 190
            + E+    E+      +K  + +
Sbjct: 1565 KAEEAKKAEEDKNMALRKAEEAK 1587



 Score = 30.5 bits (68), Expect = 0.70
 Identities = 25/93 (26%), Positives = 54/93 (58%), Gaps = 1/93 (1%)

Query: 96   ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
            E    KA    + A E++++ EE +++EEE+++  +L+++E +K   + +E E   +EE 
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785

Query: 156  KEEEEDKQGE-EKRVEDKIGDEKRLEENKKKKT 187
             EE+E ++ E +K+++D   +   + E  K+  
Sbjct: 1786 DEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGN 1818



 Score = 30.1 bits (67), Expect = 0.71
 Identities = 26/99 (26%), Positives = 46/99 (46%), Gaps = 8/99 (8%)

Query: 100  FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
            F   A+++   +E  EE   + EE ++ +    EE    +E +   ED R  +E +K E+
Sbjct: 1079 FDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAED 1138

Query: 160  EDKQGEEKRVED--------KIGDEKRLEENKKKKTQKR 190
              K  E ++ ED        K  D ++ EE +K +  K+
Sbjct: 1139 ARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKK 1177



 Score = 29.0 bits (64), Expect = 1.7
 Identities = 18/76 (23%), Positives = 39/76 (51%)

Query: 115  EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
            EE  + +E ++ E+ +  +E +  +E++  +E ++  +E KK +E  K+ EE + +    
Sbjct: 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA 1334

Query: 175  DEKRLEENKKKKTQKR 190
             +K  E  K  +  K 
Sbjct: 1335 KKKAEEAKKAAEAAKA 1350



 Score = 28.6 bits (63), Expect = 2.6
 Identities = 20/95 (21%), Positives = 46/95 (48%), Gaps = 8/95 (8%)

Query: 101  KAGARDETAREEEEEEEEEE----EEEEEQEQDEYLEE--DENEKERRLPEEDERGEDEE 154
            +A  + E AR+ +E ++ EE    +E ++ E+ +  +E   + E+ ++  E  ++ E+ +
Sbjct: 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328

Query: 155  KKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
            KK +   K+ EE +   +    K   E    + + 
Sbjct: 1329 KKADAAKKKAEEAKKAAEA--AKAEAEAAADEAEA 1361



 Score = 28.6 bits (63), Expect = 2.6
 Identities = 20/84 (23%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 107  ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
            + A+++ EE ++ +E +++ E+ +  EE + + E     ++ + + EE K+ +E K+  E
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493

Query: 167  KRVEDKIGDEKRLEENKKKKTQKR 190
            +  + K  + K+  E KKK  + +
Sbjct: 1494 E-AKKKADEAKKAAEAKKKADEAK 1516



 Score = 28.2 bits (62), Expect = 3.4
 Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 6/94 (6%)

Query: 98   QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
            +L K  A ++   EE ++ EEE + + E+ + E  E+ +  +E +  EE+++     KKE
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKE 1765

Query: 158  EEEDKQGEEKRVEDKIG------DEKRLEENKKK 185
            EE+  +   K  E  I       DEKR  E  KK
Sbjct: 1766 EEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKK 1799



 Score = 28.2 bits (62), Expect = 3.7
 Identities = 22/89 (24%), Positives = 42/89 (47%), Gaps = 3/89 (3%)

Query: 105  RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE--RRLPEEDERGEDEEKKEEEEDK 162
            + E  ++ +E +++ EE ++  E  +  EE + + +  ++  EE ++  +  K E E   
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356

Query: 163  QGEEKRVEDKIGDEKRLEENKKK-KTQKR 190
               E   E     EK+ EE KKK    K+
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385



 Score = 28.2 bits (62), Expect = 3.8
 Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 10/89 (11%)

Query: 106  DETAREEEEEEEEEE----EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE--- 158
            DE  +  E +++ +E    EE ++ ++ +  EE +   E +  EE ++ ++ +K EE   
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559

Query: 159  -EEDKQGEEKRV--EDKIGDEKRLEENKK 184
             EE K+ EE +   EDK    ++ EE KK
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588



 Score = 27.8 bits (61), Expect = 4.1
 Identities = 25/86 (29%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 109  AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE-KKEEEEDKQGEEK 167
            A E ++ EE++     + E+ +  EE   E+  +L EE+++ + EE KK EE   + EE 
Sbjct: 1566 AEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEEL 1625

Query: 168  RVED----KIGDEKRLEENKKKKTQK 189
            +  +    K+   K+ E  +KKK ++
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEE 1651



 Score = 27.8 bits (61), Expect = 4.5
 Identities = 20/86 (23%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 105  RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
              + + ++ + + +E+   +E  ++ + + +E +K      E+ R  +E KK+ E+ ++ 
Sbjct: 1071 GLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKA 1130

Query: 165  EEKRVEDKIGDEKRLEENKKKKTQKR 190
            EE R   K  D ++ EE +K +  KR
Sbjct: 1131 EEAR---KAEDARKAEEARKAEDAKR 1153



 Score = 27.8 bits (61), Expect = 4.9
 Identities = 27/96 (28%), Positives = 47/96 (48%), Gaps = 9/96 (9%)

Query: 104  ARDETAREEEEEEEEEEEEEEE---QEQDEYLEEDENEKERRLPEEDERGED---EEKKE 157
            A D    E  ++ EE +++ EE    E++   EE    +E R+     R      EE ++
Sbjct: 1220 AEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK 1279

Query: 158  EEEDKQGEEKRVED---KIGDEKRLEENKKKKTQKR 190
             +E K+ EEK+  D   K  ++K+ +E KKK  + +
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAK 1315



 Score = 27.4 bits (60), Expect = 6.9
 Identities = 27/98 (27%), Positives = 48/98 (48%), Gaps = 11/98 (11%)

Query: 101  KAGARDETAREEEEEEEEEE----EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
            +A  + E AR+ EE  + E+    EE  + E  + +E     ++ R  EE  + ED +K 
Sbjct: 1119 EAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKA 1178

Query: 157  EE----EEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            E     EE ++ EE R   K  D ++ E  +K + +++
Sbjct: 1179 EAARKAEEVRKAEELR---KAEDARKAEAARKAEEERK 1213


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 40.9 bits (96), Expect = 2e-04
 Identities = 15/42 (35%), Positives = 27/42 (64%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
               EEE+EEEEEEE+E+E    E+ +++E E++    + ++
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYED 429



 Score = 34.7 bits (80), Expect = 0.021
 Identities = 17/57 (29%), Positives = 33/57 (57%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           F A   ++   E ++EEEE+EEEEEE+++DE   ++ ++ E    ++ E   ++   
Sbjct: 376 FSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432



 Score = 34.3 bits (79), Expect = 0.034
 Identities = 19/79 (24%), Positives = 37/79 (46%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           E   +++  + +E EQ+           ++     +DE  EDEE++EEE++ +G  K   
Sbjct: 354 ETLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHS 413

Query: 171 DKIGDEKRLEENKKKKTQK 189
           D    E+   E+K + +  
Sbjct: 414 DDEEFEEDDVESKYEDSDG 432



 Score = 34.0 bits (78), Expect = 0.047
 Identities = 19/59 (32%), Positives = 35/59 (59%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           D  +  E E+   E ++EEE++++E  EEDE+E   +   +DE  E+++ + + ED  G
Sbjct: 374 DAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432



 Score = 32.0 bits (73), Expect = 0.17
 Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLE-EDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           ET  +++  + +E E++   +    LE ED N +     EEDE  E+EE ++E   K+  
Sbjct: 354 ETLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHS 413

Query: 166 EKRVEDKIGDEKRLEENKKKK 186
           +    ++   E + E++    
Sbjct: 414 DDEEFEEDDVESKYEDSDGNS 434



 Score = 32.0 bits (73), Expect = 0.18
 Identities = 17/45 (37%), Positives = 25/45 (55%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           E+   +    +E   EEEEEE+E+E   +E   DE  EED+ E +
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESK 426


>gnl|CDD|162039 TIGR00790, fnt, formate/nitrite transporter.  The Formate-Nitrite
           Transporter (FNT) Family (TC 2.A.44)The prokaryotic
           proteins of the FNT family probably function in the
           transport of the structurally related compounds, formate
           and nitrite. The homologous yeast protein may function
           as a short chain aliphatic carboxylate H+
           symporter,transporting formate, acetate and propionate,
           and functioning primarily as an acetate uptake permease.
           The putative formate efflux transporters (FocA) of
           bacteria associated with pyruvate-formate lyase (pfl)
           comprise cluster I; the putative formate uptake
           permeases (FdhC) of bacteria and archaea associated with
           formate dehydrogenase comprise cluster II; the putative
           nitrite uptake permeases (NirC) of bacteria comprise
           cluster III, and the single yeast protein, the putative
           acetate:H+ symporter alone comprises cluster IV. The
           energy coupling mechanisms for proteins of the FNT
           family have not been extensively characterized. HCO2 -,
           CH3CO2 - and NO2 - uptakes are probably coupled to
           H+symport. HCO2 - efflux may be driven by the membrane
           potential by a uniport mechanism or by H+ antiport
           [Transport and binding proteins, Anions].
          Length = 239

 Score = 40.0 bits (94), Expect = 3e-04
 Identities = 17/57 (29%), Positives = 20/57 (35%), Gaps = 14/57 (24%)

Query: 21  TLILVFCTATG-NILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLV 76
           T +  F      N LVCLA+ W            MSL+  DL      M   I   V
Sbjct: 136 TFVEAFFLGILCNWLVCLAV-W------------MSLSGKDLAGKFFAMWFPIFAFV 179


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 40.4 bits (94), Expect = 3e-04
 Identities = 24/86 (27%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE---EEDK 162
            D  A +E +EE + E++E  Q++D   E +  +++ +  +  +  ED+EK  E   EE+ 
Sbjct: 4012 DMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENV 4071

Query: 163  QGEEKRVEDKIGDEKRLEENKKKKTQ 188
            Q  E+  ED +  ++ LE+ +  + Q
Sbjct: 4072 QENEESTEDGVKSDEELEQGEVPEDQ 4097



 Score = 38.8 bits (90), Expect = 0.001
 Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 5/87 (5%)

Query: 96   ENQLFKAGARDETAREEEEEEEEEE-EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
            EN+      +DE  ++E+  EE    +E+ +Q+    L ED+ +      EE+ +  +E 
Sbjct: 4018 ENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEES 4077

Query: 155  KKE----EEEDKQGEEKRVEDKIGDEK 177
             ++    +EE +QGE    +      K
Sbjct: 4078 TEDGVKSDEELEQGEVPEDQAIDNHPK 4104



 Score = 38.1 bits (88), Expect = 0.002
 Identities = 23/84 (27%), Positives = 40/84 (47%), Gaps = 4/84 (4%)

Query: 105  RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL---PEEDERGEDEEKKEEEED 161
            +D    E E  + EEE ++ ++E ++  E+  N    +L   P E++  E E+K  E+  
Sbjct: 3869 KDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSA 3928

Query: 162  KQGEEKRVEDKIGDEKRLEENKKK 185
               E   V  K  D K LE+  ++
Sbjct: 3929 ANNESDLV-SKEDDNKALEDKDRQ 3951



 Score = 36.1 bits (83), Expect = 0.010
 Identities = 17/84 (20%), Positives = 42/84 (50%), Gaps = 4/84 (4%)

Query: 109  AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED----ERGEDEEKKEEEEDKQG 164
             +E +  ++ + E+ + +  DE  EE + EK+  + +ED        DE+ ++++     
Sbjct: 3997 EKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLA 4056

Query: 165  EEKRVEDKIGDEKRLEENKKKKTQ 188
            E+    ++ G E+ ++EN++    
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTED 4080



 Score = 35.7 bits (82), Expect = 0.012
 Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 3/63 (4%)

Query: 116  EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE---EDKQGEEKRVEDK 172
             E+++ EE   E+D   + D +E E R  E D  G  ++    E    D + E + ++++
Sbjct: 3832 NEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891

Query: 173  IGD 175
            + D
Sbjct: 3892 VND 3894



 Score = 35.4 bits (81), Expect = 0.015
 Identities = 18/76 (23%), Positives = 36/76 (47%), Gaps = 4/76 (5%)

Query: 104  ARDETAREEEEEEEEEEEEEEEQEQDEYLEE---DENEKERRLPEEDERGEDEEKKEEEE 160
            + +E   +E  EE+  E E++  EQ     E      E + +  E+ +R +++E +EE  
Sbjct: 3902 SLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDR-QEKEDEEEMS 3960

Query: 161  DKQGEEKRVEDKIGDE 176
            D  G +  ++  I + 
Sbjct: 3961 DDVGIDDEIQPDIQEN 3976



 Score = 34.2 bits (78), Expect = 0.042
 Identities = 17/80 (21%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 110  REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE----EKKEEEEDKQGE 165
            ++ +E+E+EEE  ++    DE   + +    +  PE ++    E    ++KE +  K  +
Sbjct: 3948 KDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSD 4007

Query: 166  EKRVEDKIGDEKRLEENKKK 185
             + ++ +  DE + E + +K
Sbjct: 4008 LEDMDMEAADENKEEADAEK 4027



 Score = 31.1 bits (70), Expect = 0.34
 Identities = 18/85 (21%), Positives = 36/85 (42%), Gaps = 5/85 (5%)

Query: 104  ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            A D+    E+  EE  +E EE  E     +E   E E+    ED+  ++  K + +    
Sbjct: 4056 AEDDEKMNEDGFEENVQENEESTEDGVKSDE---ELEQGEVPEDQAIDNHPKMDAKSTFA 4112

Query: 164  GEEKRVEDKIGDEKRLEENKKKKTQ 188
                  +++  D+  + EN++   +
Sbjct: 4113 --SAEADEENTDKGIVGENEELGEE 4135



 Score = 31.1 bits (70), Expect = 0.37
 Identities = 20/89 (22%), Positives = 42/89 (47%), Gaps = 11/89 (12%)

Query: 106  DETAREEEEEEEEEEEEEEEQE--QDEYLEEDENEKERR---------LPEEDERGEDEE 154
            D  A E+++ +E+E+EEE   +   D+ ++ D  E   +         LPE+ +  E E 
Sbjct: 3941 DNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000

Query: 155  KKEEEEDKQGEEKRVEDKIGDEKRLEENK 183
               ++ D +  +    D+  +E   E+++
Sbjct: 4001 DVSKDSDLEDMDMEAADENKEEADAEKDE 4029



 Score = 28.8 bits (64), Expect = 2.4
 Identities = 16/67 (23%), Positives = 30/67 (44%), Gaps = 2/67 (2%)

Query: 107  ETAREEEEEEEEEEEEEEEQEQDEYLEE-DENEKERRLPEEDERG-EDEEKKEEEEDKQG 164
            E    E+     E +   +++ ++ LE+ D  EKE      D+ G +DE + + +E+   
Sbjct: 3920 EQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQ 3979

Query: 165  EEKRVED 171
                 ED
Sbjct: 3980 PPPENED 3986



 Score = 27.7 bits (61), Expect = 4.6
 Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 5/93 (5%)

Query: 96   ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
            E+ L +    DE  ++++  +  E++E+  ++     EE+  E E    +  +  E+ E+
Sbjct: 4034 EDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDG---FEENVQENEESTEDGVKSDEELEQ 4090

Query: 156  KEEEEDK--QGEEKRVEDKIGDEKRLEENKKKK 186
             E  ED+      K            +E    K
Sbjct: 4091 GEVPEDQAIDNHPKMDAKSTFASAEADEENTDK 4123


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 39.6 bits (92), Expect = 4e-04
 Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 1/75 (1%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
                EEEE  EE +  EE+E +E   +   E  R L EE+E  E E   + E+    E 
Sbjct: 172 PDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE-EAEGSDDGEDVVDYEG 230

Query: 167 KRVEDKIGDEKRLEE 181
           +R++ K G+E+ +EE
Sbjct: 231 ERIDKKQGEEEEMEE 245



 Score = 38.1 bits (88), Expect = 0.001
 Identities = 27/90 (30%), Positives = 47/90 (52%), Gaps = 4/90 (4%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE--NEKERRLPEEDERGED--EEKKEE 158
              +E     EE +  EEEE+EE   D Y E +   NE+E    E  + GED  + + E 
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGER 232

Query: 159 EEDKQGEEKRVEDKIGDEKRLEENKKKKTQ 188
            + KQGEE+ +E+++ +   +E  ++  ++
Sbjct: 233 IDKKQGEEEEMEEEVINLFEIEWEEESPSE 262



 Score = 28.8 bits (64), Expect = 1.5
 Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 5/66 (7%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQD-----EYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            ++  E   E  EEEEEE E   D     +Y  E  ++K+    E +E   +  + E EE
Sbjct: 198 SDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEE 257

Query: 161 DKQGEE 166
           +   EE
Sbjct: 258 ESPSEE 263


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 39.9 bits (94), Expect = 4e-04
 Identities = 18/60 (30%), Positives = 39/60 (65%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
                 + EEE++E++++    ++++E  EE++ +KE    EE+E  E+E+++EEE+ K+
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 38.7 bits (91), Expect = 0.001
 Identities = 19/60 (31%), Positives = 42/60 (70%), Gaps = 1/60 (1%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
              + E++ EEE++E++++      +E+E E+E+   +E+E+ E+EE+ EEE++++ E+K
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE-KKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 38.4 bits (90), Expect = 0.001
 Identities = 22/73 (30%), Positives = 44/73 (60%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           F  G++  T + ++  E+ E++ EEE+++ +       +KE    EE E+ E+E+++EEE
Sbjct: 401 FLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460

Query: 160 EDKQGEEKRVEDK 172
           E ++ +E+  E K
Sbjct: 461 EAEEEKEEEEEKK 473



 Score = 38.0 bits (89), Expect = 0.002
 Identities = 23/82 (28%), Positives = 47/82 (57%), Gaps = 7/82 (8%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
           T  ++  ++ ++  E+ E++++E  +E + +      +E+E  E++EKKEEE++++ EE 
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462

Query: 168 RVEDKIGDEKRLEENKKKKTQK 189
             E +       EE +KKK Q 
Sbjct: 463 EEEKE-------EEEEKKKKQA 477



 Score = 34.9 bits (81), Expect = 0.016
 Identities = 23/81 (28%), Positives = 44/81 (54%), Gaps = 9/81 (11%)

Query: 83  SGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
           S    +  K I+ + +  +   + E  ++    +++EEEEEEE+E+ E         E +
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE---------EEK 455

Query: 143 LPEEDERGEDEEKKEEEEDKQ 163
             EE+E  E++E++EE++ KQ
Sbjct: 456 EEEEEEAEEEKEEEEEKKKKQ 476



 Score = 34.9 bits (81), Expect = 0.020
 Identities = 15/77 (19%), Positives = 44/77 (57%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
            EEE E     ++  +      ++ EK+R   +++++ +    K++EE+++ E+++ E++
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 173 IGDEKRLEENKKKKTQK 189
             +E+   E +K++ ++
Sbjct: 455 KEEEEEEAEEEKEEEEE 471



 Score = 33.0 bits (76), Expect = 0.087
 Identities = 13/80 (16%), Positives = 46/80 (57%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           EEE E     ++  ++ +    + ++  +E +  ++ +    ++K+EEEE+++ +++  +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455

Query: 171 DKIGDEKRLEENKKKKTQKR 190
           ++  +E   E+ ++++ +K+
Sbjct: 456 EEEEEEAEEEKEEEEEKKKK 475


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 39.4 bits (92), Expect = 5e-04
 Identities = 20/93 (21%), Positives = 44/93 (47%), Gaps = 9/93 (9%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQ-EQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           +  A+    + +++E+++ EE +++Q  + E L++ E E+     E+ ++ E+  K+   
Sbjct: 71  QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQ-EQKKQAEEAAKQAAL 129

Query: 160 EDKQGEEKRVEDKI-------GDEKRLEENKKK 185
           + KQ EE   +           + KR     KK
Sbjct: 130 KQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKK 162



 Score = 32.9 bits (75), Expect = 0.081
 Identities = 15/97 (15%), Positives = 38/97 (39%), Gaps = 7/97 (7%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E +  K   + E  ++++  E+E  ++ E++      ++ + E+  +     ++  +E  
Sbjct: 79  EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138

Query: 156 KEEEEDKQ----GEEKRVEDKIGDEKRLEENKKKKTQ 188
            +     +     E KR        K+     KKK +
Sbjct: 139 AKAAAAAKAKAEAEAKRAAAA---AKKAAAEAKKKAE 172



 Score = 27.8 bits (62), Expect = 4.2
 Identities = 14/77 (18%), Positives = 30/77 (38%), Gaps = 2/77 (2%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE--KR 168
               +E++++ EE  ++     ++ E    +       + E E K+     K+     K+
Sbjct: 110 RLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169

Query: 169 VEDKIGDEKRLEENKKK 185
             +    +K   E KKK
Sbjct: 170 KAEAEAAKKAAAEAKKK 186



 Score = 27.1 bits (60), Expect = 7.0
 Identities = 19/103 (18%), Positives = 41/103 (39%), Gaps = 5/103 (4%)

Query: 89  LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE---RRLPE 145
           L++    E +  K   ++  A +E++++ EE  ++   +Q +  E           +   
Sbjct: 92  LQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEA 151

Query: 146 EDERGEDEEKK-EEEEDKQGEEKRVEDKIGD-EKRLEENKKKK 186
           E +R     KK   E  K+ E +  +    + +K+ E     K
Sbjct: 152 EAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 39.5 bits (92), Expect = 5e-04
 Identities = 20/68 (29%), Positives = 37/68 (54%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
             E E+EE+ ++++  E EQ+E  E +   +E   P+E E  + E +  EEE  Q ++  
Sbjct: 206 DTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDD 265

Query: 169 VEDKIGDE 176
           + D+  D+
Sbjct: 266 LPDESDDD 273



 Score = 32.6 bits (74), Expect = 0.11
 Identities = 18/61 (29%), Positives = 28/61 (45%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           D+    E E+EE+ E E E QE     E +  ++E    EE+    D++   +E D   E
Sbjct: 216 DDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSE 275

Query: 166 E 166
            
Sbjct: 276 T 276


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
            [Transcription / DNA replication, recombination, and
            repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 39.6 bits (92), Expect = 6e-04
 Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 10/77 (12%)

Query: 102  AGARDETAREEEEEEE----------EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
             G+ DE+   EEE  E          E +E+EE  E  E L EDE+E +    E+ E  +
Sbjct: 925  VGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWD 984

Query: 152  DEEKKEEEEDKQGEEKR 168
            + E K   + + G+ ++
Sbjct: 985  ELESKAAYDSRPGKRRK 1001



 Score = 33.1 bits (75), Expect = 0.081
 Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 6/77 (7%)

Query: 98  QLFKAGARDETAREEEEEEEEEEEEEEEQE------QDEYLEEDENEKERRLPEEDERGE 151
             F+ G         ++E +E EEE  E E       DE  E++E+++      EDE   
Sbjct: 913 SFFEDGGWSFLMVGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESEN 972

Query: 152 DEEKKEEEEDKQGEEKR 168
           D   +E+ ED    E +
Sbjct: 973 DSSDEEDGEDWDELESK 989



 Score = 32.7 bits (74), Expect = 0.10
 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 9/65 (13%)

Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
            ++E +E E+E  EY    ++E        DE  EDEE  E  ED   +E   E+   DE
Sbjct: 927 SDDESDESEEEVSEYEASSDDES-------DETDEDEESDESSEDLSEDES--ENDSSDE 977

Query: 177 KRLEE 181
           +  E+
Sbjct: 978 EDGED 982


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 39.4 bits (92), Expect = 7e-04
 Identities = 21/75 (28%), Positives = 40/75 (53%)

Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
           EEE+   +  E+E +E LE           +++   EDE++ EE++++     + E    
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 175 DEKRLEENKKKKTQK 189
           ++ RL+E KK+K Q+
Sbjct: 61  EKARLKELKKQKKQE 75



 Score = 28.2 bits (63), Expect = 3.0
 Identities = 18/89 (20%), Positives = 37/89 (41%), Gaps = 3/89 (3%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E +     A +E   EE E        + + ++    +EDE+E++    EE E     E 
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEED---DEEAESPAKAEI 57

Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEENKK 184
            + E+ +  E K+ + +   +   ++N  
Sbjct: 58  SKREKARLKELKKQKKQEIQKILEQQNAA 86



 Score = 27.1 bits (60), Expect = 7.6
 Identities = 10/50 (20%), Positives = 25/50 (50%)

Query: 97  NQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
            +L++   R      ++ + ++  +  E +E  + L  +E E++ +L EE
Sbjct: 773 TELYEKEVRYLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEE 822


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 38.6 bits (90), Expect = 8e-04
 Identities = 22/81 (27%), Positives = 41/81 (50%)

Query: 92  NILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
            +   +      A DE   ++ +++E  E EE ++E+ E    ++ E +    +EDE  E
Sbjct: 33  QLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92

Query: 152 DEEKKEEEEDKQGEEKRVEDK 172
           +E ++E+EE     EK  E+K
Sbjct: 93  EENEEEDEESSDENEKETEEK 113



 Score = 36.6 bits (85), Expect = 0.004
 Identities = 23/74 (31%), Positives = 42/74 (56%), Gaps = 2/74 (2%)

Query: 101 KAGARDETAREEEEEEEEEE--EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
           K     ETA  EE +EEE+E    E+++++ +  +EDE  +E    E++E  ++ EK+ E
Sbjct: 52  KKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETE 111

Query: 159 EEDKQGEEKRVEDK 172
           E+ +   EK + + 
Sbjct: 112 EKTESNVEKEITNP 125



 Score = 32.4 bits (74), Expect = 0.087
 Identities = 26/84 (30%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 98  QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
           QLF   +  + A  +E+E ++ +++E  + +    E  E EKE    E+ E   D EK++
Sbjct: 33  QLFFPSSPSDQAAADEQEAKKSDDQETAEIE----EVKEEEKEAANSEDKEDKGDAEKED 88

Query: 158 EEEDKQGEEKRVEDKIGDEKRLEE 181
           EE +++ EE+  E    +EK  EE
Sbjct: 89  EESEEENEEEDEESSDENEKETEE 112



 Score = 29.3 bits (66), Expect = 0.93
 Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 1/73 (1%)

Query: 118 EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE-EDKQGEEKRVEDKIGDE 176
            ++   +EQE  +  +++  E E    EE E    E+K+++   +K+ EE   E++  DE
Sbjct: 41  SDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100

Query: 177 KRLEENKKKKTQK 189
           +  +EN+K+  +K
Sbjct: 101 ESSDENEKETEEK 113



 Score = 29.3 bits (66), Expect = 0.95
 Identities = 17/57 (29%), Positives = 31/57 (54%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
            E A  E++E++ + E+E+E+ ++E  EEDE   +    E +E+ E   +KE     
Sbjct: 70  KEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 38.6 bits (90), Expect = 9e-04
 Identities = 19/77 (24%), Positives = 41/77 (53%)

Query: 85  AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
           +V+ + K    +  L K   +     EEEE+ +E+    E++ ++   E+ + E E+   
Sbjct: 118 SVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE 177

Query: 145 EEDERGEDEEKKEEEED 161
           EE+E  E++E  ++++D
Sbjct: 178 EEEEEEEEDEDFDDDDD 194



 Score = 33.6 bits (77), Expect = 0.040
 Identities = 14/30 (46%), Positives = 26/30 (86%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
            +EE+E++EEEEEEE+E+DE  ++D+++ +
Sbjct: 168 VDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 33.2 bits (76), Expect = 0.045
 Identities = 18/49 (36%), Positives = 33/49 (67%)

Query: 89  LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
            +K  + E +L +  A D    +E++EEEEEEEEEE+++ D+  ++D++
Sbjct: 150 DEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198



 Score = 32.8 bits (75), Expect = 0.065
 Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 6/68 (8%)

Query: 99  LFKAGARDETAR-EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
           LF     D   +    E++ +E E E+  E+DE  EE+E E+E    E+++  +D++  +
Sbjct: 141 LFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEE---EEDEDFDDDDD--D 195

Query: 158 EEEDKQGE 165
           +++D   E
Sbjct: 196 DDDDYNAE 203



 Score = 30.9 bits (70), Expect = 0.34
 Identities = 14/57 (24%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 129 DEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK---QGEEKRVEDKIGDEKRLEEN 182
               EED +EK   L ++ +  E E+  EE+E     + EE+  ++   D+   +++
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198


>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
          protein; Provisional.
          Length = 335

 Score = 38.6 bits (90), Expect = 9e-04
 Identities = 19/69 (27%), Positives = 38/69 (55%), Gaps = 3/69 (4%)

Query: 4  EVSFGLDAASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMV 63
          +++ G D  S  +  VV   + F    GNI+V   ++  K ++   + +L++LA++DL +
Sbjct: 31 DLNIGYDTNSTIL-IVVYSTIFFFGLVGNIIVIYVLTKTK-IKTPMDIYLLNLAVSDL-L 87

Query: 64 AVLVMPLGI 72
           V+ +P  I
Sbjct: 88 FVMTLPFQI 96


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 38.7 bits (90), Expect = 0.001
 Identities = 30/114 (26%), Positives = 60/114 (52%), Gaps = 9/114 (7%)

Query: 85  AVLRLK--KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
           A+L+L   K+I+  ++L K G++ ET  + E++EEE+EE + ++   + + ++E +    
Sbjct: 347 AILKLLYVKDIVPISELLKEGSKQETKEQPEKKEEEKEELDIDKIILQ-IIKNEGKIISA 405

Query: 143 LPEEDERGEDEEK------KEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           + +  E  E+E K      K EE+    E K ++      +  E  K+K+ +K 
Sbjct: 406 ILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKE 459


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 38.5 bits (90), Expect = 0.001
 Identities = 25/97 (25%), Positives = 51/97 (52%), Gaps = 3/97 (3%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD---EYLEEDENEKERRLPEEDERGED 152
           E +  K  A      +++++EEEE E EE+++ D   E+++ + +++      EDE  +D
Sbjct: 80  EEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKD 139

Query: 153 EEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           E  K+ +ED   E    +++   E+   E +K+K  +
Sbjct: 140 EAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176



 Score = 29.2 bits (66), Expect = 1.3
 Identities = 21/75 (28%), Positives = 38/75 (50%), Gaps = 5/75 (6%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           E+ +EEE +++E EQ  +   ++DE E+      E E  ED + + E  D + +++    
Sbjct: 76  EKWKEEERKKKEAEQGLESDDDDDEEEEW-----EVEEDEDSDDEGEWIDVESDKEIESS 130

Query: 172 KIGDEKRLEENKKKK 186
              DE+  +E  KK 
Sbjct: 131 DSEDEEEKDEAAKKA 145


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 38.4 bits (89), Expect = 0.001
 Identities = 30/109 (27%), Positives = 42/109 (38%), Gaps = 13/109 (11%)

Query: 87  LRLKKNILCENQLFK-AGARDETAREEEEEEEEEEE---------EEEEQEQDEYLEEDE 136
           LR   N+     L K  G        E+   EE E             E++ D+ L  ++
Sbjct: 313 LRFLDNLDFLQFLSKYIGCLWRNLNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEK 372

Query: 137 NEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
            E E R PEE E    EE+ E+ ED+    KR+ D    E       +K
Sbjct: 373 MEIENRNPEESE---HEEEVEDYEDENDHSKRICDDDELENHFRAADEK 418


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 38.2 bits (89), Expect = 0.001
 Identities = 20/87 (22%), Positives = 36/87 (41%), Gaps = 6/87 (6%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           E     E++EEEEEEEE+E          E E++RR  ++++  +DEE  +     +   
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGS 233

Query: 167 KR------VEDKIGDEKRLEENKKKKT 187
                   +  K   ++    +  K  
Sbjct: 234 SSPAKPTSILKKSAAKRSEAPSSSKAK 260



 Score = 27.4 bits (61), Expect = 4.3
 Identities = 28/108 (25%), Positives = 40/108 (37%), Gaps = 32/108 (29%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQ---------------------DEYLEEDENEKER 141
             R+  A + +EE+EE  E+E E+E                       E LEE +  + R
Sbjct: 104 ATRNYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSR 163

Query: 142 R--------LPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
           R        L     R + EE++EEEED   E        G E   + 
Sbjct: 164 RADVDVNSMLEALFRREKKEEEEEEEED---EALIKSLSFGPETEEDR 208


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 38.0 bits (89), Expect = 0.002
 Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 4/90 (4%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           +A  R    +  E  E+   E   +  +    EE+E ++ RR  + ++R  + E+ E   
Sbjct: 135 EAARRGAARKAGEGGEQPATEARADAAERT--EEEERDERRRRGDREDRQAEAERGERGR 192

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             + E  R  D      R E+  +++ + R
Sbjct: 193 --REERGRDGDDRDRRDRREQGDRREERGR 220


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 36.7 bits (85), Expect = 0.002
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
           AG    T    +EEEEEEEE E++Q+ DE
Sbjct: 85  AGTTSGTGHTRQEEEEEEEENEKQQQSDE 113



 Score = 26.3 bits (58), Expect = 7.6
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQ 128
           G       EEEEEEE E++++ ++ Q
Sbjct: 90  GTGHTRQEEEEEEEENEKQQQSDEAQ 115


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 37.4 bits (87), Expect = 0.003
 Identities = 17/85 (20%), Positives = 35/85 (41%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            E   +E + E++ +E E  +E+       E  +   + E      D++ +EE +D+   
Sbjct: 200 QEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAW 259

Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
           E    +     K +   +K K Q+ 
Sbjct: 260 EGFESEYEPINKPVRPKRKTKAQRN 284



 Score = 32.0 bits (73), Expect = 0.15
 Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 8/85 (9%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER------GEDEEKKEEEEDKQGE 165
           E++ +E E  EE++ E+        +E    L EE +        ++   +  E + +  
Sbjct: 210 EKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPI 269

Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
            K V  K   + + + NK+K+ ++ 
Sbjct: 270 NKPVRPKR--KTKAQRNKEKRRKEL 292



 Score = 28.1 bits (63), Expect = 3.4
 Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 9/85 (10%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEY--------LEEDENEKERRLPEEDERGED 152
           KA    E  R+E E E +EE++ +++             + + E  + R+  +  ERGE 
Sbjct: 280 KAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEK 339

Query: 153 EEKKEEEEDK-QGEEKRVEDKIGDE 176
           ++ K  +  K +  E  +E  + DE
Sbjct: 340 KKLKRRKLGKHKYPEPPLEVVLPDE 364


>gnl|CDD|222193 pfam13520, AA_permease_2, Amino acid permease. 
          Length = 425

 Score = 37.3 bits (87), Expect = 0.003
 Identities = 9/62 (14%), Positives = 26/62 (41%), Gaps = 10/62 (16%)

Query: 14  WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGIL 73
           W  + +   IL+       ++    I    ++QN+    ++ L +  +++ +    LG++
Sbjct: 116 WLTYGIAIAILIIFA----LINIRGIKESAKIQNIL--GIVKLLLPLILIIL----LGLV 165

Query: 74  TL 75
             
Sbjct: 166 LA 167


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 37.2 bits (86), Expect = 0.003
 Identities = 26/106 (24%), Positives = 55/106 (51%), Gaps = 1/106 (0%)

Query: 85  AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
           AV R++K            A++      +EEE+E+E+ +EE+++ +  +  E  K+R+  
Sbjct: 76  AVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKK-EKPKEEPKDRKPK 134

Query: 145 EEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           EE +     ++KE+E++K+ EE R  ++    +R+    + K   +
Sbjct: 135 EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180



 Score = 35.6 bits (82), Expect = 0.010
 Identities = 21/91 (23%), Positives = 49/91 (53%), Gaps = 3/91 (3%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED---ERGEDEEKKEEE 159
           G  +E  +E+ +EE+++++E+ ++E  +   ++E +++R   E++   E+  +E +  EE
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162

Query: 160 EDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           E K+   +         K+   NKKK+  + 
Sbjct: 163 EKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193



 Score = 32.6 bits (74), Expect = 0.10
 Identities = 23/90 (25%), Positives = 46/90 (51%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           K   ++E  +++E+ +EE ++ + ++E  E     E EKE+    E+ R  +EEKK E  
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             +   K+   K    K+ E  +++K ++ 
Sbjct: 170 RAKSRPKKPPKKKPPNKKKEPPEEEKQRQA 199



 Score = 29.9 bits (67), Expect = 0.76
 Identities = 21/86 (24%), Positives = 44/86 (51%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           K  A+++   +E+E+E+E++ EE    ++E   E    K R      ++  +++K+  EE
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKK 186
           +KQ +  R   K   E+     +++K
Sbjct: 194 EKQRQAAREAVKGKPEEPDVNEEREK 219



 Score = 29.1 bits (65), Expect = 1.4
 Identities = 19/86 (22%), Positives = 44/86 (51%), Gaps = 1/86 (1%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK-KEEEEDKQG 164
           DE  +  E+   +    + +  ++   E  + E++ +   ++E+ + +EK KEE +D++ 
Sbjct: 74  DEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKP 133

Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
           +E+  E +   EK  E+ KK +  + 
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRD 159


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.4 bits (86), Expect = 0.003
 Identities = 17/48 (35%), Positives = 27/48 (56%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
           A ++    REE  E+ + E E++ +E+ E  +E E E+ER    E ER
Sbjct: 573 ASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620



 Score = 35.0 bits (80), Expect = 0.019
 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 9/67 (13%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
           A++ EE  E+ + E E++ +    EE E EKE+    E ER E E ++E E   +     
Sbjct: 578 AKKREEAVEKAKREAEQKAR----EEREREKEK----EKER-EREREREAERAAKASSSS 628

Query: 169 VEDKIGD 175
            E ++ +
Sbjct: 629 HESRMSE 635



 Score = 33.9 bits (77), Expect = 0.050
 Identities = 19/66 (28%), Positives = 32/66 (48%)

Query: 80  TNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
           T L+ + L  K+    E    +A  +    RE E+E+E+E E E E+E +   +   +  
Sbjct: 570 TPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSH 629

Query: 140 ERRLPE 145
           E R+ E
Sbjct: 630 ESRMSE 635



 Score = 32.0 bits (72), Expect = 0.21
 Identities = 14/48 (29%), Positives = 24/48 (50%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
           A+ E  ++  EE E E+E+E+E+E++   E +   K      E    E
Sbjct: 588 AKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHESRMSE 635


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 36.3 bits (84), Expect = 0.003
 Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 3/82 (3%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           +    +EEE+E  +E  + +E+EQ    EE E EKE   PE   R +  E++E+EE  Q 
Sbjct: 34  KKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPE---REDIAEQQEKEEIAQE 90

Query: 165 EEKRVEDKIGDEKRLEENKKKK 186
           EEK  E +   ++ +   K+KK
Sbjct: 91  EEKEEEAEDVKQQEVFSFKRKK 112



 Score = 28.3 bits (63), Expect = 2.0
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD--EYLEEDENEKERRLPEEDERGEDE 153
           EN      +++E  + E EE E+E+E  E + +D  E  E++E  +E    EE E  + +
Sbjct: 44  ENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQ 103

Query: 154 EKKEEEEDKQGEEKRVEDKI 173
           E    +  K  +E  +E+KI
Sbjct: 104 EVFSFKRKKPFKEMNLEEKI 123


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 36.9 bits (86), Expect = 0.004
 Identities = 14/72 (19%), Positives = 33/72 (45%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            +  E+E+  E EEE+ + + L+E+ ++ +  +   +E  ++ E K     + G  +  E
Sbjct: 16  RKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEE 75

Query: 171 DKIGDEKRLEEN 182
           ++   E      
Sbjct: 76  EEAKAEAAEFRA 87



 Score = 29.2 bits (66), Expect = 1.3
 Identities = 23/85 (27%), Positives = 39/85 (45%), Gaps = 1/85 (1%)

Query: 99  LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
           L +     E   E+ E +  +EE ++   + + LEE  +E E +     E G  EE++EE
Sbjct: 18  LTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEE 77

Query: 159 EEDKQGEEKRVEDKIGDEKRLEENK 183
            + +  E +      GD+   EE K
Sbjct: 78  AKAEAAEFRAYLRG-GDDALAEERK 101


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 37.0 bits (85), Expect = 0.004
 Identities = 16/71 (22%), Positives = 32/71 (45%), Gaps = 5/71 (7%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           + T    ++++ +E + EE    +E   +D    E   PEE+ER     +  E  D + E
Sbjct: 407 ESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEER-----QLREFRDMEKE 461

Query: 166 EKRVEDKIGDE 176
           ++   D+   +
Sbjct: 462 DREFPDEAELQ 472



 Score = 29.3 bits (65), Expect = 1.3
 Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 1/72 (1%)

Query: 120 EEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKR- 178
             E+EE+E  +  +E+         +  E   +E   +EE      E   E    +E+R 
Sbjct: 392 WAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQ 451

Query: 179 LEENKKKKTQKR 190
           L E +  + + R
Sbjct: 452 LREFRDMEKEDR 463


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 36.9 bits (86), Expect = 0.004
 Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 11/73 (15%)

Query: 106 DETAREEEEEEEEEEEEE-------EEQEQDEYLEED----ENEKERRLPEEDERGEDEE 154
            ET+  EE EEE + E E        +QEQ+   EED      EKE     ED    +  
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286

Query: 155 KKEEEEDKQGEEK 167
           K++++E+    EK
Sbjct: 287 KEKKDEELFWFEK 299



 Score = 34.6 bits (80), Expect = 0.025
 Identities = 15/55 (27%), Positives = 26/55 (47%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
            D       E +  ++E+E   E+D  L  +E E   +  + D+    +EKK+EE
Sbjct: 239 TDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293



 Score = 33.4 bits (77), Expect = 0.058
 Identities = 24/73 (32%), Positives = 33/73 (45%), Gaps = 11/73 (15%)

Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ--GEEKRVEDKI 173
           +E  E EE E+E D  +E            E +  + E++   EED     EEK   DK 
Sbjct: 227 KETSETEEREEETDVEIETT---------SETKGTKQEQEGSTEEDPSLFSEEKEDPDKT 277

Query: 174 GDEKRLEENKKKK 186
            D  +LE  K+KK
Sbjct: 278 EDLDKLEILKEKK 290


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 36.7 bits (85), Expect = 0.004
 Identities = 21/81 (25%), Positives = 48/81 (59%), Gaps = 5/81 (6%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           +     ++ +E  +E  ++ Q+  + L E ++E +    E + + ++E+ K+EE +K+ +
Sbjct: 359 NTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDAS-EEAEAKAKEEKLKQEENEKKQK 417

Query: 166 EKRVEDKIGDEKRLEENKKKK 186
           E+  EDK   EKR +++++KK
Sbjct: 418 EQADEDK---EKR-QKDERKK 434



 Score = 30.9 bits (70), Expect = 0.42
 Identities = 13/52 (25%), Positives = 31/52 (59%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           +     ++E +  EE E + ++E+ +  E ++ +KE+   ++++R +DE KK
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 36.2 bits (84), Expect = 0.006
 Identities = 19/43 (44%), Positives = 26/43 (60%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           E E  EEEEEE +E +E LEED  +++ R    +E  E EE +
Sbjct: 146 ELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELR 188


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 36.2 bits (84), Expect = 0.007
 Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           +  R   E+E  E+E +E QEQ   L+E     E+ +  E+  G+ EE +EE E+ +   
Sbjct: 820 KLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI--ENLNGKKEELEEELEELEAAL 877

Query: 167 KRVEDKIGDEKRLEENKKKK 186
           + +E ++GD K+  +  + +
Sbjct: 878 RDLESRLGDLKKERDELEAQ 897


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 35.7 bits (83), Expect = 0.009
 Identities = 9/90 (10%), Positives = 45/90 (50%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
                 +     + +++++++++++ + D+  ++D++E +    ++D+  +++E+K+E +
Sbjct: 117 DLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAK 176

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           + +      +    ++      + +K  K 
Sbjct: 177 ELEKLSDDDDFVWDEDDSEALRQARKDAKL 206



 Score = 31.1 bits (71), Expect = 0.36
 Identities = 11/77 (14%), Positives = 36/77 (46%), Gaps = 1/77 (1%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
            NQ       D+    ++++ ++++++E++ E D+  ++ ++E E +   ++     ++ 
Sbjct: 127 LNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE-DDDDDDVDDEDEEKKEAKELEKLSDDD 185

Query: 156 KEEEEDKQGEEKRVEDK 172
               ++   E  R   K
Sbjct: 186 DFVWDEDDSEALRQARK 202



 Score = 30.3 bits (69), Expect = 0.60
 Identities = 11/77 (14%), Positives = 39/77 (50%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           K   +DE    ++ E++   +++++    + ++      +    ++D+  +D++  ++++
Sbjct: 93  KKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDD 152

Query: 161 DKQGEEKRVEDKIGDEK 177
           D+  +E   +D + DE 
Sbjct: 153 DEDDDEDDDDDDVDDED 169



 Score = 28.0 bits (63), Expect = 3.1
 Identities = 12/81 (14%), Positives = 43/81 (53%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            A  +D+     ++ +   + ++++ + D+   +D++  +    E+D+  +D++  ++E+
Sbjct: 110 NALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDED 169

Query: 161 DKQGEEKRVEDKIGDEKRLEE 181
           +++ E K +E    D+  + +
Sbjct: 170 EEKKEAKELEKLSDDDDFVWD 190


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 34.7 bits (80), Expect = 0.009
 Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 3/57 (5%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           ++ +EEEE EEE  +E EQE     + +  EK++R   + +    E++K + E K+ 
Sbjct: 93  SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKT---EKEKPKTEPKKP 146



 Score = 31.6 bits (72), Expect = 0.13
 Identities = 23/85 (27%), Positives = 36/85 (42%), Gaps = 23/85 (27%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           R+ ++ ++EEEE EEE  +E +++D                        E K E ++K+ 
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDP----------------------PETKTESKEKKK 126

Query: 165 EEKRVEDKIGDEKRLEENKKKKTQK 189
            E   + K   EK   E KK K  K
Sbjct: 127 REVP-KPKTEKEKPKTEPKKPKPSK 150



 Score = 26.3 bits (58), Expect = 7.7
 Identities = 10/47 (21%), Positives = 21/47 (44%)

Query: 144 PEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             +DE  E EE+  +E +++   +   +    +KR     K + +K 
Sbjct: 93  SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKP 139


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 35.9 bits (83), Expect = 0.010
 Identities = 26/54 (48%), Positives = 35/54 (64%), Gaps = 5/54 (9%)

Query: 136 ENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           E E E+ +P+E+E  E EEKKEEEE    +E+ V     DE+  +E KKKKT+K
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEV-----DEEEEKEEKKKKTKK 76



 Score = 34.7 bits (80), Expect = 0.022
 Identities = 19/44 (43%), Positives = 28/44 (63%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           E+E E+E  +EEE+E+ E  +E+E +   +  E DE  E EEKK
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK 71



 Score = 34.4 bits (79), Expect = 0.029
 Identities = 18/50 (36%), Positives = 31/50 (62%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           +EEEEEE+EE++EEE++  +  EE + E+E+   ++  +   E   E E 
Sbjct: 37  DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 29.7 bits (67), Expect = 1.0
 Identities = 16/51 (31%), Positives = 31/51 (60%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           +E   +EE++EEEE+  ++E+E DE  E++E +K+ +  +E     +   K
Sbjct: 39  EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 35.4 bits (82), Expect = 0.011
 Identities = 17/39 (43%), Positives = 26/39 (66%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL 143
           R      E+++EEEE+E+EEE+E++E  EE E E+   L
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376



 Score = 34.7 bits (80), Expect = 0.024
 Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 32/86 (37%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDE--------------------------------NEKE 140
            EEEEEEEEEEE  E+DE  E  +                                N ++
Sbjct: 287 LEEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQ 346

Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEE 166
           +   EE E  E+EE++EE E+ + EE
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 34.7 bits (80), Expect = 0.024
 Identities = 22/44 (50%), Positives = 25/44 (56%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERG 150
           E    E EEEEEEEE  E  E+DE  EE+E E E    EE+  G
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTG 207



 Score = 31.2 bits (71), Expect = 0.29
 Identities = 26/90 (28%), Positives = 39/90 (43%), Gaps = 26/90 (28%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENE-------------------------KERRLPEE 146
            EEEEEEEEEEE   E+DE  E  + E                         +   +  E
Sbjct: 287 LEEEEEEEEEEEPA-ERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPE 345

Query: 147 DERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
            +  E+E++ EEEE+++ E +  E + G  
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375



 Score = 28.5 bits (64), Expect = 2.5
 Identities = 15/35 (42%), Positives = 18/35 (51%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           +E   EEEEEEEEE EE E +E    L     +  
Sbjct: 351 EEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAP 385



 Score = 26.9 bits (60), Expect = 6.6
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEE 125
           K    ++   EEEEEEEE EE E E
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPE 371



 Score = 26.9 bits (60), Expect = 7.4
 Identities = 17/56 (30%), Positives = 29/56 (51%)

Query: 84  GAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
           G +L  +K+        + G  +E   E  E +EE+E EEEE+E+ E + ++E   
Sbjct: 151 GKILGTQKDYYVAEVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 34.7 bits (80), Expect = 0.012
 Identities = 25/66 (37%), Positives = 42/66 (63%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E   F+ G  DE   +E++EEEE++E +   ++++  +ED++E    L E+DE  EDE+ 
Sbjct: 106 EVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDD 165

Query: 156 KEEEED 161
            E+EED
Sbjct: 166 DEDEED 171



 Score = 28.2 bits (63), Expect = 2.0
 Identities = 21/57 (36%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           DE   EEE++E +  ++E + E ++  E  E   E    +EDE  +D+E +E+EEDK
Sbjct: 121 DEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDE--DDDEDEEDEEDK 175


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 35.5 bits (83), Expect = 0.012
 Identities = 19/87 (21%), Positives = 44/87 (50%)

Query: 99  LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
            F     +E       E EE +++E+E+E+++  ++     E  LPE+         K+ 
Sbjct: 172 GFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQY 231

Query: 159 EEDKQGEEKRVEDKIGDEKRLEENKKK 185
           ++ ++ +EK+VE ++   K+  + ++K
Sbjct: 232 KKLRKAQEKKVEGRLAQHKKYAKLREK 258



 Score = 30.9 bits (71), Expect = 0.36
 Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
             +   EE+      + E L++DE+E+E     E++  +D    +E E  +   ++ +  
Sbjct: 173 FVDPNAEEDPAHVGSELEELDDDEDEEE-----EEDENDDSLAADESELPEKVLEKFKAL 227

Query: 173 IGDEKRLEENKKKKTQKR 190
               K+L + ++KK + R
Sbjct: 228 AKQYKKLRKAQEKKVEGR 245


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 35.3 bits (81), Expect = 0.013
 Identities = 12/69 (17%), Positives = 21/69 (30%), Gaps = 3/69 (4%)

Query: 102 AGARDETAR---EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
             A +ET+    E E+  +   +     + D        E+   L  E     + E  E+
Sbjct: 360 TPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEK 419

Query: 159 EEDKQGEEK 167
                   K
Sbjct: 420 AAPIPDPAK 428



 Score = 31.5 bits (71), Expect = 0.22
 Identities = 14/78 (17%), Positives = 25/78 (32%), Gaps = 3/78 (3%)

Query: 104 ARDETAREEEEEEEEEEEEE-EEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED- 161
            +   A + + E      EE      + + E +    E+  P  D    DE       D 
Sbjct: 381 GQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDD 440

Query: 162 -KQGEEKRVEDKIGDEKR 178
             + +  R +D     +R
Sbjct: 441 PAEPDGIRRQDDFSSRRR 458


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 35.2 bits (82), Expect = 0.013
 Identities = 19/85 (22%), Positives = 33/85 (38%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           +E  +EE + E EEE + E +E      E+E + E       E   +     E   +  E
Sbjct: 41  EEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAE 100

Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
           ++         +R +  K  K +K 
Sbjct: 101 DEAARPAEAAARRPKAKKAAKKKKG 125


>gnl|CDD|150923 pfam10327, 7TM_GPCR_Sri, Serpentine type 7TM GPCR chemoreceptor
           Sri.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Sri is part of the Str
           superfamily of chemoreceptors. Chemoperception is one of
           the central senses of soil nematodes like C. elegans
           which are otherwise 'blind' and 'deaf'.
          Length = 303

 Score = 34.8 bits (81), Expect = 0.016
 Identities = 15/78 (19%), Positives = 30/78 (38%), Gaps = 13/78 (16%)

Query: 14  WNIWAVVTLILVFCTATGNILVCLAISWE-----KRLQN---VTNYFLMSLAITDLMV-- 63
           +N W +V  IL       + L+ + ++ +     KRL+       Y     A+  L+   
Sbjct: 187 YNPWLIVFGILAIFGGFLSFLILVYLTVDMFRMLKRLRKKISAQTYQRHKAALRSLVAQF 246

Query: 64  ---AVLVMPLGILTLVRV 78
               V ++P     +V +
Sbjct: 247 ATSPVCILPPFAFVVVVL 264


>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
          Srsx.  Chemoreception is mediated in Caenorhabditis
          elegans by members of the seven-transmembrane
          G-protein-coupled receptor class (7TM GPCRs) of
          proteins which are of the serpentine type. Srsx is a
          solo family amongst the superfamilies of
          chemoreceptors. Chemoperception is one of the central
          senses of soil nematodes like C. elegans which are
          otherwise 'blind' and 'deaf'.
          Length = 257

 Score = 34.9 bits (81), Expect = 0.016
 Identities = 8/46 (17%), Positives = 22/46 (47%)

Query: 30 TGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTL 75
           GN+++ +    +K+L++  +Y +    + DL+     +   +L  
Sbjct: 6  FGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLLF 51


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 35.2 bits (81), Expect = 0.018
 Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE-DERGEDEEKKEEEEDKQGEEKR 168
           RE+ E E+EEEEE+E+ + D   +EDE+E      EE     E    + +E+D++ E+  
Sbjct: 132 REDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDAD 191

Query: 169 VEDKIGDEKRLEENKKK 185
           +  K   E+     K +
Sbjct: 192 LSKKDVLEEPKMFKKPE 208


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 34.8 bits (80), Expect = 0.018
 Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 1/66 (1%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
              D     E   EE E  +E E+++ E  + +  E E    E  E  E +E  +E+ + 
Sbjct: 239 DQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDE-SDEDTET 297

Query: 163 QGEEKR 168
            GE+ R
Sbjct: 298 PGEDAR 303



 Score = 34.0 bits (78), Expect = 0.034
 Identities = 13/66 (19%), Positives = 30/66 (45%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           D+   E+ +EE+ ++++ +  E  E   E+    +    +E E  + E ++ E +  +  
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEAS 283

Query: 166 EKRVED 171
           E    D
Sbjct: 284 EDSESD 289



 Score = 31.4 bits (71), Expect = 0.31
 Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 3/65 (4%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           +ET  +  EE+ +EE+ +++Q  +    ED           DE  EDE +  + E ++GE
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDN---NEDSEAGREESEGSDESEEDEAEATDGEGEEGE 276

Query: 166 EKRVE 170
               E
Sbjct: 277 MDAAE 281



 Score = 29.8 bits (67), Expect = 0.80
 Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 1/74 (1%)

Query: 98  QLFKAGARDETAREEEEEE-EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           Q F    RD     +  EE  ++  EE+  E+D   ++ +N ++     E+  G DE ++
Sbjct: 203 QAFARVVRDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEE 262

Query: 157 EEEEDKQGEEKRVE 170
           +E E   GE +  E
Sbjct: 263 DEAEATDGEGEEGE 276



 Score = 28.3 bits (63), Expect = 3.3
 Identities = 21/73 (28%), Positives = 26/73 (35%), Gaps = 18/73 (24%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP------------------EED 147
           + T  E EE E +  E  E+ E DE  E+ E   E   P                  E D
Sbjct: 266 EATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDARPATPFTELMEEVDYKVFTREFD 325

Query: 148 ERGEDEEKKEEEE 160
           E    EE  +E E
Sbjct: 326 EIVLAEELCDEAE 338


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 34.8 bits (80), Expect = 0.018
 Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 7/88 (7%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
            +G R ++  E EE+E+E+ +  ++  +D    EDE+E+   +PE      +EE+ EE E
Sbjct: 276 PSGERSDSEEETEEKEKEKRKRLKKMMED----EDEDEEMEIVPESP---VEEEESEEPE 328

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQ 188
                +K  E +          ++ + +
Sbjct: 329 PPPLPKKEEEKEEVTVSPDGGRRRGRRR 356



 Score = 33.7 bits (77), Expect = 0.045
 Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 2/88 (2%)

Query: 95  CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
            + +  K  A + T +EE EEE  + +   E E  E    DE+E E       ER + EE
Sbjct: 226 TKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEE 285

Query: 155 KKEEEEDKQGEEKR--VEDKIGDEKRLE 180
           + EE+E ++ +  +  +ED+  DE+   
Sbjct: 286 ETEEKEKEKRKRLKKMMEDEDEDEEMEI 313



 Score = 32.1 bits (73), Expect = 0.16
 Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 5/92 (5%)

Query: 90  KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
           K NI+  +  FK   +++  ++E  E   +EE EEE  + + + EDE+ +   L E++  
Sbjct: 214 KSNIM--SSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDE-- 269

Query: 150 GEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
            +++E K   E    EE+  E +    KRL++
Sbjct: 270 -DEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 34.5 bits (79), Expect = 0.019
 Identities = 24/83 (28%), Positives = 37/83 (44%), Gaps = 3/83 (3%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYL---EEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           DE      + E EE ++E E E   YL   E D   KE+   E  E   D+    E E +
Sbjct: 204 DEVTSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIE 263

Query: 163 QGEEKRVEDKIGDEKRLEENKKK 185
              E   E++ G+E+  E+  ++
Sbjct: 264 DESEDEYEEEAGEEQENEDKGEE 286



 Score = 27.2 bits (60), Expect = 5.6
 Identities = 14/52 (26%), Positives = 23/52 (44%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED 147
            +   K   R E    + ++    E E E++ +DEY EE   E+E     E+
Sbjct: 235 NDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENEDKGEE 286


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 35.0 bits (81), Expect = 0.020
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
           R      E EEEEEEE+E  E  EE+E E+ER   EE
Sbjct: 297 RPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 34.2 bits (79), Expect = 0.033
 Identities = 17/28 (60%), Positives = 20/28 (71%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEE 134
           E   EEEEE  EEEEEEEE+E+  + EE
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 33.1 bits (76), Expect = 0.078
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
           E   EEEEEEE  EEEEEE+E++E   E+E
Sbjct: 304 EPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 28.5 bits (64), Expect = 2.7
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDE 130
           +E   E  EEEEEEEEEEE   ++E
Sbjct: 309 EEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 28.1 bits (63), Expect = 3.4
 Identities = 15/39 (38%), Positives = 21/39 (53%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
           R        E EEEEE+E++   EE+E E+E     E+E
Sbjct: 295 RRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 27.7 bits (62), Expect = 3.8
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 106 DETAREEEEEEEEEEEEEEEQE 127
           +E   EE  EEEEEEEEEEE+ 
Sbjct: 308 EEEEEEEVPEEEEEEEEEEERT 329


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 34.8 bits (80), Expect = 0.020
 Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQD-EYLEEDENEKERRLPEEDERGEDEE--KKEEEEDK 162
           +   ++++   ++E+E +++ EQ  E  E+    ++ R  E ++R   E+  K+ E+  K
Sbjct: 53  NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112

Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           Q EEK+ + +    K+  E K K   + 
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEA 140



 Score = 34.4 bits (79), Expect = 0.026
 Identities = 22/85 (25%), Positives = 41/85 (48%), Gaps = 3/85 (3%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ---GE 165
             E++ EE E++   E+  Q E  +    EK  +  E+  +  +E++K+ EE K     E
Sbjct: 72  KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE 131

Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
            K   +   ++K  EE KK+  ++ 
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEA 156



 Score = 34.0 bits (78), Expect = 0.028
 Identities = 21/87 (24%), Positives = 44/87 (50%), Gaps = 1/87 (1%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER-GEDEEKKEEE 159
           KA  + E A ++ EE++++ EE + ++  E   + E E E++  EE ++  E+E K +  
Sbjct: 102 KAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAA 161

Query: 160 EDKQGEEKRVEDKIGDEKRLEENKKKK 186
            + + +    + K   E + +   K K
Sbjct: 162 AEAKKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 31.0 bits (70), Expect = 0.36
 Identities = 23/91 (25%), Positives = 38/91 (41%), Gaps = 3/91 (3%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK-KEEE 159
           K   +     E++   E+  ++E EQ         + E+  +  EE ++  +E K K+  
Sbjct: 71  KKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAA 130

Query: 160 EDKQGEEKRVEDKIGDE--KRLEENKKKKTQ 188
           E K   E   E K  +E  K+ EE  K K  
Sbjct: 131 EAKAKAEAEAEKKAKEEAKKQAEEEAKAKAA 161



 Score = 27.1 bits (60), Expect = 5.8
 Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 3/93 (3%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQ---EQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
           KA    E   + E E E++ +EE ++   E+ +     E +K+    ++    E + K E
Sbjct: 125 KAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAE 184

Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            +   + EE + + +    K   E   K   + 
Sbjct: 185 AKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEA 217


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 34.6 bits (79), Expect = 0.024
 Identities = 17/64 (26%), Positives = 38/64 (59%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           +E EEE  EEE+++E++ D+  +EDE++ +    E+DE  +D++    ++    +    +
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETD 102

Query: 171 DKIG 174
           ++ G
Sbjct: 103 NEAG 106



 Score = 33.4 bits (76), Expect = 0.054
 Identities = 19/64 (29%), Positives = 34/64 (53%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           EEE  EEE+++EE+ + D+  +ED+++ +    +EDE  +D    ++     G E   E 
Sbjct: 46  EEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEA 105

Query: 172 KIGD 175
              D
Sbjct: 106 GFAD 109


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 34.5 bits (79), Expect = 0.027
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 5/72 (6%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER-----GEDEE 154
            +A    E+  E + E+ EE+E+E++ E +E +   +    R LP+         G D E
Sbjct: 665 CEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSE 724

Query: 155 KKEEEEDKQGEE 166
            +E  +D+Q   
Sbjct: 725 DEEGMDDEQMMR 736



 Score = 33.7 bits (77), Expect = 0.047
 Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 8/85 (9%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDE-------NEKERRLPEEDERGEDEEKKEEE 159
           ET  +E+E E  E+ E E +   E  EEDE       NE    + +   R    +     
Sbjct: 657 ETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRR-ALPKVLNLP 715

Query: 160 EDKQGEEKRVEDKIGDEKRLEENKK 184
           +   G +   E+ + DE+ +  +  
Sbjct: 716 DALDGGDSEDEEGMDDEQMMRLDTY 740



 Score = 33.3 bits (76), Expect = 0.060
 Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 7/71 (9%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           EEE+E++ EE ++++++ E +E+ E+E E         GED E+ E+E+D +  E  V  
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESE-------SDGEDGEEDEQEDDAEANEGVVPI 700

Query: 172 KIGDEKRLEEN 182
                + L + 
Sbjct: 701 DKAVRRALPKV 711



 Score = 29.5 bits (66), Expect = 1.1
 Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 5/68 (7%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
              L K     +     + E+EE  ++E+    D YL +   EK+ R+    E      K
Sbjct: 705 RRALPKVLNLPDALDGGDSEDEEGMDDEQMMRLDTYLAQIFKEKKERIQAGGET-----K 759

Query: 156 KEEEEDKQ 163
           KE +  KQ
Sbjct: 760 KEAQSQKQ 767


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 34.0 bits (78), Expect = 0.027
 Identities = 15/53 (28%), Positives = 29/53 (54%)

Query: 99  LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
            F  G     + ++  E+EE E++  ++++++    +E+E E    EEDE GE
Sbjct: 20  TFGNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 32.8 bits (75), Expect = 0.084
 Identities = 30/101 (29%), Positives = 44/101 (43%), Gaps = 27/101 (26%)

Query: 61  LMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEE 120
           L++ +L  P  +LT    +                      A A+D T  EE E++  +E
Sbjct: 7   LLLTLLAFPAVLLTFGNGKG-------------------PFASAQDLTEDEEAEDDVVDE 47

Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
           +EE+E      +EEDENE    L EE+E  E E K   + D
Sbjct: 48  DEEDEAV----VEEDENE----LTEEEEDEEGEVKASPDAD 80


>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
          Length = 323

 Score = 34.1 bits (78), Expect = 0.030
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 3/72 (4%)

Query: 2  ETEVSFGLDAASWNIWAVVTLILVFCTA-TGNILVCLAISWEKRLQNVTNYFLMSLAITD 60
          E E  F +  +  N + +V  IL+F     GN+LV +A+   KR   V + +L ++A++D
Sbjct: 15 EEECDFEMVNSDVNYFVIVFYILLFIFGLIGNVLV-IAVLIVKRFMFVVDVYLFNIAMSD 73

Query: 61 LMVAVLVMPLGI 72
          LM+ V   P  I
Sbjct: 74 LML-VFSFPFII 84


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 34.1 bits (78), Expect = 0.031
 Identities = 14/20 (70%), Positives = 17/20 (85%)

Query: 106 DETAREEEEEEEEEEEEEEE 125
            ++  EE+EEEEEEEEEEEE
Sbjct: 224 SDSESEEDEEEEEEEEEEEE 243



 Score = 33.4 bits (76), Expect = 0.057
 Identities = 15/19 (78%), Positives = 17/19 (89%)

Query: 111 EEEEEEEEEEEEEEEQEQD 129
           E EE+EEEEEEEEEE+E D
Sbjct: 227 ESEEDEEEEEEEEEEEEID 245



 Score = 32.2 bits (73), Expect = 0.15
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
              + E EE+EEEEEE+E++E ++    EK R       
Sbjct: 222 SGSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA 260



 Score = 31.1 bits (70), Expect = 0.31
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 105 RDETAREEEEEEEEEEEEEEEQ 126
               +  EE+EEEEEEEEEEE+
Sbjct: 222 SGSDSESEEDEEEEEEEEEEEE 243



 Score = 30.3 bits (68), Expect = 0.60
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 97  NQLFKAGARDETAREEEEEEEEEEEEE 123
           N    +G+  E+  +EEEEEEEEEEEE
Sbjct: 217 NSSSSSGSDSESEEDEEEEEEEEEEEE 243


>gnl|CDD|177154 MTH00091, ND2, NADH dehydrogenase subunit 2; Provisional.
          Length = 273

 Score = 33.7 bits (78), Expect = 0.034
 Identities = 13/62 (20%), Positives = 26/62 (41%), Gaps = 13/62 (20%)

Query: 14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLM--SLAITDLMVAVLVMPLG 71
          W+++ ++TL+ VF                    ++ NYF++  SL +  L+ +       
Sbjct: 17 WSVFLLMTLLFVFL-----------NKGMGSYSSLFNYFVLQESLGLLFLLFSFGYFQFL 65

Query: 72 IL 73
          IL
Sbjct: 66 IL 67


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 33.5 bits (77), Expect = 0.038
 Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 9/84 (10%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           GA D    E   E E  E+E   + + E   E+E +KE++  +E ++ + E+K ++E+  
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMV 178

Query: 163 QGEEKRVEDKIGDEKRLEENKKKK 186
           + +           K+ ++ KKKK
Sbjct: 179 EPKG---------SKKKKKKKKKK 193



 Score = 27.0 bits (60), Expect = 5.3
 Identities = 17/60 (28%), Positives = 27/60 (45%)

Query: 131 YLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           Y   D    E     E    E   K E+E + + EEK+ + K  + K+ ++ KK K +K 
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177



 Score = 26.2 bits (58), Expect = 9.3
 Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 132 LEEDENEKERRLPEEDERGEDEEKKEEEEDK---QGEEKRVEDKIGDEKRLEENKKKKTQ 188
            E + +EKE     E E   +EE+K+E++ K   + E+K  +DK       + +KKKK +
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189

Query: 189 KR 190
           K+
Sbjct: 190 KK 191


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 33.5 bits (76), Expect = 0.041
 Identities = 26/73 (35%), Positives = 40/73 (54%), Gaps = 8/73 (10%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE--------NEKERRLPEEDERGED 152
           KAG   + A +E+E +E+EEEEEEE+E++  +EE+E        N  E         G++
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDN 180

Query: 153 EEKKEEEEDKQGE 165
            E+ EEE   + E
Sbjct: 181 GEEGEEESVTEAE 193



 Score = 26.6 bits (58), Expect = 8.2
 Identities = 18/42 (42%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 98  QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
           QL K          +E+E +E+EEEEEE+E++E  E +ENE+
Sbjct: 117 QLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEA-EVEENEQ 157


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 33.7 bits (77), Expect = 0.045
 Identities = 17/65 (26%), Positives = 32/65 (49%)

Query: 126 QEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
           + + E     E E ER+  +E++  E E KK +   K+ + K    +  D   + +  +K
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64

Query: 186 KTQKR 190
           K++KR
Sbjct: 65  KSRKR 69



 Score = 30.6 bits (69), Expect = 0.52
 Identities = 22/93 (23%), Positives = 45/93 (48%), Gaps = 8/93 (8%)

Query: 83  SGAVLRLKKNILCENQLFKAGARDETAREEEE------EEEEEEEEEEEQEQDEYLEEDE 136
           S      +K IL E +L +   ++E A+E+E       ++E + + + +Q  D      +
Sbjct: 2   SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61

Query: 137 NEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
           +EK+ R  + D   E+ E   + +   G++KR+
Sbjct: 62  SEKKSR--KRDVEDENPEDFIDPDTPFGQKKRL 92


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 33.9 bits (78), Expect = 0.046
 Identities = 15/77 (19%), Positives = 34/77 (44%), Gaps = 9/77 (11%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           + Q   A  R+    E  E+   ++E+++   +         E++RR  +E  + + E K
Sbjct: 645 QAQQQTAETRESQQAEVTEKARTQDEQQQAPRR---------ERQRRRNDEKRQAQQEAK 695

Query: 156 KEEEEDKQGEEKRVEDK 172
               E++  +E   E++
Sbjct: 696 ALNVEEQSVQETEQEER 712



 Score = 32.3 bits (74), Expect = 0.14
 Identities = 17/90 (18%), Positives = 41/90 (45%), Gaps = 2/90 (2%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE--EKKEEEE 160
              +ET  +E+   + E + E +Q++ +  + +  ++  R    D R   E  E +EE  
Sbjct: 581 SGGEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENR 640

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             + + ++   +  + ++ E  +K +TQ  
Sbjct: 641 RNRRQAQQQTAETRESQQAEVTEKARTQDE 670



 Score = 29.6 bits (67), Expect = 1.2
 Identities = 14/85 (16%), Positives = 37/85 (43%), Gaps = 4/85 (4%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            R+E  R   + +++  E  E Q+     E  E  + +   ++  R E + ++ +E+ + 
Sbjct: 635 NREENRRNRRQAQQQTAETRESQQA----EVTEKARTQDEQQQAPRRERQRRRNDEKRQA 690

Query: 164 GEEKRVEDKIGDEKRLEENKKKKTQ 188
            +E +  +      +  E +++  Q
Sbjct: 691 QQEAKALNVEEQSVQETEQEERVQQ 715



 Score = 27.3 bits (61), Expect = 5.5
 Identities = 14/63 (22%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            E AR ++E+++    E + +  DE  +  +  K   L  E++  ++ E++E  +  Q  
Sbjct: 662 TEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKA--LNVEEQSVQETEQEERVQQVQPR 719

Query: 166 EKR 168
            K+
Sbjct: 720 RKQ 722


>gnl|CDD|222060 pfam13347, MFS_2, MFS/sugar transport protein.  This family is part
           of the major facilitator superfamily of membrane
           transport proteins.
          Length = 425

 Score = 33.7 bits (78), Expect = 0.046
 Identities = 21/74 (28%), Positives = 31/74 (41%), Gaps = 11/74 (14%)

Query: 5   VSFGLDAASWNIWAVVTLILVFCTATG-NI-----LVCLAISWEKR--LQNVTNYFLMSL 56
             FGL   +  I+A +T IL+    T  NI     +  L    ++R  L +         
Sbjct: 90  TPFGLSMTAKLIYAFITYILLGLAYTFVNIPYWSLIPVLTRDPKERTSLASYRRIGAT-- 147

Query: 57  AITDLMVAVLVMPL 70
            I  L+VAV+  PL
Sbjct: 148 -IGGLLVAVIAPPL 160


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 33.7 bits (77), Expect = 0.048
 Identities = 18/65 (27%), Positives = 35/65 (53%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           ET++++++EE     E    E  E +  +  E+ R L +E E  E+ E+  ++ED+ GE+
Sbjct: 206 ETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGED 265

Query: 167 KRVED 171
               +
Sbjct: 266 SEDGE 270



 Score = 26.4 bits (58), Expect = 9.6
 Identities = 25/82 (30%), Positives = 35/82 (42%), Gaps = 3/82 (3%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE--RGEDEEKKEEEE 160
           G  D    E   +EE  +   E  +QD+  EE     ERRL E  E      EE++ +  
Sbjct: 185 GGPDSDEDENPLDEEAPDMTPETSKQDQ-KEERRAAVERRLAELVEMINWNLEERRRDLR 243

Query: 161 DKQGEEKRVEDKIGDEKRLEEN 182
            +Q  E+ VE    DE    E+
Sbjct: 244 KEQELEENVERDSDDEDEHGED 265


>gnl|CDD|217749 pfam03824, NicO, High-affinity nickel-transport protein.  High
           affinity nickel transporters involved in the
           incorporation of nickel into H2-uptake hydrogenase and
           urease enzymes. Essential for the expression of
           catalytically active hydrogenase and urease. Ion uptake
           is dependent on proton motive force. HoxN in Alcaligenes
           eutrophus is thought to be an integral membrane protein
           with seven transmembrane helices. The family also
           includes a cobalt transporter.
          Length = 278

 Score = 33.1 bits (76), Expect = 0.053
 Identities = 17/80 (21%), Positives = 28/80 (35%), Gaps = 2/80 (2%)

Query: 8   GLDAASWNIW-AVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVL 66
           GL      +  A+    +  C     +L+  A  W            MS+  T + VA+L
Sbjct: 170 GLSFDGILVLPALFAAGMRPCDGADGVLMLAAYLWAFLGG-RKLVLNMSITGTSITVALL 228

Query: 67  VMPLGILTLVRVRTNLSGAV 86
              +  L L+  +   SG  
Sbjct: 229 AGLIETLGLLAKKLAGSGTF 248


>gnl|CDD|216374 pfam01226, Form_Nir_trans, Formate/nitrite transporter. 
          Length = 249

 Score = 33.2 bits (77), Expect = 0.054
 Identities = 16/67 (23%), Positives = 25/67 (37%), Gaps = 22/67 (32%)

Query: 11  AASWNIWAVVTLIL-VFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMP 69
           + S+        +  + C    N LVCLA+ W            ++LA  D    +L + 
Sbjct: 146 SLSF----WEAFLRGILC----NWLVCLAV-W------------LALAAKDTAGKILAIW 184

Query: 70  LGILTLV 76
           L I   V
Sbjct: 185 LPIFAFV 191


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 33.3 bits (77), Expect = 0.055
 Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 3/53 (5%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
               EE EE E  E+ E   D+Y  E   E E    E+D+  +D+  ++++E 
Sbjct: 334 ENPREEAEEAEAPEKVEFMWDDYHREQLEEVE---EEDDDDWDDDWDEDDDEG 383



 Score = 33.3 bits (77), Expect = 0.056
 Identities = 16/53 (30%), Positives = 26/53 (49%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           EE   EE EE E  +      D+  +E+    E+E  +D +   +E+D +G E
Sbjct: 333 EENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVE 385


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 33.2 bits (76), Expect = 0.056
 Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 5/65 (7%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE-EDKQGE 165
           + A E  +E E  + E+ E E D   +ED+++ E    EE+    D   ++ +   ++ E
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGE----EEESGSSDSLSEDSDASSEEME 262

Query: 166 EKRVE 170
              +E
Sbjct: 263 SGEME 267



 Score = 32.9 bits (75), Expect = 0.075
 Identities = 14/54 (25%), Positives = 28/54 (51%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           DE    + E+ E+E++ +E+++ D+  EE+    +    + D   E+ E  E E
Sbjct: 214 DEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267



 Score = 30.2 bits (68), Expect = 0.57
 Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 5/59 (8%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           +  EE  +E E    +D   E+D  E E     +D++GE+EE    +   +  +   E+
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDE-----DDDQGEEEESGSSDSLSEDSDASSEE 260



 Score = 29.8 bits (67), Expect = 0.77
 Identities = 12/57 (21%), Positives = 25/57 (43%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
            DE   +E+E++++ EEEE         + D + +E    E +      +   + +D
Sbjct: 225 EDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDSDD 281



 Score = 28.6 bits (64), Expect = 1.9
 Identities = 17/76 (22%), Positives = 39/76 (51%), Gaps = 2/76 (2%)

Query: 85  AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
           A  R+ +++L    + +    +  + + E+ E+E++ +E+  E D+  EE+E+     L 
Sbjct: 194 AFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKED--EDDDQGEEEESGSSDSLS 251

Query: 145 EEDERGEDEEKKEEEE 160
           E+ +   +E +  E E
Sbjct: 252 EDSDASSEEMESGEME 267



 Score = 28.6 bits (64), Expect = 2.1
 Identities = 14/61 (22%), Positives = 31/61 (50%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           D    E+E++ +E+E++++ +E++    +  +E      EE E GE E  +   +D    
Sbjct: 220 DSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDS 279

Query: 166 E 166
           +
Sbjct: 280 D 280



 Score = 26.3 bits (58), Expect = 9.7
 Identities = 12/55 (21%), Positives = 22/55 (40%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           D+   +E++++ EEEE        E  +    E E    E  E   D+    ++ 
Sbjct: 228 DDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDSDDA 282


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 33.2 bits (76), Expect = 0.059
 Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           + +  +++ ++++ E E    L E E  K+ R   E++  E+EEK  EEE  + EE
Sbjct: 127 DSDSSDDDSDDDDSEDETAALLRELEKIKKER--AEEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 33.4 bits (76), Expect = 0.060
 Identities = 9/81 (11%), Positives = 25/81 (30%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
                 +   E+  EE E    + + +     ++E     P      ++  + +    +Q
Sbjct: 162 QPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQ 221

Query: 164 GEEKRVEDKIGDEKRLEENKK 184
                 +  +  E    E ++
Sbjct: 222 APSPNTQQAVEHEDEPTEPER 242



 Score = 33.4 bits (76), Expect = 0.062
 Identities = 8/60 (13%), Positives = 13/60 (21%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
             +                    +    E  N    + P    +   E+  EE E    E
Sbjct: 125 GSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSE 184



 Score = 33.0 bits (75), Expect = 0.067
 Identities = 9/71 (12%), Positives = 23/71 (32%), Gaps = 2/71 (2%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           + +  ++     +   E+  +E +    E E +     P + E        +   D+ GE
Sbjct: 156 NPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPG--PPQSETPTSSPPPQSPPDEPGE 213

Query: 166 EKRVEDKIGDE 176
            +    +    
Sbjct: 214 PQSPTPQQAPS 224



 Score = 33.0 bits (75), Expect = 0.073
 Identities = 13/77 (16%), Positives = 22/77 (28%), Gaps = 1/77 (1%)

Query: 101 KAGARDETAREEEEEEEEEEEEE-EEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           +A A     + E + E  EE    E++E+ +        +    P     G  EE     
Sbjct: 59  RAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGL 118

Query: 160 EDKQGEEKRVEDKIGDE 176
             +       E      
Sbjct: 119 SPENTSGSSPESPASHS 135



 Score = 32.6 bits (74), Expect = 0.11
 Identities = 8/74 (10%), Positives = 12/74 (16%), Gaps = 2/74 (2%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
                T       EE       E       E   +      P         E    E   
Sbjct: 99  SVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPP--PSPPSHPGPHEPAPPESHN 156

Query: 163 QGEEKRVEDKIGDE 176
               ++    +   
Sbjct: 157 PSPNQQPSSFLQPS 170



 Score = 32.2 bits (73), Expect = 0.14
 Identities = 8/70 (11%), Positives = 12/70 (17%), Gaps = 2/70 (2%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
            +A E       E       E                P   E    E        +    
Sbjct: 109 GSAEELASGLSPENTSGSSPESPA--SHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSF 166

Query: 167 KRVEDKIGDE 176
            +   +   E
Sbjct: 167 LQPSHEDSPE 176



 Score = 31.9 bits (72), Expect = 0.18
 Identities = 11/56 (19%), Positives = 17/56 (30%), Gaps = 4/56 (7%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
             E    E       Q+   +L+    +     PEE E    E + +     Q E 
Sbjct: 146 PHEPAPPESHNPSPNQQPSSFLQPSHEDS----PEEPEPPTSEPEPDSPGPPQSET 197



 Score = 31.5 bits (71), Expect = 0.27
 Identities = 12/73 (16%), Positives = 16/73 (21%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
           A +   + E + E  EE    E  E  +        E             EE   G    
Sbjct: 62  AEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPE 121

Query: 169 VEDKIGDEKRLEE 181
                  E     
Sbjct: 122 NTSGSSPESPASH 134



 Score = 31.1 bits (70), Expect = 0.34
 Identities = 6/69 (8%), Positives = 15/69 (21%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
            + +  A               E    E      N++     +       EE +    + 
Sbjct: 126 SSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEP 185

Query: 163 QGEEKRVED 171
           + +      
Sbjct: 186 EPDSPGPPQ 194



 Score = 30.7 bits (69), Expect = 0.40
 Identities = 7/69 (10%), Positives = 13/69 (18%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           G   E       E                  E    +        +     +   E+  +
Sbjct: 117 GLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPE 176

Query: 163 QGEEKRVED 171
           + E    E 
Sbjct: 177 EPEPPTSEP 185



 Score = 29.2 bits (65), Expect = 1.3
 Identities = 14/83 (16%), Positives = 27/83 (32%), Gaps = 4/83 (4%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           F   + +++  E E    E E +     Q E        +       DE GE +    ++
Sbjct: 166 FLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQS----PPDEPGEPQSPTPQQ 221

Query: 160 EDKQGEEKRVEDKIGDEKRLEEN 182
                 ++ VE +    +   E 
Sbjct: 222 APSPNTQQAVEHEDEPTEPEREG 244



 Score = 29.2 bits (65), Expect = 1.3
 Identities = 6/69 (8%), Positives = 11/69 (15%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           G    +  E             E+       E+ +      P                + 
Sbjct: 90  GGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEP 149

Query: 163 QGEEKRVED 171
              E     
Sbjct: 150 APPESHNPS 158



 Score = 26.9 bits (59), Expect = 8.8
 Identities = 10/76 (13%), Positives = 22/76 (28%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           EE E    E E +     +      +   +  P+E    +    ++       +    ED
Sbjct: 176 EEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHED 235

Query: 172 KIGDEKRLEENKKKKT 187
           +  + +R         
Sbjct: 236 EPTEPEREGPPFPGHR 251


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 33.0 bits (75), Expect = 0.061
 Identities = 30/83 (36%), Positives = 47/83 (56%), Gaps = 3/83 (3%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           ET   E EEE+E  EEEEE+ ++E   + E++ E RL     R +D    +E E +  +E
Sbjct: 1   ETEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKE 60

Query: 167 KRVEDKIGDEKRLEENKKKKTQK 189
           K +E+   + KR  E +K++T K
Sbjct: 61  KALEE---EAKRKAEERKRETLK 80


>gnl|CDD|225027 COG2116, FocA, Formate/nitrite family of transporters [Inorganic
           ion transport and metabolism].
          Length = 265

 Score = 33.0 bits (76), Expect = 0.070
 Identities = 13/52 (25%), Positives = 20/52 (38%), Gaps = 17/52 (32%)

Query: 25  VFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLV 76
           + C    N LVCLA+ W            MS +  D    +  + L ++  V
Sbjct: 166 ILC----NWLVCLAV-W------------MSYSAKDGAAKIFAIILPVMLFV 200


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 33.1 bits (76), Expect = 0.075
 Identities = 24/85 (28%), Positives = 39/85 (45%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           ++E   E EE   E E EEE  E++      +N   R+   E+   E E KK ++E+K  
Sbjct: 387 KEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNE 446

Query: 165 EEKRVEDKIGDEKRLEENKKKKTQK 189
            +++ E    +E   EE  K +   
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEKVA 471



 Score = 32.3 bits (74), Expect = 0.13
 Identities = 20/91 (21%), Positives = 43/91 (47%), Gaps = 8/91 (8%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED--------EEKKEEE 159
             R E  ++EE + E EE  ++   EE+ +E+E   P +   G          +E + ++
Sbjct: 379 MQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKK 438

Query: 160 EDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             K+ + +  E K  DE+   E++++   ++
Sbjct: 439 LKKENKNEFKEKKESDEEEELEDEEEAKVEK 469



 Score = 30.8 bits (70), Expect = 0.40
 Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 1/95 (1%)

Query: 96  ENQLFKAGARDETAREEEE-EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
           EN       R E   EEE  EEE EE  ++   + ++  E+  ++      + E   + +
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448

Query: 155 KKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           +K+E ++++  E   E K+         + +K QK
Sbjct: 449 EKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQK 483



 Score = 30.0 bits (68), Expect = 0.68
 Identities = 25/96 (26%), Positives = 46/96 (47%), Gaps = 16/96 (16%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED----------------E 153
            EE+E+E+ + EEE+E + ++  + +     R+   + + GED                E
Sbjct: 324 SEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAE 383

Query: 154 EKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
            +K+EE D + EE R E +  +E   EEN++   + 
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKN 419



 Score = 29.3 bits (66), Expect = 1.5
 Identities = 22/89 (24%), Positives = 47/89 (52%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           +  A  +  ++E + E +E++E +E+E+ E  EE + EK      +      +E++EEE 
Sbjct: 431 EKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEEL 490

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           D++    +    +G   + +++KKK + K
Sbjct: 491 DEENPWLKTTSSVGKSAKKQDSKKKSSSK 519



 Score = 28.5 bits (64), Expect = 2.6
 Identities = 18/89 (20%), Positives = 48/89 (53%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           + G ++  +++ ++E + E +E++E +++E LE++E  K  ++  +  +  ++ +KEEEE
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           ++  EE           +  + +  K + 
Sbjct: 488 EELDEENPWLKTTSSVGKSAKKQDSKKKS 516



 Score = 27.3 bits (61), Expect = 6.1
 Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 3/84 (3%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            + E  R + E +   EE+E+E    E  +ED++E +          +   K +E ED +
Sbjct: 309 RQGEELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDE 368

Query: 164 GEEKRVEDKI---GDEKRLEENKK 184
                +  K     + ++ EEN  
Sbjct: 369 ENSGLLSMKFMQRAEARKKEENDA 392



 Score = 26.9 bits (60), Expect = 7.6
 Identities = 23/102 (22%), Positives = 40/102 (39%), Gaps = 8/102 (7%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E+            +  + +E E++EE       ++++  E  K+     E E    E +
Sbjct: 343 EDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELE 402

Query: 156 KEEEEDKQGEEKRVEDKIG--------DEKRLEENKKKKTQK 189
            EEE D++  E+  +  +G         EK  E  K KK  K
Sbjct: 403 GEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENK 444


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 32.4 bits (74), Expect = 0.076
 Identities = 13/81 (16%), Positives = 42/81 (51%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
               E +++++E  EE E+ + EY E+ + + +++  ++ +  + ++K ++++DK  ++ 
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114

Query: 168 RVEDKIGDEKRLEENKKKKTQ 188
             E +   E   +   +  + 
Sbjct: 115 EKEAEDKLEDLTKSYSETLST 135



 Score = 31.6 bits (72), Expect = 0.16
 Identities = 13/80 (16%), Positives = 43/80 (53%), Gaps = 3/80 (3%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
            +++E  EE E+ ++E +E+ ++  + +  K+    ++ ++ + ++KK+++ +K+ E++ 
Sbjct: 62  KKKKELAEEIEKVKKEYEEKQKWKWKKKKSKK---KKDKDKDKKDDKKDDKSEKKDEKEA 118

Query: 169 VEDKIGDEKRLEENKKKKTQ 188
            +      K   E     ++
Sbjct: 119 EDKLEDLTKSYSETLSTLSE 138


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 32.1 bits (73), Expect = 0.077
 Identities = 25/84 (29%), Positives = 43/84 (51%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           ++   E++E++EEEEEE+EE+ ++    EDE E      EE+E  ED    ++ E K   
Sbjct: 41  EDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIN 100

Query: 166 EKRVEDKIGDEKRLEENKKKKTQK 189
           +     +  + + L     KK +K
Sbjct: 101 DIFNSTQDDNAQNLISKNYKKNEK 124



 Score = 26.4 bits (58), Expect = 8.8
 Identities = 21/58 (36%), Positives = 37/58 (63%)

Query: 133 EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           E ++ + E++  +E+E  EDEE+ EE ED + EE+ VED+  +E+  E+N   K  ++
Sbjct: 39  ENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEK 96


>gnl|CDD|233471 TIGR01569, A_tha_TIGR01569, plant integral membrane protein
          TIGR01569.  This model describes a region of ~160
          residues found exclusively in plant proteins, generally
          as the near complete length of the protein. At least 24
          different members are found in Arabidopsis thaliana.
          Members have four predicted transmembrane regions, the
          last of which is preceded by an invariant CXXXXX[FY]C
          motif. The family is not functionally characterized.
          Length = 154

 Score = 32.4 bits (74), Expect = 0.077
 Identities = 18/70 (25%), Positives = 33/70 (47%), Gaps = 4/70 (5%)

Query: 2  ETEVSFGLDA---ASWN-IWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLA 57
          ET+V F       A ++ + A V  ++    A G  L+ L +S    L+    + L++L 
Sbjct: 23 ETKVVFVQLITFKAKFSDLPAFVYFVVANAIACGYSLLSLVVSIFGLLKRRVFFKLIALF 82

Query: 58 ITDLMVAVLV 67
            DL++  L+
Sbjct: 83 FLDLVMLALL 92


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 33.1 bits (75), Expect = 0.077
 Identities = 20/95 (21%), Positives = 43/95 (45%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
                 AG  D+   EE + ++E E E +  +  E  E D++  + +  +     E+EEK
Sbjct: 483 VRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEK 542

Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           K +      ++K++  K+      +E + +  +K+
Sbjct: 543 KLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKK 577



 Score = 28.1 bits (62), Expect = 3.6
 Identities = 17/81 (20%), Positives = 33/81 (40%), Gaps = 2/81 (2%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           D           EE +   E++     E+   E      ++D+  E + +KE E + QG 
Sbjct: 454 DSYDPRASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQG- 512

Query: 166 EKRVEDKIGDEKRLEENKKKK 186
             +  +    +K + ++K KK
Sbjct: 513 -IKYSETSEADKDVNKSKNKK 532


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 32.3 bits (74), Expect = 0.078
 Identities = 19/49 (38%), Positives = 32/49 (65%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           ++E+EEEEEEE E E+  +E   ++  EKE    + ++R E+E K++E 
Sbjct: 98  KKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 33.2 bits (76), Expect = 0.081
 Identities = 13/58 (22%), Positives = 24/58 (41%), Gaps = 1/58 (1%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           E   + +  E +   + E   Q+    E E   + R+P  D   E  ++ +E +DK  
Sbjct: 885 ENTVKGQYSELQNHHKTEALSQNNKYNE-EKSAQERMPGADSPEELMKRAKEYQDKHK 941



 Score = 28.6 bits (64), Expect = 2.7
 Identities = 13/70 (18%), Positives = 24/70 (34%), Gaps = 2/70 (2%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           G    + R EE   + +  E +   + E L ++    E +     ER    +  EE   +
Sbjct: 875 GDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEK--SAQERMPGADSPEELMKR 932

Query: 163 QGEEKRVEDK 172
             E +     
Sbjct: 933 AKEYQDKHKG 942


>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein.  Bacillus subtilis stage V
           sporulation protein R is involved in spore cortex
           formation. Little is known about cortex biosynthesis,
           except that it depends on several sigma E controlled
           genes, including spoVR.
          Length = 427

 Score = 32.9 bits (76), Expect = 0.082
 Identities = 13/39 (33%), Positives = 25/39 (64%)

Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
           EEE+  ++E++EYL+   N+  R +P + E+ E+E  + 
Sbjct: 178 EEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARR 216



 Score = 31.0 bits (71), Expect = 0.29
 Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 8/50 (16%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEY--------LEEDENEKERRLPEEDER 149
           +  EE+  +EE EE  + Q  D +          E+E  + +R PEE + 
Sbjct: 176 SLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQE 225


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 32.4 bits (74), Expect = 0.085
 Identities = 21/54 (38%), Positives = 35/54 (64%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
            RE EEEE EE ++ EE+ + E  EE+E E+ER   +E+E  ++ E++  +E +
Sbjct: 20  QREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQE 73



 Score = 31.2 bits (71), Expect = 0.23
 Identities = 23/66 (34%), Positives = 41/66 (62%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           K  A+ E  +   ++ E EEEE EE+++ E   E E ++E  L EE E+ ++EE+++E E
Sbjct: 6   KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65

Query: 161 DKQGEE 166
           ++  +E
Sbjct: 66  EQARKE 71



 Score = 30.1 bits (68), Expect = 0.57
 Identities = 17/74 (22%), Positives = 43/74 (58%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
             ++  + EE++  ++Q E  EE+  E+++   + +   ++EE+ EEE +K+ EE+  ++
Sbjct: 4   GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63

Query: 172 KIGDEKRLEENKKK 185
           +    ++ +E  +K
Sbjct: 64  REEQARKEQEEYEK 77



 Score = 26.2 bits (58), Expect = 8.4
 Identities = 20/56 (35%), Positives = 34/56 (60%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           ++E   EEE E+++EEEE +E+E+    E++E EK +     +E G D+   +EE 
Sbjct: 43  KEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSADEES 98


>gnl|CDD|184899 PRK14906, PRK14906, DNA-directed RNA polymerase subunit beta'/alpha
           domain fusion protein; Provisional.
          Length = 1460

 Score = 32.9 bits (75), Expect = 0.089
 Identities = 21/85 (24%), Positives = 41/85 (48%), Gaps = 3/85 (3%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           D+ AREE+ ++  +E   + +E D    +   E  RRL  +    +DE   +  +D++  
Sbjct: 142 DKEAREEDADDLRDELAADLEELDAE-RDRLIEATRRLSVDYVPEDDEFVDDIGDDERLT 200

Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
            + V  ++ D    EE  ++K  +R
Sbjct: 201 AEEVRAEVADIY--EEYNERKALRR 223


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 32.3 bits (73), Expect = 0.090
 Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 4/105 (3%)

Query: 68  MPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQE 127
           +PL I    ++R+     + + K  I+ + Q+ KA       REE EE  E E  EE+ +
Sbjct: 97  VPLAIRIRDQLRSVKKDDIKKKKSLIIRQEQIEKA----RQEREELEERMEWERREEKID 152

Query: 128 QDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
           + E  EE E E+E +  EE     + E  E++E +   +    +K
Sbjct: 153 EREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197



 Score = 30.8 bits (69), Expect = 0.27
 Identities = 22/73 (30%), Positives = 38/73 (52%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           +E+ E+  +E EE +E+ E+   +E   ER   EE ER  +E+  EE+ D    E   +D
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQD 184

Query: 172 KIGDEKRLEENKK 184
           +   E   ++ +K
Sbjct: 185 ESETESDDDKTEK 197



 Score = 26.6 bits (58), Expect = 8.4
 Identities = 22/86 (25%), Positives = 46/86 (53%), Gaps = 1/86 (1%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           +D+  +++     +E+ E+  QE++E  E  E E+     +E E  E++E++ EE+  + 
Sbjct: 112 KDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEE 171

Query: 165 EEKRVEDKIGDEKRLE-ENKKKKTQK 189
           +    E +I ++   E E+   KT+K
Sbjct: 172 QSDDSEHEIIEQDESETESDDDKTEK 197


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 32.6 bits (74), Expect = 0.090
 Identities = 26/89 (29%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
           AG   E   E+    EE   E+ +Q+  E   E ENE  +   E+ ER + E     EED
Sbjct: 152 AGKVHE-ETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEED 210

Query: 162 KQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
              E+  VED    ++ L +   ++ Q+ 
Sbjct: 211 YDEEDNPVEDSKAIKEELAKEPVEEQQEV 239



 Score = 28.4 bits (63), Expect = 2.3
 Identities = 16/71 (22%), Positives = 30/71 (42%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           +  +E  E  +  EE E+    +E   E   +  +    E E  + +E  E+ E  + E 
Sbjct: 144 DDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAET 203

Query: 167 KRVEDKIGDEK 177
             V ++  DE+
Sbjct: 204 DDVTEEDYDEE 214



 Score = 28.0 bits (62), Expect = 3.0
 Identities = 15/81 (18%), Positives = 34/81 (41%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           R    + EEE +E+ +   E+    +  E+   ++ +   ++    ED + ++E  +   
Sbjct: 95  RKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGK 154

Query: 165 EEKRVEDKIGDEKRLEENKKK 185
             +  ED    E+   E  K+
Sbjct: 155 VHEETEDSYHVEETASEQYKQ 175



 Score = 27.2 bits (60), Expect = 5.4
 Identities = 14/91 (15%), Positives = 37/91 (40%), Gaps = 3/91 (3%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE---KKE 157
           K    +E   + +   E+    ++E++      + + +K     + D+R E  E     E
Sbjct: 98  KQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHE 157

Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQ 188
           E ED    E+   ++   + + + ++++   
Sbjct: 158 ETEDSYHVEETASEQYKQDMKEKASEQENED 188


>gnl|CDD|218954 pfam06246, Isy1, Isy1-like splicing family.  Isy1 protein is
           important in the optimisation of splicing.
          Length = 253

 Score = 32.3 bits (74), Expect = 0.094
 Identities = 25/88 (28%), Positives = 39/88 (44%), Gaps = 3/88 (3%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           G RDE   E+    E E E E+E  +    E  E + E    EEDER ED     +++ +
Sbjct: 168 GYRDE---EDGVLLEYEAEREKELIEKLVEEWKEIQDEDLQAEEDEREEDIYIPLDDDAE 224

Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQKR 190
                 V  +   E+ L E +K++   +
Sbjct: 225 FVAHVPVPTQEEVEEWLLEKRKRELLAK 252


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 31.9 bits (73), Expect = 0.096
 Identities = 13/32 (40%), Positives = 25/32 (78%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
           +++EE+EEEE++EE+ ++DE  EE+   K+ +
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 30.4 bits (69), Expect = 0.29
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
           +E +++EE+EEEE+++E D+  E +E E 
Sbjct: 112 DESDDDEEDEEEEDDEEDDDEDESEEEES 140



 Score = 29.2 bits (66), Expect = 0.73
 Identities = 12/31 (38%), Positives = 23/31 (74%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
           E    +++EE+EEEE++EE + ++  EE+E+
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEES 140



 Score = 28.8 bits (65), Expect = 1.1
 Identities = 14/30 (46%), Positives = 25/30 (83%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
            EE+E +++EE+EEE++ +E  +EDE+E+E
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEE 138



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDE 130
            +    D+   +EEEE++EE+++E+E E++E
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEE 139



 Score = 28.1 bits (63), Expect = 1.6
 Identities = 12/33 (36%), Positives = 23/33 (69%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
            EE+E +++EE+EE+E DE  ++++  +E   P
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESP 141


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 32.6 bits (75), Expect = 0.098
 Identities = 28/117 (23%), Positives = 49/117 (41%), Gaps = 6/117 (5%)

Query: 24  LVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLS 83
           L+  TATG ++V  A S     + +      +     L+VA ++  L ++  +     L 
Sbjct: 240 LIISTATG-LIVTRASSEGSFGKAIVTQLGAN-PRALLIVAAVLGLLALVPGLPTFPFLF 297

Query: 84  GAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
            A             L +   ++E A  E  + +EE  EEEE+  ++ L  D+ E E
Sbjct: 298 LAGALAFLAW----YLRRRSKQEEEAAAEAAKAQEEAAEEEEESINDILALDDLELE 350


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 32.6 bits (75), Expect = 0.10
 Identities = 19/59 (32%), Positives = 32/59 (54%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
             EEE  EE EEE +E + + E  E ++   E++  +  ++G+DEE   EE  +   E+
Sbjct: 65  EIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 31.9 bits (71), Expect = 0.10
 Identities = 25/65 (38%), Positives = 35/65 (53%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           K GA D    + ++E EEEEE E E+E +E  E +E  +E   PEE    E E + E E 
Sbjct: 49  KVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEP 108

Query: 161 DKQGE 165
           + + E
Sbjct: 109 EPEPE 113



 Score = 29.9 bits (66), Expect = 0.41
 Identities = 24/63 (38%), Positives = 32/63 (50%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           G   +   EEEEE E EEE EEE E +E  EE+   +E    E +   E E + E E + 
Sbjct: 57  GTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEP 116

Query: 163 QGE 165
           + E
Sbjct: 117 EPE 119



 Score = 27.6 bits (60), Expect = 2.7
 Identities = 24/62 (38%), Positives = 32/62 (51%), Gaps = 2/62 (3%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
            D+   EEEE E EEE EEE + ++E   E+E E E    EE E   + E + E E +  
Sbjct: 60  PDDEPEEEEEPEPEEEGEEEPEPEEE--GEEEPEPEETGEEEPEPEPEPEPEPEPEPEPE 117

Query: 165 EE 166
            E
Sbjct: 118 PE 119



 Score = 26.8 bits (58), Expect = 5.5
 Identities = 22/57 (38%), Positives = 30/57 (52%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           E   E E EEE EEE E E+E +E  E +E  +E   PE +   E E + E E + +
Sbjct: 65  EEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEPE 121



 Score = 26.1 bits (56), Expect = 8.8
 Identities = 22/59 (37%), Positives = 29/59 (49%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
              +E   E EEE EEE E EEE E++   EE   E+    PE +   E E + E E +
Sbjct: 63  EPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEPE 121


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 32.7 bits (74), Expect = 0.11
 Identities = 19/107 (17%), Positives = 48/107 (44%), Gaps = 6/107 (5%)

Query: 90  KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL---PEE 146
            + ++ +  +       ++   E+  +  + + E+  + +E  EEDE+ ++  +    E 
Sbjct: 79  VEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEA 138

Query: 147 DERGEDEEKKEEEEDKQ---GEEKRVEDKIGDEKRLEENKKKKTQKR 190
           D   E E   E+  D +    EEK++E++   E++    +  + +K 
Sbjct: 139 DLFNESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKF 185



 Score = 32.7 bits (74), Expect = 0.11
 Identities = 20/79 (25%), Positives = 39/79 (49%)

Query: 90  KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
           KK+   + +L       E  ++E + + EEE+EE+++++D+  EE+  E      + DE 
Sbjct: 266 KKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEP 325

Query: 150 GEDEEKKEEEEDKQGEEKR 168
             +    E  ++    EKR
Sbjct: 326 VLEGVDLESPKELSSFEKR 344



 Score = 31.1 bits (70), Expect = 0.30
 Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 1/78 (1%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
            + D    EEEEEE+E  E+E   ++DE    +E+E        D+  ED+E+K+ EE++
Sbjct: 110 DSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLE-DLSDDETEDDEEKKMEEEE 168

Query: 163 QGEEKRVEDKIGDEKRLE 180
            GEEK   ++   EK+ +
Sbjct: 169 AGEEKESVEQATREKKFD 186



 Score = 28.0 bits (62), Expect = 3.8
 Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 10/81 (12%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK----------KEEEE 160
           E E   E+  ++E E ++++ +EE+E  +E+   E+  R +  +K           E  E
Sbjct: 143 ESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNE 202

Query: 161 DKQGEEKRVEDKIGDEKRLEE 181
             +  E   E  +GDE   E+
Sbjct: 203 FLEATEAEEEAALGDEDDFED 223


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 32.2 bits (74), Expect = 0.11
 Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 27/84 (32%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
            REEEEE+  +  EEE QE                         E ++++EE K+ E + 
Sbjct: 264 TREEEEEKILKAAEEERQE-------------------------EAQEKKEEKKKEEREA 298

Query: 169 VEDKIGDE--KRLEENKKKKTQKR 190
              K+  E  ++LEE ++KK  ++
Sbjct: 299 KLAKLSPEEQRKLEEKERKKQARK 322



 Score = 31.5 bits (72), Expect = 0.19
 Identities = 16/62 (25%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE--KKEEEEDKQGEE 166
             +  EEEEE+  +  E+E+ E  +E + EK++   E        E  +K EE++++ + 
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320

Query: 167 KR 168
           ++
Sbjct: 321 RK 322



 Score = 30.7 bits (70), Expect = 0.36
 Identities = 14/52 (26%), Positives = 25/52 (48%)

Query: 91  KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
           + IL   +  +     E   E+++EE E +  +   E+   LEE E +K+ R
Sbjct: 270 EKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQAR 321


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 32.7 bits (74), Expect = 0.11
 Identities = 23/61 (37%), Positives = 33/61 (54%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           E    EEEE+EE  EE EE E+ E + + E + + R  EE ++ E E + EEEE  +   
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGS 166

Query: 167 K 167
            
Sbjct: 167 L 167



 Score = 26.9 bits (59), Expect = 6.9
 Identities = 17/56 (30%), Positives = 29/56 (51%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           REE  +  EEEE+  +QE+ +    +E EK R   E + R  +  +K ++  + G 
Sbjct: 223 REERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGL 278



 Score = 26.9 bits (59), Expect = 7.7
 Identities = 21/75 (28%), Positives = 36/75 (48%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
           AR EE  +E   E  E Q++ +    D++  E     +++ G + E  EEEE ++  E+R
Sbjct: 64  ARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREER 123

Query: 169 VEDKIGDEKRLEENK 183
            E +  +     E K
Sbjct: 124 EEVEETEGVTKSEQK 138



 Score = 26.9 bits (59), Expect = 8.6
 Identities = 22/80 (27%), Positives = 44/80 (55%), Gaps = 3/80 (3%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE- 154
           EN   + GA        +E E+ +++++E   + E L++   E+ + L EE++R + EE 
Sbjct: 183 ENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEA 242

Query: 155 --KKEEEEDKQGEEKRVEDK 172
             K  EEE+K+  ++ +E +
Sbjct: 243 DRKSREEEEKRRLKEEIERR 262


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 32.6 bits (75), Expect = 0.11
 Identities = 27/102 (26%), Positives = 52/102 (50%), Gaps = 10/102 (9%)

Query: 89  LKKNILCENQLFKAGARDETAREEEEEEEE-EEEEEEEQEQDEYLEEDENEKERRLPEED 147
           LKK  L E       A++E  +   E E E +E   E Q  +  L + E   +R++   D
Sbjct: 48  LKKEALLE-------AKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLD 100

Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           ++ E+ EKKE+E   +  EK +++K  + + L   ++++ ++
Sbjct: 101 KKEENLEKKEKELSNK--EKNLDEKEEELEELIAEQREELER 140



 Score = 28.4 bits (64), Expect = 2.5
 Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 12/90 (13%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENE---KERRLPEE----DERGEDEEKKEEE 159
           ET ++E   E +EE  +   E +  L+E  NE    ERRL +     D + E  +KKEE 
Sbjct: 46  ETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEEN 105

Query: 160 EDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
            +K+  EK + +K   EK L+E +++  + 
Sbjct: 106 LEKK--EKELSNK---EKNLDEKEEELEEL 130


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 32.2 bits (73), Expect = 0.12
 Identities = 18/34 (52%), Positives = 23/34 (67%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD 129
           + QLF   A DE   EE +E++EEEE EEE+E D
Sbjct: 318 QEQLFNPFAIDEEMLEETQEQQEEEENEEEEEND 351


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 32.1 bits (74), Expect = 0.15
 Identities = 32/141 (22%), Positives = 69/141 (48%), Gaps = 9/141 (6%)

Query: 58  ITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEE 117
           +  +++A++ + +G +    VR  ++ A ++  +    +  L +A    E  ++E   E 
Sbjct: 4   LIIILIALVALVVGAVIGYFVRKKIAEAKIKEAEE-EAKRILEEAKKEAEAIKKEALLEA 62

Query: 118 EEEEEEEEQEQDEYLEEDENE---KERRLPEE----DERGEDEEKKEEEEDKQGEEKRVE 170
           +EE  +   E ++ L E  NE    E+RL ++    D + E  EK+EEE +K+ +E   +
Sbjct: 63  KEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQK 122

Query: 171 DKIGDEKRLE-ENKKKKTQKR 190
            +  ++K  E E   ++  + 
Sbjct: 123 QQELEKKEEELEELIEEQLQE 143



 Score = 27.4 bits (62), Expect = 5.9
 Identities = 16/57 (28%), Positives = 35/57 (61%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           R  ++EE  + + E  ++++E LE+ E E E++  E +++ E+ E+  EE+ ++ E 
Sbjct: 90  RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 31.6 bits (72), Expect = 0.15
 Identities = 19/80 (23%), Positives = 39/80 (48%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           E ++ + E  EE+  +E +   E+   E++    E ++    E+ +E  E K  E K   
Sbjct: 4   ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63

Query: 171 DKIGDEKRLEENKKKKTQKR 190
           +K+ +E +  EN+ +  + R
Sbjct: 64  NKLKEENKKLENELEALKDR 83



 Score = 30.1 bits (68), Expect = 0.46
 Identities = 18/67 (26%), Positives = 36/67 (53%)

Query: 120 EEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRL 179
            E+E +  + E +EED  ++     E+  + ED E +E E+++  E+    +++  E+  
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60

Query: 180 EENKKKK 186
           +EN K K
Sbjct: 61  DENNKLK 67



 Score = 30.1 bits (68), Expect = 0.51
 Identities = 24/83 (28%), Positives = 48/83 (57%), Gaps = 2/83 (2%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           E   +EE++ +EE+ E EE E++E +E+ E   E +   E+ + E+ + KEE +  + E 
Sbjct: 20  ENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVK--IEELKDENNKLKEENKKLENEL 77

Query: 167 KRVEDKIGDEKRLEENKKKKTQK 189
           + ++D++       +N +K+T K
Sbjct: 78  EALKDRLLRTVAEYDNYRKRTAK 100



 Score = 29.3 bits (66), Expect = 0.87
 Identities = 24/76 (31%), Positives = 45/76 (59%), Gaps = 2/76 (2%)

Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
            E+E ++ + E  +++  +E+EN++E +  EED   E+ EK+E  ED +   +   +++ 
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60

Query: 175 DE--KRLEENKKKKTQ 188
           DE  K  EENKK + +
Sbjct: 61  DENNKLKEENKKLENE 76


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 31.9 bits (73), Expect = 0.16
 Identities = 26/82 (31%), Positives = 45/82 (54%), Gaps = 10/82 (12%)

Query: 85  AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
           A+  +K+ I  +   FK     E    +EE+ EE  E+ EE+E+++   E E+E E    
Sbjct: 248 ALEAIKEVIKKKGGDFKVKGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDE---- 303

Query: 145 EEDERGEDEEKKEEEEDKQGEE 166
                 EDE+++EEE+D +G++
Sbjct: 304 ------EDEDEEEEEDDDEGDK 319



 Score = 26.5 bits (59), Expect = 9.3
 Identities = 15/44 (34%), Positives = 26/44 (59%)

Query: 133 EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
           EED  E   +  EE+E  +  E ++E+E+ + EE+  +D  GD+
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 31.8 bits (73), Expect = 0.16
 Identities = 17/33 (51%), Positives = 21/33 (63%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
              A+ + A  EEEEEEEEEEEEEE  ++E   
Sbjct: 291 VLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAA 323



 Score = 28.7 bits (65), Expect = 1.7
 Identities = 14/29 (48%), Positives = 21/29 (72%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEED 135
            +A+ +    EEEEEEEEE+E++E  EE+
Sbjct: 292 LSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 32.1 bits (73), Expect = 0.17
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 6/83 (7%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE----- 166
           E+    E+E E++ +  ++ L+E E  K+    E +E  E E  K+ E +K+ +E     
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKAL 577

Query: 167 -KRVEDKIGDEKRLEENKKKKTQ 188
            K VE  I + K  + +K K+ +
Sbjct: 578 KKEVESIIRELKEKKIHKAKEIK 600



 Score = 31.7 bits (72), Expect = 0.22
 Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 6/87 (6%)

Query: 104 ARDETAREEEE-EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
              E  + ++E E+E EE +E E+ +   LE++  E  + L +E E     E KE++  K
Sbjct: 537 LLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVES-IIRELKEKKIHK 595

Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQK 189
             E K +ED      +L+E K+K  QK
Sbjct: 596 AKEIKSIEDL----VKLKETKQKIPQK 618


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 31.9 bits (72), Expect = 0.17
 Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 7/84 (8%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDE------YLEEDENEKERRLPEEDER 149
           E+    + + DE   +  ++E+ + E  +E +Q+E      Y  + E +      E  E 
Sbjct: 714 EDDFSDSTSDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEE 773

Query: 150 GE-DEEKKEEEEDKQGEEKRVEDK 172
            E  EE+KEEEE+K+   KR + K
Sbjct: 774 DESSEEEKEEEENKEVSAKRAKKK 797



 Score = 28.9 bits (64), Expect = 1.7
 Identities = 16/89 (17%), Positives = 36/89 (40%), Gaps = 3/89 (3%)

Query: 96  ENQLFKAGARDETAREEEEEEEE---EEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
           EN+++ A  +     E++ ++ E    E++  +   D+  + D  + E    E  +  + 
Sbjct: 687 ENEIWSALVKSRPDVEDDSDDSELDFAEDDFSDSTSDDEPKLDAIDDEDAKSEGSQESDQ 746

Query: 153 EEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
           EE  +E       E+   D   +    +E
Sbjct: 747 EEGLDEIFYSFDGEQDNSDSFAESSEEDE 775



 Score = 28.1 bits (62), Expect = 3.1
 Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 9/74 (12%)

Query: 106 DETAREEEEEEEEEEE--------EEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
           DE A+ E  +E ++EE         + EQ+  +    + +E++    EE E  E++E   
Sbjct: 733 DEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFA-ESSEEDESSEEEKEEEENKEVSA 791

Query: 158 EEEDKQGEEKRVED 171
           +   K+  +  ++ 
Sbjct: 792 KRAKKKQRKNMLKS 805



 Score = 28.1 bits (62), Expect = 4.0
 Identities = 16/83 (19%), Positives = 37/83 (44%), Gaps = 2/83 (2%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED--ERGEDEEKKEE 158
           K    ++ + + ++E +E E      +    +E+D ++ E    E+D  +   D+E K +
Sbjct: 670 KGKKSNKASFDSDDEMDENEIWSALVKSRPDVEDDSDDSELDFAEDDFSDSTSDDEPKLD 729

Query: 159 EEDKQGEEKRVEDKIGDEKRLEE 181
             D +  +     +   E+ L+E
Sbjct: 730 AIDDEDAKSEGSQESDQEEGLDE 752


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 31.2 bits (71), Expect = 0.18
 Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 2/59 (3%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
            E E ++ EE+EE  +E ++    DE E +    E+DE  + EE  E+    + EE+  
Sbjct: 3   SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEE--EQIFVTRQEEEVD 59



 Score = 30.0 bits (68), Expect = 0.50
 Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
               E   ++E  +EEE +  ++++ +E   E+E     R  EE+   E E + + E +K
Sbjct: 14  EELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTR-QEEEVDPEAEAEFDREFEK 72



 Score = 29.3 bits (66), Expect = 0.92
 Identities = 19/82 (23%), Positives = 44/82 (53%), Gaps = 2/82 (2%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE--RGEDEEKKEEEEDK 162
            D+   +EE  EE+E++E  ++E+ +  +++++E+     E+    R E+E   E E + 
Sbjct: 7   SDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEAEF 66

Query: 163 QGEEKRVEDKIGDEKRLEENKK 184
             E +++  +  + ++ E  KK
Sbjct: 67  DREFEKMMAESLESRKFESRKK 88


>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2).  Surfeit locus
           protein 2 is part of a group of at least six sequence
           unrelated genes (Surf-1 to Surf-6). The six Surfeit
           genes have been classified as housekeeping genes, being
           expressed in all tissue types tested and not containing
           a TATA box in their promoter region. The exact function
           of SURF2 is unknown.
          Length = 244

 Score = 31.4 bits (71), Expect = 0.19
 Identities = 16/81 (19%), Positives = 33/81 (40%), Gaps = 7/81 (8%)

Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED------EEKKEEEEDKQGEEKRV 169
            +E++ + E+   D Y  E    K     +  +  +D      +E + E  + Q +  + 
Sbjct: 160 SDEDDSDSEDSMSDLYPPELFTLKNPGKEQNGDEDDDFETDDEDEMEVESPELQQKRSKK 219

Query: 170 EDKIGDEKRLEENKKKKTQKR 190
           +      K+ ++N KKK   R
Sbjct: 220 QSG-SLTKKFKKNHKKKGPFR 239


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 31.3 bits (71), Expect = 0.21
 Identities = 17/73 (23%), Positives = 27/73 (36%), Gaps = 2/73 (2%)

Query: 110 REEEEEEEEEEEEEEE-QEQDEYLEEDENEKERRLPEEDERGEDEEK-KEEEEDKQGEEK 167
           RE E +EE E EEE E     +           R   E +    + K   ++E K  ++ 
Sbjct: 67  RERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDP 126

Query: 168 RVEDKIGDEKRLE 180
           R   +   +K   
Sbjct: 127 RGGTQDVVDKSQA 139



 Score = 28.6 bits (64), Expect = 1.6
 Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 7/70 (10%)

Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
           E E +EE E E     EE E +              + K+ E++    + K   DK  + 
Sbjct: 68  ERELKEEAEAE-----EEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDK--ER 120

Query: 177 KRLEENKKKK 186
           K  ++ +   
Sbjct: 121 KSAKDPRGGT 130


>gnl|CDD|218241 pfam04750, Far-17a_AIG1, FAR-17a/AIG1-like protein.  This family
          includes the hamster androgen-induced FAR-17a protein,
          and its human homologue, the AIG1 protein. The function
          of these proteins is unknown. This family also includes
          homologous regions from a number of other metazoan
          proteins.
          Length = 203

 Score = 31.1 bits (71), Expect = 0.21
 Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 12/78 (15%)

Query: 1  METEVSFGLDAASWNIWAVVTLILVFCTATGNILVCLA--ISWEKRLQNVTNYFLMSLAI 58
               +  +++A    +  +T I +   A    L  LA  +   ++L+           +
Sbjct: 20 DHQLPTLEIESAYGGKFKYLTFIGLVLQALYFTLALLADVVRLSRKLRK----------L 69

Query: 59 TDLMVAVLVMPLGILTLV 76
           D ++AVL  P+G++  V
Sbjct: 70 RDFLLAVLAFPVGVIVSV 87


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 31.2 bits (71), Expect = 0.21
 Identities = 25/74 (33%), Positives = 39/74 (52%)

Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
           EEE EE+E        EE+E+++E  + EE+E  E +   ++ ED + E    E+   + 
Sbjct: 19  EEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKEL 78

Query: 177 KRLEENKKKKTQKR 190
           +R E  KKKK  K 
Sbjct: 79  QREERLKKKKRVKT 92



 Score = 30.0 bits (68), Expect = 0.56
 Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 7/80 (8%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           EE++EE E EEEEEE+E D   ++ E+       +E E  ++EE ++E + ++  +K+  
Sbjct: 37  EEDDEEFEIEEEEEEEEVDSDFDDSED-------DEPESDDEEEGEKELQREERLKKKKR 89

Query: 171 DKIGDEKRLEENKKKKTQKR 190
            K    K   + KKKK    
Sbjct: 90  VKTKAYKEPTKKKKKKDPTA 109


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 31.3 bits (71), Expect = 0.21
 Identities = 9/44 (20%), Positives = 19/44 (43%)

Query: 122 EEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           +   ++    LE  +  +E      D+  E  E+ ++E  K+ E
Sbjct: 149 KARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKKE 192


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 30.3 bits (69), Expect = 0.21
 Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 4/50 (8%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQ----DEYLEEDENEKERRLPE 145
           +K    D    +EE+EEE E + + EQE+    +   EE+E EK+R L E
Sbjct: 59  YKVPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 31.6 bits (72), Expect = 0.22
 Identities = 16/24 (66%), Positives = 17/24 (70%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDE 130
           E    EEEEEEEEEEEEEE+   E
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAE 423



 Score = 30.4 bits (69), Expect = 0.61
 Identities = 17/37 (45%), Positives = 20/37 (54%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
             R   EEEEEEEEEEEE+E+           E +LP
Sbjct: 399 VERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLP 435


>gnl|CDD|216235 pfam00999, Na_H_Exchanger, Sodium/hydrogen exchanger family.  Na/H
           antiporters are key transporters in maintaining the pH
           of actively metabolising cells. The molecular mechanisms
           of antiport are unclear. These antiporters contain 10-12
           transmembrane regions (M) at the amino-terminus and a
           large cytoplasmic region at the carboxyl terminus. The
           transmembrane regions M3-M12 share identity with other
           members of the family. The M6 and M7 regions are highly
           conserved. Thus, this is thought to be the region that
           is involved in the transport of sodium and hydrogen
           ions. The cytoplasmic region has little similarity
           throughout the family.
          Length = 370

 Score = 31.4 bits (72), Expect = 0.22
 Identities = 12/77 (15%), Positives = 25/77 (32%), Gaps = 10/77 (12%)

Query: 5   VSFGL----DAASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITD 60
           VS GL     +   ++  +V L+LV     G +L    ++    L       +       
Sbjct: 271 VSVGLSLDLSSLLLSLLLLVLLLLVAILL-GKLLGVFLLARLLGLSLREALIVGF----- 324

Query: 61  LMVAVLVMPLGILTLVR 77
             +    + L +  +  
Sbjct: 325 GGLQRGAVSLALAAIGL 341


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 31.6 bits (71), Expect = 0.23
 Identities = 29/89 (32%), Positives = 46/89 (51%), Gaps = 2/89 (2%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           EN +   G       EEE   ++EEEE+E+ E+ E +EE E  +E    +ED   +DE++
Sbjct: 85  ENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKE 144

Query: 156 KEEE--EDKQGEEKRVEDKIGDEKRLEEN 182
            + E  E++   E  +E+   D   L EN
Sbjct: 145 SDAEGDENELAGEYIIEEVDDDVAILNEN 173


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 31.6 bits (71), Expect = 0.23
 Identities = 26/98 (26%), Positives = 41/98 (41%), Gaps = 13/98 (13%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER-------------RLPEED 147
           K    D    EEE  +++E+   E+ E +E   ++E E+ R              +  E 
Sbjct: 582 KDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYET 641

Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
           ER E+  KKEE       E+R + +  D     E K+K
Sbjct: 642 EREENARKKEELRGNFELEERGDPEKKDVDWYTEEKRK 679


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 31.3 bits (71), Expect = 0.24
 Identities = 15/30 (50%), Positives = 17/30 (56%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
           E   EEEEEEEEEEEEE  + +    E   
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428



 Score = 30.2 bits (68), Expect = 0.72
 Identities = 16/30 (53%), Positives = 18/30 (60%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
            R  EEEEEEEEEEEEE  + E   E+   
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428



 Score = 29.8 bits (67), Expect = 0.91
 Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
           A +E   EEEEEEE  E E   +E     E
Sbjct: 402 AEEEEEEEEEEEEEAAEAEAPMEEPVPGFE 431



 Score = 27.9 bits (62), Expect = 3.3
 Identities = 17/52 (32%), Positives = 27/52 (51%)

Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           EE+     E  E+L+E  +    R  EE+E  E+EE++E  E +   E+ V 
Sbjct: 377 EEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428



 Score = 27.9 bits (62), Expect = 4.0
 Identities = 15/33 (45%), Positives = 19/33 (57%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE 145
            EEEEEEEEEEEE+  +     +E      +PE
Sbjct: 402 AEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPE 434


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 29.9 bits (68), Expect = 0.24
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
           A A    A  EE++EEEEEEEE+E+ ++E   
Sbjct: 68  AAAAAAAAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 29.2 bits (66), Expect = 0.46
 Identities = 15/29 (51%), Positives = 20/29 (68%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
           A A    A   EE++EEEEEEEE++E +E
Sbjct: 67  AAAAAAAAAAAEEKKEEEEEEEEKEESEE 95



 Score = 28.8 bits (65), Expect = 0.80
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
           A A    A    EE++EEEEEEEE+E+ E
Sbjct: 66  AAAAAAAAAAAAEEKKEEEEEEEEKEESE 94



 Score = 26.8 bits (60), Expect = 3.6
 Identities = 15/34 (44%), Positives = 20/34 (58%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEED 135
           A A    A      EE++EEEEEE+E++E  EE 
Sbjct: 64  AAAAAAAAAAAAAAEEKKEEEEEEEEKEESEEEA 97



 Score = 26.5 bits (59), Expect = 4.6
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEED 135
           A A    A       EE++EEEEE+E+ E  EE+
Sbjct: 63  AAAAAAAAAAAAAAAEEKKEEEEEEEEKEESEEE 96


>gnl|CDD|129888 TIGR00806, rfc, RFC reduced folate carrier.  The Reduced Folate
           Carrier (RFC) Family (TC 2.A.48) Members of the RFC
           family mediate the uptake of folate, reduce folate,
           derivatives of reduced folate and the drug,
           methotrexate. Proteins of the RFC family are so-far
           restricted to animals. RFC proteins possess 12 putative
           transmembrane a-helical spanners (TMSs) and evidence for
           a 12 TMS topology has been published for the human RFC.
           The RFC transporters appear to transport reduced folate
           by an energy-dependent, pH-dependent, Na+-independent
           mechanism. Folate:H+ symport, folate:OH- antiport and
           folate:anion antiport mechanisms have been proposed, but
           the energetic mechanism is not well defined [Transport
           and binding proteins, Carbohydrates, organic alcohols,
           and acids].
          Length = 511

 Score = 31.4 bits (71), Expect = 0.25
 Identities = 37/172 (21%), Positives = 60/172 (34%), Gaps = 40/172 (23%)

Query: 14  WNIW-----AVVTLI---LVFCTA-TGNILVC---------------------LAISWEK 43
           W  W     AVV+ I   LVF  + + +I V                      +A S  K
Sbjct: 325 WARWSKLLIAVVSAIQAGLVFWMSQSHDIWVLYVTYVLFRGIYQFLVPIATFQIASSLSK 384

Query: 44  RLQNVT---NYFLMSLAITDLMVAVLVMPLGILTLVRVR------TNLSGAVLRLKKNIL 94
            L  +    N F+ + A+  ++ AV+    G+   VR +        L  +++      L
Sbjct: 385 ELCALVFGINTFV-ATALKTIITAVVSDKRGLGLQVRKQFRIYSVYFLVLSIICFFGAGL 443

Query: 95  CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
              +  K G     A  +E     EE  +    QD  L   +    + LPE+
Sbjct: 444 DGLRYCKRGTHQPLAPAQELRSPLEESVQAISLQDRDLRGPQPSAPQLLPED 495


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 31.5 bits (72), Expect = 0.26
 Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 2/40 (5%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
               EEE        E+++   ++  EE++++ E R  EE
Sbjct: 262 PQPPEEEPPPPPPPPEDDDDPPED--EEEQDDAEDRALEE 299



 Score = 30.8 bits (70), Expect = 0.46
 Identities = 9/47 (19%), Positives = 16/47 (34%), Gaps = 7/47 (14%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
              +  EEE        +D+    ++        EE +  ED   +E
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPED-------EEEQDDAEDRALEE 299



 Score = 28.8 bits (65), Expect = 1.7
 Identities = 8/41 (19%), Positives = 17/41 (41%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
           A  +  EEE        ++ D+  E++E + +      +E 
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300



 Score = 26.9 bits (60), Expect = 7.3
 Identities = 8/35 (22%), Positives = 17/35 (48%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
             +E        E++++  E+E+EQD+  +    E
Sbjct: 265 PEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 30.3 bits (68), Expect = 0.26
 Identities = 20/67 (29%), Positives = 29/67 (43%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           A    A E  EE E  + +E+  E  E    DE ++E    E  +   DE   E EE  +
Sbjct: 64  AAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADE 123

Query: 164 GEEKRVE 170
            E++  E
Sbjct: 124 EEDEEAE 130



 Score = 30.3 bits (68), Expect = 0.31
 Identities = 19/75 (25%), Positives = 33/75 (44%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           E  R E+          E  E+ E  + DE+  E    +  +  ++EE+ +E  D+  +E
Sbjct: 54  EAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADE 113

Query: 167 KRVEDKIGDEKRLEE 181
              E +  DE+  EE
Sbjct: 114 ADAEAEEADEEEDEE 128



 Score = 30.3 bits (68), Expect = 0.34
 Identities = 16/65 (24%), Positives = 30/65 (46%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
             A +     E  + +E+ +E  E +  +  +E+E   E      DE   + E+ +EEED
Sbjct: 67  TAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEED 126

Query: 162 KQGEE 166
           ++ E 
Sbjct: 127 EEAEA 131


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 31.1 bits (70), Expect = 0.28
 Identities = 8/41 (19%), Positives = 19/41 (46%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
           +++E     + +EE++ D+   ED ++      +ED     
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 31.2 bits (70), Expect = 0.31
 Identities = 27/80 (33%), Positives = 35/80 (43%), Gaps = 24/80 (30%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            EEEEE E EE+EE  E                 +E  +  DE+ +EEEE    E K ++
Sbjct: 140 LEEEEEVEMEEDEEYYE-----------------KEPGKVVDEKSEEEEE---EELKTMK 179

Query: 171 DKIGDEKRLEENKKKKTQKR 190
           D I     LE   KK + K 
Sbjct: 180 DFI----DLESQTKKPSVKD 195



 Score = 31.2 bits (70), Expect = 0.32
 Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 1/69 (1%)

Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK-QGEEKRVEDKIGDEK 177
           +++EE         E D   ++  +P+     E+E + EE+E+  + E  +V D+  +E+
Sbjct: 111 DDDEENLPSSIAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEE 170

Query: 178 RLEENKKKK 186
             EE K  K
Sbjct: 171 EEEELKTMK 179



 Score = 30.4 bits (68), Expect = 0.53
 Identities = 27/93 (29%), Positives = 40/93 (43%), Gaps = 7/93 (7%)

Query: 102 AGARDETAREEEEEEEEEEEEEE-EQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
               D   R+    +   EEEEE E E+DE  E  E E  + + E+ E  E+EE K  ++
Sbjct: 123 PPEIDPEPRKPIVPDLVLEEEEEVEMEEDE--EYYEKEPGKVVDEKSEEEEEEELKTMKD 180

Query: 161 ----DKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
               + Q ++  V+D             KK QK
Sbjct: 181 FIDLESQTKKPSVKDNGKSFWSAASVFSKKLQK 213


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 31.2 bits (70), Expect = 0.31
 Identities = 17/74 (22%), Positives = 41/74 (55%)

Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
           E E +E  E ++++  EE EN +  +   + +R ++  +++EE+D+  E +R  + + DE
Sbjct: 283 EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADE 342

Query: 177 KRLEENKKKKTQKR 190
               E   ++ +++
Sbjct: 343 LNELEKGIEEKRRQ 356



 Score = 29.3 bits (65), Expect = 1.3
 Identities = 14/65 (21%), Positives = 29/65 (44%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E Q  +    +    E + +  +E  EE+E++ +    E   E       E E+G +E++
Sbjct: 295 EQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKR 354

Query: 156 KEEEE 160
           ++ E 
Sbjct: 355 RQMES 359



 Score = 27.0 bits (59), Expect = 7.2
 Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 3/73 (4%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            + A EE EE   EE+ +    + E  E  E +KE+   +++E    E  KEE +  + +
Sbjct: 261 KQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQ---QQEEVENAEAHKEEVQSDRPD 317

Query: 166 EKRVEDKIGDEKR 178
           E   E +  DE  
Sbjct: 318 EIGEEKEEDDENE 330


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 30.6 bits (70), Expect = 0.33
 Identities = 27/72 (37%), Positives = 48/72 (66%), Gaps = 3/72 (4%)

Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
           + E+ + E +++ + LEE++ E E+R+ E + + E  EK+EEEE +Q EEKR  D+I   
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEE-RQIEEKRHADEIAFL 175

Query: 177 KRLEENKKKKTQ 188
           K+  +N++ K+Q
Sbjct: 176 KK--QNQQLKSQ 185


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 30.8 bits (70), Expect = 0.33
 Identities = 20/64 (31%), Positives = 36/64 (56%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           E A  E  EEEE  E  ++Q Q E     E  + +RL E + R  +E+++ +++DK+ ++
Sbjct: 142 EQALLEVLEEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQ 201

Query: 167 KRVE 170
           +  E
Sbjct: 202 REKE 205


>gnl|CDD|215369 PLN02685, PLN02685, iron superoxide dismutase.
          Length = 299

 Score = 30.7 bits (69), Expect = 0.36
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           RE+EE   E+EEE + +  + YL+ D +  E
Sbjct: 267 REQEETRTEDEEEPDSEAVEMYLDSDIDVSE 297


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 29.1 bits (66), Expect = 0.36
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
           A A    A    EEE++EEEEEEE++ D
Sbjct: 55  AAAAAAAAAAAAEEEKKEEEEEEEEDDD 82



 Score = 27.2 bits (61), Expect = 2.1
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 111 EEEEEEEEEEEEEEEQEQDE 130
                EEE++EEEEE+E+D+
Sbjct: 62  AAAAAEEEKKEEEEEEEEDD 81



 Score = 26.8 bits (60), Expect = 2.3
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
           A A    A     EEE++EEEEEE+E D
Sbjct: 54  AAAAAAAAAAAAAEEEKKEEEEEEEEDD 81


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.1 bits (71), Expect = 0.37
 Identities = 14/62 (22%), Positives = 30/62 (48%)

Query: 125 EQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKK 184
           E E++E    + +  ER+   + +R  +++ ++EE +K   +K+ E      K  +   K
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454

Query: 185 KK 186
           K 
Sbjct: 455 KV 456



 Score = 26.8 bits (60), Expect = 8.9
 Identities = 11/63 (17%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E +  +   R    + E +EE E+   +++ E      +  + + ++  + D  GE   +
Sbjct: 410 ERKKLRKKQRKAEKKAE-KEEAEKAAAKKKAEAAAKKAKGPDGETKK-VDPDPLGEKLAR 467

Query: 156 KEE 158
            E+
Sbjct: 468 TED 470


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 30.8 bits (70), Expect = 0.39
 Identities = 10/26 (38%), Positives = 11/26 (42%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQE 127
           A A    A         EEEEEEE +
Sbjct: 278 AAAAAAAAAAAAAAAPAEEEEEEEDD 303



 Score = 29.6 bits (67), Expect = 0.78
 Identities = 11/39 (28%), Positives = 13/39 (33%)

Query: 91  KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD 129
           K  L       A A    A          EEEEEE++  
Sbjct: 266 KEALANPSAAAAAAAAAAAAAAAAAAAPAEEEEEEEDDM 304



 Score = 27.3 bits (61), Expect = 4.6
 Identities = 7/22 (31%), Positives = 10/22 (45%)

Query: 109 AREEEEEEEEEEEEEEEQEQDE 130
           A            EEEE+E+D+
Sbjct: 282 AAAAAAAAAAAPAEEEEEEEDD 303



 Score = 26.9 bits (60), Expect = 6.9
 Identities = 7/29 (24%), Positives = 11/29 (37%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
           A A    A             EEE+E+++
Sbjct: 274 AAAAAAAAAAAAAAAAAAAPAEEEEEEED 302


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 31.0 bits (71), Expect = 0.39
 Identities = 26/106 (24%), Positives = 49/106 (46%), Gaps = 23/106 (21%)

Query: 97  NQLFKAGARDETAREEEEEEE-------EEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
           N +    A+D+   +E++          ++E E   ++ +EY  ED+  KER     + +
Sbjct: 475 NGILTVSAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKER----IEAK 530

Query: 150 GEDEE-------KKEEEEDK--QGEEKRVEDKIGDEKRLEENKKKK 186
            E EE         +EE DK  + ++K+VE+ I     L+E  + +
Sbjct: 531 NEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEW---LKEELEGE 573



 Score = 27.6 bits (62), Expect = 4.5
 Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 6/90 (6%)

Query: 106 DETAREEEEEE--EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
                ++E E   ++ EE   E ++ +   E +NE E  +   ++  ++E  K  E DK+
Sbjct: 498 SSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKK 557

Query: 164 GEEKRVED----KIGDEKRLEENKKKKTQK 189
             E+ +E       G++K   E K ++ QK
Sbjct: 558 KVEEAIEWLKEELEGEDKEEIEAKTEELQK 587


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 31.0 bits (70), Expect = 0.40
 Identities = 23/92 (25%), Positives = 42/92 (45%), Gaps = 5/92 (5%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED-ERGEDEEKKEEEEDK 162
            R E   EE E+   +E  + E+E ++ +E+   E+E        E G  E +++EE   
Sbjct: 364 ERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFL 423

Query: 163 QG---EEKRVEDKIGDE-KRLEENKKKKTQKR 190
           +    EEK + +K+  E   +E+   +    R
Sbjct: 424 KELSKEEKELLEKLKMEASEVEKLFGRALPVR 455


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 29.3 bits (66), Expect = 0.41
 Identities = 24/50 (48%), Positives = 36/50 (72%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           EEEEEEEEE++ E+  ++DE+++E E E +RR    D R E EE+++ EE
Sbjct: 11  EEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEE 60



 Score = 25.9 bits (57), Expect = 6.1
 Identities = 19/60 (31%), Positives = 36/60 (60%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            E E ++EEEEEEEE++  E L +++   +    E+D R    +++ E+E+++  E+  E
Sbjct: 4   TEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAE 63


>gnl|CDD|224712 COG1799, COG1799, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 167

 Score = 30.1 bits (68), Expect = 0.43
 Identities = 12/47 (25%), Positives = 17/47 (36%), Gaps = 3/47 (6%)

Query: 112 EEEEEEEEEEEEEEQEQDE---YLEEDENEKERRLPEEDERGEDEEK 155
             ++EE+  EE+  QE  +     EE    K      E  R E    
Sbjct: 15  PLDDEEDYYEEDPRQEPRDPAVEPEEYAPPKYNFRKREPARMEMRGN 61


>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
           hydrophilic C-term.  This domain is a hydrophilic region
           found at the C-terminus of plant and metazoan
           pre-mRNA-splicing factor 38 proteins. The function is
           not known.
          Length = 97

 Score = 29.4 bits (66), Expect = 0.44
 Identities = 12/50 (24%), Positives = 21/50 (42%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
           +EEEE EEEE++EE  +    + D   +  R          +  ++    
Sbjct: 11  DEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRR 60



 Score = 28.2 bits (63), Expect = 1.1
 Identities = 19/65 (29%), Positives = 31/65 (47%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
            +A EE+ +EEEE EEEE+ E+     E + ++ RR P    R     +K   + ++   
Sbjct: 3   VSALEEDLDEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRRR 62

Query: 167 KRVED 171
            R   
Sbjct: 63  DRDRA 67


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 30.9 bits (70), Expect = 0.44
 Identities = 8/34 (23%), Positives = 14/34 (41%)

Query: 133 EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
            ++E E  R + E + R   E     E ++Q   
Sbjct: 142 TDEEKELLREVEELESRLATEPSPAPELEEQLAL 175



 Score = 27.8 bits (62), Expect = 3.8
 Identities = 14/50 (28%), Positives = 21/50 (42%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            E  + +EE+E   + E LE     +    PE +E+    EK  E   K 
Sbjct: 137 YEYPKTDEEKELLREVEELESRLATEPSPAPELEEQLALMEKSYELAAKY 186


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 30.4 bits (69), Expect = 0.44
 Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 13/83 (15%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           +   E + EE EE E+ EE+E +E   E+ENE E       E+ E  E + + E+ + + 
Sbjct: 6   QKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEE------EQQEIAELEAQLEELKDKY 59

Query: 167 KRVEDKIGDEKRLEENKKKKTQK 189
            R + +        EN +K+T++
Sbjct: 60  LRAQAEF-------ENLRKRTER 75


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 30.6 bits (69), Expect = 0.45
 Identities = 19/80 (23%), Positives = 39/80 (48%), Gaps = 10/80 (12%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED-----EEKKEEEEDKQG 164
           R   EE  + + ++  +++DE   E E ++ER+ P    +  +     ++ K  +  ++ 
Sbjct: 538 RNSTEERIDLDADDWTEDEDE--NEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEE- 594

Query: 165 EEKRVEDKIGD--EKRLEEN 182
           EE  ++ K+ D  EK L  N
Sbjct: 595 EEAALKMKMTDTSEKPLWRN 614



 Score = 29.5 bits (66), Expect = 1.1
 Identities = 13/61 (21%), Positives = 26/61 (42%), Gaps = 15/61 (24%)

Query: 104 ARDETAREEEEEEEEEEEEE---------------EEQEQDEYLEEDENEKERRLPEEDE 148
           A D T  E+E E E +EE +               ++ +  +  EE+E   + ++ +  E
Sbjct: 549 ADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAALKMKMTDTSE 608

Query: 149 R 149
           +
Sbjct: 609 K 609


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 30.7 bits (69), Expect = 0.46
 Identities = 28/108 (25%), Positives = 54/108 (50%), Gaps = 4/108 (3%)

Query: 83  SGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEED--ENEKE 140
            GAV++    I  +    K G +    +EE+  EE + ++  EQE+ + LE++  + +++
Sbjct: 57  PGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ 116

Query: 141 RRLPEEDERGEDEEKK--EEEEDKQGEEKRVEDKIGDEKRLEENKKKK 186
           ++  EE E+    E+K  EE+  K   E++ + +    K   E  K K
Sbjct: 117 QKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLK 164



 Score = 27.6 bits (61), Expect = 4.7
 Identities = 20/82 (24%), Positives = 37/82 (45%), Gaps = 5/82 (6%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERG-----EDEEKKEEEEDKQ 163
           A E+E  ++ E+E  + QEQ +  EE E + +    +++E+      E ++K E  + K 
Sbjct: 97  AAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKA 156

Query: 164 GEEKRVEDKIGDEKRLEENKKK 185
             E        + K+  E   K
Sbjct: 157 AAEAAKLKAAAEAKKKAEEAAK 178



 Score = 27.2 bits (60), Expect = 5.9
 Identities = 19/89 (21%), Positives = 36/89 (40%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           K  A  E  ++ EE  +  EE + + E     ++ E E +    +     E + K E++ 
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKA 221

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           +   EEK   +K     + + +K     K
Sbjct: 222 EAAAEEKAAAEKKKAAAKAKADKAAAAAK 250


>gnl|CDD|226458 COG3949, COG3949, Uncharacterized membrane protein [Function
           unknown].
          Length = 349

 Score = 30.4 bits (69), Expect = 0.46
 Identities = 15/70 (21%), Positives = 30/70 (42%), Gaps = 12/70 (17%)

Query: 12  ASWNIWAVVTLIL---VFCTATGNILVCLAISWEKRLQNVTN--YFLMSLAITDLMVAVL 66
            S  I  V+++I+   ++ T  G I          RL +     Y++  +A   L++A  
Sbjct: 254 FSPLIGLVMSVIIWLEIYTTTVGLIY-----GLASRLTSFFPRRYWI--IAAIILVIAYP 306

Query: 67  VMPLGILTLV 76
           +   G + L+
Sbjct: 307 LSFFGFINLI 316


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 30.4 bits (69), Expect = 0.49
 Identities = 16/71 (22%), Positives = 37/71 (52%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           ++EEE++  +  E      E L+  EN       E+    ++++KK+  + K+   K ++
Sbjct: 129 DDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELK 188

Query: 171 DKIGDEKRLEE 181
           ++   EK+L++
Sbjct: 189 ERKEREKKLKK 199


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 30.4 bits (69), Expect = 0.50
 Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 9/96 (9%)

Query: 104 ARDETAREEEEEEEEEEEEE------EEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
            R +   + + ++E++ EE+      + ++ DE  ++ E    R  P      E E   +
Sbjct: 309 RRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQ 368

Query: 158 EEEDKQGEEKRVEDKIGDEKRLE---ENKKKKTQKR 190
           EE D QG        I   + L       +  + +R
Sbjct: 369 EEADGQGSSTDPAGDIFRIRVLAPPQARARGASGRR 404


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 30.6 bits (69), Expect = 0.51
 Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 123 EEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
           EE++ + E L ++E E+ER+   E++R  +EEK   E D+   +  VE +
Sbjct: 252 EEKRRELEKLAKEEAERERQ--AEEQRRREEEKAAMEADRAQAKAEVEKR 299



 Score = 29.8 bits (67), Expect = 1.0
 Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 4/60 (6%)

Query: 128 QDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKT 187
           +D  LEE   E E+   EE ER    E++ EE+ ++ EEK   +    + + E  K+++ 
Sbjct: 247 EDFLLEEKRRELEKLAKEEAER----ERQAEEQRRREEEKAAMEADRAQAKAEVEKRREK 302



 Score = 28.7 bits (64), Expect = 2.4
 Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE--EKKEEEEDKQGEEKR 168
              EE E E+E E EQ Q + L+     KE  +  + +  +D+  ++K + E +   ++ 
Sbjct: 69  SAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKRE 128

Query: 169 VE 170
           +E
Sbjct: 129 IE 130



 Score = 27.1 bits (60), Expect = 6.2
 Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           EE+  E E+  +EE E++   EE    +E +   E +R   + K E E+ ++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADR--AQAKAEVEKRRE 301


>gnl|CDD|203554 pfam06965, Na_H_antiport_1, Na+/H+ antiporter 1.  This family
           contains a number of bacterial Na+/H+ antiporter 1
           proteins. These are integral membrane proteins that
           catalyze the exchange of H+ for Na+ in a manner that is
           highly dependent on the pH.
          Length = 379

 Score = 30.3 bits (69), Expect = 0.51
 Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 6/55 (10%)

Query: 17  WAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLG 71
           WA+        T     L  LA+    R+      FL++LAI D + A+L++ L 
Sbjct: 122 WAIPM-----ATDIAFALGVLAL-LGSRVPASLKIFLLALAIVDDLGAILIIALF 170


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 30.1 bits (68), Expect = 0.52
 Identities = 9/57 (15%), Positives = 21/57 (36%)

Query: 134 EDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           E E  K  +     E       +++  +K     + +     +K   + KK++ +K 
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKP 73



 Score = 29.8 bits (67), Expect = 0.73
 Identities = 16/62 (25%), Positives = 30/62 (48%), Gaps = 4/62 (6%)

Query: 131 YLEEDENEKERRLPEEDERGEDEEK---KEEEEDKQGEEKRVEDKIGDEKRLEENKKKKT 187
             E+ +  K++  PE   +   ++K   K +   K    +RV  K+  +++ EE KK K 
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERV-KKLHSQEKKEEKKKPKK 75

Query: 188 QK 189
           +K
Sbjct: 76  KK 77


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 30.4 bits (69), Expect = 0.52
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 8/88 (9%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER--GEDEEKKEE 158
           K      +     +E  E    EEE+E  +   E   EKE+RL E ++    E +E   +
Sbjct: 291 KLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSK 350

Query: 159 EEDKQGEEKRVEDKIGDEKRLEENKKKK 186
            E+ Q   K++ED    EKRLE+ K  K
Sbjct: 351 LEEIQ---KKLEDL---EKRLEKLKSNK 372


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 30.3 bits (69), Expect = 0.53
 Identities = 18/84 (21%), Positives = 30/84 (35%), Gaps = 11/84 (13%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
            D       + E  E +E   + +D  L+  E EK   L EE     +  + + E D+  
Sbjct: 3   LDALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLR 62

Query: 165 EEKR-----------VEDKIGDEK 177
           EE             V + + D +
Sbjct: 63  EEIERLKEPPLIVGTVLEVLDDGR 86


>gnl|CDD|234333 TIGR03727, urea_t_UrtC_arc, urea ABC transporter, permease protein
           UrtC, archaeal type.  Members of this protein family are
           ABC transporter permease subunits restricted to the
           Archaea. Several lines of evidence suggest this protein
           is functionally analogous, as well as homologous, to the
           UrtC subunit of the Corynebacterium glutamicum urea
           transporter. All members of the operon show sequence
           similarity to urea transport subunits, the gene is
           located near the urease structural subunits in two of
           three species, and partial phylogenetic profiling
           identifies this permease subunit as closely matching the
           profile of urea utilization.
          Length = 369

 Score = 30.2 bits (68), Expect = 0.54
 Identities = 16/50 (32%), Positives = 23/50 (46%), Gaps = 1/50 (2%)

Query: 31  GNILVCLAISW-EKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVR 79
           G I+  +AI W  K        F +      LMV +LV+P GI+  +R  
Sbjct: 295 GAIVATVAIEWLRKWFSINAGEFAIVFVGLLLMVTILVIPRGIVPGLREG 344


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 30.6 bits (69), Expect = 0.55
 Identities = 18/79 (22%), Positives = 38/79 (48%), Gaps = 6/79 (7%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQ----EQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           K  + +   R+E E+   +E EE+      E+++  ++ +NE +    EE E  E     
Sbjct: 691 KVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEALEANLLA 750

Query: 157 EEEEDK--QGEEKRVEDKI 173
           E+   K  + ++KR+  ++
Sbjct: 751 EQNSLKAQKQQQKRIAAEV 769



 Score = 29.1 bits (65), Expect = 1.8
 Identities = 20/80 (25%), Positives = 38/80 (47%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           EE  E +    E E +++ E L  DE+E++  +   +E  + ++ K E +D   EE    
Sbjct: 685 EENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEAL 744

Query: 171 DKIGDEKRLEENKKKKTQKR 190
           +     ++     +K+ QKR
Sbjct: 745 EANLLAEQNSLKAQKQQQKR 764


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 28.9 bits (65), Expect = 0.57
 Identities = 19/32 (59%), Positives = 23/32 (71%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
           A A    A EEEEEEEEEEEEEEE+ ++E + 
Sbjct: 67  AAAAAAAAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 28.9 bits (65), Expect = 0.62
 Identities = 18/29 (62%), Positives = 20/29 (68%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
           A A    A   EEEEEEEEEEEEE+E+ E
Sbjct: 65  AAAAAAAAAAAEEEEEEEEEEEEEEEESE 93



 Score = 28.5 bits (64), Expect = 0.91
 Identities = 19/29 (65%), Positives = 21/29 (72%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
           A A    A  EEEEEEEEEEEEEE+E +E
Sbjct: 66  AAAAAAAAAAEEEEEEEEEEEEEEEESEE 94



 Score = 26.9 bits (60), Expect = 3.2
 Identities = 16/27 (59%), Positives = 19/27 (70%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEED 135
           A     EEEEEEEEEEE+E++E  EE 
Sbjct: 70  AAAAAAEEEEEEEEEEEEEEEESEEEA 96



 Score = 26.2 bits (58), Expect = 5.8
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEED 135
           A      EEEEEEEEEE+E++E  EE+
Sbjct: 69  AAAAAAAEEEEEEEEEEEEEEEESEEE 95


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 30.3 bits (67), Expect = 0.59
 Identities = 25/102 (24%), Positives = 52/102 (50%), Gaps = 2/102 (1%)

Query: 90  KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
           KK  L + +     ARD   +   E E+EE++ E+E+++ E    +    + +  +E ++
Sbjct: 126 KKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQK 185

Query: 150 GEDEEKKEEEEDKQGE--EKRVEDKIGDEKRLEENKKKKTQK 189
            E E++K E+E ++      +   ++  EK+  EN+K+   K
Sbjct: 186 TEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIK 227


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.2 bits (68), Expect = 0.59
 Identities = 23/102 (22%), Positives = 46/102 (45%), Gaps = 8/102 (7%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEED--------ENEKERRLPEED 147
           ++   ++ + DE+  EE  E++ ++EEEE+ + ++Y + D        E E E       
Sbjct: 169 KDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINY 228

Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
                E + +E + +  E + V D        ++ K+KKT  
Sbjct: 229 NEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSS 270



 Score = 29.4 bits (66), Expect = 1.2
 Identities = 17/76 (22%), Positives = 38/76 (50%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
             A+++ ++ +++E +E   + DE   E E+E +     ED+  ++EE+  + ED    +
Sbjct: 149 GKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYD 208

Query: 167 KRVEDKIGDEKRLEEN 182
             + D   +E+  E  
Sbjct: 209 GMLVDSSDEEEGEEAP 224


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.5 bits (69), Expect = 0.59
 Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 1/42 (2%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
            +          E  E EEE  E D+  + D+ E+  ++PEE
Sbjct: 269 PEPQPPPPPPPPEPPEPEEEPDEPDQ-TDPDDGEETDQIPEE 309


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 30.4 bits (68), Expect = 0.59
 Identities = 16/84 (19%), Positives = 41/84 (48%), Gaps = 1/84 (1%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           A   + +E+++  E ++ E E+ + +    ++E  K +     D + E +  ++E EDK+
Sbjct: 264 ADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKE 323

Query: 164 GEEKR-VEDKIGDEKRLEENKKKK 186
            E ++  E    D ++ +   + +
Sbjct: 324 LEAQKKREPVAEDLQKTKPQVEAQ 347



 Score = 29.6 bits (66), Expect = 1.0
 Identities = 18/84 (21%), Positives = 42/84 (50%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           + A     +E+++  E +++E ++   E +   E  L  +D +  D +++ +  +K+ E+
Sbjct: 262 KPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAED 321

Query: 167 KRVEDKIGDEKRLEENKKKKTQKR 190
           K +E +   E   E+ +K K Q  
Sbjct: 322 KELEAQKKREPVAEDLQKTKPQVE 345



 Score = 29.2 bits (65), Expect = 1.2
 Identities = 19/91 (20%), Positives = 49/91 (53%), Gaps = 7/91 (7%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            RD T  +E E +E+ +  ++ +E+ +  + D ++ +++  +  +   D+++ E  + KQ
Sbjct: 197 RRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKA-DFAQDNADKQRDEVRQ-KQ 254

Query: 164 GEEKRVEDKIG-----DEKRLEENKKKKTQK 189
            E K +          ++K++ EN+K++ +K
Sbjct: 255 QEAKNLPKPADTSSPKEDKQVAENQKREIEK 285



 Score = 29.2 bits (65), Expect = 1.3
 Identities = 17/102 (16%), Positives = 45/102 (44%), Gaps = 12/102 (11%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEE-----EQEQDEYLEEDENEKERRLPEEDERGEDEE 154
           F     D+   E  ++++E +   +       ++D+ + E++  +  +   E ++ ++E 
Sbjct: 238 FAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEA 297

Query: 155 KKEEE-------EDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
            K ++       ++ +  EK  EDK  + ++  E   +  QK
Sbjct: 298 LKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQK 339



 Score = 28.0 bits (62), Expect = 3.2
 Identities = 18/90 (20%), Positives = 40/90 (44%), Gaps = 1/90 (1%)

Query: 98  QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
           +   A    +  R+E  ++++E +   +       +ED+   E +   E E+ + E KK 
Sbjct: 235 KADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQ-KREIEKAQIEIKKN 293

Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKT 187
           +EE  + ++ +  D   + K  E+  + K 
Sbjct: 294 DEEALKAKDHKAFDLKQESKASEKEAEDKE 323


>gnl|CDD|234300 TIGR03662, Chlor_Arch_YYY, Chlor_Arch_YYY domain.  Members of this
           highly hydrophobic probable integral membrane family
           belong to two classes. In one, a single copy of the
           region modeled by This model represents essentially the
           full length of a strongly hydrophobic protein of about
           700 to 900 residues (variable because of long inserts in
           some). The domain architecture of the other class
           consists of an additional N-terminal region, two copies
           of the region represented by this model, and three to
           four repeats of TPR, or tetratricopeptide repeat. The
           unusual species range includes several Archaea, several
           Chloroflexi, and Clostridium phytofermentans. An unusual
           motif YYYxG is present, and we suggest the name
           Chlor_Arch_YYY protein. The function is unknown.
          Length = 723

 Score = 30.4 bits (69), Expect = 0.61
 Identities = 21/99 (21%), Positives = 35/99 (35%), Gaps = 24/99 (24%)

Query: 5   VSFGLDAASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVA 64
               L   S  I  +  L+ +  T           +W  R ++    F   LA+  L + 
Sbjct: 441 FIALLLGLSLGIPLLALLLPLLVT-----------TWLLRRRDAPVGFETLLALGGLGLL 489

Query: 65  VLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAG 103
           +LV       ++ VR N+ G   R+       N +FK  
Sbjct: 490 LLV------EIIYVRDNIGGDYYRM-------NTVFKFY 515


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 29.9 bits (67), Expect = 0.62
 Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 3/43 (6%)

Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
           ++EEE+   Q E +EEDE E E     ED   E++E+ EE E+
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEME---DWEDSLDEEDEEAEEVEE 57



 Score = 29.1 bits (65), Expect = 1.1
 Identities = 15/29 (51%), Positives = 18/29 (62%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           EE EE+EEE E+ E   DE  EE E  +E
Sbjct: 29  EEVEEDEEEMEDWEDSLDEEDEEAEEVEE 57



 Score = 28.3 bits (63), Expect = 2.0
 Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           ++EEE+ + + E+ E  EE+  + E  L EEDE  E+ E  EE     
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVE--EETAASS 63



 Score = 27.6 bits (61), Expect = 2.9
 Identities = 12/35 (34%), Positives = 21/35 (60%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           D  A E EE+EEE E+ E+  ++++   E+  E+ 
Sbjct: 25  DSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59



 Score = 26.4 bits (58), Expect = 7.8
 Identities = 11/39 (28%), Positives = 18/39 (46%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED 147
             EE+EEE E+ E+  ++E +E  E +E           
Sbjct: 30  EVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSS 68


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 30.2 bits (68), Expect = 0.63
 Identities = 20/63 (31%), Positives = 33/63 (52%), Gaps = 1/63 (1%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
           E  E E  ++     D + E+DE   E R   ED   E+ E+ EE+E+++GEE+  E + 
Sbjct: 46  ELAEPETSDDPYGNPDPFGEDDEGRIEVR-ISEDGEEEEVEEGEEDEEEEGEEESEEFEP 104

Query: 174 GDE 176
             +
Sbjct: 105 LGD 107



 Score = 29.8 bits (67), Expect = 0.84
 Identities = 23/68 (33%), Positives = 33/68 (48%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           G  DE   E    E+ EEEE EE E+DE  E +E  +E     +      E  ++ EE +
Sbjct: 64  GEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQLEEHE 123

Query: 163 QGEEKRVE 170
           +G +  VE
Sbjct: 124 EGFQAMVE 131



 Score = 29.4 bits (66), Expect = 1.0
 Identities = 16/56 (28%), Positives = 26/56 (46%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           E E  ++     +   E DE   E    ++    E +E  EDEE++ EEE ++ E 
Sbjct: 49  EPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEP 104



 Score = 27.1 bits (60), Expect = 5.8
 Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           A  ET+ +     +   E++E + +   + ED  E+E    EEDE  E EE+ EE E
Sbjct: 48  AEPETSDDPYGNPDPFGEDDEGRIEVR-ISEDGEEEEVEEGEEDEEEEGEEESEEFE 103


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 29.9 bits (68), Expect = 0.63
 Identities = 20/77 (25%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE-EEEDKQGEEK 167
             +   E+ ++E EE +Q   E   + E  K+ R   E+     EE K+ E+E K+ + +
Sbjct: 66  KLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAE 125

Query: 168 RVEDKIGDEKRLEENKK 184
             + +  D +R+E+ K+
Sbjct: 126 LEKYEKNDPERIEKLKE 142


>gnl|CDD|225962 COG3428, COG3428, Predicted membrane protein [Function unknown].
          Length = 494

 Score = 30.1 bits (68), Expect = 0.63
 Identities = 36/171 (21%), Positives = 52/171 (30%), Gaps = 24/171 (14%)

Query: 20  VTLILVFCTATGNILVCLAISWE-KRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRV 78
             LI+    A    +V L  S+          Y     A   L+V V+ + L I+  +  
Sbjct: 10  RALIVGIIHAILRAIVVLIGSFSFVLGVATNGYSFWGGAA--LVVLVIFLILQIIKWITF 67

Query: 79  RTNLSGAVLRLKKNILCE--------------------NQLFK-AGARDETAREEEEEEE 117
              +    +RL+  I                        +LF     R ETA    E E 
Sbjct: 68  TYRIEDEEVRLQTGIFSRKKRYIPIDRIQSVDTTAGLVARLFGLVVLRIETAGGGGEAEA 127

Query: 118 EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
           E  E E   E    + E   +K   L   +    +EE  E     Q    R
Sbjct: 128 ELVEREIAFELAALVREARVKKLDALELAEADTPEEEVAEVLARSQSSVLR 178



 Score = 27.4 bits (61), Expect = 4.7
 Identities = 12/45 (26%), Positives = 15/45 (33%), Gaps = 2/45 (4%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
                K     E A  E +  EEE  E   + Q   L    N+ E
Sbjct: 143 REARVKKLDALELA--EADTPEEEVAEVLARSQSSVLRYPMNKGE 185


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
           Provisional.
          Length = 1136

 Score = 30.5 bits (68), Expect = 0.64
 Identities = 18/76 (23%), Positives = 38/76 (50%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           E+E  E+ EEE      +E L  DE+ +E    +  E+  D+E  E +   +  E++  +
Sbjct: 432 EDESVEDNEEEHSGDANEEELSVDEHVEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVN 491

Query: 172 KIGDEKRLEENKKKKT 187
           +  +E  + +  +++T
Sbjct: 492 EHVEEPTVADIVEQET 507



 Score = 29.0 bits (64), Expect = 1.7
 Identities = 20/85 (23%), Positives = 43/85 (50%), Gaps = 3/85 (3%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEE---DENEKERRLPEEDERGEDEEKKEEEEDK 162
           ++ + E+ EEE   +  EEE   DE++EE   D++ +++   E  E     E  EE+   
Sbjct: 432 EDESVEDNEEEHSGDANEEELSVDEHVEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVN 491

Query: 163 QGEEKRVEDKIGDEKRLEENKKKKT 187
           +  E+     I +++ ++E+ ++  
Sbjct: 492 EHVEEPTVADIVEQETVDEHVEEPA 516



 Score = 27.4 bits (60), Expect = 5.4
 Identities = 23/64 (35%), Positives = 29/64 (45%)

Query: 107  ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
            E   E  EE  EE  EE  +E  E +EE+  E      EE+     EE  EE +D+  EE
Sbjct: 1070 EEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEE 1129

Query: 167  KRVE 170
               E
Sbjct: 1130 HNEE 1133



 Score = 27.4 bits (60), Expect = 6.2
 Identities = 25/79 (31%), Positives = 40/79 (50%)

Query: 111  EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            EE  EE  EE +EE  E+ E   E+ +E+     EE+     EE  EE  ++  EE   E
Sbjct: 1012 EENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE 1071

Query: 171  DKIGDEKRLEENKKKKTQK 189
             +   E+ +EEN ++  ++
Sbjct: 1072 IEENIEENIEENVEENVEE 1090



 Score = 27.4 bits (60), Expect = 6.6
 Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 3/113 (2%)

Query: 78   VRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
            V  N+   V    +  + E +       +E   E EE  EE  EE  E+  +EY EE+  
Sbjct: 1011 VEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVE 1070

Query: 138  EKERRLPEEDERGEDEEKKEE-EEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
            E E  + E  E   +E  +E  EE ++  E+ VE+    E+  EEN ++  ++
Sbjct: 1071 EIEENIEENIEENVEENVEENVEEIEENVEENVEENA--EENAEENAEENAEE 1121



 Score = 27.1 bits (59), Expect = 8.0
 Identities = 20/61 (32%), Positives = 29/61 (47%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            +E   E  EE  EE  EE  +E +E +EE+  E      EE+     EE  +E  ++  E
Sbjct: 1073 EENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNE 1132

Query: 166  E 166
            E
Sbjct: 1133 E 1133



 Score = 26.7 bits (58), Expect = 9.0
 Identities = 20/60 (33%), Positives = 29/60 (48%)

Query: 106  DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            +E   E  EE  EE  EE E+  +E +EE+  E      EE+    D+E  EE  ++  E
Sbjct: 1077 EENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNEEYDE 1136



 Score = 26.7 bits (58), Expect = 9.4
 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 106  DETAREEEEEEEEEEEEEE-EQEQDEYLEEDENEKERRLPEEDERG--EDEEKKEEEEDK 162
            +E   E  EE  EE  EE  E+  +EY EE+  E E  + E DE    E EE  EE  ++
Sbjct: 996  EENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEE 1055

Query: 163  QGEEKRVEDKIGDEKRLEEN 182
              EE   E    + + +EEN
Sbjct: 1056 NIEENIEEYDEENVEEIEEN 1075


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 28.8 bits (65), Expect = 0.67
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
           A A    A  E ++EE++EEEEEE + D
Sbjct: 70  AAAAAAAAAAEAKKEEKKEEEEEESDDD 97



 Score = 26.9 bits (60), Expect = 3.0
 Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 101 KAGARDETAREEEEEEEEEEEEEE 124
            A A  E  +EE++EEEEEE +++
Sbjct: 74  AAAAAAEAKKEEKKEEEEEESDDD 97



 Score = 26.5 bits (59), Expect = 4.1
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
           A A    A   E ++EE++EEEEE+  D+
Sbjct: 69  AAAAAAAAAAAEAKKEEKKEEEEEESDDD 97


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 30.0 bits (67), Expect = 0.69
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
           T  ++ +EE+EEEE EEE +Q+E    D+
Sbjct: 215 TEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243



 Score = 28.9 bits (64), Expect = 1.4
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEE 134
           E  + +EE+EEEE EEE +QE+ +  ++
Sbjct: 216 EEKQPQEEQEEEEVEEEAKQEEGQGTDD 243



 Score = 27.7 bits (61), Expect = 3.1
 Identities = 15/56 (26%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
             + +  + ++E+   +  EYL+E + E+++   E++E   +EE K+EE   QG +
Sbjct: 189 SVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE--GQGTD 242



 Score = 27.3 bits (60), Expect = 5.0
 Identities = 11/36 (30%), Positives = 23/36 (63%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
             E  +E + EE++ +EE+EE+E +E  +++E +  
Sbjct: 206 LQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGT 241



 Score = 26.9 bits (59), Expect = 7.3
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
             + EE++ +EE+EEEE E++   EE +   +
Sbjct: 212 EHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 29.7 bits (67), Expect = 0.73
 Identities = 25/63 (39%), Positives = 33/63 (52%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
             +L +    +E+A EEEE+ EEE EEEEE+ Q     E+E E      E     E EE 
Sbjct: 70  RLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREEL 129

Query: 156 KEE 158
           +EE
Sbjct: 130 REE 132



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 22/62 (35%), Positives = 33/62 (53%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           + EE   E  E++EE+E  E  EED  E+     EE +RG   E++EE+ D   E    E
Sbjct: 65  DREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPE 124

Query: 171 DK 172
           ++
Sbjct: 125 ER 126



 Score = 27.4 bits (61), Expect = 3.9
 Identities = 24/71 (33%), Positives = 38/71 (53%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E+ L +  AR E   ++EEEE  EEEEE+ +E++E  EE+         EE++    +E 
Sbjct: 61  ESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDEL 120

Query: 156 KEEEEDKQGEE 166
              EE ++  E
Sbjct: 121 PTPEEREELRE 131


>gnl|CDD|236973 PRK11767, PRK11767, SpoVR family protein; Provisional.
          Length = 498

 Score = 29.8 bits (68), Expect = 0.74
 Identities = 13/38 (34%), Positives = 24/38 (63%)

Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           +EE+  ++E++EYL+   N+  R LP+ +E    EE +
Sbjct: 188 QEEKARQKEREEYLQSQVNDLWRTLPKREEEKAVEEAR 225



 Score = 28.3 bits (64), Expect = 2.5
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 6/47 (12%)

Query: 108 TAREEEEEEEEEEEEEEEQEQD------EYLEEDENEKERRLPEEDE 148
           + +EE+  ++E EE  + Q  D      +  EE   E+ RR P E +
Sbjct: 186 SLQEEKARQKEREEYLQSQVNDLWRTLPKREEEKAVEEARRFPSEPQ 232


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 29.9 bits (67), Expect = 0.74
 Identities = 14/91 (15%), Positives = 27/91 (29%), Gaps = 3/91 (3%)

Query: 90  KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN--EKERRLPEED 147
                      + GA ++        E  EEE  +  E + +  E+E+  +   R   E 
Sbjct: 471 GGAQFSSANSLERGADEDYLIVNGTNEPYEEEVIKTNENENFPLENESPCDGISRGTSEL 530

Query: 148 E-RGEDEEKKEEEEDKQGEEKRVEDKIGDEK 177
               +D          + E     D +   +
Sbjct: 531 PVNDDDNNNLNRGNTLRSEGNNCFDNLRCAR 561



 Score = 27.6 bits (61), Expect = 5.2
 Identities = 15/69 (21%), Positives = 27/69 (39%), Gaps = 3/69 (4%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
           +E+        E  ++E ++ +ENE     P E+E   D   +   E    ++       
Sbjct: 486 DEDYLIVNGTNEPYEEEVIKTNENEN---FPLENESPCDGISRGTSELPVNDDDNNNLNR 542

Query: 174 GDEKRLEEN 182
           G+  R E N
Sbjct: 543 GNTLRSEGN 551


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 30.2 bits (68), Expect = 0.75
 Identities = 22/138 (15%), Positives = 46/138 (33%), Gaps = 29/138 (21%)

Query: 76  VRVRTNLSGAVLRLKKNILCENQLFKAGARDE---TAREEEEEEEEEEEEEEEQEQDEYL 132
           V  RT+          NIL  + L     +       + +  ++++++E +   E DE  
Sbjct: 47  VDERTSGVLRDEGKHANILYNSILCNQ--KKHASFLNQRKSLDDDDDDEFDFLYEDDEDD 104

Query: 133 --------EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK----------------R 168
                      +++    LP+ DE  + +   ++  D   ++                  
Sbjct: 105 AGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSS 164

Query: 169 VEDKIGDEKRLEENKKKK 186
            E  I ++K   EN  K 
Sbjct: 165 AEKLIEEKKGQTENTFKF 182


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
           terminus.  The C terminus of the plasma membrane Nha1
           antiporter plays an important role in the immediate cell
           response to hypo-osmotic shock which prevents an
           execessive loss of ions and water. This domain is found
           with pfam00999.
          Length = 430

 Score = 29.8 bits (67), Expect = 0.77
 Identities = 13/52 (25%), Positives = 21/52 (40%), Gaps = 5/52 (9%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP-----EEDERGEDEE 154
            +R     + E+E+ E +    +  EE   E+ RRL            +DEE
Sbjct: 379 PSRPRSRRDSEDEDTERDDSDSDDEEETPAERRRRLAALGELPSARDDDDEE 430



 Score = 29.0 bits (65), Expect = 1.5
 Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 1/77 (1%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           KA ARDE    +     + + EE+ ++   +        ERR   E        +     
Sbjct: 327 KAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPSSSSPPSRPRSRR 386

Query: 161 D-KQGEEKRVEDKIGDE 176
           D +  + +R +    DE
Sbjct: 387 DSEDEDTERDDSDSDDE 403


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 30.1 bits (67), Expect = 0.79
 Identities = 19/64 (29%), Positives = 29/64 (45%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           R E    E  E E  E  E E+ + E LE +  E++R   +  +R E E     E D+  
Sbjct: 462 RLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRLE 521

Query: 165 EEKR 168
           + +R
Sbjct: 522 KARR 525



 Score = 27.8 bits (61), Expect = 4.1
 Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 6/71 (8%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
           E  E EE E  E+E+ E +E +  E+ER    E ER E +  + +  D+   E+   D++
Sbjct: 461 ERLEREERERLERERMERIERERLERERL---ERERLERDRLERDRLDRL--ERERVDRL 515

Query: 174 GDEKRLEENKK 184
            +  RLE+ ++
Sbjct: 516 -ERDRLEKARR 525


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 30.0 bits (67), Expect = 0.79
 Identities = 11/47 (23%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 19  VVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAV 65
           +  +I+      GN  + + I + K+++ +T+ ++ +LAI+DL+  +
Sbjct: 103 IFYIIIFILGLFGNAAI-IMILFCKKIKTITDIYIFNLAISDLIFVI 148


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 29.5 bits (67), Expect = 0.81
 Identities = 26/80 (32%), Positives = 33/80 (41%), Gaps = 14/80 (17%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            +    +EEEEE  E    E  E    +K       DE   DEE  EEE+D+  EE R  
Sbjct: 15  PKPPSPKEEEEEALELAIQEAHENALEKK-----LLDEL--DEELDEEEDDRFLEEYR-- 65

Query: 171 DKIGDEKRLEENKKKKTQKR 190
                 KRL E K    + +
Sbjct: 66  -----RKRLAEMKALAEKSK 80



 Score = 27.2 bits (61), Expect = 4.9
 Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 5/60 (8%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYL-----EEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           EEE  E   +E  E   + + L     E DE E +R L E   +   E K   E+ K GE
Sbjct: 24  EEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKALAEKSKFGE 83


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 29.6 bits (67), Expect = 0.81
 Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 13/86 (15%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
              + E+ EEE EE E EE  +DE  EE   E+           E E   EE+      E
Sbjct: 1   MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEES----------EAELLDEEQAKIAELE 50

Query: 167 KRVEDKIGDEKRLE---ENKKKKTQK 189
            ++++      RL+   EN K++ QK
Sbjct: 51  AKLDELEERYLRLQADFENYKRRIQK 76



 Score = 27.7 bits (62), Expect = 3.1
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
           +ET  EE  E+E EEE  EE+ + E L+E++ +
Sbjct: 13  EETEVEEAVEDEVEEETVEEESEAELLDEEQAK 45


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 28.6 bits (63), Expect = 0.82
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDE 130
           F  G  D  +  +EEE++++E+EE++ E D+
Sbjct: 5   FLEGEEDSDSDSDEEEDDDDEDEEDDDEDDD 35



 Score = 28.2 bits (62), Expect = 1.3
 Identities = 12/42 (28%), Positives = 27/42 (64%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
           E   + + + +EEE++++E E+D+  ++DE++ E  +P   E
Sbjct: 7   EGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSFGE 48



 Score = 27.0 bits (59), Expect = 3.2
 Identities = 9/37 (24%), Positives = 24/37 (64%)

Query: 98  QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEE 134
              +     ++  +EEE++++E+EE+++++ DE  +E
Sbjct: 4   HFLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 25.9 bits (56), Expect = 6.9
 Identities = 13/36 (36%), Positives = 22/36 (61%)

Query: 131 YLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           +LE +E+       EED+  EDEE  +E++D+  +E
Sbjct: 5   FLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40


>gnl|CDD|237158 PRK12650, PRK12650, putative monovalent cation/H+ antiporter
           subunit A; Reviewed.
          Length = 962

 Score = 30.0 bits (68), Expect = 0.82
 Identities = 12/30 (40%), Positives = 19/30 (63%), Gaps = 1/30 (3%)

Query: 58  ITDLMVAVLVMPLGILTLVRVRTNLSGAVL 87
             DL++  ++  LG++ LVR RT L+  VL
Sbjct: 599 WIDLVLLAIIA-LGVIALVRTRTRLAAVVL 627


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 29.7 bits (68), Expect = 0.85
 Identities = 19/32 (59%), Positives = 21/32 (65%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEE 134
             R   A EEEEEE EEEEEEEE+E+ E  E 
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257



 Score = 28.6 bits (65), Expect = 1.9
 Identities = 18/34 (52%), Positives = 21/34 (61%)

Query: 97  NQLFKAGARDETAREEEEEEEEEEEEEEEQEQDE 130
            Q   A A +E   E EEEEEEEEEEE E+ + E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 2.9
 Identities = 20/34 (58%), Positives = 23/34 (67%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEE 134
           + G   E A EEEEE EEEEEEEEE+E +E   E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 3.0
 Identities = 16/33 (48%), Positives = 19/33 (57%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
                   EEEEEE EEEEEEE+E++    E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 3.3
 Identities = 17/32 (53%), Positives = 21/32 (65%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
            R  E  EEEEEE EEE+E++E  E +E E E
Sbjct: 227 GRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.1 bits (61), Expect = 5.9
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
           G +   A   EEEEEE EEEEEE+E++E  E +  
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.1 bits (61), Expect = 5.9
 Identities = 19/35 (54%), Positives = 22/35 (62%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
            R     E  EEEEEE EEEEE+E++E  EE E E
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 29.6 bits (66), Expect = 0.86
 Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 13/92 (14%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE----------- 151
           G+ D  + +++    EEE+E+  Q + +       E +   P   ER             
Sbjct: 159 GSDDSDSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKR 218

Query: 152 --DEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
             +EE +E+ E +   E   ED+ G     +E
Sbjct: 219 KLNEEDEEDAESESSFESSNEDEEGSSSEADE 250


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 29.7 bits (67), Expect = 0.86
 Identities = 14/57 (24%), Positives = 18/57 (31%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
              E   +E   E E E     E   E   E     PEE E   +EE  +  +    
Sbjct: 199 EPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVP 255


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 29.7 bits (67), Expect = 0.91
 Identities = 20/85 (23%), Positives = 41/85 (48%), Gaps = 1/85 (1%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
              +EEE   E+E  EEEE+ +    E+   + E  +   +E   + +++ EE  ++ E+
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEK 454

Query: 167 KRVE-DKIGDEKRLEENKKKKTQKR 190
              E ++   E R +  K ++ + R
Sbjct: 455 LESELERFRREVRDKVRKDREIRAR 479



 Score = 29.3 bits (66), Expect = 1.4
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 2/97 (2%)

Query: 86  VLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE 145
           V R+K  ++    L +A ++ +      E+E  EEEE  E    E   +   E   RL E
Sbjct: 377 VERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEE 436

Query: 146 EDE--RGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
           E+   + E EE K E E  + E +R   ++ D+ R +
Sbjct: 437 ENSELKRELEELKREIEKLESELERFRREVRDKVRKD 473



 Score = 27.4 bits (61), Expect = 6.4
 Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 8/75 (10%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
           RE EE + E E+ E E E+      D+  K+R +   D R E  EK+ EE+ K+ EE   
Sbjct: 443 RELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEEL-- 500

Query: 170 EDKIGDEKRLEENKK 184
                 E++L E +K
Sbjct: 501 ------ERKLAELRK 509


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 29.5 bits (67), Expect = 0.91
 Identities = 10/44 (22%), Positives = 19/44 (43%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
                     E E ++E  E  ++D+ +++    EE + G D E
Sbjct: 35  AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPE 78



 Score = 28.7 bits (65), Expect = 1.4
 Identities = 17/74 (22%), Positives = 40/74 (54%), Gaps = 5/74 (6%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENE-KER--RLPEEDERGEDEEKKEEEEDK 162
           +    + E++++++E+E+E+ E++  L  D  E +ER   L E+ E+     +K     K
Sbjct: 47  ELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARERFAELREQYEKTRKAIEKHGRGSK 106

Query: 163 QGEEKRVEDKIGDE 176
           + ++ R  + + +E
Sbjct: 107 KAQKAR--EALAEE 118



 Score = 27.9 bits (63), Expect = 2.7
 Identities = 16/46 (34%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER-RLPEE 146
           A A   TA   E E +EE+ E+++ + ++  E+DE E +    PEE
Sbjct: 34  AAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 29.6 bits (66), Expect = 0.93
 Identities = 29/116 (25%), Positives = 54/116 (46%), Gaps = 12/116 (10%)

Query: 86  VLRLKKNILCENQL---FKAGARDETAREEEEEEEEEEEEEEEQEQD--EYLEEDENEK- 139
           +LRLK  I  +  L        +  T RE+ ++E+E   EEEE+     E + +D  ++ 
Sbjct: 166 MLRLKSGIYGDTPLNVREHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEK 225

Query: 140 ------ERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
                 E+ L  +      E ++EE  + + +E   ED  G  ++ + +  KK++K
Sbjct: 226 TKKKDLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRK 281


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 29.3 bits (66), Expect = 0.93
 Identities = 13/64 (20%), Positives = 25/64 (39%), Gaps = 2/64 (3%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE--KKEEEEDKQGEE 166
           A  E + E   +E+   +    + EE +      L  E++     E    + E  ++ EE
Sbjct: 108 ASGESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEE 167

Query: 167 KRVE 170
           +  E
Sbjct: 168 EIAE 171


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 28.4 bits (64), Expect = 0.95
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
           A A    A    ++EE++EE EEE + D
Sbjct: 75  AAAAAAAAAAAAKKEEKKEESEEESDDD 102


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 29.8 bits (67), Expect = 0.98
 Identities = 14/64 (21%), Positives = 31/64 (48%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            +     E +  EE  + +E+   E+L       E    E+DE  ++E +K+++  K+  
Sbjct: 306 SDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPR 365

Query: 166 EKRV 169
           +++V
Sbjct: 366 KRKV 369



 Score = 29.4 bits (66), Expect = 1.0
 Identities = 18/113 (15%), Positives = 37/113 (32%), Gaps = 27/113 (23%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE------------------------K 139
            ++E + ++E +   E E EE        E  +++                        +
Sbjct: 240 VQEEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMR 299

Query: 140 ERRLPEEDERGEDEEKKEEEEDKQGEEKRVE---DKIGDEKRLEENKKKKTQK 189
             R    DE    E    EE  K  E+   E      G ++  E+   +++++
Sbjct: 300 PVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKE 352



 Score = 29.0 bits (65), Expect = 1.5
 Identities = 25/109 (22%), Positives = 34/109 (31%), Gaps = 28/109 (25%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENE----------------KERRLPEEDERGEDE 153
           RE E EE      EEE    +   ++                   +  R    DE    E
Sbjct: 254 REIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPE 313

Query: 154 EKKEEEEDKQGEEKRVE---DKIG---------DEKRLEENKKKKTQKR 190
               EE  K  E+   E      G         DE+  EE +KK+  K+
Sbjct: 314 SDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKK 362


>gnl|CDD|234350 TIGR03766, TIGR03766, conserved hypothetical integral membrane
           protein.  Models TIGR03110, TIGR03111, and TIGR03112
           describe a three-gene system found in several
           Gram-positive bacteria, where TIGR03110 (XrtG) is
           distantly related to a putative transpeptidase,
           exosortase (TIGR02602). This model describes a small
           clade that correlates by both gene clustering and
           phyletic pattern, although imperfectly, to the three
           gene system. Both this narrow clade, and the larger set
           of full-length homologous integral membrane proteins,
           have an especially well-conserved region near the
           C-terminus with an invariant tyrosine. The function is
           unknown.
          Length = 483

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 6/69 (8%)

Query: 13  SWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNY-FLMSLAITDLMVA----VLV 67
           SW  + VV ++LV  +A         + + K+   V  Y F++ LA++  ++       V
Sbjct: 137 SWLFFDVVNIVLVDLSALILYKAVKKV-FNKKKAFVALYLFVLLLALSPYILIPYTDTWV 195

Query: 68  MPLGILTLV 76
           +P   L L 
Sbjct: 196 LPFVSLFLF 204


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 28.8 bits (65), Expect = 1.0
 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 14/69 (20%)

Query: 107 ETAREEEEEEEEEEEEEEE------------QEQDEYLEEDENEKERRLP--EEDERGED 152
           E  REE+E E+EE+E +EE            +E+++ LEE E  K + L    +DE  ++
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDE 74

Query: 153 EEKKEEEED 161
           E K++E  D
Sbjct: 75  ELKEQERWD 83



 Score = 26.9 bits (60), Expect = 4.8
 Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 4/53 (7%)

Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKR----VEDKIGDEKRLEENKKKKTQK 189
           R +  E++R E E +KEE+E K+ +EK     +  K   EKRLEE +K K + 
Sbjct: 10  RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 29.5 bits (66), Expect = 1.0
 Identities = 14/63 (22%), Positives = 27/63 (42%), Gaps = 8/63 (12%)

Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK---KEEEEDKQGEEKRVEDKIGD 175
           +++E+   E  +  +ED  +K     +E++R  DE+K                 +D+   
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKK-----QEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPK 219

Query: 176 EKR 178
           EKR
Sbjct: 220 EKR 222



 Score = 28.7 bits (64), Expect = 1.5
 Identities = 22/85 (25%), Positives = 37/85 (43%), Gaps = 7/85 (8%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE------EDERGEDEEKKEEEEDKQGE 165
            E+ E+E      ++  DE+ +    +KE+R  E      ED R + EEK+  +EDK+  
Sbjct: 140 AEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPG 199

Query: 166 EKRVEDKIGDEKRLEEN-KKKKTQK 189
                         ++   K+K QK
Sbjct: 200 GGGGSSGGQSGLSTKDEPPKEKRQK 224


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 29.5 bits (66), Expect = 1.0
 Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 106 DETAREEEEEEEE-EEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
           DE  +EEE   +E EE E+E+ + D  L E + EKE+   EE +
Sbjct: 73  DELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQ 116


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 29.5 bits (66), Expect = 1.1
 Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 5/75 (6%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE-----RRLPEEDERG 150
           +   +K  A +    E E E   EEE  EE E +  LE ++ EK          +ED   
Sbjct: 196 DPDEWKISASEPFQLESEVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALA 255

Query: 151 EDEEKKEEEEDKQGE 165
            D +K  EEE+++GE
Sbjct: 256 GDAKKALEEEEEKGE 270



 Score = 27.9 bits (62), Expect = 3.6
 Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 7/82 (8%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
                E ARE+EEEEE    E E   Q+E   E+E   E ++       E E  ++ E+ 
Sbjct: 140 NNLDAEPAREDEEEEESFSAEFEHPAQEETAGEEERTDEPKV-------EHEAHEQHEQP 192

Query: 162 KQGEEKRVEDKIGDEKRLEENK 183
              +    +    +  +LE   
Sbjct: 193 ADDDPDEWKISASEPFQLESEV 214


>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor. 
          Length = 456

 Score = 29.6 bits (66), Expect = 1.1
 Identities = 10/35 (28%), Positives = 16/35 (45%)

Query: 118 EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
           + +EE +E+ QD+    D    E  +     RGE 
Sbjct: 54  QGQEEPDEKTQDQQSLSDVERAEPEVEASRGRGEG 88


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 29.0 bits (65), Expect = 1.2
 Identities = 12/67 (17%), Positives = 33/67 (49%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           +++   EE++ E ++++ EE +   +  EE  + +E          ++E+  ++ + K  
Sbjct: 73  KEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDNEDDMDDSDLKLA 132

Query: 165 EEKRVED 171
            E  ++D
Sbjct: 133 TENIIKD 139


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 29.6 bits (67), Expect = 1.2
 Identities = 23/106 (21%), Positives = 47/106 (44%), Gaps = 13/106 (12%)

Query: 89  LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQ-EQDEYLEEDENEK----ERRL 143
           L++    +  L +     +  ++EE++++E E+ E+ +    E+ +  E++     E  L
Sbjct: 553 LQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGL 612

Query: 144 PEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           P  D  GE+  KK        E K++  +   + +L E    K  K
Sbjct: 613 PTHDADGEEISKK--------ERKKLSKEYDKQAKLHEEYLAKGGK 650


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 29.6 bits (66), Expect = 1.2
 Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 3/49 (6%)

Query: 129 DEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEK 177
           +E+L E E+E      E D   +DE++K  EE K   E  V+ +I  EK
Sbjct: 92  EEFLSESESEASL---EIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEK 137


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 29.2 bits (66), Expect = 1.2
 Identities = 24/102 (23%), Positives = 54/102 (52%), Gaps = 7/102 (6%)

Query: 93  ILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
           ILC +Q   +   +++  + +E+ +  + + +  ++   L E  ++ +R+L  + ER  D
Sbjct: 263 ILCNHQRTVSKTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFER--D 320

Query: 153 EEKKEEEEDKQGEEKRVEDKIGD-----EKRLEENKKKKTQK 189
            EK + E  ++ +EK+ E+K        E+R+E+ + + T K
Sbjct: 321 NEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDK 362


>gnl|CDD|223097 COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 577

 Score = 29.5 bits (67), Expect = 1.2
 Identities = 18/70 (25%), Positives = 32/70 (45%)

Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
           E+   E      E D YL E   +  + L E+    E+E ++E E+ + G+E+    +  
Sbjct: 178 EKRGREALGLTPEPDGYLGEYYVKIAKDLEEDPGNDEEEAREEVEKLESGDEEAELWRKF 237

Query: 175 DEKRLEENKK 184
            +  LE  K+
Sbjct: 238 VDLSLEGIKE 247


>gnl|CDD|222508 pfam14033, DUF4246, Protein of unknown function (DUF4246).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and fungi.
           Proteins in this family are typically between 392 and
           644 amino acids in length.
          Length = 496

 Score = 29.1 bits (66), Expect = 1.3
 Identities = 12/47 (25%), Positives = 19/47 (40%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
             +    EE + E  E+E+ E   +   E  +  +ERR     E G 
Sbjct: 264 HPKEKEPEEPKPEPGEDEDYEDYWERLEEYLDWREERRKAIHPEPGT 310


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 29.2 bits (66), Expect = 1.3
 Identities = 16/50 (32%), Positives = 27/50 (54%)

Query: 132 LEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
           +EE   E+E     E E  E+EE +EE+++K G   + +     EK +E+
Sbjct: 16  VEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 29.4 bits (66), Expect = 1.3
 Identities = 13/40 (32%), Positives = 21/40 (52%)

Query: 98  QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
           +L +     +  + EE + EE +EE EE+E+  Y E  E 
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEK 317



 Score = 27.9 bits (62), Expect = 4.7
 Identities = 13/46 (28%), Positives = 30/46 (65%)

Query: 139 KERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKK 184
           KE++  +++E+ E+ + +E +E+ + EEK    ++ ++  +E NKK
Sbjct: 280 KEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK 325


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 29.3 bits (66), Expect = 1.3
 Identities = 11/44 (25%), Positives = 19/44 (43%)

Query: 122 EEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
             EE E++   E+D       + EE E  E ++ +  +E K   
Sbjct: 127 PNEEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKR 170


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 29.4 bits (66), Expect = 1.3
 Identities = 16/83 (19%), Positives = 35/83 (42%)

Query: 71  GILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDE 130
           G    + V  N++ A +  KKN   +         +  +R     +E+ +++EE Q  + 
Sbjct: 397 GAPPRIPVMVNMASASVSSKKNQKPQESSPSLDQTNAASRNSVMMDEDSDDDEEFQIPES 456

Query: 131 YLEEDENEKERRLPEEDERGEDE 153
             E +  + E +    +E  +D+
Sbjct: 457 EQEPETTKNETKDTAMEEEPQDK 479


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 28.0 bits (62), Expect = 1.3
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQE 127
           AGA    A+ E ++EE++EEEEEE++
Sbjct: 79  AGAATAGAKAEAKKEEKKEEEEEEED 104



 Score = 26.8 bits (59), Expect = 3.5
 Identities = 13/22 (59%), Positives = 19/22 (86%)

Query: 102 AGARDETAREEEEEEEEEEEEE 123
           AGA+ E  +EE++EEEEEEE++
Sbjct: 84  AGAKAEAKKEEKKEEEEEEEDD 105



 Score = 26.8 bits (59), Expect = 3.8
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
             A   TA  + E ++EE++EEEE+E+D+
Sbjct: 77  PAAGAATAGAKAEAKKEEKKEEEEEEEDD 105


>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family.  The TOR signalling pathway
           activates a cell-growth program in response to
           nutrients. TIP41 (pfam04176) interacts with TAP42 and
           negatively regulates the TOR signaling pathway.
          Length = 335

 Score = 29.2 bits (66), Expect = 1.4
 Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
           +   E     EEEE+ E DE  +++E  K R 
Sbjct: 295 QGGGEAAASAEEEEDDEDDEEDDDEETLKARA 326


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 16/87 (18%), Positives = 37/87 (42%), Gaps = 4/87 (4%)

Query: 91  KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE-- 148
           + I  E +  K   + E +  +E EEE +   E + E  E+  + +   +  + ++    
Sbjct: 837 ELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIK 896

Query: 149 --RGEDEEKKEEEEDKQGEEKRVEDKI 173
                DE K++E+  K+ ++   +   
Sbjct: 897 LVISSDEIKQDEKTTKKKKKDLEKTDE 923



 Score = 28.9 bits (65), Expect = 1.9
 Identities = 20/98 (20%), Positives = 40/98 (40%), Gaps = 4/98 (4%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E +L     + E      EE E   EE++  +Q+  LE  E ++     +     ++E  
Sbjct: 816 EERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFV 875

Query: 156 KEEEEDKQGEE----KRVEDKIGDEKRLEENKKKKTQK 189
           + + + K+  +    K V  K+       +  +K T+K
Sbjct: 876 EFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKK 913



 Score = 28.1 bits (63), Expect = 3.2
 Identities = 12/73 (16%), Positives = 31/73 (42%), Gaps = 6/73 (8%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
           E  ++  E EEE+ +   ++ +E           E  E   ++++   ++ + +  E + 
Sbjct: 806 ELIDQIIESEEERLKALRIQREEMLMRP------EELELINEEQKNLKQEIKLELSEIQE 859

Query: 174 GDEKRLEENKKKK 186
            +E+    N+ K 
Sbjct: 860 AEEEIQNINENKN 872



 Score = 27.7 bits (62), Expect = 4.3
 Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 6/87 (6%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE--DKQGEE 166
            R EE E   EE++  +QE    L E +  +E      + + E  E K + +  +K    
Sbjct: 831 MRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIA 890

Query: 167 KRVEDK----IGDEKRLEENKKKKTQK 189
           K V  K      + K+ E+  KKK + 
Sbjct: 891 KDVLIKLVISSDEIKQDEKTTKKKKKD 917


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 28.3 bits (63), Expect = 1.4
 Identities = 22/81 (27%), Positives = 44/81 (54%), Gaps = 3/81 (3%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED---EEKKEEEEDKQGEEKR 168
           E+E     E+E  E++     E+ + EK+++L E+ E  +    E++++E   +Q E ++
Sbjct: 43  EKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEK 102

Query: 169 VEDKIGDEKRLEENKKKKTQK 189
           +E     +K  E  +KK T+K
Sbjct: 103 IELSKKKQKERERRRKKLTKK 123


>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B.  Escherichia coli
           stringent starvation protein B (SspB), is thought to
           enhance the specificity of degradation of tmRNA-tagged
           proteins by the ClpXP protease. The tmRNA tag, also
           known as ssrA, is an 11-aa peptide added to the C
           terminus of proteins stalled during translation, targets
           proteins for degradation by ClpXP and ClpAP. SspB a
           cytoplasmic protein that specifically binds to residues
           1-4 and 7 of the tag. Binding of SspB enhances
           degradation of tagged proteins by ClpX, and masks
           sequence elements important for ClpA interactions,
           inhibiting degradation by ClpA. However, more recent
           work has cast doubt on the importance of SspB in
           wild-type cells. SspB is encoded in an operon whose
           synthesis is stimulated by carbon, amino acid, and
           phosphate starvation. SspB may play a special role
           during nutrient stress, for example by ensuring rapid
           degradation of the products of stalled translation,
           without causing a global increase in degradation of all
           ClpXP substrates.
          Length = 153

 Score = 28.3 bits (64), Expect = 1.4
 Identities = 13/43 (30%), Positives = 22/43 (51%), Gaps = 3/43 (6%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE 153
           E EE +EEEEEE+++ +     E+ E +     P+   R   +
Sbjct: 111 EPEEADEEEEEEDDDADDSPEDEDPEPKDP---PKTKGRPSLK 150


>gnl|CDD|215283 PLN02518, PLN02518, pheophorbide a oxygenase.
          Length = 539

 Score = 29.1 bits (65), Expect = 1.4
 Identities = 14/84 (16%), Positives = 29/84 (34%), Gaps = 6/84 (7%)

Query: 62  MVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEE------EE 115
           M  +L +    LTL       +   +  +      +   + G      R         E 
Sbjct: 1   MAVLLGIACNSLTLTSSTPKSTPFFIPARTIPFVSSSRPRRGKIFTPLRVAAPPSVPSEA 60

Query: 116 EEEEEEEEEEQEQDEYLEEDENEK 139
             +++E EE++ + E  +E  + K
Sbjct: 61  ALQQDEGEEQRVEQELGQESSDSK 84


>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein. 
          Length = 283

 Score = 29.1 bits (65), Expect = 1.5
 Identities = 15/70 (21%), Positives = 31/70 (44%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           E  R ++    +E+  EE      YL E+ ++ E     ED++G   +    E+ +   +
Sbjct: 24  EEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSETEPSIEDDQGLYTQLPPAEQVEGFIQ 83

Query: 167 KRVEDKIGDE 176
             ++D   D+
Sbjct: 84  GPLDDIADDD 93



 Score = 28.3 bits (63), Expect = 2.4
 Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 4/67 (5%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED---KQGE 165
           A E  + E E   E+++    +    ++ E   + P  D+  +D+     EED   K   
Sbjct: 50  AEELSDSETEPSIEDDQGLYTQLPPAEQVEGFIQGPL-DDIADDDIDVVFEEDRPWKPPV 108

Query: 166 EKRVEDK 172
           +K    +
Sbjct: 109 QKSDNGR 115


>gnl|CDD|220379 pfam09750, DRY_EERY, Alternative splicing regulator.  This entry
           represents the conserved N-terminal region of SWAP
           (suppressor-of-white-apricot protein) proteins. This
           region contains two highly conserved motifs, viz: DRY
           and EERY, which appear to be the sites for alternative
           splicing of exons 2 and 3 of the SWAP mRNA. These
           proteins are thus thought to be involved in
           auto-regulation of pre-mRNA splicing. Most family
           members are associated with two Surp domains pfam01805
           and an Arginine- serine-rich binding region towards the
           C-terminus.
          Length = 121

 Score = 28.2 bits (63), Expect = 1.5
 Identities = 16/59 (27%), Positives = 29/59 (49%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           R    R   + +EE+E EE    ++ Y +  +N+ E    EE+E+ E  +KK+ +    
Sbjct: 53  RGSRNRRPYKTDEEDEAEERFLNEERYRDLIKNDFEEAEKEEEEKRELAQKKKSKGGYN 111


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.9 bits (65), Expect = 1.5
 Identities = 15/76 (19%), Positives = 30/76 (39%), Gaps = 9/76 (11%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
           E      EEE +       ++ + +K     ++ E+ + ++K+++E   +GE K      
Sbjct: 37  EILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKT- 95

Query: 174 GDEKRLEENKKKKTQK 189
                    KK K  K
Sbjct: 96  --------PKKSKKTK 103


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 29.3 bits (66), Expect = 1.5
 Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 4/58 (6%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDE-NEKERRLPEEDERGEDEE---KKEEEE 160
           E A + EE  EE  EEE++Q   + +     N  ++    + +  E +    K+EE E
Sbjct: 838 ENAGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERE 895


>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
          Length = 580

 Score = 29.0 bits (66), Expect = 1.6
 Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            +   EE     E     +E  EED    E+ L +  E+  D+  +E  E+   +E+ +E
Sbjct: 499 RDAGLEELAALLESLAVWEEISEEDIAALEKELKDALEKLRDQLLEERLEELIAKERLLE 558

Query: 171 DKIGD-EKRLEENKKKKTQKR 190
                 E+RLE  +     KR
Sbjct: 559 GHGLSSEERLELLQLLIELKR 579


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 27.8 bits (62), Expect = 1.6
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
           A A  E A E +E EEEE+EEE E+E D+
Sbjct: 73  AAAGAEAAAEADEAEEEEKEEEAEEESDD 101



 Score = 27.8 bits (62), Expect = 1.8
 Identities = 15/29 (51%), Positives = 18/29 (62%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQD 129
            AGA      +E EEEE+EEE EEE + D
Sbjct: 74  AAGAEAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 27.4 bits (61), Expect = 2.4
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
           AGA    A   E   E +E EEEE+E++   E D+
Sbjct: 67  AGAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDD 101



 Score = 26.2 bits (58), Expect = 6.4
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDE 130
            A A    A  E +E EEEE+EEE +E+ +
Sbjct: 71  AAAAAGAEAAAEADEAEEEEKEEEAEEESD 100



 Score = 25.8 bits (57), Expect = 8.5
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYL 132
            A   +  A  +E EEEE+EEE EE+  D+ L
Sbjct: 73  AAAGAEAAAEADEAEEEEKEEEAEEESDDDML 104


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 28.9 bits (65), Expect = 1.6
 Identities = 18/60 (30%), Positives = 36/60 (60%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           E+EE+EEE+ +   + +  + L+    EKE+   E++E+G  E +++  ED+  E+ R+ 
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 15/77 (19%), Positives = 43/77 (55%), Gaps = 1/77 (1%)

Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK-KEEEEDKQGEEKRVEDKI 173
           ++E+E+++ ++   +E  EE E EK +   +   +   + K +E+E+ K+ +E++   ++
Sbjct: 25  DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84

Query: 174 GDEKRLEENKKKKTQKR 190
            ++   +E  +K   ++
Sbjct: 85  EEDTPEDELAEKLRLRK 101



 Score = 27.7 bits (62), Expect = 3.7
 Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
               E+E+ + E+EE+   + +++  E++  EK R 
Sbjct: 64  AKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRL 99


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 24/86 (27%), Positives = 43/86 (50%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           R+  A EEE E+ +   EE E+E +E  EE E  +ER    E+E    EE   + +++  
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIE 782

Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
           E +     + +E    E + ++ ++R
Sbjct: 783 ELEEKRQALQEELEELEEELEEAERR 808



 Score = 28.9 bits (65), Expect = 1.8
 Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 2/84 (2%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           E +   E  EE E E EE +E+ E L+E     +  L E +   E+ E+   E ++  EE
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEE 362

Query: 167 KRVEDKIGDEKRLEENKKKKTQKR 190
              E+K+       E   +  ++ 
Sbjct: 363 L--EEKLSALLEELEELFEALREE 384



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 26/90 (28%), Positives = 39/90 (43%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
               R+E A  E E  E   E EE + + E LEE       RL +  E  ++ E + EE 
Sbjct: 378 FEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEEL 437

Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
             + EE   E +  +E+  E   + K  +R
Sbjct: 438 QTELEELNEELEELEEQLEELRDRLKELER 467



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 21/95 (22%), Positives = 42/95 (44%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
           E+ L +   + E    + EE + E    EE+ +      +E E+E    EE+     E  
Sbjct: 701 EDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERL 760

Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           +E EE+ +  E+ +     + + LEE ++   ++ 
Sbjct: 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEEL 795



 Score = 28.5 bits (64), Expect = 2.4
 Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 1/95 (1%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE-RRLPEEDERGEDEE 154
           + +L +A    E  + E EE  EE EE +E+  +   E +E E E   L E  E  E+E 
Sbjct: 259 QEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENEL 318

Query: 155 KKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           ++ EE  ++ +EK    K   E+R    ++ +   
Sbjct: 319 EELEERLEELKEKIEALKEELEERETLLEELEQLL 353



 Score = 28.5 bits (64), Expect = 2.9
 Identities = 23/109 (21%), Positives = 42/109 (38%), Gaps = 2/109 (1%)

Query: 82  LSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER 141
           L   +  LK+ I    +  +         E+   E EE +EE E++    LEE E   E 
Sbjct: 321 LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEA 380

Query: 142 RLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
               E+    + E  E   + +  ++ +E      +RL E  +   ++ 
Sbjct: 381 L--REELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL 427



 Score = 27.8 bits (62), Expect = 4.1
 Identities = 22/75 (29%), Positives = 34/75 (45%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
             +E    EEE  E E E  E +E+ E L E   E E +L   +    + E++ EEE + 
Sbjct: 882 LEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYED 941

Query: 164 GEEKRVEDKIGDEKR 178
             E  +E +I   + 
Sbjct: 942 TLETELEREIERLEE 956



 Score = 27.8 bits (62), Expect = 4.9
 Identities = 29/110 (26%), Positives = 51/110 (46%)

Query: 81  NLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
            L   +  L++ +    +  +       A +EE EE EEE EE E+  D    E E+ ++
Sbjct: 762 ELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQ 821

Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           RR   E E  E EE+ EE E+K  E +   +++  E    + + ++ +  
Sbjct: 822 RRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE 871


>gnl|CDD|140234 PTZ00207, PTZ00207, hypothetical protein; Provisional.
          Length = 591

 Score = 29.0 bits (65), Expect = 1.7
 Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 14/67 (20%)

Query: 18  AVVTLILVFCTATGNILVCL--------AISWEKRL----QNVTNYF--LMSLAITDLMV 63
           AVVT++ VF +  G ++  +        AI    +L     N + YF  LMS A+   ++
Sbjct: 141 AVVTVLSVFPSNRGAVVAIMKTFTGLGSAILGSIQLAFFSDNTSAYFFFLMSFALVVGIL 200

Query: 64  AVLVMPL 70
           A++ M L
Sbjct: 201 AIVFMRL 207


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 28.9 bits (65), Expect = 1.8
 Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 16/84 (19%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ----GE 165
           +E  E EE  E EEE   + E   E +  K+            E++   EED       E
Sbjct: 243 KETSETEERVESEEETDVEIETASETKGTKQ------------EQEGSTEEDPSPSLFSE 290

Query: 166 EKRVEDKIGDEKRLEENKKKKTQK 189
           E+   DKI + + +  N K+K +K
Sbjct: 291 EREDPDKIDETEEIRVNGKEKIKK 314



 Score = 28.1 bits (63), Expect = 3.1
 Identities = 9/36 (25%), Positives = 21/36 (58%)

Query: 145 EEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
            E +  +  E+K +++ K+ ++K  E K  ++ R+E
Sbjct: 733 AEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 28.1 bits (63), Expect = 3.9
 Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 2/63 (3%)

Query: 125  EQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK-QGEEKRVE-DKIGDEKRLEEN 182
              E+     E+ N++E+    + E   D+EKK   E     +EK +E D    + +  +N
Sbjct: 1512 SNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKN 1571

Query: 183  KKK 185
            KK+
Sbjct: 1572 KKQ 1574



 Score = 27.4 bits (61), Expect = 6.8
 Identities = 14/50 (28%), Positives = 27/50 (54%), Gaps = 11/50 (22%)

Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            ++ E +  + +EE   +K+++           EKK+EEE K+ E+ R+E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKK-----------EKKKEEEYKREEKARIE 768


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 29.0 bits (65), Expect = 1.9
 Identities = 10/25 (40%), Positives = 19/25 (76%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQE 127
              D+ A E+E+++E+E+E+E+E E
Sbjct: 886 ENEDDDADEDEDQDEDEDEDEDEDE 910



 Score = 26.7 bits (59), Expect = 9.0
 Identities = 10/34 (29%), Positives = 20/34 (58%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD 129
           E  L      +E    +E+E+++E+E+E+E E +
Sbjct: 877 EEYLISQFQENEDDDADEDEDQDEDEDEDEDEDE 910


>gnl|CDD|235622 PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M;
           Reviewed.
          Length = 497

 Score = 28.6 bits (65), Expect = 1.9
 Identities = 14/63 (22%), Positives = 24/63 (38%), Gaps = 11/63 (17%)

Query: 6   SFGLDAASWNIWAVV-TLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVA 64
             G+D  S  +  VV T +L          + +  SWE+  + V  Y    L +   ++ 
Sbjct: 77  HLGVDGIS--LLLVVLTALLT--------PLAVLASWEEIKKRVKFYMAALLWLEGGVIG 126

Query: 65  VLV 67
           V  
Sbjct: 127 VFA 129


>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
           domain 1.  The small mechanosensitive channel, MscS, is
           a part of the turgor-driven solute efflux system that
           protects bacteria from lysis in the event of osmotic
           shock. The MscS protein alone is sufficient to form a
           functional mechanosensitive channel gated directly by
           tension in the lipid bilayer. The MscS proteins are
           heptamers of three transmembrane subunits with seven
           converging M3 domains, and this domain is one of the
           inner membrane domains.
          Length = 339

 Score = 28.7 bits (65), Expect = 1.9
 Identities = 16/42 (38%), Positives = 19/42 (45%), Gaps = 3/42 (7%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL 143
           A    + A EEEE  E   E  EE E D  LE   + +  RL
Sbjct: 268 AEILAQRAEEEEESSEGAAETIEEPELD--LET-ISAQSLRL 306


>gnl|CDD|235887 PRK06925, PRK06925, flagellar motor protein MotS; Reviewed.
          Length = 230

 Score = 28.6 bits (64), Expect = 1.9
 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 6/45 (13%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEE-----DENE-KERRLPEEDERG 150
           E   E  E+EE++E + DE L++     +ENE K     + DERG
Sbjct: 67  ENPAESSEDEEKQEDQLDELLKKVNEYIEENELKSVITAKRDERG 111


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 28.8 bits (64), Expect = 2.0
 Identities = 16/79 (20%), Positives = 34/79 (43%), Gaps = 11/79 (13%)

Query: 104 ARDETAREEEEEEEEEEEEEE-----------EQEQDEYLEEDENEKERRLPEEDERGED 152
           A+DE ++      E+E+++++           E   DE   E  ++      E+DE  E 
Sbjct: 37  AQDEYSQRSYSRFEDEDDDDDFPAPADGYSRGEAADDEEEGEASSDATEGHDEDDEIYEG 96

Query: 153 EEKKEEEEDKQGEEKRVED 171
           E +     +  G+++ + D
Sbjct: 97  EYQGIPRAENGGKDEIMAD 115



 Score = 27.2 bits (60), Expect = 5.9
 Identities = 21/85 (24%), Positives = 30/85 (35%), Gaps = 22/85 (25%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEY--LEEDENE-------------------KER 141
            A DE   E   +  E  +E++E  + EY  +   EN                    K+ 
Sbjct: 70  AADDEEEGEASSDATEGHDEDDEIYEGEYQGIPRAENGGKDEIMADGAPKAGVRDELKDG 129

Query: 142 RLPEEDERGEDEEKKEEEEDKQGEE 166
             P     GE E + +EEE  Q  E
Sbjct: 130 EGP-PGGEGEAERRADEEELAQQYE 153


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
           mitochondrial protein translocase family, which is
           responsible for movement of nuclear encoded pre-proteins
           into mitochondria, is very complex with at least 19
           components. These proteins include several chaperone
           proteins, four proteins of the outer membrane
           translocase (Tom) import receptor, five proteins of the
           Tom channel complex, five proteins of the inner membrane
           translocase (Tim) and three "motor" proteins. This
           family represents the Tom22 proteins. The N terminal
           region of Tom22 has been shown to have chaperone-like
           activity, and the C terminal region faces the
           intermembrane face.
          Length = 136

 Score = 28.0 bits (63), Expect = 2.0
 Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER 141
           E    K  A    A+EE ++++E++ + +    D+   E+E   +R
Sbjct: 10  ETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENETLYDR 55


>gnl|CDD|217956 pfam04194, PDCD2_C, Programmed cell death protein 2, C-terminal
           putative domain. 
          Length = 165

 Score = 28.1 bits (63), Expect = 2.0
 Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 1/55 (1%)

Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
            EEEE E      L E+  E+E    EE E+      KE+EE  + E+ + +DK 
Sbjct: 8   PEEEEYETLHAPSLPEEPEEREIDAFEELEKEGGG-GKEDEETDESEKAKSDDKT 61


>gnl|CDD|206316 pfam14147, Spore_YhaL, Sporulation protein YhaL.  This family of
           proteins is involved in sporulation. In B. subtilis its
           expression is regulated by the early
           mother-cell-specific transcription factor sigma-E.
          Length = 52

 Score = 26.6 bits (59), Expect = 2.0
 Identities = 14/28 (50%), Positives = 20/28 (71%)

Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERR 142
           EE E+E+E  E+E + Y+E  E E+ERR
Sbjct: 25  EEREKEQEFIEKEGEVYMERLEEERERR 52


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 28.2 bits (62), Expect = 2.0
 Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 1/85 (1%)

Query: 97  NQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           N   K  A++   R+ +E  E E+ ++E +E  E   ED  E+    PEE    ++EE +
Sbjct: 3   NYCRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPE-QEEENE 61

Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEE 181
           E++  ++ +    E+K  DEK L++
Sbjct: 62  EQKPKEEIDYPIQENKSFDEKNLDD 86



 Score = 28.2 bits (62), Expect = 2.3
 Identities = 22/82 (26%), Positives = 43/82 (52%), Gaps = 5/82 (6%)

Query: 95  CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
           C     K   R +     E E+ ++E EE  +++ E + E+  ++    PE++E  E+++
Sbjct: 5   CRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQK 64

Query: 155 KKEE-----EEDKQGEEKRVED 171
            KEE     +E+K  +EK ++D
Sbjct: 65  PKEEIDYPIQENKSFDEKNLDD 86



 Score = 26.2 bits (57), Expect = 8.7
 Identities = 22/82 (26%), Positives = 37/82 (45%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           +   EE  E++ E+  EE ++Q E   E E E E + P+E+     +E K  +E    + 
Sbjct: 28  KKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPKEEIDYPIQENKSFDEKNLDDL 87

Query: 167 KRVEDKIGDEKRLEENKKKKTQ 188
           +R    I  E    +N   K +
Sbjct: 88  ERSNSDIYSESHKYDNASDKLE 109


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 28.5 bits (64), Expect = 2.0
 Identities = 12/43 (27%), Positives = 15/43 (34%), Gaps = 4/43 (9%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE 153
           EE EE+EE   E    E       +  E E    E      +E
Sbjct: 208 EEAEEKEEAAAETAGVEDAAAAAAEAEEGE----EWVAAANNE 246



 Score = 27.7 bits (62), Expect = 3.2
 Identities = 12/57 (21%), Positives = 18/57 (31%), Gaps = 15/57 (26%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           R+ EE EE+EE   E    ++                     + E+ EE       E
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAA---------------AEAEEGEEWVAAANNE 246



 Score = 26.9 bits (60), Expect = 6.5
 Identities = 11/37 (29%), Positives = 13/37 (35%), Gaps = 1/37 (2%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQ-DEYLEEDENEKE 140
           RD    EE+EE   E    E+        EE E    
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVA 241


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 28.8 bits (64), Expect = 2.0
 Identities = 22/83 (26%), Positives = 43/83 (51%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           +    + E EEEEEE+ E+ ++     E+E   +++L  E      + K+EE E K  EE
Sbjct: 346 KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEE 405

Query: 167 KRVEDKIGDEKRLEENKKKKTQK 189
           K  +  +   ++ E+  K++ ++
Sbjct: 406 KEAKLLLELSEQEEDLLKEEKKE 428



 Score = 28.8 bits (64), Expect = 2.2
 Identities = 28/106 (26%), Positives = 50/106 (47%)

Query: 81  NLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
            L   + + K+ I    +  K       A EEEEE+ E+ +E+ EQ ++E L + + E E
Sbjct: 326 KLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESE 385

Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKK 186
           R       + E+ E K EEE +      + ++  D  + E+ ++ K
Sbjct: 386 RLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELK 431



 Score = 27.6 bits (61), Expect = 4.5
 Identities = 28/101 (27%), Positives = 47/101 (46%)

Query: 89  LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
           L    L E ++          +EE E  ++E E+EEE       E  E EKE++L EE+ 
Sbjct: 230 LDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEEL 289

Query: 149 RGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           +   +E++E + +    E+R  D     K  E+  KK  ++
Sbjct: 290 KLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKE 330



 Score = 27.6 bits (61), Expect = 5.3
 Identities = 27/106 (25%), Positives = 49/106 (46%), Gaps = 4/106 (3%)

Query: 85  AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
           A+    K      +  +     E   +EEE EE   E +EEQ+ ++  EE+    E  + 
Sbjct: 806 ALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEIT 865

Query: 145 EEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           +E+   E   K+EE E    E+K  ++    E++ +E KK+  ++ 
Sbjct: 866 KEELLQELLLKEEELE----EQKLKDELESKEEKEKEEKKELEEES 907


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 28.8 bits (64), Expect = 2.0
 Identities = 9/83 (10%), Positives = 26/83 (31%), Gaps = 2/83 (2%)

Query: 101 KAGARDETAREEEEEEEEEE--EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
            + +      + +  E++    + + +  Q    EE + +K                 E+
Sbjct: 210 LSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKALAALEAANAADED 269

Query: 159 EEDKQGEEKRVEDKIGDEKRLEE 181
            +D+    +     +G  + L  
Sbjct: 270 PQDRDAAVEAAARLMGAARALVP 292


>gnl|CDD|220098 pfam09057, Smac_DIABLO, Second Mitochondria-derived Activator of
           Caspases.  Second Mitochondria-derived Activator of
           Caspases promotes apoptosis by activating caspases in
           the cytochrome c/Apaf-1/caspase-9 pathway, and by
           opposing the inhibitory activity of inhibitor of
           apoptosis proteins (XIAP-BIR3). The protein assumes an
           elongated three-helix bundle structure, and forms a
           dimer in solution.
          Length = 234

 Score = 28.3 bits (63), Expect = 2.1
 Identities = 14/62 (22%), Positives = 26/62 (41%), Gaps = 2/62 (3%)

Query: 76  VRVRTNLSGAVLRLK--KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
           V  R +L  A  +++  + +  E +   A ++ E  +   E     +  E E   + YL 
Sbjct: 173 VAARNHLQVAQSQVEEVRQLSKEAEKKLAESKAEEIQRMAEYASSIDLSELEDIPEAYLR 232

Query: 134 ED 135
           ED
Sbjct: 233 ED 234


>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose
           2'-O-methyltransferase Hen1.  Members of this protein
           family are bacterial Hen1, a 3' terminal RNA ribose
           2'-O-methyltransferase that acts in bacterial RNA
           repair. All members of the seed alignment belong to a
           cassette with the RNA repair enzyme polynucleotide
           kinase-phosphatase (Pnkp). Chemically similar Hen1 in
           eukaryotes acts instead on small regulatory RNAs
           [Transcription, RNA processing, Protein synthesis, tRNA
           and rRNA base modification].
          Length = 462

 Score = 28.4 bits (64), Expect = 2.1
 Identities = 17/40 (42%), Positives = 23/40 (57%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL 143
           ARD  AR  E +E E EE E E+ Q+E  E+  +   +RL
Sbjct: 232 ARDALARLAEADEAEPEEAETEEAQEEAAEKPPSLNRQRL 271


>gnl|CDD|177543 PHA03146, PHA03146, helicase-primase primase subunit; Provisional.
          Length = 1075

 Score = 28.8 bits (64), Expect = 2.1
 Identities = 16/58 (27%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 95  CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEY--LEEDE--NEKERRLPEEDE 148
           C     +A A  E +   +  +  EE   E    D+Y  +EEDE  ++ E  +  E E
Sbjct: 693 CPPAAARALAAGEASSLSDYVDGVEECYMESDFLDDYAAMEEDEGWDDYEMAVAREGE 750


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 28.5 bits (64), Expect = 2.1
 Identities = 12/71 (16%), Positives = 30/71 (42%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
                 + A E+++E+++E+E+       E LEE  ++ +  L + D     +       
Sbjct: 480 DLPDDSDDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSS 539

Query: 161 DKQGEEKRVED 171
             + ++  +E 
Sbjct: 540 GAEDDDDDIEG 550


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 28.8 bits (65), Expect = 2.2
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           E    EE EEE  EE EE   +D   E+D ++++
Sbjct: 829 EPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDED 862


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 27.7 bits (62), Expect = 2.2
 Identities = 13/56 (23%), Positives = 37/56 (66%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
           +E +EE    EE+ ++++ ++++ D++  +  +  +D+  E++  +E++ED++ EE
Sbjct: 74  DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129



 Score = 26.5 bits (59), Expect = 5.5
 Identities = 7/53 (13%), Positives = 29/53 (54%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
           + ++     + +  +++  + +++  +++    D++ EED +E++    E++E
Sbjct: 77  DEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129


>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
          Length = 233

 Score = 28.2 bits (63), Expect = 2.3
 Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 15/75 (20%)

Query: 87  LRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
           L+L+  +L E   F  G  D+ A E+EE+     E +  + QD               EE
Sbjct: 171 LQLESVMLVELAEFGGGGEDDWADEDEEDGYVASEAKARKPQD---------------EE 215

Query: 147 DERGEDEEKKEEEED 161
           +   +DEE+ EE+ED
Sbjct: 216 EWDEDDEEESEEDED 230


>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
          Length = 432

 Score = 28.5 bits (64), Expect = 2.4
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
           ET +EE+       +  + +E+   LE +E  + R   E++E
Sbjct: 97  ETQQEEKAVNPFTPQPGQREERRPTLESEEEWRARLKREQEE 138


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 28.7 bits (64), Expect = 2.4
 Identities = 32/133 (24%), Positives = 53/133 (39%), Gaps = 10/133 (7%)

Query: 65   VLVMPLGILTLVRV------RTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEE 118
            +L M +G LTL +V      R  L+  V  LKK       L    A ++   + ++E+ +
Sbjct: 1115 LLSMAIGTLTLEKVQELCADRDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDKEDAK 1174

Query: 119  EEEEEEEQEQDEYLEEDENEKE--RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
             EE  E+ ++     E    K+  R+ P++    +  +K  E E    EE      +  E
Sbjct: 1175 AEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETT--EETYGSSAMETE 1232

Query: 177  KRLEENKKKKTQK 189
               E  K K    
Sbjct: 1233 NVAEVVKPKGRAG 1245


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 27.8 bits (62), Expect = 2.4
 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 2/66 (3%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
           E ++++ EE EEE+E +E  E  E+     L + + + E     E  ED     K+V +K
Sbjct: 11  ESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIES--ASENIEDATSNPKQVSEK 68

Query: 173 IGDEKR 178
           I   KR
Sbjct: 69  ISAIKR 74


>gnl|CDD|236568 PRK09560, nhaA, pH-dependent sodium/proton antiporter; Reviewed.
          Length = 389

 Score = 28.3 bits (64), Expect = 2.4
 Identities = 8/19 (42%), Positives = 15/19 (78%)

Query: 52  FLMSLAITDLMVAVLVMPL 70
           FL++LAI D + A++++ L
Sbjct: 155 FLLALAIIDDLGAIVIIAL 173


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 28.5 bits (64), Expect = 2.4
 Identities = 12/69 (17%), Positives = 30/69 (43%), Gaps = 5/69 (7%)

Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
              E  E +   ++ +N+++       ++   + KK+ ++D     + + +    E +  
Sbjct: 1   AMNENGENEVKQQKQQNKQK-----GTKKKNKKSKKDVDDDDAFLAELISENQEAENKQN 55

Query: 181 ENKKKKTQK 189
             KKKK +K
Sbjct: 56  NKKKKKKKK 64


>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
          Length = 232

 Score = 28.3 bits (63), Expect = 2.5
 Identities = 18/52 (34%), Positives = 27/52 (51%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
           +ET  E E+E  EEEEEE+   + E       EK   L +E+ R   + ++E
Sbjct: 63  EETEAEVEDEGGEEEEEEDADVETELRPRGLTEKTPELSDEEARLLTQRRRE 114


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 28.5 bits (64), Expect = 2.5
 Identities = 28/121 (23%), Positives = 53/121 (43%), Gaps = 35/121 (28%)

Query: 104  ARDETAREEEEEEEEEEEEEEEQEQDEY------------------LEEDENEKERRL-- 143
            A +E   EE++E ++E++EEE      Y                  L  +  +KE+ L  
Sbjct: 1061 AEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEK 1120

Query: 144  -----PEE---------DERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
                 P++         +E  E++E+ EE+E  + +  + + K G   +L + K KK +K
Sbjct: 1121 LKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTK-GKASKLRKPKLKKKEK 1179

Query: 190  R 190
            +
Sbjct: 1180 K 1180


>gnl|CDD|221342 pfam11960, DUF3474, Domain of unknown function (DUF3474).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria and eukaryotes. This domain
           is typically between 126 to 140 amino acids in length.
           This domain is found associated with pfam00487.
          Length = 135

 Score = 27.7 bits (62), Expect = 2.5
 Identities = 12/18 (66%), Positives = 14/18 (77%)

Query: 112 EEEEEEEEEEEEEEQEQD 129
           EEEEEE   +EEEE+E D
Sbjct: 74  EEEEEENGNDEEEEEEFD 91


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 28.5 bits (64), Expect = 2.6
 Identities = 12/58 (20%), Positives = 26/58 (44%), Gaps = 5/58 (8%)

Query: 133 EEDENEKER-----RLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
           +E++N             ++E+  ++   E E+   G  +  ++K+   K L+E  KK
Sbjct: 220 KEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKK 277



 Score = 27.0 bits (60), Expect = 7.0
 Identities = 15/88 (17%), Positives = 34/88 (38%)

Query: 90  KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
           + NI  E +   +   +   +  +EE++  +   E ++      E + EK  ++ +  E+
Sbjct: 215 EININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEK 274

Query: 150 GEDEEKKEEEEDKQGEEKRVEDKIGDEK 177
            +         D   + K    KI +E 
Sbjct: 275 AKKLGISIILFDDMTKNKTTNYKIQNED 302


>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
          Length = 2033

 Score = 28.7 bits (63), Expect = 2.6
 Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 3/53 (5%)

Query: 105 RDETAREEEEEEEEEEEEEEEQ---EQDEYLEEDENEKERRLPEEDERGEDEE 154
           R E AR   + E  ++EEE E     Q+ +  ++  E   R   E +  EDE+
Sbjct: 315 RKEAARRSPDAERNDDEEEYESLPCAQEFFTSQELAEINARAIAESDEAEDED 367


>gnl|CDD|215107 PLN00220, PLN00220, tubulin beta chain; Provisional.
          Length = 447

 Score = 28.3 bits (63), Expect = 2.7
 Identities = 14/22 (63%), Positives = 17/22 (77%)

Query: 105 RDETAREEEEEEEEEEEEEEEQ 126
           +D TA EE E E+EEEEEE E+
Sbjct: 426 QDATADEEGEYEDEEEEEEYEE 447


>gnl|CDD|217526 pfam03385, DUF288, Protein of unknown function, DUF288. 
          Length = 390

 Score = 28.2 bits (63), Expect = 2.7
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 36  CLAISWEKRLQNVTNYFLMS 55
           C+ +  E  L NV  YFLMS
Sbjct: 278 CVTLVKELGLNNVEGYFLMS 297


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 28.0 bits (63), Expect = 2.8
 Identities = 22/86 (25%), Positives = 43/86 (50%), Gaps = 5/86 (5%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK-EEEEDKQG 164
            E A+ E  E E+E   E+++E+++ +E  E   +  + +  E+ E E +K   E+++  
Sbjct: 208 AERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERML 267

Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
           E K  E     E+ L+E  K + +  
Sbjct: 268 EHKLQE----QEELLKEGFKTEAESL 289


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 28.2 bits (63), Expect = 2.9
 Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 7/85 (8%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           +D   + +E+   E+EE EE  E      E E E++RRL  + E  E+E++  + ++KQ 
Sbjct: 130 KDVIQKNKEKSTREQEELEEALEF-----EKEEEEQRRLLLQKE--EEEQQMNKRKNKQA 182

Query: 165 EEKRVEDKIGDEKRLEENKKKKTQK 189
               +E        L    KK + K
Sbjct: 183 LLDELETSTLPAAELIAQHKKNSVK 207


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 27.8 bits (62), Expect = 3.0
 Identities = 18/84 (21%), Positives = 36/84 (42%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
                E   EE + E E  +EQ +   E E   E   P+  E+ + E+K ++ + K   +
Sbjct: 45  AAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPK 104

Query: 167 KRVEDKIGDEKRLEENKKKKTQKR 190
            + + K+  + + ++   K   K 
Sbjct: 105 PKPKPKVKPQPKPKKPPSKTAAKA 128


>gnl|CDD|225340 COG2719, SpoVR, Uncharacterized conserved protein [Function
           unknown].
          Length = 495

 Score = 28.2 bits (63), Expect = 3.1
 Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 7/60 (11%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            R +    EEE   +EE EE  Q Q        N+  R LP++      ++ +    + +
Sbjct: 177 KRPKKISSEEERARQEEREEYLQSQ-------VNDLWRTLPKKPGEAAVKDARRFPSEPE 229


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 26.9 bits (60), Expect = 3.1
 Identities = 10/61 (16%), Positives = 33/61 (54%)

Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
           +A A D     +++E+EE+E++    + D+  ++D+         + +  +D+  +++++
Sbjct: 44  RAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDD 103

Query: 161 D 161
           +
Sbjct: 104 E 104



 Score = 26.5 bits (59), Expect = 3.8
 Identities = 12/62 (19%), Positives = 35/62 (56%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
            K+ A    A +  +++E+EE+E++    D+  ++D+++      ++ +  +D++   E+
Sbjct: 41  AKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLED 100

Query: 160 ED 161
           +D
Sbjct: 101 DD 102



 Score = 26.1 bits (58), Expect = 6.6
 Identities = 12/58 (20%), Positives = 32/58 (55%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
            A  E+  +++E+EE+E+    +  ++D+++ +    ++D+   D++  +  ED   E
Sbjct: 47  AADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104


>gnl|CDD|227574 COG5249, RER1, Golgi protein involved in Golgi-to-ER retrieval
           [Intracellular trafficking and secretion].
          Length = 180

 Score = 27.5 bits (61), Expect = 3.2
 Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 5/29 (17%)

Query: 124 EEQEQDEYLEEDENEKE-----RRLPEED 147
           E+ E D+ +EE E + E     RRLPE  
Sbjct: 85  EQIEDDDEIEEGEKDNEFRPFIRRLPEFK 113


>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
          Length = 586

 Score = 28.2 bits (63), Expect = 3.2
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 98  QLFKAGARDETAREEEEEEEEEEEEEEEQ 126
           QL +     E    EE +E+EEE+ E+E 
Sbjct: 50  QLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 26.7 bits (59), Expect = 3.4
 Identities = 15/28 (53%), Positives = 23/28 (82%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
           A A +E A E+EEE+++EEE+EEE+E+ 
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 26.3 bits (58), Expect = 5.0
 Identities = 14/28 (50%), Positives = 22/28 (78%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
           A A  E   EE+EEE+++EEE+EE+E++
Sbjct: 70  AAAAAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 25.9 bits (57), Expect = 6.7
 Identities = 14/32 (43%), Positives = 23/32 (71%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
           A A     + EE+EEE+++EEE+E+E++E L 
Sbjct: 69  AAAAAAEEKAEEKEEEKKKEEEKEEEEEEALA 100


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 27.9 bits (62), Expect = 3.4
 Identities = 12/68 (17%), Positives = 21/68 (30%), Gaps = 7/68 (10%)

Query: 123 EEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEEN 182
           E  + +     E    K R+        E  +K+   E +   E R E      +     
Sbjct: 328 ENAEIKKTRTAEKNEAKARK-------KEIAQKRRAAEREINREARQERAAAMARARARR 380

Query: 183 KKKKTQKR 190
              K +K+
Sbjct: 381 AAVKAKKK 388


>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 181 to 207 amino acids in length. This domain
           has two conserved sequence motifs: PIP and CEICG. The
           domain carries a zinc-finger domain of the C2H2-type.
          Length = 187

 Score = 27.2 bits (61), Expect = 3.8
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
           T  E +   +E  E+  E   D+  ++DE E 
Sbjct: 32  TEEERQASADESSEDASEDGSDDDSDDDEEEP 63


>gnl|CDD|224559 COG1645, COG1645, Uncharacterized Zn-finger containing protein
           [General function prediction only].
          Length = 131

 Score = 27.0 bits (60), Expect = 3.8
 Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 8/78 (10%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
              E   EEEEEE       E +  E+ RR   E     DE KKE +ED   E  ++   
Sbjct: 51  GYREVVVEEEEEEV------EAEVQEQLRRSRPELPDDSDELKKEIDEDPASE--KLLSV 102

Query: 173 IGDEKRLEENKKKKTQKR 190
           I   KR+E  +      R
Sbjct: 103 IETLKRIEARRDLSRAGR 120


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 27.5 bits (61), Expect = 3.8
 Identities = 15/56 (26%), Positives = 38/56 (67%), Gaps = 4/56 (7%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
           D+  +  + + ++ ++EE++ +  +Y+EE E +++    EE+E  E+EE++E+E++
Sbjct: 234 DDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKED----EEEEETEEEEEEEDEDE 285


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 27.8 bits (62), Expect = 3.9
 Identities = 28/99 (28%), Positives = 53/99 (53%), Gaps = 5/99 (5%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE-EDERGEDEE 154
           E +  +A A+ E AR E EE  +  +E+ +QEQ     +DE  ++R   E E +R ++EE
Sbjct: 74  EYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEE 133

Query: 155 --KKEEEEDKQGEE--KRVEDKIGDEKRLEENKKKKTQK 189
             K +EE   + E   +  E++I + +R    ++ + ++
Sbjct: 134 LLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELER 172


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 28.0 bits (63), Expect = 4.0
 Identities = 32/122 (26%), Positives = 52/122 (42%), Gaps = 15/122 (12%)

Query: 24  LVFCTATGNILVCLAISWEKRLQNVTNYFLMS---LAITDLMVAVL-VMP-LGILTLVRV 78
           L+  TA G I+V    + E   Q +      +   L I   ++ +L ++P +  L  + +
Sbjct: 254 LLISTAAG-IIVTRVSTDESLGQQLFTQLFANPKVLLIAAGVLFLLGLIPGMPHLPFLLL 312

Query: 79  RTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
              L+     L K    E Q  +         +  EE ++EEEEEEE+  D+ L  D  E
Sbjct: 313 AGLLAFLAYLLSKRK--EQQAEEKE-------KPAEEAKKEEEEEEEESVDDVLLIDPIE 363

Query: 139 KE 140
            E
Sbjct: 364 LE 365


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 27.7 bits (62), Expect = 4.0
 Identities = 15/59 (25%), Positives = 28/59 (47%), Gaps = 2/59 (3%)

Query: 132 LEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           L +     E +  ++  +   ++K+E EED +  E R E K   ++  E  KKK  + +
Sbjct: 211 LLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKE--EIKKKKPKESK 267



 Score = 26.5 bits (59), Expect = 7.8
 Identities = 14/75 (18%), Positives = 31/75 (41%), Gaps = 4/75 (5%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
            +  +E  +     + +       E++K++R  EED     E  +   E K+  ++ ++ 
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEED----VEAAESRAEKKRKSKEEIKK 260

Query: 172 KIGDEKRLEENKKKK 186
           K   E +  +  KK 
Sbjct: 261 KKPKESKGVKALKKV 275


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 27.8 bits (62), Expect = 4.1
 Identities = 22/80 (27%), Positives = 40/80 (50%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
                E E  EE EE E+E +E +E L E    +E    E +E+ E  E   EE ++  E
Sbjct: 491 SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKE 550

Query: 166 EKRVEDKIGDEKRLEENKKK 185
           + +++    + ++LE+  ++
Sbjct: 551 KLQLQQLKEELRQLEDRLQE 570



 Score = 26.6 bits (59), Expect = 9.9
 Identities = 23/80 (28%), Positives = 37/80 (46%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           EE   E EEE+E  E+ +   LE +  E E     E+E  E E   EE E+K    + +E
Sbjct: 238 EERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELE 297

Query: 171 DKIGDEKRLEENKKKKTQKR 190
            +I + +   E  +   ++ 
Sbjct: 298 REIEELEEELEGLRALLEEL 317


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 27.6 bits (61), Expect = 4.3
 Identities = 15/45 (33%), Positives = 22/45 (48%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
             EE  +EE E+E E DE + E+  E E    + D+  E   + E
Sbjct: 343 VTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387



 Score = 27.2 bits (60), Expect = 5.6
 Identities = 11/35 (31%), Positives = 17/35 (48%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
           D    E  +EE E+E E +E   +E  E +  E +
Sbjct: 341 DVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQ 375



 Score = 27.2 bits (60), Expect = 6.0
 Identities = 11/53 (20%), Positives = 24/53 (45%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
           +     E++E +   EE  ++E ++ +E DE+  E     E    + ++  E 
Sbjct: 330 EVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAER 382


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 27.7 bits (62), Expect = 4.4
 Identities = 16/50 (32%), Positives = 30/50 (60%)

Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           +E E E++EE+ ++++ ++   +    L E  +  + E K EEEE+KQ E
Sbjct: 94  KELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQME 143


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score = 27.6 bits (61), Expect = 4.4
 Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 3/79 (3%)

Query: 99  LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
           LF+ G         + E EE   +EE+  +  Y  ED  E   R+ +    G  E     
Sbjct: 345 LFEVGPETAEGELLQFEAEEIYNDEEQYTRPVYENEDFEE---RVDDVSNGGLAEFAPAA 401

Query: 159 EEDKQGEEKRVEDKIGDEK 177
            E    E+ +V  +   ++
Sbjct: 402 VEVAPKEDAKVTKRGKAKR 420


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 27.7 bits (62), Expect = 4.5
 Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 2/75 (2%)

Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
           EEEEE E   +++++E L + E E   RL  ++   E + + E E +K+   K+ E+K+ 
Sbjct: 185 EEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLR 244

Query: 175 DEKRLEENKKKKTQK 189
            E  LE   +   QK
Sbjct: 245 QE--LERQAEAHEQK 257


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 27.7 bits (61), Expect = 4.5
 Identities = 30/99 (30%), Positives = 50/99 (50%), Gaps = 14/99 (14%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEE--QEQDEYLEED-----ENEKERRLPEEDERGED 152
            K G R     +E++ E  E    +E  ++  E++  D     E   E+ + +EDE  ED
Sbjct: 165 MKRGTRITLHLKEDQMEYLEPRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDE--ED 222

Query: 153 EEKKEEEEDKQGEEKRVED-KIGDEKRLEENKKKKTQKR 190
            +K +E+    GEE +VE+ K GDE + ++ KK K   +
Sbjct: 223 TKKADED----GEEPKVEEVKEGDEGKKKKTKKVKEVTK 257


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 27.5 bits (61), Expect = 4.5
 Identities = 14/39 (35%), Positives = 19/39 (48%), Gaps = 3/39 (7%)

Query: 89  LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQE 127
           LK+     + + K   R+    E EEEEE  + EE E E
Sbjct: 388 LKEK---GHPVVKRVVREVDEEEIEEEEEAMQPEEMEME 423



 Score = 27.1 bits (60), Expect = 6.5
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 109 AREEEEEEEEEEEEEEEQEQDE 130
            RE +EEE EEEEE  + E+ E
Sbjct: 400 VREVDEEEIEEEEEAMQPEEME 421



 Score = 27.1 bits (60), Expect = 7.3
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLE 133
           +    E +EEE EEEEE  Q ++  +E
Sbjct: 397 KRVVREVDEEEIEEEEEAMQPEEMEME 423


>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 508

 Score = 27.7 bits (61), Expect = 4.7
 Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 6/74 (8%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           AR+E  +E  + +     + E+       EEDE+E      +ED +   +    EE D Q
Sbjct: 439 ARNEEVQERLQTDLGSISDSEDINMGSAGEEDESE------DEDFQMVSDSDVAEEYDLQ 492

Query: 164 GEEKRVEDKIGDEK 177
                 E    +E+
Sbjct: 493 AALSDAEGGSDEER 506


>gnl|CDD|238818 cd01640, IPPase, IPPase; Inositol polyphosphate-1-phosphatase, a
           member of the Mg++ dependent family of inositol
           monophosphatase-like domains, hydrolyzes the 1' position
           phosphate from inositol 1,3,4-trisphosphate and inositol
           1,4-bisphosphate. Members in this group may also exhibit
           3'-phosphoadenosine 5'-phosphate phosphatase activity,
           and they all appear to be inhibited by lithium. IPPase
           is one of the proposed targets of Li+ therapy in
           manic-depressive illness.
          Length = 293

 Score = 27.3 bits (61), Expect = 4.7
 Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 1/38 (2%)

Query: 111 EEEEEEEEEEEEEEEQEQDEY-LEEDENEKERRLPEED 147
           EE+ E E +E+E  + + DE  LEE      + LPEED
Sbjct: 69  EEDNEFENQEDESRDVDLDEEILEESCPSPSKDLPEED 106


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 27.6 bits (61), Expect = 4.9
 Identities = 16/86 (18%), Positives = 44/86 (51%), Gaps = 1/86 (1%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           R+      +  +E+++E+++++ + E  E+  ++KE  +  ++     +E+  +E  K  
Sbjct: 7   REANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKE-EVVTDNVDSPVKEQSSQENLKIA 65

Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
           +E +   K   ++ LE  K K+  ++
Sbjct: 66  DEVKKSTKEESKQLLEVLKTKEEHQK 91



 Score = 27.6 bits (61), Expect = 5.1
 Identities = 17/80 (21%), Positives = 38/80 (47%), Gaps = 5/80 (6%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
           R+         +  +E+++E+Q+Q   +E     KE+ L +++E   D      +E    
Sbjct: 4   RNRREANINNNDRMQEKDDEKQDQKNRME----LKEKVLDKKEEVVTDNVDSPVKEQSSQ 59

Query: 165 EEKRVEDKIGDEKRLEENKK 184
           E  ++ D++    + EE+K+
Sbjct: 60  ENLKIADEVKKSTK-EESKQ 78


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 27.7 bits (61), Expect = 5.0
 Identities = 16/50 (32%), Positives = 25/50 (50%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
             R     +E+ E++  E E  + E+ E++   ENE E  L  E + GED
Sbjct: 139 KRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDGED 188


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 27.7 bits (61), Expect = 5.0
 Identities = 16/72 (22%), Positives = 27/72 (37%), Gaps = 3/72 (4%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLE---EDENEKERRLPEEDERGEDEEKKEEE 159
           GAR       EE              D+ +E    DE +++   P   +R + +E   +E
Sbjct: 400 GARGSPMAAPEEGRAAAAAAAANAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDE 459

Query: 160 EDKQGEEKRVED 171
           ++   E    ED
Sbjct: 460 QESGDERADGED 471


>gnl|CDD|185626 PTZ00447, PTZ00447, apical membrane antigen 1-like protein;
           Provisional.
          Length = 508

 Score = 27.6 bits (61), Expect = 5.1
 Identities = 17/60 (28%), Positives = 33/60 (55%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
           +++  E    E ++ +++ LEEDE   +   P+E E    +EK+ E ++ Q +E + E K
Sbjct: 412 QKDSLETTAGERDEFKNDVLEEDEISGDNIGPDEIENDHYQEKEIENDNIQYDENKNEGK 471


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 27.5 bits (61), Expect = 5.1
 Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 137 NEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
           N+K R    + +R  +E +K + ED+   E RVE K G E
Sbjct: 507 NDKGRLSKADIDRMVNEAEKYKAEDEANRE-RVEAKNGLE 545


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 27.5 bits (62), Expect = 5.3
 Identities = 13/65 (20%), Positives = 30/65 (46%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
           EE      EE  E +  DE   E+  E+ +   E +   ++E+  ++    +G ++ +  
Sbjct: 381 EELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRELAF 440

Query: 172 KIGDE 176
           K+ ++
Sbjct: 441 KLAEK 445



 Score = 26.8 bits (60), Expect = 8.6
 Identities = 15/61 (24%), Positives = 28/61 (45%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
            E E  +EE  EE  +   E LE +   +E +L ++    E  +++   +  +   K +E
Sbjct: 392 LEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLE 451

Query: 171 D 171
           D
Sbjct: 452 D 452


>gnl|CDD|153125 cd02434, Nodulin-21_like_3, Nodulin-21 and CCC1-related protein
           family.  Nodulin-21_like_3: This is a family of proteins
           closely related to nodulin-21, a plant nodule-specific
           protein that may be involved in symbiotic nitrogen
           fixation. This family is also related to CCC1, a yeast
           vacuole transmembrane protein that functions as an iron
           and manganese transporter. .
          Length = 225

 Score = 26.9 bits (60), Expect = 5.4
 Identities = 9/39 (23%), Positives = 19/39 (48%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
               A +  + + E +    E++ EE E + Y E +++E
Sbjct: 47  ISMAAGEYVSTKAERDFLHSEKKREEWEIENYPEGEKSE 85


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 27.3 bits (61), Expect = 5.4
 Identities = 10/37 (27%), Positives = 24/37 (64%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
           ++EEEE    +E++ ++D+  + D++++E   PE + 
Sbjct: 226 DDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVER 262



 Score = 26.9 bits (60), Expect = 7.5
 Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 4/54 (7%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
           ++ ++EEEE    ++D+  E+D  + +    ++DE   D E +     K+  EK
Sbjct: 223 DDIDDEEEERVLADEDDDDEDDMFDMD----DDDEEESDPEVERTSTIKEVSEK 272



 Score = 26.9 bits (60), Expect = 7.8
 Identities = 7/36 (19%), Positives = 19/36 (52%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
            +E  R   +E++++E++  + + D+  E D   + 
Sbjct: 227 DEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVER 262


>gnl|CDD|178354 PLN02753, PLN02753, triacylglycerol lipase.
          Length = 531

 Score = 27.4 bits (60), Expect = 5.4
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
           E E  + Q   ++E E   E  L  ED R  D++ +EE
Sbjct: 63  ERERRERQGLLIDEAEGAGELWLTAEDIRRRDKKTEEE 100


>gnl|CDD|224400 COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General
           function prediction only].
          Length = 774

 Score = 27.4 bits (61), Expect = 5.5
 Identities = 25/100 (25%), Positives = 35/100 (35%), Gaps = 4/100 (4%)

Query: 88  RLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED 147
           RL + +L      +    +    E E EE     EEE        E  +  KE+      
Sbjct: 583 RLLELLLGFIASGRRTYSNTVLVEYEAEEILSILEEETARVLACDEVMKTIKEK----YK 638

Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKT 187
           + GED E+  E E ++  EK VED     K          
Sbjct: 639 KYGEDVEEISENEVRKIGEKAVEDLEEQFKDSFGRVAYPV 678


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 26.8 bits (59), Expect = 5.5
 Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
           +E  +E  +EEE++       EQ E   EDE  ++ + P++    E EE K++ + K+ E
Sbjct: 2   EEVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQ-PQQQTVDEIEELKQKLQQKEVE 60

Query: 166 EKRVEDKIGDEKRLEENKKKKTQK 189
            +   D     K   EN +K+T+K
Sbjct: 61  AQEYLDIAQRLKAEFENYRKRTEK 84


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 26.5 bits (59), Expect = 5.6
 Identities = 19/75 (25%), Positives = 39/75 (52%), Gaps = 12/75 (16%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQ----DEYLEEDE------NEKERRLPEEDERGEDE 153
           ARD    ++EE+E+ +  ++ + E      +Y +EDE         + R+  ++ R +  
Sbjct: 25  ARDGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERM--DERRKKRR 82

Query: 154 EKKEEEEDKQGEEKR 168
           E+KE+EE ++  E+ 
Sbjct: 83  EQKEKEEIEKYREEN 97


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 27.4 bits (61), Expect = 5.6
 Identities = 7/32 (21%), Positives = 17/32 (53%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
           + E+    ++++++       LEE+E+   RR
Sbjct: 36  DSEDPASLDDDDDDRDSGGGMLEEEEDGNMRR 67



 Score = 26.6 bits (59), Expect = 9.5
 Identities = 11/70 (15%), Positives = 24/70 (34%), Gaps = 10/70 (14%)

Query: 110 REEEEEEEEEEEEEE-------EQEQDEYLEEDENEKERRLPEEDERGEDE---EKKEEE 159
            EEE                  +  +  +++  E++ E     +D+  + +      EEE
Sbjct: 1   MEEESSGGIGSSSNGSGGLVGTDSGESRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEE 60

Query: 160 EDKQGEEKRV 169
           ED     + +
Sbjct: 61  EDGNMRRRLI 70


>gnl|CDD|240321 PTZ00230, PTZ00230, variable surface protein Vir7; Provisional.
          Length = 364

 Score = 27.1 bits (60), Expect = 5.7
 Identities = 12/48 (25%), Positives = 20/48 (41%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
           ET  + ++ EE +E E  +Q     +   E E+   +P  DE      
Sbjct: 211 ETDPQSKQLEEGDEGERVQQPTLTAIPGLEGEESLSVPPSDEXSPGST 258


>gnl|CDD|216095 pfam00748, Calpain_inhib, Calpain inhibitor.  This region is found
           multiple times in calpain inhibitor proteins.
          Length = 131

 Score = 26.7 bits (59), Expect = 5.8
 Identities = 21/46 (45%), Positives = 25/46 (54%), Gaps = 6/46 (13%)

Query: 144 PEEDERGED---EEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKK 186
           PEE +  ED   E+ KEE+ +K GE    ED I  E RL E K K 
Sbjct: 78  PEEKKPVEDKVKEKAKEEKLEKLGER---EDTIPPEYRLLEAKDKD 120


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 26.5 bits (59), Expect = 5.9
 Identities = 17/50 (34%), Positives = 30/50 (60%)

Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
           RR   E     DE+ K+E ED++ ++KR E K  DE++  + + K+ +K+
Sbjct: 45  RRREYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKK 94


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 27.5 bits (62), Expect = 5.9
 Identities = 25/97 (25%), Positives = 43/97 (44%), Gaps = 12/97 (12%)

Query: 97  NQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
           N+L  +    E   E++ EE E   +E E+ ++E  E+ E  +E          ED+  +
Sbjct: 519 NELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE---------EDKLLE 569

Query: 157 EEEEDKQ---GEEKRVEDKIGDEKRLEENKKKKTQKR 190
           E E++ Q    E K+  D+I  E R  +     + K 
Sbjct: 570 EAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKA 606


>gnl|CDD|145910 pfam03012, PP_M1, Phosphoprotein.  This family includes the M1
           phosphoprotein non-structural RNA polymerase alpha
           subunit, which is thought to be a component of the
           active polymerase, and may be involved in template
           binding.
          Length = 298

 Score = 27.0 bits (60), Expect = 6.0
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDEN 137
            +  EEE+E+E   E+D Y+ E E+
Sbjct: 64  SQFTEEEDEDEGSSEEDFYMTEGED 88


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 27.4 bits (61), Expect = 6.1
 Identities = 20/92 (21%), Positives = 41/92 (44%), Gaps = 5/92 (5%)

Query: 98  QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
           +  +     E   +E ++E EE+ E    E  E L  + N++ +       R E EE + 
Sbjct: 27  KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEA-----RREREELQR 81

Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
           EEE    +E++++ +      LE   +++ + 
Sbjct: 82  EEERLVQKEEQLDARAEKLDNLENQLEEREKA 113


>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
           factors. 
          Length = 116

 Score = 26.3 bits (59), Expect = 6.1
 Identities = 15/37 (40%), Positives = 21/37 (56%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL 143
           E   EEE++E  E  EEE +E +  LEE+  E  + L
Sbjct: 1   ELLEEEEDDEMRELAEEELEELEAELEEELEELLKLL 37


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 27.2 bits (61), Expect = 6.1
 Identities = 29/106 (27%), Positives = 44/106 (41%), Gaps = 11/106 (10%)

Query: 89  LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
           LKK      +L + G   E  RE +EE EE E+E E   ++  L+E   +    L    E
Sbjct: 197 LKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEE--LKELAKKYLEELLALYE 254

Query: 149 RGEDEEKKEEEEDKQGEEKR-------VEDKIGDEKRLEENKKKKT 187
             E E ++ E   K  +  +       V +     K+L+E   K T
Sbjct: 255 YLEIELERAEALSKFLKTDKTFAIEGWVPED--RVKKLKELIDKAT 298


>gnl|CDD|234070 TIGR02956, TMAO_torS, TMAO reductase sytem sensor TorS.  This
           protein, TorS, is part of a regulatory system for the
           torCAD operon that encodes the pterin molybdenum
           cofactor-containing enzyme trimethylamine-N-oxide (TMAO)
           reductase (TorA), a cognate chaperone (TorD), and a
           penta-haem cytochrome (TorC). TorS works together with
           the inducer-binding protein TorT and the response
           regulator TorR. TorS contains histidine kinase ATPase
           (pfam02518), HAMP (pfam00672), phosphoacceptor
           (pfam00512), and phosphotransfer (pfam01627) domains and
           a response regulator receiver domain (pfam00072) [Signal
           transduction, Two-component systems].
          Length = 968

 Score = 27.4 bits (61), Expect = 6.1
 Identities = 18/63 (28%), Positives = 30/63 (47%)

Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
            RD  A   + + +E +  +E QE  E LE+   ++ + L E +ER   E K   +   +
Sbjct: 397 FRDTAAHNLKLQADERQVAQELQEHKESLEQLVAQRTQELAETNERLNAEVKNHAKARAE 456

Query: 164 GEE 166
            EE
Sbjct: 457 AEE 459


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 27.1 bits (60), Expect = 6.1
 Identities = 9/62 (14%), Positives = 24/62 (38%), Gaps = 3/62 (4%)

Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
           G   +   +   +E++++ +  +       + + +       EED+RGE E    +  + 
Sbjct: 277 GEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTN---EEDDRGEKESNGAKSNEL 333

Query: 163 QG 164
             
Sbjct: 334 HL 335


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 27.5 bits (61), Expect = 6.3
 Identities = 19/74 (25%), Positives = 34/74 (45%), Gaps = 3/74 (4%)

Query: 87   LRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER---RL 143
            L  +K  L E    +   ++    E E +E+  +EE+  ++Q E L+++E +K     RL
Sbjct: 2561 LENEKLKLLEKAKIEESRKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERL 2620

Query: 144  PEEDERGEDEEKKE 157
              E    +D  K  
Sbjct: 2621 NREVSGTDDTNKNH 2634


>gnl|CDD|218967 pfam06273, eIF-4B, Plant specific eukaryotic initiation factor 4B. 
           This family consists of several plant specific
           eukaryotic initiation factor 4B proteins.
          Length = 496

 Score = 27.4 bits (60), Expect = 6.3
 Identities = 17/71 (23%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
           R+ + E E    +  E E+++ L+E+ N     L +E+    + +   +E+  +G  + +
Sbjct: 350 RKIDLELEHRRVDRPETEEEKNLKEEINLLRVDLKKEEAIAPESKGSGQEQSHKGLSELI 409

Query: 170 EDKIGDEKRLE 180
             K   EK LE
Sbjct: 410 RQK---EKELE 417


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 27.3 bits (61), Expect = 6.5
 Identities = 21/78 (26%), Positives = 32/78 (41%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
            R++    E E E+ EE+      E  E E E    EE     +EE  E E + +  E +
Sbjct: 731 LRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQ 790

Query: 169 VEDKIGDEKRLEENKKKK 186
           +E    + K L E   + 
Sbjct: 791 IEQLKEELKALREALDEL 808


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 27.3 bits (61), Expect = 6.7
 Identities = 16/76 (21%), Positives = 32/76 (42%), Gaps = 3/76 (3%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
            +E  + E E +  EE  +  ++    +EE   + E RL E DE     E  E+  ++  
Sbjct: 84  PEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEE---LEDLEDLLEELE 140

Query: 165 EEKRVEDKIGDEKRLE 180
               ++  +   + L+
Sbjct: 141 PLAYLDFDLSLLRGLK 156


>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
           modification, protein turnover, chaperones].
          Length = 623

 Score = 27.2 bits (61), Expect = 6.8
 Identities = 14/42 (33%), Positives = 23/42 (54%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
           E + FK+  + +   E  EEE+E + EEE++E    LE  + 
Sbjct: 472 EGKPFKSITKGDLDLELLEEEDEADSEEEKKEFKPLLERVKE 513


>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
          Length = 437

 Score = 27.1 bits (60), Expect = 7.1
 Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 6/46 (13%)

Query: 136 ENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
           E E+E+ L EE E      KK+  E K+  E       G+ +RLE 
Sbjct: 225 ETEEEKLLFEEAEARNKLRKKKRGEQKREFEN------GEAERLEA 264


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 26.7 bits (59), Expect = 7.2
 Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 147 DERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
            E    + +  +  +K+ ++K+ ED    +K+ +E KKKK +  
Sbjct: 127 PEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 27.2 bits (61), Expect = 7.3
 Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 8/93 (8%)

Query: 102 AGARDETAREEEEEEEEEEEEEEEQE---QDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
           AGAR     ++ EE +E E E  + E   +    E+DE EK+        +     + E+
Sbjct: 389 AGARVRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEK 448

Query: 159 EEDKQGEEKRVEDKIGD-----EKRLEENKKKK 186
           E + + +E  + + +         +L E++K+K
Sbjct: 449 ELEAEVDEDDIAEVVARWTGIPVAKLLEDEKEK 481


>gnl|CDD|216096 pfam00749, tRNA-synt_1c, tRNA synthetases class I (E and Q),
           catalytic domain.  Other tRNA synthetase sub-families
           are too dissimilar to be included. This family includes
           only glutamyl and glutaminyl tRNA synthetases. In some
           organisms, a single glutamyl-tRNA synthetase
           aminoacylates both tRNA(Glu) and tRNA(Gln).
          Length = 314

 Score = 26.9 bits (60), Expect = 7.3
 Identities = 20/53 (37%), Positives = 24/53 (45%), Gaps = 6/53 (11%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED-ERGEDEEKK 156
                 EE EEE EE+E     E+  Y EE       RL EE+  +GE E  K
Sbjct: 96  VCFCTPEELEEEREEQEALGSPERPRYDEEC-----LRLFEEEMRKGEAEGGK 143


>gnl|CDD|226246 COG3723, RecT, Recombinational DNA repair protein (RecE pathway)
           [DNA replication, recombination, and repair].
          Length = 276

 Score = 26.7 bits (59), Expect = 7.4
 Identities = 14/55 (25%), Positives = 25/55 (45%)

Query: 118 EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
            E+E E  +  D   EED  + +     E E  E +E+ E+  D   E++ + + 
Sbjct: 222 TEDEAERREPLDVTPEEDSEQLDSEPSGEPEAQETDEENEKPIDVSEEKEPITES 276


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 27.0 bits (60), Expect = 7.4
 Identities = 7/41 (17%), Positives = 17/41 (41%), Gaps = 7/41 (17%)

Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
           ++E +   D+  E+           + +  +D+   E+ ED
Sbjct: 250 DDEGDGSDDDDDEDAIES-------DLDDSDDDVSDEDGED 283



 Score = 26.6 bits (59), Expect = 8.6
 Identities = 7/59 (11%), Positives = 25/59 (42%)

Query: 96  ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
           + Q   +  R     +  + ++E +  +++ ++D    + ++  +    E+ E   D +
Sbjct: 230 KKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVSDEDGEDLFDTD 288


>gnl|CDD|113857 pfam05102, Holin_BlyA, holin, BlyA family.  BlyA, a small holin
          found in Borrelia circular plasmids that is encoded by
          a prophage. BlyA contains two largely hydrophobic
          helices and a highly charged C-terminus and is membrane
          associated.
          Length = 61

 Score = 25.1 bits (55), Expect = 7.6
 Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 3/55 (5%)

Query: 39 ISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNI 93
          IS    L  + N  L+ L I    +++ ++ LG++    ++  LS  + ++KKN 
Sbjct: 4  ISINDFLLQLINIKLIKLMI---FISIFILSLGLIFKPVIKDMLSILISKIKKND 55


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 27.1 bits (60), Expect = 7.7
 Identities = 16/62 (25%), Positives = 26/62 (41%)

Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           F+  AR+E    + E   E  E + E    E  E   +  +R L EE+    +E     +
Sbjct: 728 FREAAREEQQLTQRESRLESLEAQLEGVAAEAYELSASLDQRELKEEELALLEEAIDALD 787

Query: 160 ED 161
           E+
Sbjct: 788 EE 789


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 25.9 bits (57), Expect = 7.7
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEE 134
           ++EEE+EEE  E  +++E L E
Sbjct: 96  KKEEEKEEEIPEPTKEEELLGE 117


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 26.5 bits (59), Expect = 7.8
 Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 3/51 (5%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
           E+ E EEEE+ +E+E++  L   ++  E+   ++      E +KEE+E+ +
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQ---KAKEMQKEEDEEMR 167


>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
          Length = 520

 Score = 27.0 bits (60), Expect = 7.9
 Identities = 16/77 (20%), Positives = 30/77 (38%), Gaps = 3/77 (3%)

Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
            +   R       E E+E +         E         P++D+ G+DE+  + E+    
Sbjct: 387 GEFAERRRPRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFD 446

Query: 165 EEKRVEDKIGDEKRLEE 181
            +   ED + D K ++E
Sbjct: 447 LD---EDDVYDLKDVDE 460


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 27.0 bits (59), Expect = 8.0
 Identities = 18/90 (20%), Positives = 38/90 (42%), Gaps = 5/90 (5%)

Query: 76  VRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEED 135
           +RV T    A   +K   L   +  +  AR+    + +  ++  ++E +  +QDE   + 
Sbjct: 612 IRVPTRRELASYGIK---LPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDELARQF 668

Query: 136 ENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
              +++R  E+ +   D     E+ D   E
Sbjct: 669 AQTQQQRYGEQYQ--HDVPVNAEDADAAAE 696


>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
          Length = 325

 Score = 26.8 bits (60), Expect = 8.3
 Identities = 7/47 (14%), Positives = 20/47 (42%)

Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE 153
               +  E+ + +E   E  ++   +EE+ ++ +    E   +G  +
Sbjct: 3   LKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 26.7 bits (60), Expect = 8.6
 Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 1/32 (3%)

Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPE 145
            EE+ E  EE   E  E  EE + E+++ LP 
Sbjct: 63  LEEQRERLEELAPELLEEEEEKK-EEKKGLPP 93


>gnl|CDD|227268 COG4932, COG4932, Predicted outer membrane protein [Cell envelope
            biogenesis, outer membrane].
          Length = 1531

 Score = 26.7 bits (59), Expect = 8.7
 Identities = 12/63 (19%), Positives = 20/63 (31%)

Query: 124  EEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENK 183
            E ++        E        E   + ++    +E +     E     K   EKRL+ N 
Sbjct: 1435 EFEDSIGNQLNAEEHTGNVGEEYVFKAKNPGHYKEGDQPITFEPTEPPKPDPEKRLDSNN 1494

Query: 184  KKK 186
            K  
Sbjct: 1495 KID 1497


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 26.5 bits (59), Expect = 8.8
 Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 15/72 (20%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKER-----------RLP----EEDERGEDEEK 155
           EEE+E+E E+E++  +E  E  E + N  ++            LP    EE E    EE 
Sbjct: 18  EEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFLPDKAREEKEAELREEL 77

Query: 156 KEEEEDKQGEEK 167
           +EE   KQ   K
Sbjct: 78  REEFLKKQEAVK 89


>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306).  This
           family of proteobacterial species proteins has no known
           function.
          Length = 115

 Score = 25.7 bits (57), Expect = 8.9
 Identities = 12/45 (26%), Positives = 15/45 (33%)

Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
                EE  E  E  E+E          E+E     EDE   +E 
Sbjct: 11  LAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEELLEEL 55


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 26.6 bits (58), Expect = 9.0
 Identities = 13/48 (27%), Positives = 24/48 (50%)

Query: 123 EEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
           EEE+   + LE+D+ EK++R  E+  R        E ++     + V+
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVD 132


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 26.2 bits (58), Expect = 9.1
 Identities = 11/39 (28%), Positives = 19/39 (48%)

Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKK 186
           ER + ++K+E   +++ E      K     +  E KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|225427 COG2872, COG2872, Predicted metal-dependent hydrolases related to
           alanyl-tRNA synthetase HxxxH domain [General function
           prediction only].
          Length = 241

 Score = 26.6 bits (59), Expect = 9.4
 Identities = 12/57 (21%), Positives = 25/57 (43%)

Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
           + E+ E+E EE E   +E ++E+       +P E+        K + +     E ++
Sbjct: 132 DGEDTEDEIEEVEALANELVKENLPVIIYFIPREEAEKLPGLVKLKNKVPPDVEGKI 188


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 26.6 bits (59), Expect = 9.4
 Identities = 18/48 (37%), Positives = 25/48 (52%)

Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
           EEE  + E E +E +E  E LE++ NE E  L   DE     ++  EE
Sbjct: 73  EEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEE 120


>gnl|CDD|176518 cd08576, GDPD_like_SMaseD_PLD, Glycerophosphodiester
           phosphodiesterase-like domain of spider venom
           sphingomyelinases D, bacterial phospholipase D, and
           similar proteins.  This subfamily corresponds to the
           glycerophosphodiester phosphodiesterase-like domain
           (GDPD-like) present in sphingomyelinases D (SMases D)
           (sphingomyelin phosphodiesterase D, EC 3.1.4.4) from
           spider venom, the Corynebacterium pseudotuberculosis
           Phospholipase D (PLD)-like protein from pathogenic
           bacteria, and the Ajellomyces capsulatus H143 PLD-like
           protein from ascomycetes. Spider SMases D and bacterial
           PLD proteins catalyze the Mg2+-dependent hydrolysis of
           sphingomyelin producing choline and ceramide 1-phosphate
           (C1P), which possess a number of biological functions,
           such as regulating cell proliferation and apoptosis,
           participating in inflammatory responses, and playing a
           key role in phagocytosis. In the presence of Mg2+,
           SMases D can function as lysophospholipase D and
           hydrolyze lysophosphatidylcholine (LPC) to choline and
           lysophosphatidic acid (LPA), which is a multifunctional
           phospholipid involved in platelet aggregation,
           endothelial hyperpermeability, and pro-inflammatory
           responses. Loxosceles spider venoms' SMases D are the
           principal toxins responsible for dermonecrosis and
           complement dependent haemolysis induced by spider venom.
           Due to amino acid substitutions at the entrance to the
           active-site pocket, some members lack activity. The
           typical GDPD domain consists of a TIM barrel and a small
           insertion domain named as the GDPD-insertion (GDPD-I)
           domain, which is specific for GDPD proteins.  Although
           proteins in this family contain a non-typical GDPD
           domain which lacks the GDPD-I, their catalytic
           mechanisms are based on Mg2+-dependent acid-base
           reactions similar to GDPD proteins. They might be
           divergent members of the GDPD family. Moreover, this
           family does not belong to phospholipase D (PLD)
           superfamily, since it lacks the conserved HKD sequence
           motif that characterizes the catalytic center of the PLD
           superfamily. It belongs to the superfamily of PLC-like
           phosphodiesterases.
          Length = 265

 Score = 26.5 bits (59), Expect = 9.6
 Identities = 11/31 (35%), Positives = 11/31 (35%)

Query: 73  LTLVRVRTNLSGAVLRLKKNILCENQLFKAG 103
            T    R NL    L LK   LC      AG
Sbjct: 67  GTTPGFRENLIFVWLDLKNPDLCGECSINAG 97


>gnl|CDD|182633 PRK10669, PRK10669, putative cation:proton antiport protein;
           Provisional.
          Length = 558

 Score = 26.6 bits (59), Expect = 9.7
 Identities = 20/62 (32%), Positives = 32/62 (51%)

Query: 62  MVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEE 121
           ++A L M L +L        L+GA+L +  N +    L +  A+ ET  E+  EE  EEE
Sbjct: 350 ILAGLGMALNLLPQAGQNLVLAGAILSIMLNPVLFTLLERYLAKTETLEEQTLEEAIEEE 409

Query: 122 EE 123
           ++
Sbjct: 410 KQ 411


>gnl|CDD|224112 COG1191, FliA, DNA-directed RNA polymerase specialized sigma
           subunit [Transcription].
          Length = 247

 Score = 26.5 bits (59), Expect = 9.7
 Identities = 23/100 (23%), Positives = 42/100 (42%), Gaps = 6/100 (6%)

Query: 87  LRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEE------QEQDEYLEEDENEKE 140
           LR   ++     L + G R E A +E E+E   E  +EE       +++EY+E       
Sbjct: 93  LRKNDSVKVPRSLRELGRRIEEAIDELEQELGREPTDEEIAEELGIDKEEYIEALLAING 152

Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
            +L   DE    ++  + ++  +  +  VE +   E   E
Sbjct: 153 SQLLSLDEDVLKDDDDDVDDQIENPDDGVEKEELLEILKE 192


>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
           mechanisms].
          Length = 733

 Score = 26.7 bits (59), Expect = 9.9
 Identities = 14/59 (23%), Positives = 27/59 (45%)

Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
           +E+ ++   +  E  +      +EDE   +R+   + E G  EE  E   ++  E K+V
Sbjct: 385 KEKLKKALGKGREVVENAKFCPKEDEPGLKRKERVDVEDGPQEELIELISEEVKEGKKV 443


>gnl|CDD|220634 pfam10220, DUF2146, Uncharacterized conserved protein (DUF2146).
           This is a family of proteins conserved from plants to
           humans. In Dictyostelium it is annotated as Mss11p but
           this could not be confirmed. Mss11p is required for the
           activation of pseudo-hyphal and invasive growth by
           Ste12p in yeast.
          Length = 890

 Score = 26.7 bits (59), Expect = 9.9
 Identities = 19/87 (21%), Positives = 33/87 (37%), Gaps = 6/87 (6%)

Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERG----EDEEKKEEEED--K 162
           A++ E+ E+E  +EE +QEQ E LE           +  +         E   EE D   
Sbjct: 559 AKKMEQAEDELADEETDQEQPESLEPQLQGSSTSPSDASDLNFSTASSSEASSEESDNYA 618

Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQK 189
           +   +   D+    K   E + +   +
Sbjct: 619 RPTSRSGTDEEEASKTAREKRPQALAR 645


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.129    0.346 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,075,149
Number of extensions: 1046012
Number of successful extensions: 19094
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9893
Number of HSP's successfully gapped: 2518
Length of query: 190
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 99
Effective length of database: 6,901,388
Effective search space: 683237412
Effective search space used: 683237412
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 56 (25.2 bits)