RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7729
         (167 letters)



>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 55.7 bits (135), Expect = 1e-09
 Identities = 32/66 (48%), Positives = 51/66 (77%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            +K  E+ E++ EEEK++ + +    +KKEEEEE++K+KKEEE EEEEEE + +K+EEEE
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471

Query: 146 EEEKEK 151
           +++K+ 
Sbjct: 472 KKKKQA 477



 Score = 51.8 bits (125), Expect = 3e-08
 Identities = 28/76 (36%), Positives = 57/76 (75%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E L  ++   ++ +K  E+ E++ E+EK E ++K    K++EEE++++K+++E E+EEEE
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459

Query: 136 QKNKKKEEEEEEEKEK 151
           ++ ++++EEEEE+K+K
Sbjct: 460 EEAEEEKEEEEEKKKK 475



 Score = 46.5 bits (111), Expect = 2e-06
 Identities = 26/70 (37%), Positives = 49/70 (70%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEE +     ++  ++ K+  E  EKKRE++++E++K+    +++ EEEEEE++ K++E+
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455

Query: 144 EEEEEKEKVE 153
           EEEEE+ + E
Sbjct: 456 EEEEEEAEEE 465



 Score = 46.5 bits (111), Expect = 2e-06
 Identities = 23/65 (35%), Positives = 50/65 (76%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            + +E  E + EE++KE++++    +K+++E+EE+K +K+EE+EE++++ +EE+ EEEE+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472

Query: 135 EQKNK 139
           ++K  
Sbjct: 473 KKKQA 477



 Score = 44.9 bits (107), Expect = 5e-06
 Identities = 25/66 (37%), Positives = 48/66 (72%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
            +K  ++ ++  EK + + EE+K+EKK++    +K ++EEE E+E++E++ +++EEE EE
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464

Query: 148 EKEKVE 153
           EKE+ E
Sbjct: 465 EKEEEE 470



 Score = 44.9 bits (107), Expect = 5e-06
 Identities = 28/79 (35%), Positives = 52/79 (65%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           +  LE  E E E     ++  ++ ++  +  +++++ EKKE++++    KK+EE EEEE+
Sbjct: 389 AAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448

Query: 135 EQKNKKKEEEEEEEKEKVE 153
           E+K ++KEEEEEE +E+ E
Sbjct: 449 EKKEEEKEEEEEEAEEEKE 467



 Score = 42.2 bits (100), Expect = 5e-05
 Identities = 19/66 (28%), Positives = 47/66 (71%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
             K+  ++ ++  ++ ++  E+++++K+++    K K+EEE EE+E++++ K++EEEE E
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 148 EKEKVE 153
           E+++ E
Sbjct: 464 EEKEEE 469



 Score = 40.3 bits (95), Expect = 2e-04
 Identities = 16/59 (27%), Positives = 42/59 (71%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           + I      +  E ++E+K+K    +++EEE+E++++++++ +++EEEE +++ ++EEE
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 39.5 bits (93), Expect = 3e-04
 Identities = 20/55 (36%), Positives = 42/55 (76%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           A +  E  + E+++K    +++EEEEE+EK++ EE+K E++EE EE++++++E++
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 35.3 bits (82), Expect = 0.011
 Identities = 22/64 (34%), Positives = 38/64 (59%), Gaps = 5/64 (7%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           EEE E     K+  +  +K  EK E++ E++K +K+++    +     KK+EEEEEE+++
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGK-----KKEEEEEEEKEK 450

Query: 151 KVEM 154
           K E 
Sbjct: 451 KEEE 454



 Score = 32.6 bits (75), Expect = 0.070
 Identities = 22/45 (48%), Positives = 32/45 (71%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
              A    +  E EEEEKEK+EEE+EEEEE+ ++E EE++ +KK+
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 50.0 bits (119), Expect = 1e-07
 Identities = 27/77 (35%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK--DKKEEEVEEEEEEQ 136
           E     + +  + ++E  E+E   +++ E K+ EK  +        D +EEE EEEEEE+
Sbjct: 816 EGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEE 875

Query: 137 KNKKKEEEEEEEKEKVE 153
           + +++EEEEEEE+E  E
Sbjct: 876 EEEEEEEEEEEEEENEE 892



 Score = 48.8 bits (116), Expect = 3e-07
 Identities = 28/88 (31%), Positives = 53/88 (60%), Gaps = 4/88 (4%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE----EKQKDKK 125
           +    +E  E +E E + + + ++ E ++E  E++ + E + E K++E+        D  
Sbjct: 802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGG 861

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           + E EEEEEE++ +++EEEEEEE+E+ E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 46.9 bits (111), Expect = 1e-06
 Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A +T   +E  E+E   E + E +++EK  D          EEEEE++++++EEE EEEE
Sbjct: 823 ADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEE 882

Query: 134 EEQKNKKKEEEEEEEKEKVE 153
           EE+     EEEE EE   +E
Sbjct: 883 EEE-----EEEENEEPLSLE 897



 Score = 43.8 bits (103), Expect = 2e-05
 Identities = 25/84 (29%), Positives = 50/84 (59%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           ET E     E + E +++E+  +     D  + ++ E++EEEEE++++++EEE EEEEE 
Sbjct: 831 ETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890

Query: 136 QKNKKKEEEEEEEKEKVEMYRNPV 159
           ++    E  E  +K+ + ++  P+
Sbjct: 891 EEPLSLEWPETRQKQAIYLFLLPI 914



 Score = 38.8 bits (90), Expect = 7e-04
 Identities = 23/73 (31%), Positives = 38/73 (52%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E E E + K E E E E   E+  ++E + E + +E + + + + EEVE E E +    +
Sbjct: 668 EQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTE 727

Query: 142 EEEEEEEKEKVEM 154
           +E E E  E+ E 
Sbjct: 728 DEGEIETGEEGEE 740



 Score = 38.8 bits (90), Expect = 8e-04
 Identities = 25/79 (31%), Positives = 43/79 (54%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           +E     E E E + KE + + E E +E + + E + E  E+E E +  ++ EEVE+E E
Sbjct: 687 AERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGE 746

Query: 135 EQKNKKKEEEEEEEKEKVE 153
            +   K E E E ++++ E
Sbjct: 747 GEAEGKHEVETEGDRKETE 765



 Score = 38.8 bits (90), Expect = 8e-04
 Identities = 27/80 (33%), Positives = 43/80 (53%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           +E +E+    E E  ++E E E  EE E+ EDE +   + + E E + D+KE E E E E
Sbjct: 712 AEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETE 771

Query: 135 EQKNKKKEEEEEEEKEKVEM 154
            +  + ++E E +  E  EM
Sbjct: 772 AEGKEDEDEGEIQAGEDGEM 791



 Score = 38.4 bits (89), Expect = 9e-04
 Identities = 21/71 (29%), Positives = 43/71 (60%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           Q ++A    E  + E+        +  + EE+E++E+EE++ E++EEEEE+++++ EE +
Sbjct: 835 QELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894

Query: 130 EEEEEEQKNKK 140
             E  E + K+
Sbjct: 895 SLEWPETRQKQ 905



 Score = 38.4 bits (89), Expect = 0.001
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 6/82 (7%)

Query: 75  SETLENNEWEEEEKEKEEE------EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           S      E E E K + E       E + E+E E + + ++   K E E E+ + + E E
Sbjct: 663 SGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETE 722

Query: 129 VEEEEEEQKNKKKEEEEEEEKE 150
            E  E+E + +  EE EE E E
Sbjct: 723 AEGTEDEGEIETGEEGEEVEDE 744



 Score = 38.4 bits (89), Expect = 0.001
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           EN E    E E+E E E + E + + E   +++ ++E E E +  + + + E E EE ++
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEH 717

Query: 139 KKKEEEEEEEKE 150
           + + E E  E E
Sbjct: 718 EGETEAEGTEDE 729



 Score = 37.7 bits (87), Expect = 0.002
 Identities = 20/67 (29%), Positives = 34/67 (50%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E+  EE E   E E E  E+   + E++ E E K +++ E E+  E + ++  + E E +
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK 702

Query: 147 EEKEKVE 153
           E   K E
Sbjct: 703 EADHKGE 709



 Score = 37.3 bits (86), Expect = 0.002
 Identities = 22/72 (30%), Positives = 37/72 (51%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E E E KE + + E E EE E + + E +  + E E E  ++ +E E E E E +   + 
Sbjct: 696 EGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEV 755

Query: 142 EEEEEEEKEKVE 153
           E E + ++ + E
Sbjct: 756 ETEGDRKETEHE 767



 Score = 36.9 bits (85), Expect = 0.003
 Identities = 27/73 (36%), Positives = 36/73 (49%), Gaps = 3/73 (4%)

Query: 84  EEEEKEKEEEEEEEEE---EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           EE E+  E E E  EE   E E++ + E K E + E E   + K E+E E E E ++   
Sbjct: 647 EEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADH 706

Query: 141 KEEEEEEEKEKVE 153
           K E E EE E   
Sbjct: 707 KGETEAEEVEHEG 719



 Score = 35.7 bits (82), Expect = 0.007
 Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 1/80 (1%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKD-EDEEKKREKKEEEEEKQKDKKEEEVE 130
           I A E     E E EE E E E E E  E E + E  E+  E ++E E + + K E E E
Sbjct: 699 IEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETE 758

Query: 131 EEEEEQKNKKKEEEEEEEKE 150
            + +E +++ + E E +E E
Sbjct: 759 GDRKETEHEGETEAEGKEDE 778



 Score = 35.7 bits (82), Expect = 0.008
 Identities = 20/69 (28%), Positives = 32/69 (46%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E   E+  EE E   E E +  EE   E ++E E + K + E E E   E +  ++ E E
Sbjct: 639 EHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGE 698

Query: 145 EEEEKEKVE 153
            E ++   +
Sbjct: 699 IEAKEADHK 707



 Score = 35.4 bits (81), Expect = 0.009
 Identities = 24/81 (29%), Positives = 38/81 (46%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
              E        + E E EE E E E E E  EDE +    +E EE + + + E E + E
Sbjct: 695 GEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHE 754

Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
            E + ++K+ E E E + + +
Sbjct: 755 VETEGDRKETEHEGETEAEGK 775



 Score = 35.4 bits (81), Expect = 0.010
 Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 3/85 (3%)

Query: 75  SETLENNEWEEE---EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           +ET   NE E E   E++ E+E E E E KE D   E + E+ E E E + +  E+E E 
Sbjct: 673 TETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEI 732

Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYR 156
           E  E+  + ++E E E + K E+  
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVET 757



 Score = 35.0 bits (80), Expect = 0.014
 Identities = 25/82 (30%), Positives = 37/82 (45%), Gaps = 1/82 (1%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE-EEEEKQKDKKEEE 128
           +A    E          E E E  EE   E +++ E E K   + E E   ++K ++E E
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696

Query: 129 VEEEEEEQKNKKKEEEEEEEKE 150
            E E +E  +K + E EE E E
Sbjct: 697 GEIEAKEADHKGETEAEEVEHE 718



 Score = 34.2 bits (78), Expect = 0.030
 Identities = 22/70 (31%), Positives = 36/70 (51%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E +E E E E E E  E + + E   E +E E+E + + + +   E E ++K  + E 
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768

Query: 144 EEEEEKEKVE 153
           E E E ++ E
Sbjct: 769 ETEAEGKEDE 778



 Score = 33.8 bits (77), Expect = 0.035
 Identities = 22/66 (33%), Positives = 36/66 (54%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E  +E EE E+E E E E   + E + ++KE E E + + + +E E+E E Q  +  E +
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMK 792

Query: 145 EEEEKE 150
            +E  E
Sbjct: 793 GDEGAE 798



 Score = 33.0 bits (75), Expect = 0.067
 Identities = 27/82 (32%), Positives = 40/82 (48%), Gaps = 10/82 (12%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDE---DEEKKREKKEEEEEKQKDKKEEEVEEEE----- 133
           E  EE +E E+E E E E K + E   D ++   + E E E ++D+ E E++  E     
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMK 792

Query: 134 --EEQKNKKKEEEEEEEKEKVE 153
             E  + K + E E E  EK E
Sbjct: 793 GDEGAEGKVEHEGETEAGEKDE 814



 Score = 31.9 bits (72), Expect = 0.14
 Identities = 27/109 (24%), Positives = 47/109 (43%), Gaps = 32/109 (29%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE------------- 131
            E+ E + +E  E + + + E E  ++++ E + E Q D  E + E              
Sbjct: 786 GEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEA 845

Query: 132 -------------------EEEEQKNKKKEEEEEEEKEKVEMYRNPVPM 161
                              EEEE++ +++EEEEEEE+E+ E   N  P+
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894



 Score = 31.1 bits (70), Expect = 0.26
 Identities = 22/78 (28%), Positives = 37/78 (47%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            T    E  EE   + E+E E E + E + + E   E+K E+E + + + +E   + E E
Sbjct: 652 PTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETE 711

Query: 136 QKNKKKEEEEEEEKEKVE 153
            +  + E E E E  + E
Sbjct: 712 AEEVEHEGETEAEGTEDE 729



 Score = 30.7 bits (69), Expect = 0.35
 Identities = 23/78 (29%), Positives = 36/78 (46%), Gaps = 3/78 (3%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEE---KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           E     E E  E+   E E+E E E   + + E E     K E+E E + + KE + + E
Sbjct: 650 ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGE 709

Query: 133 EEEQKNKKKEEEEEEEKE 150
            E ++ + + E E E  E
Sbjct: 710 TEAEEVEHEGETEAEGTE 727



 Score = 29.6 bits (66), Expect = 0.85
 Identities = 19/88 (21%), Positives = 40/88 (45%), Gaps = 4/88 (4%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           QA    E   +   E + + + E E  E++E E   + +    + ++E  +Q+   E + 
Sbjct: 784 QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQG 843

Query: 130 EEEEEEQKNKKKEE----EEEEEKEKVE 153
           E +++E+           + EEE+E+ E
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEE 871



 Score = 27.3 bits (60), Expect = 4.9
 Identities = 15/59 (25%), Positives = 26/59 (44%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           + +  E E   +   EE +R  + E E  ++   E E E E E +   + E E   E++
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERK 690



 Score = 26.5 bits (58), Expect = 8.1
 Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 1/64 (1%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           K +  E E   +   E+ E+  E + E  E+   + E+E  E E + +N+ + E   E K
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQE-GETETKGENESEGEIPAERK 690

Query: 150 EKVE 153
            + E
Sbjct: 691 GEQE 694


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 45.5 bits (108), Expect = 2e-06
 Identities = 23/79 (29%), Positives = 46/79 (58%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
              +A E       ++E  E EE +EEE+E    ++ E+K   +KE+EE ++++++E+E 
Sbjct: 42  DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101

Query: 130 EEEEEEQKNKKKEEEEEEE 148
             +E E++ ++K E   E+
Sbjct: 102 SSDENEKETEEKTESNVEK 120



 Score = 42.4 bits (100), Expect = 2e-05
 Identities = 25/79 (31%), Positives = 42/79 (53%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
                A ++ +    E EE ++EE+E    E+KE   D EK+ E+ EEE E++ ++  +E
Sbjct: 46  ADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDE 105

Query: 129 VEEEEEEQKNKKKEEEEEE 147
            E+E EE+     E+E   
Sbjct: 106 NEKETEEKTESNVEKEITN 124



 Score = 41.6 bits (98), Expect = 5e-05
 Identities = 23/88 (26%), Positives = 51/88 (57%)

Query: 66  VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           V++   I A +    +   ++    E+E ++ ++++  + +E K+ EK+    E ++DK 
Sbjct: 23  VVVLIIIVAYQLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKG 82

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           + E E+EE E++N++++EE  +E EK  
Sbjct: 83  DAEKEDEESEEENEEEDEESSDENEKET 110



 Score = 38.9 bits (91), Expect = 4e-04
 Identities = 25/80 (31%), Positives = 43/80 (53%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           Q    S+  E  E EE ++E++E    E++E + D ++E +  ++E EEE ++   E E 
Sbjct: 49  QEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108

Query: 130 EEEEEEQKNKKKEEEEEEEK 149
           E EE+ + N +KE      K
Sbjct: 109 ETEEKTESNVEKEITNPSWK 128


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 44.6 bits (106), Expect = 7e-06
 Identities = 19/68 (27%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD---KKEEEVEEEEEEQKNKKKEE 143
           ++  E EE EEE + E +   E K  K+E+E   ++D     EE+ + ++ E  +K +  
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286

Query: 144 EEEEEKEK 151
           +E++++E 
Sbjct: 287 KEKKDEEL 294



 Score = 40.8 bits (96), Expect = 2e-04
 Identities = 22/80 (27%), Positives = 39/80 (48%), Gaps = 1/80 (1%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K+ +E  E EE E+E D + E   E K  ++E Q+   EE+     EE+++  K E+ ++
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQE-QEGSTEEDPSLFSEEKEDPDKTEDLDK 282

Query: 148 EKEKVEMYRNPVPMYRNPVP 167
            +   E     +  +  P P
Sbjct: 283 LEILKEKKDEELFWFEKPWP 302



 Score = 40.4 bits (95), Expect = 2e-04
 Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 4/71 (5%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            ET E  E EEE   + E   E +  K++ E   ++      EE++  DK     E+ ++
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKT----EDLDK 282

Query: 135 EQKNKKKEEEE 145
            +  K+K++EE
Sbjct: 283 LEILKEKKDEE 293



 Score = 38.4 bits (90), Expect = 9e-04
 Identities = 14/66 (21%), Positives = 32/66 (48%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
            SET E  E  + E E   E +  ++E+E   +E+     +E+E+  + +  ++    +E
Sbjct: 229 TSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKE 288

Query: 134 EEQKNK 139
           ++ +  
Sbjct: 289 KKDEEL 294



 Score = 29.6 bits (67), Expect = 0.75
 Identities = 17/57 (29%), Positives = 32/57 (56%), Gaps = 4/57 (7%)

Query: 63  RLTVILPQAISASETLENN----EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
           R+   L  +   SE ++       W E++KE +  +  EE+ K++++ E+ KRE+KE
Sbjct: 622 RIEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 15/48 (31%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 73  SASETLENNEWEEEEKEKEEE-EEEEEEEKEKDEDEEKKREKKEEEEE 119
           + SET    + +E   E++     EE+E+ +K ED +K    KE+++E
Sbjct: 245 TTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292



 Score = 27.3 bits (61), Expect = 5.4
 Identities = 13/33 (39%), Positives = 23/33 (69%), Gaps = 1/33 (3%)

Query: 122 KDKKEEEVEEEEEEQKNK-KKEEEEEEEKEKVE 153
           K K+ + ++  EE+ K + KKE+ + EEKE++E
Sbjct: 649 KKKEFKILDYTEEKTKKEEKKEKNKREEKERIE 681



 Score = 26.5 bits (59), Expect = 8.7
 Identities = 9/50 (18%), Positives = 26/50 (52%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
           I  +   +  + E+E   +E+     EE+++ D+ E+  + +  +E++ +
Sbjct: 243 IETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292



 Score = 26.5 bits (59), Expect = 9.5
 Identities = 12/31 (38%), Positives = 23/31 (74%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
           EK+KE +  +  EEK K E++++K +++E+E
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 44.4 bits (105), Expect = 9e-06
 Identities = 31/71 (43%), Positives = 48/71 (67%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           E+E E++  DE+EE+++E+K+EEEEK  DK+EE  EEEE+E+K KK ++ +E   E   +
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87

Query: 155 YRNPVPMYRNP 165
            +      RNP
Sbjct: 88  NKTKPIWTRNP 98



 Score = 40.1 bits (94), Expect = 2e-04
 Identities = 25/58 (43%), Positives = 40/58 (68%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           E  +EE+E+E+EE++EEEEK  D++EE   E+++EE++K+  K +E   E E   K K
Sbjct: 34  EVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 44.1 bits (105), Expect = 1e-05
 Identities = 20/70 (28%), Positives = 28/70 (40%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEEE  KEE + E EEE + + +E    E +EE + +       E   E           
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARP 98

Query: 144 EEEEEKEKVE 153
            E+E     E
Sbjct: 99  AEDEAARPAE 108



 Score = 41.8 bits (99), Expect = 6e-05
 Identities = 22/88 (25%), Positives = 35/88 (39%), Gaps = 3/88 (3%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           S S T+E    EE  KE+ + E EEE + E +E    + E++ + E       EE  E  
Sbjct: 33  SHSSTVEE---EEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAA 89

Query: 133 EEEQKNKKKEEEEEEEKEKVEMYRNPVP 160
              +   +  E+E     +    R    
Sbjct: 90  AAAEAAARPAEDEAARPAEAAARRPKAK 117



 Score = 36.0 bits (84), Expect = 0.006
 Identities = 14/63 (22%), Positives = 26/63 (41%)

Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRN 164
           E+EE ++E+ + E E++   + EE    E E++ K +       +E  E         R 
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARP 98

Query: 165 PVP 167
              
Sbjct: 99  AED 101


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 44.2 bits (105), Expect = 1e-05
 Identities = 27/75 (36%), Positives = 49/75 (65%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            +  E+ E EE+  + E+EE+++++ +E++ED +   E+++EE+E   D+ +EE EEEE+
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEK 376

Query: 135 EQKNKKKEEEEEEEK 149
           E+K KK  E    E 
Sbjct: 377 EKKKKKSAESTRSEL 391



 Score = 43.1 bits (102), Expect = 2e-05
 Identities = 25/72 (34%), Positives = 45/72 (62%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEEE   ++E+EE++++  ++E+E+     +EE+EE +    E++ EEEEEE++ KKK+ 
Sbjct: 324 EEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKS 383

Query: 144 EEEEEKEKVEMY 155
            E    E    +
Sbjct: 384 AESTRSELPFTF 395



 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 23/80 (28%), Positives = 50/80 (62%), Gaps = 5/80 (6%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
               ++N    + +E EEEEE+  +++++++D++   E++E+ +   +++ EE+ + ++E
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366

Query: 135 EQKNKKKEEEEEEEKEKVEM 154
           +      EEEEEEEKEK + 
Sbjct: 367 DD-----EEEEEEEKEKKKK 381



 Score = 34.2 bits (79), Expect = 0.026
 Identities = 19/70 (27%), Positives = 41/70 (58%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEE+ E+EE+ +E  ++ + + + +        + E+ ++++E+ V++E+EE  +   EE
Sbjct: 285 EEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEE 344

Query: 144 EEEEEKEKVE 153
           EEE+     E
Sbjct: 345 EEEDVDLSDE 354



 Score = 33.8 bits (78), Expect = 0.028
 Identities = 19/70 (27%), Positives = 48/70 (68%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +EEE++ +E  ++ ++E E D+D+     + EE+EE+++D  ++E EE++++   +++E+
Sbjct: 289 DEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEED 348

Query: 144 EEEEEKEKVE 153
            +  ++E+ E
Sbjct: 349 VDLSDEEEDE 358



 Score = 33.4 bits (77), Expect = 0.040
 Identities = 20/74 (27%), Positives = 41/74 (55%), Gaps = 11/74 (14%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE-----------Q 136
           K +EE  +EE E  +K E E  +R + EEE++++++  +E  ++ ++E            
Sbjct: 258 KTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLG 317

Query: 137 KNKKKEEEEEEEKE 150
           + ++ EEEEE+  +
Sbjct: 318 QGEEDEEEEEDGVD 331



 Score = 33.4 bits (77), Expect = 0.048
 Identities = 17/69 (24%), Positives = 43/69 (62%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           ++E E ++++     + E+DE+EE+     E+EE+   D +EEE + +  +++  +++E+
Sbjct: 303 DDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDED 362

Query: 145 EEEEKEKVE 153
            ++E ++ E
Sbjct: 363 SDDEDDEEE 371



 Score = 29.6 bits (67), Expect = 0.81
 Identities = 21/62 (33%), Positives = 38/62 (61%), Gaps = 2/62 (3%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
                E  +++  EEEE     +EEE+EE+++ D+++++  E++EEE+EK+K K  E   
Sbjct: 331 DDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDE--EEEEEEKEKKKKKSAESTR 388

Query: 131 EE 132
            E
Sbjct: 389 SE 390



 Score = 29.6 bits (67), Expect = 0.89
 Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 9/79 (11%)

Query: 84  EEEEKEKEEEEE------EEEEEKEKDEDEEKKREKKEEEEEKQKDKK---EEEVEEEEE 134
           EE E+ K+ E E       EEE+ E++ED ++  +  ++E E   D      +  E+EEE
Sbjct: 266 EEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEE 325

Query: 135 EQKNKKKEEEEEEEKEKVE 153
           E+     E+EE+++ +  E
Sbjct: 326 EEDGVDDEDEEDDDDDLEE 344



 Score = 28.8 bits (65), Expect = 1.7
 Identities = 17/83 (20%), Positives = 43/83 (51%), Gaps = 2/83 (2%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE--E 127
           Q++S  +  ++   +++  + +  +   ++        E   E +EE+ E++K KKE  +
Sbjct: 121 QSLSEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMK 180

Query: 128 EVEEEEEEQKNKKKEEEEEEEKE 150
           EV  + +  K ++++ +EE+E  
Sbjct: 181 EVIAKSKFYKAERQKAKEEDEDL 203


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 43.3 bits (102), Expect = 2e-05
 Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 8/77 (10%)

Query: 63  RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEK---EKDED-----EEKKREKK 114
           RL   L  +      L+ ++W E+E E E E +EE ++     K  +     ++ K  K 
Sbjct: 532 RLCWSLRNSTEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKL 591

Query: 115 EEEEEKQKDKKEEEVEE 131
            EEEE     K  +  E
Sbjct: 592 TEEEEAALKMKMTDTSE 608



 Score = 40.6 bits (95), Expect = 1e-04
 Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 16/75 (21%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEE---------------EQKNKKKEEEEEEEKEKV 152
           E++ +   ++  + +D+ E E +EE +               +Q    K  EEEE   K+
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAALKM 601

Query: 153 EMY-RNPVPMYRNPV 166
           +M   +  P++RN V
Sbjct: 602 KMTDTSEKPLWRNVV 616



 Score = 29.8 bits (67), Expect = 0.74
 Identities = 8/39 (20%), Positives = 17/39 (43%), Gaps = 7/39 (17%)

Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           R   EE  +        + ++  E++   + E +EE +K
Sbjct: 538 RNSTEERIDL-------DADDWTEDEDENEMETDEERKK 569


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 43.0 bits (101), Expect = 3e-05
 Identities = 36/73 (49%), Positives = 46/73 (63%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E EEEE  +EEEEEEEEEE+E+ E EE++ E +EEEEE + D   EE  E   E     +
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497

Query: 142 EEEEEEEKEKVEM 154
           E EE+ E+   EM
Sbjct: 498 EPEEDAERRNSEM 510



 Score = 42.6 bits (100), Expect = 4e-05
 Identities = 29/86 (33%), Positives = 46/86 (53%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           +  EEE+  EEEEEEEEEE+E++++ E++  + EEEEE+ +     E E E   + +   
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDG 496

Query: 142 EEEEEEEKEKVEMYRNPVPMYRNPVP 167
           EE EE+ + +         M     P
Sbjct: 497 EEPEEDAERRNSEMAGISRMSEGQQP 522



 Score = 39.1 bits (91), Expect = 5e-04
 Identities = 26/73 (35%), Positives = 45/73 (61%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E++E  E EEEE+E+EE+E EEEE ++++E+EE + +   EEE +   + + + EE EE+
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEED 502

Query: 136 QKNKKKEEEEEEE 148
            + +  E      
Sbjct: 503 AERRNSEMAGISR 515



 Score = 38.4 bits (89), Expect = 0.001
 Identities = 23/67 (34%), Positives = 38/67 (56%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
                    +      +E E+E+  +EE++ E++EEEEE++ +++E E EEEEEE +   
Sbjct: 421 PSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADN 480

Query: 141 KEEEEEE 147
             EEE E
Sbjct: 481 GSEEEME 487



 Score = 37.2 bits (86), Expect = 0.002
 Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 6/92 (6%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           P   S     E +  EEEE+E+EEEEEE+E E+E+ EDEE      EEEE +  +  EEE
Sbjct: 432 PSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEE------EEEEVEADNGSEEE 485

Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVP 160
           +E   E   + ++ EE+ E +       + + 
Sbjct: 486 MEGSSEGDGDGEEPEEDAERRNSEMAGISRMS 517



 Score = 36.4 bits (84), Expect = 0.005
 Identities = 27/76 (35%), Positives = 41/76 (53%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
            + E  E  E EEEE+E+E EEEE E+E+E++E E     ++E E   + D   EE EE+
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEED 502

Query: 133 EEEQKNKKKEEEEEEE 148
            E + ++        E
Sbjct: 503 AERRNSEMAGISRMSE 518



 Score = 36.0 bits (83), Expect = 0.007
 Identities = 26/79 (32%), Positives = 38/79 (48%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E  E  E EEEE+E EEEE E+EEE+E+ E +    E+ E   E   D +E E + E  
Sbjct: 447 EEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERR 506

Query: 135 EQKNKKKEEEEEEEKEKVE 153
             +        E ++ +  
Sbjct: 507 NSEMAGISRMSEGQQPRGS 525



 Score = 35.7 bits (82), Expect = 0.008
 Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 7/77 (9%)

Query: 84  EEEEKEKEEEEEEE-------EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           EEEE+ K +E E +       +  K      E      +E EE++  ++EEE EEEEEE+
Sbjct: 399 EEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEE 458

Query: 137 KNKKKEEEEEEEKEKVE 153
           + + +EEE E+E+E+ E
Sbjct: 459 EQESEEEEGEDEEEEEE 475



 Score = 31.4 bits (71), Expect = 0.23
 Identities = 22/74 (29%), Positives = 36/74 (48%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E E+E +E+E E+EEEEEE E D   E++ E   E +   ++ +E+      E    
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGI 513

Query: 139 KKKEEEEEEEKEKV 152
            +  E ++     V
Sbjct: 514 SRMSEGQQPRGSSV 527



 Score = 29.5 bits (66), Expect = 0.86
 Identities = 19/79 (24%), Positives = 37/79 (46%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E  E  E +E E+E+ E+EEEEEE +  +  EE+     E + + ++ +++ E    E 
Sbjct: 451 EEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEM 510

Query: 135 EQKNKKKEEEEEEEKEKVE 153
              ++  E ++        
Sbjct: 511 AGISRMSEGQQPRGSSVQP 529



 Score = 29.1 bits (65), Expect = 1.4
 Identities = 17/81 (20%), Positives = 36/81 (44%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
             +    SE  E  + EEEE+ + +   EEE E   + D + +  +++ E    +     
Sbjct: 455 EEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGIS 514

Query: 128 EVEEEEEEQKNKKKEEEEEEE 148
            + E ++ + +  + E  +EE
Sbjct: 515 RMSEGQQPRGSSVQPESPQEE 535



 Score = 28.7 bits (64), Expect = 1.9
 Identities = 19/69 (27%), Positives = 37/69 (53%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           +++ E+EE  + +E E++          K      +      +E EEEE  ++ +++EEE
Sbjct: 395 QDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEE 454

Query: 145 EEEEKEKVE 153
           EEEE+++ E
Sbjct: 455 EEEEEQESE 463



 Score = 28.3 bits (63), Expect = 2.4
 Identities = 25/83 (30%), Positives = 47/83 (56%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           L + IS     +++  EEE ++++E E +    +  D  +      +      Q+ ++EE
Sbjct: 384 LEEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEE 443

Query: 128 EVEEEEEEQKNKKKEEEEEEEKE 150
            VEEEEEE++ +++EE+E EE+E
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEE 466



 Score = 27.2 bits (60), Expect = 5.4
 Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 4/82 (4%)

Query: 73  SASETLENNEWEEEEKEKEEEEEE----EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
              E  + +E EE E E+EEEE E     EEE E   + +   E+ EE+ E++  +    
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGI 513

Query: 129 VEEEEEEQKNKKKEEEEEEEKE 150
               E +Q      + E  ++E
Sbjct: 514 SRMSEGQQPRGSSVQPESPQEE 535


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 42.2 bits (100), Expect = 4e-05
 Identities = 23/90 (25%), Positives = 49/90 (54%)

Query: 62  ARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
             L  +L   +   + L N +   E   K ++  EEEEEK     EE+++E+ +E++E++
Sbjct: 232 EDLLPLLNLVLYLVDKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEK 291

Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           K ++ E    +   ++ +K EE+E +++ +
Sbjct: 292 KKEEREAKLAKLSPEEQRKLEEKERKKQAR 321



 Score = 36.1 bits (84), Expect = 0.005
 Identities = 16/56 (28%), Positives = 32/56 (57%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           +           E +EE +EK+EE+++EE E +  +   +++ K EE+E K++ +K
Sbjct: 267 EEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 34.2 bits (79), Expect = 0.020
 Identities = 16/48 (33%), Positives = 30/48 (62%)

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           E  ++  K  EEE++K  K  E E +EE Q+ K+++++EE E +  ++
Sbjct: 256 EVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKL 303



 Score = 31.5 bits (72), Expect = 0.17
 Identities = 16/45 (35%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 110 KREKKEEEEEKQKDKKEEEVEEE-EEEQKNKKKEEEEEEEKEKVE 153
            + ++EEEE+  K  +EE  EE  E++++ KK+E E +  K   E
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPE 306


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 42.1 bits (99), Expect = 4e-05
 Identities = 19/78 (24%), Positives = 49/78 (62%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E +++N     +K +E  +E  ++ ++  +   + +++ +  EE +   KEE++++EE E
Sbjct: 354 EEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413

Query: 136 QKNKKKEEEEEEEKEKVE 153
           +K K++ +E++E+++K E
Sbjct: 414 KKQKEQADEDKEKRQKDE 431



 Score = 41.3 bits (97), Expect = 8e-05
 Identities = 18/82 (21%), Positives = 51/82 (62%), Gaps = 1/82 (1%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           + + ++  L  ++ +E  KE  ++ ++  ++  + +DE     ++ E + K++  K+EE 
Sbjct: 354 EEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDE-TDASEEAEAKAKEEKLKQEEN 412

Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
           E++++EQ ++ KE+ +++E++K
Sbjct: 413 EKKQKEQADEDKEKRQKDERKK 434



 Score = 33.2 bits (76), Expect = 0.053
 Identities = 21/92 (22%), Positives = 49/92 (53%), Gaps = 8/92 (8%)

Query: 70  QAISASETLENNEW--------EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
           +AI+ +  L++N+          EE K   +   ++ +E  K+ +++ +   K+  E K 
Sbjct: 331 EAINIARNLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKD 390

Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           +    EE E + +E+K K++E E++++++  E
Sbjct: 391 ETDASEEAEAKAKEEKLKQEENEKKQKEQADE 422


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 41.6 bits (98), Expect = 4e-05
 Identities = 22/70 (31%), Positives = 51/70 (72%), Gaps = 2/70 (2%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E  E +K+++E  EE EK K E EEK++ K ++++ K+K  K+++  ++++++ +K +++
Sbjct: 56  EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKD--KKDDKKDDKSEKK 113

Query: 144 EEEEEKEKVE 153
           +E+E ++K+E
Sbjct: 114 DEKEAEDKLE 123



 Score = 37.7 bits (88), Expect = 8e-04
 Identities = 21/69 (30%), Positives = 44/69 (63%), Gaps = 1/69 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE EK K+E EE+++ + +K + ++KK + K+++++K+ DK E++ +E+E E K +   +
Sbjct: 69  EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK-DEKEAEDKLEDLTK 127

Query: 144 EEEEEKEKV 152
              E    +
Sbjct: 128 SYSETLSTL 136



 Score = 36.2 bits (84), Expect = 0.003
 Identities = 15/58 (25%), Positives = 37/58 (63%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           W++++ +K++++++++++ +KD+  EKK EK+ E++ +   K   E      E K +K
Sbjct: 86  WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143



 Score = 35.8 bits (83), Expect = 0.003
 Identities = 18/67 (26%), Positives = 43/67 (64%), Gaps = 6/67 (8%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQ------KDKKEEEVEEEEEEQKNKKKEEEEE 146
            E  E ++K+K+  EE ++ KKE EE+++      K KK+++ ++++++ K   K E+++
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114

Query: 147 EEKEKVE 153
           E++ + +
Sbjct: 115 EKEAEDK 121



 Score = 35.1 bits (81), Expect = 0.009
 Identities = 20/84 (23%), Positives = 52/84 (61%), Gaps = 1/84 (1%)

Query: 82  EWEEEEKEKEEEE-EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           E+EE++K K +++  +++++K+KD+ ++KK +K E+++EK+ + K E++ +   E  +  
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTL 136

Query: 141 KEEEEEEEKEKVEMYRNPVPMYRN 164
            E +  +     ++Y++ +   R 
Sbjct: 137 SELKPRKYALHKDIYQSRLDRKRR 160



 Score = 27.7 bits (62), Expect = 2.2
 Identities = 9/56 (16%), Positives = 33/56 (58%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
               +  +++K+K++++++++++ + ++ +EK+ E K E+  K   +    + E +
Sbjct: 85  KWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 41.3 bits (97), Expect = 5e-05
 Identities = 21/63 (33%), Positives = 39/63 (61%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E +  ++EE+E EEE  +E E+++  E + + +EKK+ E  K K +KE+   E ++ + +
Sbjct: 90  ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPS 149

Query: 139 KKK 141
           K K
Sbjct: 150 KPK 152



 Score = 38.2 bits (89), Expect = 5e-04
 Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 5/83 (6%)

Query: 48  QVGP--AQYHKPIR---ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKE 102
           QVGP   QY   +R   A  ++   L  +    E  E    +E E+E   E + E +EK+
Sbjct: 66  QVGPMVQQYGPLVRNLPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKK 125

Query: 103 KDEDEEKKREKKEEEEEKQKDKK 125
           K E  + K EK++ + E +K K 
Sbjct: 126 KREVPKPKTEKEKPKTEPKKPKP 148



 Score = 35.9 bits (83), Expect = 0.003
 Identities = 17/64 (26%), Positives = 33/64 (51%), Gaps = 9/64 (14%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
              +EEEE EEE  +         E ++E+  + K + +E+ + E  + K +K++ + E 
Sbjct: 93  SSDDEEEETEEESTD---------ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143

Query: 148 EKEK 151
           +K K
Sbjct: 144 KKPK 147



 Score = 35.5 bits (82), Expect = 0.004
 Identities = 14/52 (26%), Positives = 29/52 (55%)

Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
              DE+EE + E  +E E++   + + E +E+++ +  K K E+E+ + E  
Sbjct: 93  SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144



 Score = 29.7 bits (67), Expect = 0.42
 Identities = 11/40 (27%), Positives = 23/40 (57%)

Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
            +EEEE +++  +E  +E+  E K + KE+++ E  +   
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKT 134



 Score = 29.0 bits (65), Expect = 0.70
 Identities = 11/41 (26%), Positives = 25/41 (60%)

Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           RE    ++E+++ ++E   E E+E+    K E +E++++E 
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREV 129



 Score = 25.9 bits (57), Expect = 9.3
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           E      EEEE +++   E E E+  E +   K++++ E  K K E
Sbjct: 90  ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTE 135


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 41.9 bits (99), Expect = 5e-05
 Identities = 20/102 (19%), Positives = 50/102 (49%), Gaps = 7/102 (6%)

Query: 55  HKP-----IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
           H+P     +  + ++ +     I   E L++    ++     EE+++++EE +++++E  
Sbjct: 84  HRPWNLDNVFGSDQVVIFDDGDIE--EELQDEPRYDDAYRDLEEDDDDDEESDEEDEESS 141

Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           K E  E++++   D      E   E ++ +++ EE+  E E 
Sbjct: 142 KSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEF 183



 Score = 35.0 bits (81), Expect = 0.012
 Identities = 10/58 (17%), Positives = 29/58 (50%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           +    EE+E+  + E++E++++ + D+D   +    E    +++ +++    E    Q
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEFYYYQ 187



 Score = 31.5 bits (72), Expect = 0.17
 Identities = 14/74 (18%), Positives = 32/74 (43%), Gaps = 12/74 (16%)

Query: 91  EEEEEEEEEEKEK-----------DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           EEE ++E    +            +E +E+  E  + E+++  D  +++ +    E+  +
Sbjct: 107 EEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLE 166

Query: 140 KKEEEEE-EEKEKV 152
           ++    E EEK   
Sbjct: 167 RRRRRREWEEKRAE 180



 Score = 26.9 bits (60), Expect = 6.0
 Identities = 10/56 (17%), Positives = 31/56 (55%)

Query: 98  EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
            + +E+ +DE +  +   + EE   D +E + E+EE  +    ++++++++ + + 
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIA 159



 Score = 26.9 bits (60), Expect = 7.3
 Identities = 10/60 (16%), Positives = 28/60 (46%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
            + EEE + +   +      EE+++  ++  EE+ E  + E      +++++++    E 
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRER 163


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 41.9 bits (98), Expect = 8e-05
 Identities = 24/80 (30%), Positives = 44/80 (55%)

Query: 74   ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
            A E  E  + E++E  ++E+  EE    ++D  ++   +  E++E+  +D  EE V+E E
Sbjct: 4016 ADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENE 4075

Query: 134  EEQKNKKKEEEEEEEKEKVE 153
            E  ++  K +EE E+ E  E
Sbjct: 4076 ESTEDGVKSDEELEQGEVPE 4095



 Score = 36.9 bits (85), Expect = 0.004
 Identities = 15/72 (20%), Positives = 36/72 (50%), Gaps = 2/72 (2%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE--EKQKDKKEEEVEEEEEEQ 136
            E +E  ++E   EE    +E+ ++ D  +  + ++K  E+  E+   + EE  E+  +  
Sbjct: 4026 EKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSD 4085

Query: 137  KNKKKEEEEEEE 148
            +  ++ E  E++
Sbjct: 4086 EELEQGEVPEDQ 4097



 Score = 36.1 bits (83), Expect = 0.006
 Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 6/77 (7%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDED-----EEKKREKKEEEEEKQKD-KKEEEV 129
            E  E +  E E+K  E+     E +    ED     E+K R++KE+EEE   D   ++E+
Sbjct: 3910 EPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEI 3969

Query: 130  EEEEEEQKNKKKEEEEE 146
            + + +E  ++   E E+
Sbjct: 3970 QPDIQENNSQPPPENED 3986



 Score = 36.1 bits (83), Expect = 0.007
 Identities = 16/79 (20%), Positives = 36/79 (45%)

Query: 75   SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            S +L    W+E  +E   E E++  E+    +E     K+++ +  +   ++E+ +EEE 
Sbjct: 3900 SNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEM 3959

Query: 135  EQKNKKKEEEEEEEKEKVE 153
                   +E + + +E   
Sbjct: 3960 SDDVGIDDEIQPDIQENNS 3978



 Score = 35.4 bits (81), Expect = 0.011
 Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 22/97 (22%)

Query: 74   ASETLENNEWEEEEKEKEEEEEEEEEE-----------------KEKDEDEEKKREKKEE 116
               T ++   E E  + EEE ++ +EE                  E +E++  + E+K  
Sbjct: 3865 NGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSN 3924

Query: 117  EE-----EKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
            E+     E     KE++ +  E++ + +K++EEE  +
Sbjct: 3925 EQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961



 Score = 35.4 bits (81), Expect = 0.013
 Identities = 20/92 (21%), Positives = 40/92 (43%), Gaps = 15/92 (16%)

Query: 78   LENNEWEEEEKEKEEEEEEEEEEKEKDEDE-------EKKREKKEEEEEKQKDKKEEEVE 130
             +++   E E    EEE ++ +E+  D  E       EK  ++  EE+  + ++K  E  
Sbjct: 3868 TKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQS 3927

Query: 131  EEEEEQKNKKKEE--------EEEEEKEKVEM 154
                E     KE+        + +E++++ EM
Sbjct: 3928 AANNESDLVSKEDDNKALEDKDRQEKEDEEEM 3959



 Score = 35.0 bits (80), Expect = 0.013
 Identities = 15/74 (20%), Positives = 37/74 (50%), Gaps = 2/74 (2%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            ENN  +E+ ++ +  +  E++EK  ++  E+    +E EE  +   K +E  E+ E  ++
Sbjct: 4039 ENNTLDEDIQQDDFSDLAEDDEKMNEDGFEE--NVQENEESTEDGVKSDEELEQGEVPED 4096

Query: 139  KKKEEEEEEEKEKV 152
            +  +   + + +  
Sbjct: 4097 QAIDNHPKMDAKST 4110



 Score = 34.6 bits (79), Expect = 0.019
 Identities = 20/86 (23%), Positives = 35/86 (40%), Gaps = 3/86 (3%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEE---EEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
                  E+ E+    +EE E+ E  E++      K   +      E  EE  +K    + 
Sbjct: 4070 NVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGEN 4129

Query: 127  EEVEEEEEEQKNKKKEEEEEEEKEKV 152
            EE+ EE+  + N   + E E+ +E  
Sbjct: 4130 EELGEEDGVRGNGTADGEFEQVQEDT 4155



 Score = 34.6 bits (79), Expect = 0.019
 Identities = 18/66 (27%), Positives = 31/66 (46%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            +      E+ + EE   EE+   + D DE + RE + +     KD    E E  + E++N
Sbjct: 3826 DITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEEN 3885

Query: 139  KKKEEE 144
            +  +EE
Sbjct: 3886 QDLDEE 3891



 Score = 33.0 bits (75), Expect = 0.068
 Identities = 14/61 (22%), Positives = 29/61 (47%)

Query: 72   ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
            ++  + LE    EE+   + + +E E  E E D +   K     E E    +++ ++++E
Sbjct: 3831 LNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDE 3890

Query: 132  E 132
            E
Sbjct: 3891 E 3891



 Score = 33.0 bits (75), Expect = 0.074
 Identities = 21/84 (25%), Positives = 43/84 (51%), Gaps = 19/84 (22%)

Query: 86   EEKEKEEEEEEEEEEKEKDEDEEKKREKKEE-------------EEEKQKDKKEEEVEEE 132
            E+K+++E+E+EEE   +   D+E + + +E               E+ + D+KE +V ++
Sbjct: 3946 EDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKD 4005

Query: 133  EE------EQKNKKKEEEEEEEKE 150
             +      E  ++ KEE + E+ E
Sbjct: 4006 SDLEDMDMEAADENKEEADAEKDE 4029



 Score = 32.3 bits (73), Expect = 0.10
 Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 87   EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE---E 143
            E   E +EE + E+ E  +DE+   E    +E+ Q+D   +  E++E+  ++  +E   E
Sbjct: 4014 EAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQE 4073

Query: 144  EEEEEKEKVEMYRN 157
             EE  ++ V+    
Sbjct: 4074 NEESTEDGVKSDEE 4087



 Score = 31.5 bits (71), Expect = 0.21
 Identities = 14/75 (18%), Positives = 43/75 (57%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            E+ + +E+E +  ++ + E+ + E  ++ +++ + +++E  + +D  EE    +E+ Q++
Sbjct: 3991 EDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQD 4050

Query: 139  KKKEEEEEEEKEKVE 153
               +  E++EK   +
Sbjct: 4051 DFSDLAEDDEKMNED 4065



 Score = 31.1 bits (70), Expect = 0.32
 Identities = 20/84 (23%), Positives = 33/84 (39%), Gaps = 9/84 (10%)

Query: 78   LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE---------EE 128
            L  ++ +  E   EE  +E EE  E     +++ E+ E  E++  D              
Sbjct: 4055 LAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASA 4114

Query: 129  VEEEEEEQKNKKKEEEEEEEKEKV 152
              +EE   K    E EE  E++ V
Sbjct: 4115 EADEENTDKGIVGENEELGEEDGV 4138



 Score = 30.7 bits (69), Expect = 0.34
 Identities = 16/70 (22%), Positives = 29/70 (41%), Gaps = 1/70 (1%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E+      E+++ EE   E+D   +     + E  E + D      +    E +N   EE
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSD-LDESEARELESDMNGVTKDSVVSENENSDSEE 3883

Query: 144  EEEEEKEKVE 153
            E ++  E+V 
Sbjct: 3884 ENQDLDEEVN 3893



 Score = 30.4 bits (68), Expect = 0.46
 Identities = 18/76 (23%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 76   ETLENNEWEEEEKEKEEEE--EEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEEVEEE 132
            E       E E+ +  E+   +E+E +  KD D E+   E  +E +E+   +K+E +++E
Sbjct: 3975 ENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDE 4034

Query: 133  EEEQKNKKKEEEEEEE 148
            +  ++N   +E+ +++
Sbjct: 4035 DPLEENNTLDEDIQQD 4050



 Score = 30.4 bits (68), Expect = 0.55
 Identities = 17/86 (19%), Positives = 43/86 (50%), Gaps = 7/86 (8%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDED----EEKKREKKEEEEEKQKDKK 125
            +     E   +++   +++ + + +E   +   ++ED    E+ K ++KE +  K  D +
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009

Query: 126  EEEVE---EEEEEQKNKKKEEEEEEE 148
            + ++E   E +EE   +K E  ++E+
Sbjct: 4010 DMDMEAADENKEEADAEKDEPMQDED 4035



 Score = 30.0 bits (67), Expect = 0.64
 Identities = 15/84 (17%), Positives = 39/84 (46%), Gaps = 4/84 (4%)

Query: 71   AISASETLENNEWEEEEKEKEEEEEEEE----EEKEKDEDEEKKREKKEEEEEKQKDKKE 126
             +  + TL+ +  +++  +  E++E+      EE  ++ +E  +   K +EE +Q +  E
Sbjct: 4036 PLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPE 4095

Query: 127  EEVEEEEEEQKNKKKEEEEEEEKE 150
            ++  +   +   K      E ++E
Sbjct: 4096 DQAIDNHPKMDAKSTFASAEADEE 4119



 Score = 28.8 bits (64), Expect = 1.4
 Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 15/73 (20%)

Query: 95   EEEEEEKEKDEDEEKKREKKEEEEEKQKD----------KKEEEVEEEEEEQKNKKK--- 141
            E+ EEE EKD DE  ++EK  ++ +  KD            E E EEE+ E ++      
Sbjct: 2927 EQIEEEDEKD-DEYGEQEKVSDKNDYLKDFIDYDDDGSISTESEPEEEKRECESGFDGWN 2985

Query: 142  -EEEEEEEKEKVE 153
              EEEE++ EK E
Sbjct: 2986 DYEEEEDDDEKNE 2998



 Score = 28.0 bits (62), Expect = 2.9
 Identities = 15/85 (17%), Positives = 40/85 (47%)

Query: 65   TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
             + LP+ +   E   +   + + ++ + E  +E +E+   E +E  +++   EE    D+
Sbjct: 3986 DLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDE 4045

Query: 125  KEEEVEEEEEEQKNKKKEEEEEEEK 149
              ++ +  +  + ++K  E+  EE 
Sbjct: 4046 DIQQDDFSDLAEDDEKMNEDGFEEN 4070



 Score = 27.3 bits (60), Expect = 5.7
 Identities = 12/69 (17%), Positives = 34/69 (49%), Gaps = 1/69 (1%)

Query: 82   EWEEEEKEKEEEEEEEE-EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            EWE+  + + +  ++ E    ++DE+E+ +     E+++ +   ++E   +  +   +  
Sbjct: 4187 EWEDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQNPDSMNSTN 4246

Query: 141  KEEEEEEEK 149
              E+E +E 
Sbjct: 4247 IAEDEADEV 4255



 Score = 26.9 bits (59), Expect = 7.8
 Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 15/100 (15%)

Query: 76   ETLENNEWEEEEKEKEEE--------EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
            +   + E E EE+++E E         EEEE++ EK+E E +  +  E   E +K   E 
Sbjct: 2961 DGSISTESEPEEEKRECESGFDGWNDYEEEEDDDEKNEKECELEQNAEILIEDEKIGSEG 3020

Query: 128  EVEEEEEE-------QKNKKKEEEEEEEKEKVEMYRNPVP 160
             +E   EE       + N+  E EE++ +   E Y +   
Sbjct: 3021 SLESGGEELVCSNLSEGNESSEIEEDDVESGGEFYADSRD 3060


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 41.4 bits (97), Expect = 8e-05
 Identities = 16/75 (21%), Positives = 44/75 (58%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  +    E+E E ++++EE+ KEK+  + K  +K+ + + + +   +     ++ E+K+
Sbjct: 7   EAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKS 66

Query: 139 KKKEEEEEEEKEKVE 153
           +K++ E+E  ++ ++
Sbjct: 67  RKRDVEDENPEDFID 81



 Score = 29.9 bits (67), Expect = 0.79
 Identities = 14/42 (33%), Positives = 26/42 (61%)

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           E +  +K   EEE ++ KK+EE  +E+E +K K  ++E + +
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46



 Score = 29.5 bits (66), Expect = 1.0
 Identities = 15/42 (35%), Positives = 27/42 (64%)

Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E E +K+   EEE E++K K+E+  E+E ++ K  +KE + +
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 41.3 bits (98), Expect = 1e-04
 Identities = 21/71 (29%), Positives = 47/71 (66%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E   E ++ E+   ++EE  ++  +  +KRE++ E++EK+ ++K++E+E++EEE + 
Sbjct: 76  ELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135

Query: 139 KKKEEEEEEEK 149
             +E+ +E E+
Sbjct: 136 LIEEQLQELER 146



 Score = 39.0 bits (92), Expect = 5e-04
 Identities = 24/69 (34%), Positives = 46/69 (66%), Gaps = 3/69 (4%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK---DKKEEEVEEEEEEQKNKK 140
           EE  K + E E+E  E + + +  EK+  +KEE  +++    +K+EEE+E++E+E + K+
Sbjct: 64  EEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123

Query: 141 KEEEEEEEK 149
           +E E++EE+
Sbjct: 124 QELEKKEEE 132



 Score = 36.3 bits (85), Expect = 0.004
 Identities = 24/81 (29%), Positives = 49/81 (60%), Gaps = 1/81 (1%)

Query: 74  ASETLENNEWEEEEKEKEEEEE-EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           A   LE  + E E  +KE   E +EE  K ++E E++ RE++ E ++ +K   ++E   +
Sbjct: 40  AKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLD 99

Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
            + +  +K+EEE E++++++E
Sbjct: 100 RKLELLEKREEELEKKEKELE 120



 Score = 35.5 bits (83), Expect = 0.009
 Identities = 29/91 (31%), Positives = 47/91 (51%), Gaps = 15/91 (16%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE------------- 126
           + + E  EK +EE E++E+E ++K ++ EKK E+ EE  E+Q  + E             
Sbjct: 99  DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEI 158

Query: 127 --EEVEEEEEEQKNKKKEEEEEEEKEKVEMY 155
             E+VEEE   +     +E EEE KE+ +  
Sbjct: 159 LLEKVEEEARHEAAVLIKEIEEEAKEEADKK 189



 Score = 35.1 bits (82), Expect = 0.009
 Identities = 22/70 (31%), Positives = 38/70 (54%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEE K   EE ++E E  +K+   E K E  +   E +K+ +E   E ++ E++  +KEE
Sbjct: 37  EEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEE 96

Query: 144 EEEEEKEKVE 153
             + + E +E
Sbjct: 97  NLDRKLELLE 106



 Score = 34.8 bits (81), Expect = 0.014
 Identities = 23/75 (30%), Positives = 49/75 (65%), Gaps = 6/75 (8%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE-----QKN 138
           E +E+  +   E E+E +E+  +E +K EK+  ++E+  D+K E +E+ EEE     ++ 
Sbjct: 61  EAKEEIHKLRNEFEKELRER-RNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKEL 119

Query: 139 KKKEEEEEEEKEKVE 153
           ++K++E E+++E++E
Sbjct: 120 EQKQQELEKKEEELE 134



 Score = 27.8 bits (63), Expect = 2.6
 Identities = 19/53 (35%), Positives = 31/53 (58%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           E K K+ +EE KR  +E ++E +  KKE  +E +EE  K + + E+E  E+  
Sbjct: 30  EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 40.2 bits (94), Expect = 2e-04
 Identities = 16/67 (23%), Positives = 40/67 (59%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           EE+++K+E+++ EE ++++  ++E+ ++ ++E    Q+ KK+ E   ++   K K+ EE 
Sbjct: 78  EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEA 137

Query: 145 EEEEKEK 151
             +    
Sbjct: 138 AAKAAAA 144



 Score = 40.2 bits (94), Expect = 2e-04
 Identities = 14/68 (20%), Positives = 39/68 (57%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E+ +K+++++ EE ++++  +++  K+ EK+    ++QK + EE  ++   +QK  ++  
Sbjct: 79  EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138

Query: 144 EEEEEKEK 151
            +     K
Sbjct: 139 AKAAAAAK 146



 Score = 37.1 bits (86), Expect = 0.002
 Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 7/93 (7%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEE---EEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
            +    +E L+  +  E+E+ K+ E+E    +E++K+ +E  ++   K+++ EE      
Sbjct: 83  KKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAA 142

Query: 126 EEEVEEEEEEQKN----KKKEEEEEEEKEKVEM 154
                + E E K      KK   E ++K + E 
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175



 Score = 35.2 bits (81), Expect = 0.012
 Identities = 13/72 (18%), Positives = 42/72 (58%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           +E+++ EE ++++  E+E+ +  EK+R   +E++++ ++  ++   ++++ ++   K   
Sbjct: 84  KEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAA 143

Query: 145 EEEEKEKVEMYR 156
             + K + E  R
Sbjct: 144 AAKAKAEAEAKR 155



 Score = 34.0 bits (78), Expect = 0.024
 Identities = 17/78 (21%), Positives = 40/78 (51%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A E  +  E ++ E+ ++++  E+E  K+ +++    +E+K++ EE  K    ++ + EE
Sbjct: 77  AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136

Query: 134 EEQKNKKKEEEEEEEKEK 151
              K     + + E + K
Sbjct: 137 AAAKAAAAAKAKAEAEAK 154



 Score = 33.6 bits (77), Expect = 0.032
 Identities = 15/67 (22%), Positives = 39/67 (58%), Gaps = 3/67 (4%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           +++++K  +  EE+ +K++ +  E+ ++K+  E+E+    K+ E E    +++ K+ EE 
Sbjct: 67  QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERL---KQLEKERLAAQEQKKQAEEA 123

Query: 145 EEEEKEK 151
            ++   K
Sbjct: 124 AKQAALK 130



 Score = 30.9 bits (70), Expect = 0.29
 Identities = 15/67 (22%), Positives = 28/67 (41%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
               +K   E +++ E E  +    + +KK E E   K   E + + E E +K    E +
Sbjct: 157 AAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAK 216

Query: 145 EEEEKEK 151
           ++   E 
Sbjct: 217 KKAAAEA 223



 Score = 30.2 bits (68), Expect = 0.47
 Identities = 17/70 (24%), Positives = 28/70 (40%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
             E K+K E E  ++   E  +  E +   K   E K+K + E + +   E +K    E 
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEA 223

Query: 144 EEEEEKEKVE 153
           +    K   E
Sbjct: 224 KAAAAKAAAE 233



 Score = 29.8 bits (67), Expect = 0.62
 Identities = 21/86 (24%), Positives = 34/86 (39%), Gaps = 16/86 (18%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKRE-----KKEEEEEKQ-----------KDKKEE 127
            +E+K++ EE  ++   K+K  +E   +       K E E K+             KK E
Sbjct: 113 AQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAE 172

Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVE 153
               ++   + KKK E E   K   E
Sbjct: 173 AEAAKKAAAEAKKKAEAEAAAKAAAE 198



 Score = 29.4 bits (66), Expect = 0.96
 Identities = 14/70 (20%), Positives = 25/70 (35%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           + E E   K   E +++ E E  +    + +KK   E K    K     +   E+    K
Sbjct: 186 KAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAK 245

Query: 142 EEEEEEEKEK 151
             E+    + 
Sbjct: 246 AAEKAAAAKA 255



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 12/61 (19%), Positives = 39/61 (63%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           E+   +++++K     +++ KK+E+++ ++ ++++  E+E  +Q  K++   +E++K+  
Sbjct: 62  EQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAE 121

Query: 153 E 153
           E
Sbjct: 122 E 122



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 15/78 (19%), Positives = 31/78 (39%), Gaps = 10/78 (12%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKR----------EKKEEEEEKQKDKKEEEVEEEE 133
             ++K+ EE   +     +   + E KR          E K++ E +   K   E +++ 
Sbjct: 128 ALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKA 187

Query: 134 EEQKNKKKEEEEEEEKEK 151
           E +   K   E +++ E 
Sbjct: 188 EAEAAAKAAAEAKKKAEA 205



 Score = 28.2 bits (63), Expect = 2.2
 Identities = 19/83 (22%), Positives = 37/83 (44%), Gaps = 10/83 (12%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR------EKKEEEEEKQ----KDKKEE 127
           LE      +E++K+ EE  ++   ++ + EE         + K E E K+      K   
Sbjct: 106 LEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAA 165

Query: 128 EVEEEEEEQKNKKKEEEEEEEKE 150
           E +++ E +  KK   E +++ E
Sbjct: 166 EAKKKAEAEAAKKAAAEAKKKAE 188



 Score = 27.8 bits (62), Expect = 2.6
 Identities = 18/66 (27%), Positives = 25/66 (37%), Gaps = 2/66 (3%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
              K K E E +      K    E K  KK E E  +K   E + + E E       E +
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAK--KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK 200

Query: 145 EEEEKE 150
           ++ E E
Sbjct: 201 KKAEAE 206


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 40.0 bits (94), Expect = 2e-04
 Identities = 19/72 (26%), Positives = 35/72 (48%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +  E EK  E EEE+ E +  ++E  K + + +  E+  D+ E +     E    +++EE
Sbjct: 17  KLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEE 76

Query: 144 EEEEEKEKVEMY 155
           E + E  +   Y
Sbjct: 77  EAKAEAAEFRAY 88



 Score = 39.6 bits (93), Expect = 3e-04
 Identities = 19/70 (27%), Positives = 32/70 (45%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            +  E E+  E EEE+ E D  +E+  +   E +  ++   E E +     +    +EEE
Sbjct: 16  RKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEE 75

Query: 145 EEEEKEKVEM 154
           EE + E  E 
Sbjct: 76  EEAKAEAAEF 85



 Score = 36.9 bits (86), Expect = 0.002
 Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 8/79 (10%)

Query: 79  ENNEWEEEEKEKEEEEEEE--------EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
           E     EEEK + +  +EE        +  +E  ++ E K     E    +++++E + E
Sbjct: 22  EKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAE 81

Query: 131 EEEEEQKNKKKEEEEEEEK 149
             E     +  ++   EE+
Sbjct: 82  AAEFRAYLRGGDDALAEER 100



 Score = 36.6 bits (85), Expect = 0.004
 Identities = 17/63 (26%), Positives = 31/63 (49%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
            KE  E   E  KE  +  E ++  + EEE+ + D  +EE+++ + E    ++  +E E 
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60

Query: 149 KEK 151
           K  
Sbjct: 61  KPA 63



 Score = 35.8 bits (83), Expect = 0.006
 Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 4/75 (5%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE----EEVEEEEEEQKNK 139
            E   E  +E  +  E+++  E EE+K E    +EE  K   E    EE+ +E E +   
Sbjct: 5   REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64

Query: 140 KKEEEEEEEKEKVEM 154
             E    EE+E+   
Sbjct: 65  SGEGGGGEEEEEEAK 79



 Score = 35.0 bits (81), Expect = 0.012
 Identities = 15/69 (21%), Positives = 35/69 (50%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +E+  E EEE+ E +  KE+ +  + + ++ EE  ++ + K     E    E++ ++ + 
Sbjct: 21  DEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKA 80

Query: 144 EEEEEKEKV 152
           E  E +  +
Sbjct: 81  EAAEFRAYL 89



 Score = 34.3 bits (79), Expect = 0.018
 Identities = 21/81 (25%), Positives = 35/81 (43%), Gaps = 1/81 (1%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           L +       L  +E   E +E++ E +  +EE +K  D E  R ++  +E + K     
Sbjct: 8   LAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDK-LDAEIDRLEELLDELEAKPAASG 66

Query: 128 EVEEEEEEQKNKKKEEEEEEE 148
           E    EEE++  K E  E   
Sbjct: 67  EGGGGEEEEEEAKAEAAEFRA 87



 Score = 27.3 bits (61), Expect = 4.8
 Identities = 13/48 (27%), Positives = 24/48 (50%)

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
            ++ RE   E  ++ +   E+E   E EE+K +    +EE +K   E+
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEI 48


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 40.2 bits (95), Expect = 3e-04
 Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 3/79 (3%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE---EKQKDKKEEEVEEE 132
            +LE  E E E+K +E E   +E EK K+E EEKK + +EEE+   E+ + + ++ ++E 
Sbjct: 523 ASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEA 582

Query: 133 EEEQKNKKKEEEEEEEKEK 151
           ++E     KE  + ++   
Sbjct: 583 KKEADEIIKELRQLQKGGY 601



 Score = 37.1 bits (87), Expect = 0.003
 Identities = 23/73 (31%), Positives = 47/73 (64%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE E+E E++ EE E   ++ E  +++ E+K+E+ ++++DK  EE E+E ++   + K+E
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKE 585

Query: 144 EEEEEKEKVEMYR 156
            +E  KE  ++ +
Sbjct: 586 ADEIIKELRQLQK 598



 Score = 36.0 bits (84), Expect = 0.006
 Identities = 18/82 (21%), Positives = 42/82 (51%), Gaps = 1/82 (1%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           +  E E++  E+ E+E ++  ++ K E +E  +E ++ ++      K  E+ E  +   N
Sbjct: 559 KLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRL-N 617

Query: 139 KKKEEEEEEEKEKVEMYRNPVP 160
           K  E++E+++K++ E       
Sbjct: 618 KANEKKEKKKKKQKEKQEELKV 639



 Score = 33.3 bits (77), Expect = 0.047
 Identities = 24/76 (31%), Positives = 45/76 (59%), Gaps = 6/76 (7%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE---EEKQKDKKEEEVEEEEEEQKN-- 138
           EE +K   E++E+  E     E+ E++ E+K EE     K+ +K +EE+EE++E+ +   
Sbjct: 505 EEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEE 564

Query: 139 -KKKEEEEEEEKEKVE 153
            K  EE E+E ++ ++
Sbjct: 565 DKLLEEAEKEAQQAIK 580



 Score = 31.7 bits (73), Expect = 0.14
 Identities = 21/90 (23%), Positives = 45/90 (50%), Gaps = 6/90 (6%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE-- 127
                 +  E  E   +E EK +EE EE++EK ++E+++   E ++E ++  K+ K+E  
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEAD 587

Query: 128 ----EVEEEEEEQKNKKKEEEEEEEKEKVE 153
               E+ + ++      K  E  E ++++ 
Sbjct: 588 EIIKELRQLQKGGYASVKAHELIEARKRLN 617


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 39.6 bits (93), Expect = 3e-04
 Identities = 21/68 (30%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE-EEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           +++ K+  +EEE+EE+E+EK +   K + KK  + + ++K+K + E EE+   +  +   
Sbjct: 30  DDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89

Query: 143 EEEEEEKE 150
           E+E  EK 
Sbjct: 90  EDELAEKL 97



 Score = 38.5 bits (90), Expect = 6e-04
 Identities = 18/68 (26%), Positives = 45/68 (66%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K+K ++E+E+++ K+  ++EE + +++E+ +   K K ++ ++ + EE++  K+E+EE+ 
Sbjct: 21  KDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80

Query: 148 EKEKVEMY 155
            +E  E  
Sbjct: 81  LRELEEDT 88



 Score = 36.9 bits (86), Expect = 0.002
 Identities = 17/73 (23%), Positives = 40/73 (54%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           +  ++ + EE+E+++EE+ +   + K K   + K  EK++ + EK++    E  E+  E+
Sbjct: 32  DVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPED 91

Query: 136 QKNKKKEEEEEEE 148
           +  +K    + +E
Sbjct: 92  ELAEKLRLRKLQE 104



 Score = 35.8 bits (83), Expect = 0.005
 Identities = 19/69 (27%), Positives = 41/69 (59%), Gaps = 4/69 (5%)

Query: 83  WEEEEKEKEEEEEEEEE----EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           W+EEE E++EEE+ +       K+  + + +++EK + E+E++  ++ EE   E+E  + 
Sbjct: 37  WDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96

Query: 139 KKKEEEEEE 147
            +  + +EE
Sbjct: 97  LRLRKLQEE 105


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 39.9 bits (93), Expect = 3e-04
 Identities = 20/91 (21%), Positives = 50/91 (54%)

Query: 60  ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
           A AR+   + +++  +E + ++   E+E++ ++++  E E++E+ E E + +E    +E 
Sbjct: 185 AFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQES 244

Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           +  D++ E  EEE  +       +E +++ E
Sbjct: 245 EATDRESESGEEEMVQSDQDDLPDESDDDSE 275



 Score = 34.5 bits (79), Expect = 0.020
 Identities = 19/80 (23%), Positives = 40/80 (50%), Gaps = 3/80 (3%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE---EEVEEEE 133
           ++E  E   ++ E E+EE+ ++++  ++E EE+   + E +E     + E    E E  E
Sbjct: 196 SMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255

Query: 134 EEQKNKKKEEEEEEEKEKVE 153
           EE     +++  +E  +  E
Sbjct: 256 EEMVQSDQDDLPDESDDDSE 275



 Score = 29.5 bits (66), Expect = 0.87
 Identities = 11/76 (14%), Positives = 36/76 (47%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
            +E+ +  + ++++  + E+EE+ E E E  E    +  +  + E +  +++  + ++++
Sbjct: 206 DTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDD 265

Query: 134 EEQKNKKKEEEEEEEK 149
              ++    E   E  
Sbjct: 266 LPDESDDDSETPGEGA 281



 Score = 28.8 bits (64), Expect = 1.5
 Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 6/64 (9%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           EE  ++ E E E+D D+++  E ++EE+       E E E +E     + +  + E E  
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQ------GEGEGEGQEGSAPQESEATDRESESG 254

Query: 151 KVEM 154
           + EM
Sbjct: 255 EEEM 258



 Score = 28.0 bits (62), Expect = 2.8
 Identities = 17/69 (24%), Positives = 33/69 (47%), Gaps = 10/69 (14%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E  E+  ++ E E+E++ D+D+  + E++E+ E           E E +E    ++ E  
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEG----------EGEGQEGSAPQESEAT 247

Query: 146 EEEKEKVEM 154
           + E E  E 
Sbjct: 248 DRESESGEE 256


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 38.9 bits (90), Expect = 4e-04
 Identities = 20/63 (31%), Positives = 47/63 (74%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           +E+ EK  +E EE EE+ + E  E+K +++E++EE++++++E+ +EE+ ++ +++  E++
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQD 184

Query: 145 EEE 147
           E E
Sbjct: 185 ESE 187



 Score = 36.2 bits (83), Expect = 0.004
 Identities = 19/61 (31%), Positives = 39/61 (63%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E  E  EWE  E++ +E E++EE+E+E++E   +++    E E  ++D+ E E ++++ E
Sbjct: 137 ELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTE 196

Query: 136 Q 136
           +
Sbjct: 197 K 197



 Score = 35.8 bits (82), Expect = 0.004
 Identities = 21/62 (33%), Positives = 40/62 (64%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           EE +E+ E E  EE+  E+++ EE++RE++E+  E+Q D  E E+ E++E +     ++ 
Sbjct: 136 EELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKT 195

Query: 145 EE 146
           E+
Sbjct: 196 EK 197



 Score = 35.8 bits (82), Expect = 0.004
 Identities = 27/96 (28%), Positives = 53/96 (55%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
           IR   +L  +    I   ++L   + + E+  +E EE EE  E E+ E++  +RE +EE+
Sbjct: 101 IRIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQ 160

Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           E +++++  EE  ++ E +  ++ E E E + +K E
Sbjct: 161 EREREEQTIEEQSDDSEHEIIEQDESETESDDDKTE 196



 Score = 33.5 bits (76), Expect = 0.026
 Identities = 23/86 (26%), Positives = 50/86 (58%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           +P AI   + L + + ++ +K+K     +E+ EK + E EE +   + E  E++ D++E+
Sbjct: 97  VPLAIRIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDERED 156

Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVE 153
           + E+E E ++   +E+ ++ E E +E
Sbjct: 157 QEEQEREREEQTIEEQSDDSEHEIIE 182


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 39.0 bits (91), Expect = 4e-04
 Identities = 19/69 (27%), Positives = 36/69 (52%), Gaps = 2/69 (2%)

Query: 93  EEEEEEEEKEKDEDEEKKREK-KEEEEEKQKD-KKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           E E+ +  K+K   E   +   +++  EK K   K +  E  ++    +KKEE+++ +K+
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKK 76

Query: 151 KVEMYRNPV 159
           KV +  NP 
Sbjct: 77  KVPLQVNPA 85



 Score = 30.5 bits (69), Expect = 0.27
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
            N++ +  EK K   + +  E  +K   +EKK EKK+ +++K
Sbjct: 36  INSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77



 Score = 30.1 bits (68), Expect = 0.35
 Identities = 12/72 (16%), Positives = 33/72 (45%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           E   +   +++  EK +   K +  +  ++   ++KKEE+ + ++++   +    +   +
Sbjct: 31  EIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVPLQVNPAQLFVD 90

Query: 149 KEKVEMYRNPVP 160
            E      +PVP
Sbjct: 91  DEYYVWIYDPVP 102



 Score = 30.1 bits (68), Expect = 0.41
 Identities = 16/72 (22%), Positives = 32/72 (44%), Gaps = 3/72 (4%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           E E+ +  KD+   +   K   +++  +  K     +  E  K   K   +E+++EK + 
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVK---KLHSQEKKEEKKKP 73

Query: 155 YRNPVPMYRNPV 166
            +  VP+  NP 
Sbjct: 74  KKKKVPLQVNPA 85



 Score = 28.6 bits (64), Expect = 1.3
 Identities = 10/51 (19%), Positives = 26/51 (50%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
           N E   +   +++  E+ +   +    E  K+   +E++E++K  K+++V 
Sbjct: 29  NPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVP 79


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 39.2 bits (92), Expect = 4e-04
 Identities = 10/78 (12%), Positives = 42/78 (53%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
            N+ ++++ + ++++ ++++  + D+DE+   +  +++ + + ++K+E  E E+    + 
Sbjct: 127 LNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDD 186

Query: 140 KKEEEEEEEKEKVEMYRN 157
              +E++ E  +      
Sbjct: 187 FVWDEDDSEALRQARKDA 204



 Score = 35.0 bits (81), Expect = 0.013
 Identities = 12/94 (12%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 60  ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
               L          +   +++    ++ +   + ++++++ + D+ ++   +  +++E+
Sbjct: 96  LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155

Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
             +D  +++V++E+EE   KK+ +E E+  +  +
Sbjct: 156 DDEDDDDDDVDDEDEE---KKEAKELEKLSDDDD 186



 Score = 29.6 bits (67), Expect = 0.77
 Identities = 8/83 (9%), Positives = 45/83 (54%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           P      + L++++  E++   +++++    +     ++    +  +++++   D  +++
Sbjct: 91  PAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDD 150

Query: 129 VEEEEEEQKNKKKEEEEEEEKEK 151
            ++E++++ +   + ++E+E++K
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKK 173


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 39.1 bits (92), Expect = 5e-04
 Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 13/93 (13%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKK----------REKKEEEEEKQ 121
           I+AS  L +   +E E+  ++ EE   E+K++ E  E K           +  +EE +K 
Sbjct: 495 ITASSGLSD---DEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKL 551

Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
            +  +++VEE  E  K + + E++EE + K E 
Sbjct: 552 PEADKKKVEEAIEWLKEELEGEDKEEIEAKTEE 584



 Score = 36.8 bits (86), Expect = 0.003
 Identities = 20/82 (24%), Positives = 38/82 (46%), Gaps = 1/82 (1%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           +SA +     E +          ++E E   KD +E    +KK +E  + K++  EE   
Sbjct: 480 VSAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEA-EEYVY 538

Query: 132 EEEEQKNKKKEEEEEEEKEKVE 153
             E+   ++ ++  E +K+KVE
Sbjct: 539 SLEKSLKEEGDKLPEADKKKVE 560



 Score = 28.7 bits (65), Expect = 1.7
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 6/83 (7%)

Query: 79  ENNEWEEEEKEKEEEEEEEEE-----EKEKDEDEEKK-REKKEEEEEKQKDKKEEEVEEE 132
           E    +++ KE+ E + E EE     EK   E+ +K     K++ EE  +  KEE   E+
Sbjct: 515 EYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGED 574

Query: 133 EEEQKNKKKEEEEEEEKEKVEMY 155
           +EE + K +E ++  +     MY
Sbjct: 575 KEEIEAKTEELQKVVQPIGERMY 597



 Score = 26.8 bits (60), Expect = 7.5
 Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           + I A    E   +  E+  KEE ++  E +K+K E+       KEE E + K++ E + 
Sbjct: 525 ERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEE--AIEWLKEELEGEDKEEIEAKT 582

Query: 130 EE 131
           EE
Sbjct: 583 EE 584


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 39.1 bits (92), Expect = 5e-04
 Identities = 21/71 (29%), Positives = 38/71 (53%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E   E +  E    + EE  ++  +   K+E+  E++EK+   KE+ ++E+EEE + 
Sbjct: 70  ELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEE 129

Query: 139 KKKEEEEEEEK 149
              E+ EE E+
Sbjct: 130 LIAEQREELER 140



 Score = 33.7 bits (78), Expect = 0.030
 Identities = 19/70 (27%), Positives = 36/70 (51%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE  K   EE ++E E  +K+   E K E  +   E +++ KE   E +  E++  ++EE
Sbjct: 31  EELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREE 90

Query: 144 EEEEEKEKVE 153
             + + E ++
Sbjct: 91  TLDRKMESLD 100



 Score = 32.6 bits (75), Expect = 0.082
 Identities = 31/85 (36%), Positives = 49/85 (57%), Gaps = 7/85 (8%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-------EKQKDKKEEE 128
           E+L+  E   E+KEKE   +E+  +++++E EE   E++EE E       E+ K+   EE
Sbjct: 97  ESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEE 156

Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVE 153
           VEEE   +  K  +E EEE KE+ +
Sbjct: 157 VEEEARHEAAKLIKEIEEEAKEEAD 181



 Score = 31.8 bits (73), Expect = 0.15
 Identities = 24/81 (29%), Positives = 44/81 (54%), Gaps = 6/81 (7%)

Query: 74  ASETLENNEWEEEEKEKE---EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
           A   +E  + E E  +KE   E +EE  + + + E E K+R  + +  E++  ++EE ++
Sbjct: 34  AKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLD 93

Query: 131 EEEEEQKNKKKEEEEEEEKEK 151
            + E   +  K+EE  E+KEK
Sbjct: 94  RKME---SLDKKEENLEKKEK 111



 Score = 29.5 bits (67), Expect = 0.81
 Identities = 24/70 (34%), Positives = 37/70 (52%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE    K E  +++EE  EK E E   +EK  +E+E++ ++   E  EE E      +EE
Sbjct: 89  EETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEE 148

Query: 144 EEEEEKEKVE 153
            +E   E+VE
Sbjct: 149 AKEILLEEVE 158



 Score = 28.4 bits (64), Expect = 1.8
 Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 9/74 (12%)

Query: 86  EEKEKEEEEEEEEEEKE-KDEDEEKKRE-----KKEEEEEKQKDKK---EEEVEEEEEEQ 136
           EE +KE E  ++E   E K+E  + + E     K+   E ++ +++    EE  + + E 
Sbjct: 39  EEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMES 98

Query: 137 KNKKKEEEEEEEKE 150
            +KK+E  E++EKE
Sbjct: 99  LDKKEENLEKKEKE 112


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 39.4 bits (91), Expect = 6e-04
 Identities = 27/75 (36%), Positives = 51/75 (68%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            +  +  E  K++ EE ++ EE K+K+ +E+KK E+ ++ EE+ K K EE  +E EE++K 
Sbjct: 1686 DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745

Query: 139  KKKEEEEEEEKEKVE 153
             ++ +++EEEK+K+ 
Sbjct: 1746 AEEAKKDEEEKKKIA 1760



 Score = 39.0 bits (90), Expect = 6e-04
 Identities = 28/87 (32%), Positives = 52/87 (59%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            + +   E  E  + EE +K +EE + +  EE +K E+++KK E+ ++ EE +K   E   
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695

Query: 130  EEEEEEQKNKKKEEEEEEEKEKVEMYR 156
            +E EE +K ++ +++E EEK+K E  +
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELK 1722



 Score = 37.8 bits (87), Expect = 0.002
 Identities = 25/69 (36%), Positives = 49/69 (71%)

Query: 85   EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            EE K+ EEE+++ E+ K+K+ +E+KK E+ ++ EE+ K K  EE ++ EE++K  ++ ++
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682

Query: 145  EEEEKEKVE 153
             EE+++K  
Sbjct: 1683 AEEDEKKAA 1691



 Score = 37.4 bits (86), Expect = 0.002
 Identities = 25/72 (34%), Positives = 46/72 (63%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            EN     EE +K EE++++ EE +K E++EKK  +  ++E ++  K EE  ++E EE+K 
Sbjct: 1658 ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717

Query: 139  KKKEEEEEEEKE 150
             ++ ++ EEE +
Sbjct: 1718 AEELKKAEEENK 1729



 Score = 36.7 bits (84), Expect = 0.004
 Identities = 24/78 (30%), Positives = 49/78 (62%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            E  + E+ +K++ EE+++ EE K+ +E+ + K  ++ ++ E+ K K EE  + EE+E+K 
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690

Query: 139  KKKEEEEEEEKEKVEMYR 156
             +  ++E EE +K E  +
Sbjct: 1691 AEALKKEAEEAKKAEELK 1708



 Score = 35.9 bits (82), Expect = 0.007
 Identities = 26/81 (32%), Positives = 48/81 (59%), Gaps = 8/81 (9%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV--------EEEEEE 135
             EE+K+K EE ++ EE+++K  +  KK  ++ ++ E+ K K+ EE          EEE +
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729

Query: 136  QKNKKKEEEEEEEKEKVEMYR 156
             K ++ ++E EE+K+K E  +
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAK 1750



 Score = 35.9 bits (82), Expect = 0.007
 Identities = 25/73 (34%), Positives = 48/73 (65%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            E  +  + E+ K+ EE + + E+ K  +EEKK+ ++ +++E ++ KK EE+++ EEE K 
Sbjct: 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661

Query: 139  KKKEEEEEEEKEK 151
            K  EE ++ E++K
Sbjct: 1662 KAAEEAKKAEEDK 1674



 Score = 34.0 bits (77), Expect = 0.031
 Identities = 19/70 (27%), Positives = 40/70 (57%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE +K+ +  +++ EE K+  E  + + E   +E E  ++K E   +++EE +K     +
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384

Query: 144  EEEEEKEKVE 153
            ++ EEK+K +
Sbjct: 1385 KKAEEKKKAD 1394



 Score = 32.8 bits (74), Expect = 0.090
 Identities = 28/84 (33%), Positives = 54/84 (64%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E  E  + EE +K++ EE+++ EE K+ +E+ + K E+ ++E E+ K K EE  ++EEE+
Sbjct: 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756

Query: 136  QKNKKKEEEEEEEKEKVEMYRNPV 159
            +K    ++EEE++ E++   +  V
Sbjct: 1757 KKIAHLKKEEEKKAEEIRKEKEAV 1780



 Score = 32.4 bits (73), Expect = 0.11
 Identities = 22/72 (30%), Positives = 44/72 (61%)

Query: 85   EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            EE K+K +E ++  E K+K ++ +K  E K+ +E K+ ++ ++  E ++ E+K K  E +
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552

Query: 145  EEEEKEKVEMYR 156
            + EE +K E  +
Sbjct: 1553 KAEELKKAEEKK 1564



 Score = 32.0 bits (72), Expect = 0.16
 Identities = 21/70 (30%), Positives = 40/70 (57%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE +K  E +++ EE +K+ D  ++K  E K+  E  + + +    E E  E+K +  E+
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371

Query: 144  EEEEEKEKVE 153
            ++EE K+K +
Sbjct: 1372 KKEEAKKKAD 1381



 Score = 31.6 bits (71), Expect = 0.17
 Identities = 19/73 (26%), Positives = 41/73 (56%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            E  + +E +K+ EE +++ +  K+K E+ +K  E  + E E   D+ E   E+ E  +K 
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372

Query: 139  KKKEEEEEEEKEK 151
            K++ +++ +  +K
Sbjct: 1373 KEEAKKKADAAKK 1385



 Score = 31.6 bits (71), Expect = 0.21
 Identities = 22/84 (26%), Positives = 52/84 (61%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            +A +A++  E  E + E  EK++EE +++ +  K + EEKK+  + +++ ++  KK +E+
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410

Query: 130  EEEEEEQKNKKKEEEEEEEKEKVE 153
            ++    +K   + +++ EEK+K +
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKAD 1434



 Score = 31.3 bits (70), Expect = 0.23
 Identities = 27/84 (32%), Positives = 53/84 (63%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            +A  A E  + +E ++ E+ K+ EE+++ EE +K E+++    +K EE +K ++ + EEV
Sbjct: 1538 EAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEV 1597

Query: 130  EEEEEEQKNKKKEEEEEEEKEKVE 153
             +  EE+K  K EE ++ E+ K++
Sbjct: 1598 MKLYEEEKKMKAEEAKKAEEAKIK 1621



 Score = 31.3 bits (70), Expect = 0.25
 Identities = 20/68 (29%), Positives = 46/68 (67%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE++K+ +E ++    +K+ DE ++K  EKK+ +E K+K ++ ++ +E +++ +  KK E
Sbjct: 1401 EEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460

Query: 144  EEEEEKEK 151
            E +++ E+
Sbjct: 1461 EAKKKAEE 1468



 Score = 30.9 bits (69), Expect = 0.31
 Identities = 19/75 (25%), Positives = 46/75 (61%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            E+ +  +E K+    +++ +E K+K E+++K  E K++ EE +K  + ++  EE ++ + 
Sbjct: 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEE 1461

Query: 139  KKKEEEEEEEKEKVE 153
             KK+ EE ++ ++ +
Sbjct: 1462 AKKKAEEAKKADEAK 1476



 Score = 30.9 bits (69), Expect = 0.32
 Identities = 28/91 (30%), Positives = 52/91 (57%), Gaps = 4/91 (4%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK----DKK 125
            +A  A +  E  +  +E K+K EE ++ EE K+K E+ +K  E K++ EE +K     KK
Sbjct: 1432 KADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491

Query: 126  EEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
             EE +++ +E K   + +++ +E +K E  +
Sbjct: 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522



 Score = 30.5 bits (68), Expect = 0.40
 Identities = 22/82 (26%), Positives = 48/82 (58%), Gaps = 4/82 (4%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK----DKKEEEVEEEEE 134
               + +  +K+ EE+++ +E +K+ +ED++K  E K+    K+K     KK EE ++ +E
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435

Query: 135  EQKNKKKEEEEEEEKEKVEMYR 156
             +K  ++ ++ +E K+K E  +
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAK 1457



 Score = 30.5 bits (68), Expect = 0.44
 Identities = 26/75 (34%), Positives = 49/75 (65%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            EN    EE K++ EE++++ EE +KDE+E+KK    ++EEEK+ ++  +E E   EE+ +
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786

Query: 139  KKKEEEEEEEKEKVE 153
            ++ E+   E  +K++
Sbjct: 1787 EEDEKRRMEVDKKIK 1801



 Score = 30.1 bits (67), Expect = 0.55
 Identities = 19/68 (27%), Positives = 39/68 (57%)

Query: 86   EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            +E +K EE+++ +E K+K E+ +K  E K++ EE +K     + + EE ++  +  + E 
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352

Query: 146  EEEKEKVE 153
            E   ++ E
Sbjct: 1353 EAAADEAE 1360



 Score = 30.1 bits (67), Expect = 0.58
 Identities = 21/74 (28%), Positives = 47/74 (63%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            +  E +K+ +E ++ EE +K ++ +K  E K+ +E K+ ++K++  E ++ E+  K +E+
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEK 1563

Query: 144  EEEEEKEKVEMYRN 157
            ++ EE +K E  +N
Sbjct: 1564 KKAEEAKKAEEDKN 1577



 Score = 30.1 bits (67), Expect = 0.59
 Identities = 26/82 (31%), Positives = 50/82 (60%)

Query: 75   SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            +E  +N    + E+ K+ EE   EE  +  E+E+K + ++ ++ E+ K K EE  + EEE
Sbjct: 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEE 1631

Query: 135  EQKNKKKEEEEEEEKEKVEMYR 156
            ++K ++ +++E EEK+K E  +
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELK 1653



 Score = 30.1 bits (67), Expect = 0.67
 Identities = 25/70 (35%), Positives = 45/70 (64%), Gaps = 1/70 (1%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
             EE K K EE ++ EEEK+K E + KK+E +E+++ ++  K EEE + +  E+  K +E+
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVE-QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673

Query: 144  EEEEEKEKVE 153
            +++ E+ K  
Sbjct: 1674 KKKAEEAKKA 1683



 Score = 30.1 bits (67), Expect = 0.67
 Identities = 25/75 (33%), Positives = 44/75 (58%), Gaps = 3/75 (4%)

Query: 85   EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK---DKKEEEVEEEEEEQKNKKK 141
            EE K+ EE +++ EE K+ DE ++K  E K+ +E K+K    KK+ +  ++  E K K  
Sbjct: 1454 EEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513

Query: 142  EEEEEEEKEKVEMYR 156
            E ++ EE +K +  +
Sbjct: 1514 EAKKAEEAKKADEAK 1528



 Score = 30.1 bits (67), Expect = 0.70
 Identities = 23/73 (31%), Positives = 45/73 (61%), Gaps = 3/73 (4%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK---DKKEEEVEEEEEEQKNKK 140
               +K+ +E +++ EE+K+ DE ++K  E K+ +E K+K    KK EE +++ EE K   
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473

Query: 141  KEEEEEEEKEKVE 153
            + +++ EE +K +
Sbjct: 1474 EAKKKAEEAKKAD 1486



 Score = 30.1 bits (67), Expect = 0.71
 Identities = 21/69 (30%), Positives = 47/69 (68%)

Query: 85   EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            EE+K+ +E +++ EE K+ DE ++K  E K+ EE K+K ++ ++ +E +++ +  KK +E
Sbjct: 1428 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADE 1487

Query: 145  EEEEKEKVE 153
             +++ E+ +
Sbjct: 1488 AKKKAEEAK 1496



 Score = 29.7 bits (66), Expect = 0.73
 Identities = 20/72 (27%), Positives = 46/72 (63%)

Query: 82   EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
            E ++  +  + E E   +E E  E++ +  EKK+EE +K+ D  +++ EE+++  + KKK
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399

Query: 142  EEEEEEEKEKVE 153
             EE++++ ++++
Sbjct: 1400 AEEDKKKADELK 1411



 Score = 29.7 bits (66), Expect = 0.75
 Identities = 21/73 (28%), Positives = 45/73 (61%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            E  + +E +K+ EE ++ +E +K+ +E ++K  E K+  E K+K  + ++ EE ++  + 
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527

Query: 139  KKKEEEEEEEKEK 151
            KK EE ++ ++ K
Sbjct: 1528 KKAEEAKKADEAK 1540



 Score = 29.7 bits (66), Expect = 0.80
 Identities = 22/82 (26%), Positives = 52/82 (63%)

Query: 68   LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
            L +A +A +  +  + + EEK+K +E +++ EE +K ++ +KK E+ ++ EE +K  +E 
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEA 1469

Query: 128  EVEEEEEEQKNKKKEEEEEEEK 149
            +  +E +++  + K+ +E ++K
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKK 1491



 Score = 29.7 bits (66), Expect = 0.83
 Identities = 25/80 (31%), Positives = 47/80 (58%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            +A  A +  E  +  +E K+K EE ++ +E K+K E+ +K  E K++ EE +K  + ++ 
Sbjct: 1419 KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478

Query: 130  EEEEEEQKNKKKEEEEEEEK 149
             EE ++    KK+ EE ++K
Sbjct: 1479 AEEAKKADEAKKKAEEAKKK 1498



 Score = 29.7 bits (66), Expect = 0.88
 Identities = 24/70 (34%), Positives = 45/70 (64%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
             EE+K+K EE +++EEEK+K    +K+ EKK EE  K+K+   EE  +EE+E++  + ++
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798

Query: 144  EEEEEKEKVE 153
            + ++  +   
Sbjct: 1799 KIKDIFDNFA 1808



 Score = 29.3 bits (65), Expect = 1.1
 Identities = 26/83 (31%), Positives = 53/83 (63%), Gaps = 2/83 (2%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE--KKEEEEEKQKDKKEE 127
            +A +A +  E  +  +E K+K EE++++ +E +K    +KK +  KK+ EE+K+ D+ ++
Sbjct: 1379 KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438

Query: 128  EVEEEEEEQKNKKKEEEEEEEKE 150
            + EE ++  + KKK EE ++ +E
Sbjct: 1439 KAEEAKKADEAKKKAEEAKKAEE 1461



 Score = 29.0 bits (64), Expect = 1.4
 Identities = 32/82 (39%), Positives = 50/82 (60%), Gaps = 9/82 (10%)

Query: 84   EEEEKEKEEEEEEEEEE---------KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E EEK+K EE ++ EEE         KE +ED++K  E K++EEEK+K    ++ EE++ 
Sbjct: 1711 EAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKA 1770

Query: 135  EQKNKKKEEEEEEEKEKVEMYR 156
            E+  K+KE   EEE ++ +  R
Sbjct: 1771 EEIRKEKEAVIEEELDEEDEKR 1792



 Score = 29.0 bits (64), Expect = 1.5
 Identities = 23/78 (29%), Positives = 45/78 (57%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            E  + EE +K +E++     + +E  + EE + E+  +  E++K  K EE ++ EE +  
Sbjct: 1562 EKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIK 1621

Query: 139  KKKEEEEEEEKEKVEMYR 156
             ++ ++ EEEK+KVE  +
Sbjct: 1622 AEELKKAEEEKKKVEQLK 1639



 Score = 27.4 bits (60), Expect = 4.4
 Identities = 24/81 (29%), Positives = 50/81 (61%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            +A  A E  + +E ++ E++K+ +E ++ EE +K E+++K  E K+ EE+K    ++ E 
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585

Query: 130  EEEEEEQKNKKKEEEEEEEKE 150
             ++ EE + ++  +  EEEK+
Sbjct: 1586 AKKAEEARIEEVMKLYEEEKK 1606



 Score = 27.4 bits (60), Expect = 4.7
 Identities = 21/79 (26%), Positives = 50/79 (63%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            +A  A +  E  +  +E K+ EE ++ +E +K ++  +  + +K EE+++  + KK EE+
Sbjct: 1498 KADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557

Query: 130  EEEEEEQKNKKKEEEEEEE 148
            ++ EE++K ++ ++ EE++
Sbjct: 1558 KKAEEKKKAEEAKKAEEDK 1576



 Score = 27.0 bits (59), Expect = 5.7
 Identities = 23/78 (29%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            E  +  +E K+  E +++ +E K K E+ +K  E K+ EE K+ D+ ++  E+++ ++  
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAK-KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552

Query: 139  KKKEEEEEEEKEKVEMYR 156
            K +E ++ EEK+K E  +
Sbjct: 1553 KAEELKKAEEKKKAEEAK 1570



 Score = 27.0 bits (59), Expect = 5.8
 Identities = 19/72 (26%), Positives = 36/72 (50%)

Query: 82   EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
            +++ + KE    +E  EE   K E+ +K    K EE  K ++ K++  +  + E+  K +
Sbjct: 1078 DFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAE 1137

Query: 142  EEEEEEEKEKVE 153
            +  + EE  K E
Sbjct: 1138 DARKAEEARKAE 1149



 Score = 26.6 bits (58), Expect = 8.1
 Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 9/77 (11%)

Query: 86   EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK----DKKEEEVEEEEEEQKN--- 138
            EEK+K +E ++ EE+K+ DE ++K  E K+ +E K+K     KK +  +++ EE K    
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346

Query: 139  --KKKEEEEEEEKEKVE 153
              K + E   +E E  E
Sbjct: 1347 AAKAEAEAAADEAEAAE 1363


>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like.  IG domains that cannot
          be classified into one of IGv1, IGc1, IGc2, IG.
          Length = 85

 Score = 36.7 bits (85), Expect = 7e-04
 Identities = 10/44 (22%), Positives = 13/44 (29%), Gaps = 2/44 (4%)

Query: 23 GDSSHGVYNLRIRNVSLTDDAEFQCQVGPAQYHKPIRANARLTV 66
             S     L I NV+  D   + C            +   LTV
Sbjct: 44 VSRSGSTSTLTISNVTPEDSGTYTCAATN--SSGSASSGTTLTV 85


>gnl|CDD|214652 smart00409, IG, Immunoglobulin. 
          Length = 85

 Score = 36.7 bits (85), Expect = 7e-04
 Identities = 10/44 (22%), Positives = 13/44 (29%), Gaps = 2/44 (4%)

Query: 23 GDSSHGVYNLRIRNVSLTDDAEFQCQVGPAQYHKPIRANARLTV 66
             S     L I NV+  D   + C            +   LTV
Sbjct: 44 VSRSGSTSTLTISNVTPEDSGTYTCAATN--SSGSASSGTTLTV 85


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 38.5 bits (89), Expect = 8e-04
 Identities = 22/83 (26%), Positives = 44/83 (53%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           +   E  + +  E+     E+   E    +E D+D+E+ + +KE E E Q  K  E  E 
Sbjct: 462 LMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEA 521

Query: 132 EEEEQKNKKKEEEEEEEKEKVEM 154
           +++  K+K K+ + +EE+E+ ++
Sbjct: 522 DKDVNKSKNKKRKVDEEEEEKKL 544



 Score = 31.2 bits (70), Expect = 0.22
 Identities = 25/84 (29%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           Q I  SET E ++   + K K+ + +EEEEEK+        ++KK  ++ K  + K    
Sbjct: 511 QGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAK---- 566

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
           +EE+ E   KKK++  +++K   +
Sbjct: 567 KEEQAENLKKKKKQIAKQKKLDSK 590


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 38.4 bits (90), Expect = 9e-04
 Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 4/66 (6%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E E+E+ E       E++K   +++K EKK E+EE +K   +++     E    K K  +
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA----EAAAKKAKGPD 450

Query: 145 EEEEKE 150
            E +K 
Sbjct: 451 GETKKV 456



 Score = 34.5 bits (80), Expect = 0.019
 Identities = 16/68 (23%), Positives = 31/68 (45%), Gaps = 1/68 (1%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
             A    E  EN      E++K  +++ + E+K + E E +K   K++ E   K  K  +
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKE-EAEKAAAKKKAEAAAKKAKGPD 450

Query: 129 VEEEEEEQ 136
            E ++ + 
Sbjct: 451 GETKKVDP 458


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 38.1 bits (89), Expect = 0.001
 Identities = 27/73 (36%), Positives = 40/73 (54%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  EWE EE E  ++E E  + +   E E    E +EE++E  K  KE+  EE  EE + 
Sbjct: 100 EEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEE 159

Query: 139 KKKEEEEEEEKEK 151
           +  EEEE E +++
Sbjct: 160 EAAEEEEAEAEKE 172



 Score = 36.9 bits (86), Expect = 0.002
 Identities = 22/81 (27%), Positives = 49/81 (60%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
            A + LE+++ ++EE+E E EE+E+ +++ +  D E  +E +  + E +++K E   + +
Sbjct: 87  EAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAK 146

Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
           E+  +   +E+EEE  +E+  
Sbjct: 147 EDSDEELSEEDEEEAAEEEEA 167



 Score = 36.9 bits (86), Expect = 0.002
 Identities = 24/79 (30%), Positives = 51/79 (64%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
               ++++ EEEE E EE+E+ ++E +  D + +K+ E  + E+E++KD+  ++ +E+ +
Sbjct: 91  GLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSD 150

Query: 135 EQKNKKKEEEEEEEKEKVE 153
           E+ +++ EEE  EE+E   
Sbjct: 151 EELSEEDEEEAAEEEEAEA 169



 Score = 33.9 bits (78), Expect = 0.025
 Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           +  EW + E +KE E  + E+E+EKDE   KK ++  +EE  ++D++E   EEE E +K 
Sbjct: 114 DEGEWIDVESDKEIESSDSEDEEEKDE-AAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172

Query: 139 K 139
           K
Sbjct: 173 K 173



 Score = 33.5 bits (77), Expect = 0.036
 Identities = 27/78 (34%), Positives = 47/78 (60%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E  +  E E+  +  ++++EEEE E E+DED + + E  + E +K+ +  + E EEE++E
Sbjct: 81  EERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDE 140

Query: 136 QKNKKKEEEEEEEKEKVE 153
              K KE+ +EE  E+ E
Sbjct: 141 AAKKAKEDSDEELSEEDE 158



 Score = 28.5 bits (64), Expect = 1.6
 Identities = 17/50 (34%), Positives = 31/50 (62%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
                  ++E EEE+ E  ++ +E+ +E+  +EDEE+  E++E E EK+K
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173



 Score = 27.7 bits (62), Expect = 2.6
 Identities = 13/50 (26%), Positives = 33/50 (66%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           S  E   ++  +EEEK++  ++ +E+ ++E  E++E++  ++EE E +++
Sbjct: 123 SDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 37.8 bits (88), Expect = 0.001
 Identities = 21/79 (26%), Positives = 49/79 (62%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  + + E  E++  +E E +E++K ++E+ + E+ E+EE  +  ++  EV+ EE + +N
Sbjct: 4   ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63

Query: 139 KKKEEEEEEEKEKVEMYRN 157
            K +EE ++ + ++E  ++
Sbjct: 64  NKLKEENKKLENELEALKD 82



 Score = 35.9 bits (83), Expect = 0.004
 Identities = 23/76 (30%), Positives = 44/76 (57%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E    + + +  EE+  +E E KE+D+ +E+  E +E E+E+  +  EE  E + EE K+
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61

Query: 139 KKKEEEEEEEKEKVEM 154
           +  + +EE +K + E+
Sbjct: 62  ENNKLKEENKKLENEL 77



 Score = 32.8 bits (75), Expect = 0.058
 Identities = 26/74 (35%), Positives = 43/74 (58%), Gaps = 5/74 (6%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK-----N 138
           E  E++  +E E +EE+K K+ED E +  +KEE  E  ++  E ++EE ++E       N
Sbjct: 11  ENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEEN 70

Query: 139 KKKEEEEEEEKEKV 152
           KK E E E  K+++
Sbjct: 71  KKLENELEALKDRL 84



 Score = 32.4 bits (74), Expect = 0.063
 Identities = 22/69 (31%), Positives = 43/69 (62%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            E++ K+ + E  EE+  K+ + +++ + KEE+ E ++ +KEE +E+ EE  + K +E +
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60

Query: 145 EEEEKEKVE 153
           +E  K K E
Sbjct: 61  DENNKLKEE 69



 Score = 29.3 bits (66), Expect = 0.70
 Identities = 18/53 (33%), Positives = 28/53 (52%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           E     E  E E+ ++EE  E+ EE  + + EE K E  + +EE +K + E E
Sbjct: 26  EDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENELE 78


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 38.1 bits (89), Expect = 0.001
 Identities = 15/77 (19%), Positives = 41/77 (53%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
             + +++  E  E ++ E  EK + +DE+++  ++E +  +  +K++ + E +    + +
Sbjct: 643 RRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQ 702

Query: 140 KKEEEEEEEKEKVEMYR 156
             +E E+EE+ +    R
Sbjct: 703 SVQETEQEERVQQVQPR 719



 Score = 32.3 bits (74), Expect = 0.12
 Identities = 11/58 (18%), Positives = 29/58 (50%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
                + E  EK   ++E+++   ++    +  EK++ ++E +    EE+  +E E++
Sbjct: 653 TRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQE 710



 Score = 31.5 bits (72), Expect = 0.18
 Identities = 14/63 (22%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKD-EDEEKKREKKEEEEEKQKDKKEEEVEEE 132
            +ET E+ + E  EK + ++E+++   +E+     ++KR+ ++E +    +++  +  E+
Sbjct: 650 TAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQ 709

Query: 133 EEE 135
           EE 
Sbjct: 710 EER 712



 Score = 31.2 bits (71), Expect = 0.27
 Identities = 22/99 (22%), Positives = 35/99 (35%)

Query: 69   PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
            PQ I+ S+     E  E  +   E ++E  + +E  E  E    + E   +       E 
Sbjct: 928  PQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEV 987

Query: 129  VEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRNPVP 167
              E E     + +    +  +  VE      PM R P P
Sbjct: 988  AAEVETVTAVEPEVAPAQVPEATVEHNHATAPMTRAPAP 1026



 Score = 30.4 bits (69), Expect = 0.45
 Identities = 16/78 (20%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE-EKKREKKEEEEEKQKDKKEEEVEEEEE 134
           +  +      E ++ E  E+   +++++     E++R + +E+ + Q++ K   VEE+  
Sbjct: 645 QAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSV 704

Query: 135 EQKNKKKEEEEEEEKEKV 152
           +      E E+EE  ++V
Sbjct: 705 Q------ETEQEERVQQV 716



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 16/70 (22%), Positives = 36/70 (51%), Gaps = 5/70 (7%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEK-----KREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            EE K +E+   + E + E+ +D  K     +R++ E  + +    + E  E  EE ++N
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRN 642

Query: 139 KKKEEEEEEE 148
           +++ +++  E
Sbjct: 643 RRQAQQQTAE 652


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 37.8 bits (88), Expect = 0.001
 Identities = 14/65 (21%), Positives = 32/65 (49%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + EE +++E+EEEE+   E +E+E +  E++  +  +    +       + E K  K+  
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESA 425

Query: 144 EEEEE 148
            + + 
Sbjct: 426 SDSDS 430



 Score = 37.8 bits (88), Expect = 0.001
 Identities = 15/70 (21%), Positives = 38/70 (54%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           L+  ++EE +++++EEEE+  +E E++E E+ + E  +  E+   +   +   + E +  
Sbjct: 362 LDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKAD 421

Query: 138 NKKKEEEEEE 147
            +   + + E
Sbjct: 422 KESASDSDSE 431



 Score = 37.8 bits (88), Expect = 0.001
 Identities = 16/65 (24%), Positives = 33/65 (50%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE  E E+EEEE+  ++ ++E+ E   E+  +  E    +   +V  + E + +K+   
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426

Query: 144 EEEEE 148
           + + E
Sbjct: 427 DSDSE 431



 Score = 36.6 bits (85), Expect = 0.004
 Identities = 18/69 (26%), Positives = 35/69 (50%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
             +  + E+ +E+E+EEEE+  DE EE++ E  EEE  + ++    E   +       K 
Sbjct: 361 RLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420

Query: 142 EEEEEEEKE 150
           ++E   + +
Sbjct: 421 DKESASDSD 429



 Score = 35.5 bits (82), Expect = 0.008
 Identities = 15/66 (22%), Positives = 34/66 (51%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           + E+  E+E+EEEE+  ++ E+EE +  ++E  + ++    E   +   + +    KE  
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESA 425

Query: 145 EEEEKE 150
            + + E
Sbjct: 426 SDSDSE 431



 Score = 34.7 bits (80), Expect = 0.013
 Identities = 16/67 (23%), Positives = 28/67 (41%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           P      +  E+ E E+   E EEEE E+ EE+     E+   E   +     + K ++E
Sbjct: 364 PIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423

Query: 129 VEEEEEE 135
              + + 
Sbjct: 424 SASDSDS 430



 Score = 33.2 bits (76), Expect = 0.049
 Identities = 17/50 (34%), Positives = 29/50 (58%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
              E K R+K+    +    ++ +E E+EEEEQ++ + EEEE E+ E+  
Sbjct: 348 STKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEG 397



 Score = 33.2 bits (76), Expect = 0.051
 Identities = 15/65 (23%), Positives = 30/65 (46%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
               +  + EE  E +++EE++R  + EEEE +  ++E     E+   ++      + E 
Sbjct: 359 RARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSES 418

Query: 149 KEKVE 153
           K   E
Sbjct: 419 KADKE 423



 Score = 31.6 bits (72), Expect = 0.15
 Identities = 15/60 (25%), Positives = 26/60 (43%), Gaps = 1/60 (1%)

Query: 76  ETLENNEWEEEEKEKEEEEE-EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           E  +  E +  ++ +EEE E  EEE  +  ED   +       + + K  KE   + + E
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 37.7 bits (88), Expect = 0.001
 Identities = 25/73 (34%), Positives = 42/73 (57%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E ++   E++   E E+EK+E  +K  EK  +E E+Q +  E++++ E   Q  + + E 
Sbjct: 214 ESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREF 273

Query: 145 EEEEKEKVEMYRN 157
            +E KEKVE  RN
Sbjct: 274 NKEIKEKVEEERN 286



 Score = 32.3 bits (74), Expect = 0.085
 Identities = 23/70 (32%), Positives = 37/70 (52%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEEE E+  +E+ EE   + +E+   + E KE   EKQ   + E  +EE  ++  +K  +
Sbjct: 186 EEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQ 245

Query: 144 EEEEEKEKVE 153
           E E + E  E
Sbjct: 246 ELERQAEAHE 255



 Score = 30.8 bits (70), Expect = 0.36
 Identities = 18/64 (28%), Positives = 33/64 (51%), Gaps = 2/64 (3%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E + EEEEE E   KEK E+   K E++       + K+    ++   E + +K+E  ++
Sbjct: 181 ELKAEEEEELERALKEKREELLSKLEEELLAR--LESKEAALEKQLRLEFEREKEELRKK 238

Query: 147 EEKE 150
            E++
Sbjct: 239 YEEK 242



 Score = 30.4 bits (69), Expect = 0.44
 Identities = 15/56 (26%), Positives = 30/56 (53%)

Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           E++E E   +EK+EE   K +++    +E +E   + + + E E E++E  + Y  
Sbjct: 186 EEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEE 241


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 37.8 bits (88), Expect = 0.002
 Identities = 22/55 (40%), Positives = 32/55 (58%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           A   LE  +   E  ++EEE+EEEEEE+++DE   K+    EE EE   + K E+
Sbjct: 375 AFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYED 429



 Score = 37.4 bits (87), Expect = 0.002
 Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 7/78 (8%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           ETL   +W +  KE E++   +     + ED   +R+ +EEE+E      EEE EE+E+E
Sbjct: 354 ETLNKQKWTKA-KETEQDYILDAFSALEIEDANTERDDEEEEDE------EEEEEEDEDE 406

Query: 136 QKNKKKEEEEEEEKEKVE 153
             +K+  ++EE E++ VE
Sbjct: 407 GPSKEHSDDEEFEEDDVE 424



 Score = 33.2 bits (76), Expect = 0.051
 Identities = 18/47 (38%), Positives = 29/47 (61%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
            T  ++E EE+E+E+EEE+E+E   KE  +DEE + +  E + E   
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSD 431



 Score = 31.3 bits (71), Expect = 0.22
 Identities = 18/67 (26%), Positives = 39/67 (58%), Gaps = 4/67 (5%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
             +     E E+   E+++EEEE+EEE+E+++++E   ++  ++EE +    E++VE + 
Sbjct: 372 ILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFE----EDDVESKY 427

Query: 134 EEQKNKK 140
           E+     
Sbjct: 428 EDSDGNS 434



 Score = 28.6 bits (64), Expect = 1.7
 Identities = 14/58 (24%), Positives = 30/58 (51%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
              E E    E ++EEE++++E+EE+  ++   +E    ++ EE+  E + E  +   
Sbjct: 377 SALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNS 434


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 38.0 bits (88), Expect = 0.002
 Identities = 14/55 (25%), Positives = 26/55 (47%), Gaps = 3/55 (5%)

Query: 97  EEEEKEK-DEDEEKKREKKEEEEEKQKDKKE--EEVEEEEEEQKNKKKEEEEEEE 148
           +++ KEK   D+E K E    E E ++   +  EE   +     N KK++  + +
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852



 Score = 34.9 bits (80), Expect = 0.017
 Identities = 18/85 (21%), Positives = 36/85 (42%), Gaps = 2/85 (2%)

Query: 51  PAQYHKPIRANARL--TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
           P  Y +    + R   T +L   I A+   +    E+   +KE + E    E E +    
Sbjct: 768 PRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTI 827

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEE 133
           K+RE+K  +     + K+++  + +
Sbjct: 828 KQREEKGIDAPAILNVKKKKPYKVD 852



 Score = 31.8 bits (72), Expect = 0.14
 Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 2/48 (4%)

Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           K     ++K +EK   ++E + +     VE E E    K++EE+  + 
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIES--PSVETEGERCTIKQREEKGIDA 837



 Score = 28.0 bits (62), Expect = 2.5
 Identities = 10/45 (22%), Positives = 22/45 (48%)

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
             +     K++ +EK  + KE ++E    E + ++   ++ EEK 
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKG 834


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 37.3 bits (87), Expect = 0.002
 Identities = 21/74 (28%), Positives = 41/74 (55%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            + E   +E  +    EEE E K+K  ++ +++EK+ EE E+   ++ +E+  + EE + 
Sbjct: 297 NSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQK 356

Query: 139 KKKEEEEEEEKEKV 152
           K ++ E+  EK K 
Sbjct: 357 KLEDLEKRLEKLKS 370



 Score = 31.9 bits (73), Expect = 0.13
 Identities = 15/66 (22%), Positives = 39/66 (59%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E +  EEE E +++  EK  ++EK+ E+ E+   +++ +   ++EE +++ ++ +K  E+
Sbjct: 308 EARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEK 367

Query: 146 EEEKEK 151
            +  + 
Sbjct: 368 LKSNKS 373


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 36.6 bits (85), Expect = 0.002
 Identities = 14/62 (22%), Positives = 31/62 (50%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E   +  +EE E ++E    E+++K   +K +E+ +Q++ K +   E   E    K  +
Sbjct: 36  IEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATK 95

Query: 144 EE 145
           ++
Sbjct: 96  KQ 97



 Score = 27.3 bits (61), Expect = 2.6
 Identities = 11/48 (22%), Positives = 25/48 (52%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
             ++KD +      ++E E +K+    EE+ +   ++ K K ++EE +
Sbjct: 30  RYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELK 77



 Score = 27.0 bits (60), Expect = 3.8
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 3/60 (5%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEE---EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           E + + + K    EKK  +   +   EE + DK+    EE+++    K KE+ E+EE + 
Sbjct: 19  ENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKN 78



 Score = 26.6 bits (59), Expect = 4.4
 Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 8/72 (11%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE--------KKREKKEEEEEKQKDKKEE 127
           + L   E +++   ++ +E+ E+EE +     E        K  +K+    E ++     
Sbjct: 51  KELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQLFSSEYEQTSSSS 110

Query: 128 EVEEEEEEQKNK 139
           E   EEE +K  
Sbjct: 111 ESTSEEETKKTS 122


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 37.5 bits (87), Expect = 0.002
 Identities = 16/71 (22%), Positives = 32/71 (45%), Gaps = 1/71 (1%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK-EEEVEEEEEEQK 137
           E ++ E+E +  E+ E E E + E  E++E++ + +  E     DK     + +      
Sbjct: 657 ETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPD 716

Query: 138 NKKKEEEEEEE 148
                + E+EE
Sbjct: 717 ALDGGDSEDEE 727



 Score = 36.4 bits (84), Expect = 0.005
 Identities = 20/74 (27%), Positives = 37/74 (50%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           ++  E EEE+E++ EE + DEDE +  E  E E E   +  EE+ +E++ E        +
Sbjct: 642 QQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPID 701

Query: 145 EEEEKEKVEMYRNP 158
           +   +   ++   P
Sbjct: 702 KAVRRALPKVLNLP 715



 Score = 33.7 bits (77), Expect = 0.034
 Identities = 17/60 (28%), Positives = 37/60 (61%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           + L+ +E +   ++  E EEE+E++ E+ +D+E + E  E+ E + +   E+  E+E+E+
Sbjct: 630 DVLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQED 689



 Score = 31.4 bits (71), Expect = 0.20
 Identities = 14/82 (17%), Positives = 37/82 (45%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E +E++E E E   ++ EE+E+E++ E +E      +       K  +  +     + E
Sbjct: 665 CEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSE 724

Query: 135 EQKNKKKEEEEEEEKEKVEMYR 156
           +++    E+    +    ++++
Sbjct: 725 DEEGMDDEQMMRLDTYLAQIFK 746



 Score = 26.8 bits (59), Expect = 6.3
 Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 1/68 (1%)

Query: 70  QAISASETLENNEWE-EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           +AI  SE+   ++ E  EE E+E++ E  E     D+   +   K     +       E+
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSED 725

Query: 129 VEEEEEEQ 136
            E  ++EQ
Sbjct: 726 EEGMDDEQ 733


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 36.6 bits (85), Expect = 0.002
 Identities = 21/75 (28%), Positives = 48/75 (64%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
             +     +E   E +  E+E   + E++ + E+EEKK +KK++E +K+K +K+++ E+ 
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177

Query: 133 EEEQKNKKKEEEEEE 147
            E + +KKK++++++
Sbjct: 178 VEPKGSKKKKKKKKK 192



 Score = 35.5 bits (82), Expect = 0.005
 Identities = 22/66 (33%), Positives = 41/66 (62%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
                  E   E E  EK+   + ++E + EEEEK++ KK++EV++E++E+K+KK++  E
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179

Query: 146 EEEKEK 151
            +  +K
Sbjct: 180 PKGSKK 185



 Score = 35.1 bits (81), Expect = 0.007
 Identities = 18/68 (26%), Positives = 43/68 (63%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
             E   + E  E+E   K + E E ++ EKKE++++K+  K+++E ++++E+    K  +
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184

Query: 144 EEEEEKEK 151
           +++++K+K
Sbjct: 185 KKKKKKKK 192


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.4 bits (86), Expect = 0.003
 Identities = 19/42 (45%), Positives = 27/42 (64%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           K  +K+EE  EK K + E++  EE E +K K+KE E E E+E
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617



 Score = 35.4 bits (81), Expect = 0.010
 Identities = 20/48 (41%), Positives = 30/48 (62%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           K  ++ EE  EK K E E+K RE++E E+EK+K+++ E   E E   K
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623



 Score = 34.3 bits (78), Expect = 0.027
 Identities = 19/51 (37%), Positives = 32/51 (62%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
           AS  L     E  EK K E E++  EE+E+++++EK+RE++ E E ++  K
Sbjct: 573 ASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623



 Score = 33.1 bits (75), Expect = 0.056
 Identities = 16/38 (42%), Positives = 26/38 (68%)

Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           K+ EE  +K K+E E +  EE ++ K+KE+E E E+E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERERERER 616



 Score = 32.4 bits (73), Expect = 0.12
 Identities = 15/41 (36%), Positives = 30/41 (73%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           K++EE  E+ + E E+   EE++REK++E+E +++ ++E E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619



 Score = 29.3 bits (65), Expect = 0.97
 Identities = 16/42 (38%), Positives = 30/42 (71%), Gaps = 1/42 (2%)

Query: 109 KKREKKEEEEEKQKDKK-EEEVEEEEEEQKNKKKEEEEEEEK 149
           KKRE+  E+ +++ ++K  EE E E+E++K +++E E E E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620



 Score = 29.3 bits (65), Expect = 1.1
 Identities = 16/52 (30%), Positives = 35/52 (67%), Gaps = 1/52 (1%)

Query: 72  ISASETLENNEWEEEEKEKEEEEE-EEEEEKEKDEDEEKKREKKEEEEEKQK 122
           +++S+  +  E   E+ ++E E++  EE E+EK++++E++RE++ E E   K
Sbjct: 572 LASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623



 Score = 27.0 bits (59), Expect = 6.1
 Identities = 13/46 (28%), Positives = 31/46 (67%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
           L +++  ++ +E  E+ + E E+K ++E E +K ++KE E E++++
Sbjct: 572 LASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617



 Score = 27.0 bits (59), Expect = 6.5
 Identities = 15/39 (38%), Positives = 27/39 (69%), Gaps = 1/39 (2%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E+  EK + E E QK ++E E E+E+E+++ +++E E E
Sbjct: 582 EEAVEKAKREAE-QKAREEREREKEKEKEREREREREAE 619


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 36.7 bits (85), Expect = 0.003
 Identities = 21/56 (37%), Positives = 37/56 (66%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
             EEEE+  EK    EKK ++ E E+  ++D+K+EE EEEEEE+     +++++++
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 33.2 bits (76), Expect = 0.033
 Identities = 14/55 (25%), Positives = 39/55 (70%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           EEEE   E+    E++ KE + ++  + ++K+EEEE+++++++E+ ++++++  +
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198



 Score = 27.8 bits (62), Expect = 2.5
 Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
             E+EE   EK    E+K K+ + E+V+EE+E+ + +  EEEEEEE E  +
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEE--EEEEEEEDEDFD 190


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 36.7 bits (85), Expect = 0.003
 Identities = 18/64 (28%), Positives = 32/64 (50%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            E   + EEEE  + E E+  + E+    +E+   ++     E+  E+E+E +  + EEE
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEE 445

Query: 145 EEEE 148
           EE  
Sbjct: 446 EEAR 449



 Score = 33.6 bits (77), Expect = 0.039
 Identities = 19/70 (27%), Positives = 36/70 (51%), Gaps = 2/70 (2%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +E E  K  E   + EE+E  + E ++R + E+ E  ++D +  E+    E++     E+
Sbjct: 378 QEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFED--ED 435

Query: 144 EEEEEKEKVE 153
           E EE + + E
Sbjct: 436 ELEEAQPEEE 445



 Score = 32.8 bits (75), Expect = 0.073
 Identities = 17/69 (24%), Positives = 36/69 (52%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            E  ++ E+ +  E   +++E+E  +RE +E  E +Q ++ EE+    E     + + E+
Sbjct: 374 AEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFED 433

Query: 145 EEEEKEKVE 153
           E+E +E   
Sbjct: 434 EDELEEAQP 442


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 36.7 bits (85), Expect = 0.003
 Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 3/73 (4%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE---EEVEEEEEEQKNKK 140
           ++  +E  +EE+ ++++ + +ED E  RE+ E  +E ++D+ E    E EE E +     
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEAS 283

Query: 141 KEEEEEEEKEKVE 153
           ++ E +E  E  E
Sbjct: 284 EDSESDESDEDTE 296



 Score = 34.8 bits (80), Expect = 0.013
 Identities = 16/60 (26%), Positives = 29/60 (48%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            +  ++     E+ E   EE E  +E E+DE E    E +E E +  +  ++ E +E +E
Sbjct: 234 EDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDE 293



 Score = 33.7 bits (77), Expect = 0.035
 Identities = 22/77 (28%), Positives = 37/77 (48%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
             +  E++E   EE E  +E EE+E E    E EE + +  E  E+ + D+ +E+ E   
Sbjct: 240 QPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPG 299

Query: 134 EEQKNKKKEEEEEEEKE 150
           E+ +      E  EE +
Sbjct: 300 EDARPATPFTELMEEVD 316



 Score = 32.1 bits (73), Expect = 0.13
 Identities = 16/62 (25%), Positives = 35/62 (56%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           E+  ++  EE+  E+D D+++    ++ E  +++ +  +E EE+E E  + + EE E + 
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDA 279

Query: 149 KE 150
            E
Sbjct: 280 AE 281



 Score = 31.0 bits (70), Expect = 0.29
 Identities = 19/81 (23%), Positives = 39/81 (48%), Gaps = 2/81 (2%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE--EE 133
           + L + +  EE  +   EE+ +EE+ + D+ +  +  +   EE +  D+ EE+  E  + 
Sbjct: 211 DMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDG 270

Query: 134 EEQKNKKKEEEEEEEKEKVEM 154
           E ++ +    E  E+ E  E 
Sbjct: 271 EGEEGEMDAAEASEDSESDES 291



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 17/71 (23%), Positives = 34/71 (47%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E +  +++    E+ E   EE +  DE EE + E  + E E+ +    E  E+ E ++ +
Sbjct: 233 EEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESD 292

Query: 139 KKKEEEEEEEK 149
           +  E   E+ +
Sbjct: 293 EDTETPGEDAR 303


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 36.7 bits (85), Expect = 0.004
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
              EKE E++ E  E+  ++ E  +K+ E++ EE ++++  K+ E+E+E +E     K+E
Sbjct: 521 SALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580

Query: 144 EEEEEKE 150
            E   +E
Sbjct: 581 VESIIRE 587



 Score = 34.0 bits (78), Expect = 0.030
 Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 5/97 (5%)

Query: 61  NARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKE-----KDEDEEKKREKKE 115
           N  L  +L +     + LE    E +E+E+ ++ E E+E +E     K E E   RE KE
Sbjct: 531 NEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKE 590

Query: 116 EEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           ++  K K+ K  E   + +E K K  ++    + +K+
Sbjct: 591 KKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKI 627



 Score = 33.6 bits (77), Expect = 0.036
 Identities = 23/88 (26%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 64  LTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
           + V++ +  +  + LE      E+  KE+E+ ++E E+E +E +E++R KK E E++ ++
Sbjct: 513 INVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQE 572

Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
             +   +E E   + + KE++  + KE 
Sbjct: 573 ALKALKKEVESIIR-ELKEKKIHKAKEI 599



 Score = 33.6 bits (77), Expect = 0.039
 Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 5/73 (6%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEK-KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E KE+     E+    EK+ +++    EK  +E+EK K    +E+E+E EE K +++ ++
Sbjct: 508 EFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLK----KELEQEMEELKERERNKK 563

Query: 145 EEEEKEKVEMYRN 157
            E EKE  E  + 
Sbjct: 564 LELEKEAQEALKA 576



 Score = 32.5 bits (74), Expect = 0.10
 Identities = 18/72 (25%), Positives = 41/72 (56%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            +N   E+  KE+E+ ++E E+E E+ ++ E+ ++ + E+E ++  K  ++  E    + 
Sbjct: 529 QKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIREL 588

Query: 138 NKKKEEEEEEEK 149
            +KK  + +E K
Sbjct: 589 KEKKIHKAKEIK 600


>gnl|CDD|239285 cd02987, Phd_like_Phd, Phosducin (Phd)-like family, Phd subfamily;
           Phd is a cytosolic regulator of G protein functions. It
           specifically binds G protein betagamma (Gbg)-subunits
           with high affinity, resulting in the solubilization of
           Gbg from the plasma membrane. This impedes the formation
           of a functional G protein trimer (G protein
           alphabetagamma), thereby inhibiting G protein-mediated
           signal transduction. Phd also inhibits the GTPase
           activity of G protein alpha. Phd can be phosphorylated
           by protein kinase A and G protein-coupled receptor
           kinase 2, leading to its inactivation. Phd was
           originally isolated from the retina, where it is highly
           expressed and has been implicated to play an important
           role in light adaptation. It is also found in the pineal
           gland, liver, spleen, striated muscle and the brain. The
           C-terminal domain of Phd adopts a thioredoxin fold, but
           it does not contain a CXXC motif. Phd interacts with G
           protein beta mostly through the N-terminal helical
           domain.
          Length = 175

 Score = 35.7 bits (83), Expect = 0.004
 Identities = 11/45 (24%), Positives = 30/45 (66%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           N+W + ++ KE E+E+++++++K+E  ++ RE++ +E   +    
Sbjct: 14  NDWRKFKQLKESEQEDDDDDEDKEEFLQQYREQRMQEMHAKLPFG 58


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 36.4 bits (84), Expect = 0.004
 Identities = 22/71 (30%), Positives = 40/71 (56%), Gaps = 1/71 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-EKQKDKKEEEVEEEEEEQKNKKKE 142
           E+  K+ EE++++ EE K K   E K + + E E+  K++ KK+ E E + +     KK+
Sbjct: 108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKK 167

Query: 143 EEEEEEKEKVE 153
             E ++K + E
Sbjct: 168 AAEAKKKAEAE 178



 Score = 34.0 bits (78), Expect = 0.021
 Identities = 16/69 (23%), Positives = 39/69 (56%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E+    E+  ++ E+  ++ E+++K+ E+ + ++  +   K E   E++ +++ KK+ EE
Sbjct: 95  EQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEE 154

Query: 145 EEEEKEKVE 153
           E + K   E
Sbjct: 155 EAKAKAAAE 163



 Score = 33.3 bits (76), Expect = 0.044
 Identities = 17/70 (24%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E++ E+ E++   E+ ++K+ ++    EK  ++ E+   K+ EE +++ EE K K+  E
Sbjct: 73  LEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAA-KQAEEKQKQAEEAKAKQAAE 131

Query: 144 EEEEEKEKVE 153
            + + + + E
Sbjct: 132 AKAKAEAEAE 141



 Score = 32.9 bits (75), Expect = 0.054
 Identities = 14/66 (21%), Positives = 40/66 (60%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           ++K   ++E+E +K+ ++  E+  +++  E+ +QK+ ++    E+  +Q  +  ++ EE+
Sbjct: 58  QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117

Query: 148 EKEKVE 153
           +K+  E
Sbjct: 118 QKQAEE 123



 Score = 32.1 bits (73), Expect = 0.12
 Identities = 13/67 (19%), Positives = 42/67 (62%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           +++   ++E+E +++ +++ E+ EK+R  ++  +++ + +   E   ++ EQ  K+ EE+
Sbjct: 58  QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117

Query: 145 EEEEKEK 151
           +++ +E 
Sbjct: 118 QKQAEEA 124



 Score = 31.3 bits (71), Expect = 0.17
 Identities = 21/78 (26%), Positives = 41/78 (52%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A +  E  +  EE K K+  E + + E E ++  +++ +K+ EEE K K   E + +  E
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170

Query: 134 EEQKNKKKEEEEEEEKEK 151
            ++K + + + + E K K
Sbjct: 171 AKKKAEAEAKAKAEAKAK 188



 Score = 30.6 bits (69), Expect = 0.35
 Identities = 18/72 (25%), Positives = 37/72 (51%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E E E+K KEE +++ EEE +     E K++  E +++ + + K +   + + + +  K 
Sbjct: 137 EAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKA 196

Query: 142 EEEEEEEKEKVE 153
           + E  + K   E
Sbjct: 197 KAEAAKAKAAAE 208



 Score = 28.6 bits (64), Expect = 1.5
 Identities = 11/85 (12%), Positives = 44/85 (51%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           + P+     E ++    +     ++    +++++    +++E++++ +++ EE +K +  
Sbjct: 27  VKPEPGGGGEIIQAVLVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAA 86

Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEK 151
           E+  ++E EQ+   ++  ++ E+  
Sbjct: 87  EQARQKELEQRAAAEKAAKQAEQAA 111



 Score = 26.7 bits (59), Expect = 6.7
 Identities = 16/72 (22%), Positives = 32/72 (44%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E E + K   E +++  E K+K E E K + + + + + ++ K + E  + +   +   K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAK 212

Query: 142 EEEEEEEKEKVE 153
            E E       E
Sbjct: 213 AEAEAAAAAAAE 224



 Score = 26.3 bits (58), Expect = 9.5
 Identities = 17/71 (23%), Positives = 33/71 (46%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +EE K++ EEE + +   E  +   + ++K E E + + + K +   EE + +    K +
Sbjct: 145 KEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204

Query: 144 EEEEEKEKVEM 154
              E   K E 
Sbjct: 205 AAAEAAAKAEA 215


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 36.6 bits (85), Expect = 0.004
 Identities = 26/73 (35%), Positives = 39/73 (53%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E +E EE  EE +E+ E  ++E ++RE   EE E+   + EE  EE EE+     +E 
Sbjct: 315 ENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEEL 374

Query: 144 EEEEEKEKVEMYR 156
           EE  E  + E+  
Sbjct: 375 EELFEALREELAE 387



 Score = 34.7 bits (80), Expect = 0.016
 Identities = 22/70 (31%), Positives = 37/70 (52%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E + +E EE  EE +++ +  +E+  E++   EE ++   E E  +EE E+K     E
Sbjct: 313 ELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLE 372

Query: 144 EEEEEKEKVE 153
           E EE  E + 
Sbjct: 373 ELEELFEALR 382



 Score = 34.3 bits (79), Expect = 0.021
 Identities = 29/99 (29%), Positives = 42/99 (42%)

Query: 59  RANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
                L   L +     E L+    E E   +E E+   E E+ K+E EEK     EE E
Sbjct: 316 NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE 375

Query: 119 EKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           E  +  +EE  E E E  + + + EE + E E +E    
Sbjct: 376 ELFEALREELAELEAELAEIRNELEELKREIESLEERLE 414



 Score = 33.1 bits (76), Expect = 0.060
 Identities = 28/81 (34%), Positives = 42/81 (51%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
             +E  E  E  EE KEK E  +EE EE+E   +E ++   + EE +++ ++K   + EE
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEE 373

Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
            EE     +EE  E E E  E
Sbjct: 374 LEELFEALREELAELEAELAE 394



 Score = 33.1 bits (76), Expect = 0.064
 Identities = 21/72 (29%), Positives = 40/72 (55%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
             E EE + E EE  EE EE +E+  + +++ E+ E E    +++ EE   E EE ++  
Sbjct: 266 EKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERL 325

Query: 140 KKEEEEEEEKEK 151
           ++ +E+ E  ++
Sbjct: 326 EELKEKIEALKE 337



 Score = 32.4 bits (74), Expect = 0.096
 Identities = 26/86 (30%), Positives = 43/86 (50%), Gaps = 4/86 (4%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE----VEE 131
           E LEN   E EE+ +E +E+ E  ++E +E E    E ++   E ++ K+E E       
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371

Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYRN 157
           EE E+  +   EE  E + ++   RN
Sbjct: 372 EELEELFEALREELAELEAELAEIRN 397



 Score = 31.6 bits (72), Expect = 0.19
 Identities = 22/84 (26%), Positives = 35/84 (41%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
                 E L     E EE  +   EE  E E E  E   +  E K E E  ++  +    
Sbjct: 359 AKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSE 418

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
             E+ +++ K+ E E EE + ++E
Sbjct: 419 RLEDLKEELKELEAELEELQTELE 442



 Score = 31.6 bits (72), Expect = 0.20
 Identities = 22/68 (32%), Positives = 37/68 (54%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE E E     E  EE + + E+ E++ E+ +E+ E  K++ EE     EE ++   + E
Sbjct: 298 EELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELE 357

Query: 144 EEEEEKEK 151
           E +EE E+
Sbjct: 358 EAKEELEE 365



 Score = 31.6 bits (72), Expect = 0.21
 Identities = 18/82 (21%), Positives = 35/82 (42%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           + +           EE E++     EE EE  E   +E  + E +  E   + ++ + E+
Sbjct: 347 EELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREI 406

Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
           E  EE  +   +  E+ +E+ K
Sbjct: 407 ESLEERLERLSERLEDLKEELK 428



 Score = 31.2 bits (71), Expect = 0.26
 Identities = 23/84 (27%), Positives = 40/84 (47%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           Q        E  E E E     E  EE E E E+ E+  ++ ++K E  +++ +++E  +
Sbjct: 287 QEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL 346

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
           EE E+     ++ +EE EEK    
Sbjct: 347 EELEQLLAELEEAKEELEEKLSAL 370



 Score = 31.2 bits (71), Expect = 0.27
 Identities = 30/80 (37%), Positives = 47/80 (58%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
                E  E E+E +E +EE EE E EKE+ EDE K+ E+++EE E++  + E E+ E +
Sbjct: 845 DELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELK 904

Query: 134 EEQKNKKKEEEEEEEKEKVE 153
           EE +  ++  EE E K +  
Sbjct: 905 EEIEKLRERLEELEAKLERL 924



 Score = 30.5 bits (69), Expect = 0.39
 Identities = 21/93 (22%), Positives = 41/93 (44%)

Query: 61  NARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
              L           + L   E  +EE E++     EE E+  +   E+  E + E  E 
Sbjct: 336 KEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEI 395

Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           + + +E + E E  E++ ++  E  E+ KE+++
Sbjct: 396 RNELEELKREIESLEERLERLSERLEDLKEELK 428



 Score = 30.5 bits (69), Expect = 0.41
 Identities = 31/73 (42%), Positives = 44/73 (60%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E LE  + E E++ KE EEE+EE E+E  E E +  E KEE E+ ++  +E E + E  E
Sbjct: 866 EELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLE 925

Query: 136 QKNKKKEEEEEEE 148
            +  + EEE EEE
Sbjct: 926 VELPELEEELEEE 938



 Score = 30.5 bits (69), Expect = 0.46
 Identities = 31/81 (38%), Positives = 45/81 (55%), Gaps = 1/81 (1%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE-EKKREKKEEEEEKQKDKKEEEVEEE 132
           A+   E  + +   +E EEE EE EEE E+ ++  E+  E+ E  EE     KEE  E E
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE 785

Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
           E+ Q  +++ EE EEE E+ E
Sbjct: 786 EKRQALQEELEELEEELEEAE 806



 Score = 30.5 bits (69), Expect = 0.49
 Identities = 29/104 (27%), Positives = 46/104 (44%)

Query: 50  GPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
             AQ  +       L  +  Q     E L++ + E    E   EE   + E+ + + EE 
Sbjct: 662 SLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEEL 721

Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           KRE    EEE ++ +   E  EEE E+  ++ EE +E  +E  E
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765



 Score = 30.1 bits (68), Expect = 0.55
 Identities = 29/84 (34%), Positives = 41/84 (48%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           LE  + E EE E E+EE E+E ++ ++E EE + E +E E E  + K+E E   E  E+ 
Sbjct: 858 LEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917

Query: 138 NKKKEEEEEEEKEKVEMYRNPVPM 161
             K E  E E  E  E        
Sbjct: 918 EAKLERLEVELPELEEELEEEYED 941



 Score = 30.1 bits (68), Expect = 0.56
 Identities = 26/74 (35%), Positives = 49/74 (66%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E  E  E  +E +E+ EE E+E EE +++ +E +  +++ E+E K+ ++++EE+EEE  E
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895

Query: 136 QKNKKKEEEEEEEK 149
            +++  E +EE EK
Sbjct: 896 LESELAELKEEIEK 909



 Score = 30.1 bits (68), Expect = 0.65
 Identities = 25/77 (32%), Positives = 40/77 (51%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
               LE  E + EE ++E    EEE E+ +   EE + E +E EEE ++ ++  E  EEE
Sbjct: 707 LRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEE 766

Query: 134 EEQKNKKKEEEEEEEKE 150
            E   +   + +EE +E
Sbjct: 767 LESLEEALAKLKEEIEE 783



 Score = 29.7 bits (67), Expect = 0.88
 Identities = 31/72 (43%), Positives = 47/72 (65%), Gaps = 1/72 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE-EEEEEQKNKKKE 142
           E EE+ +E EE+ +E E+E +E E++  E KEE EE + +K+E E E +E EE+K + +E
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEE 891

Query: 143 EEEEEEKEKVEM 154
           E  E E E  E+
Sbjct: 892 ELRELESELAEL 903



 Score = 29.3 bits (66), Expect = 0.94
 Identities = 25/81 (30%), Positives = 38/81 (46%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +       L     E EE ++E E  EE  E+  +  E+ K E KE E E ++ + E E 
Sbjct: 384 ELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEE 443

Query: 130 EEEEEEQKNKKKEEEEEEEKE 150
             EE E+  ++ EE  +  KE
Sbjct: 444 LNEELEELEEQLEELRDRLKE 464



 Score = 29.3 bits (66), Expect = 0.98
 Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 3/81 (3%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD---KKEEEVEEEEEE 135
           E  E EEE +E +E  EE EEE E  E+   K +++ EE E+++    ++ EE+EEE EE
Sbjct: 745 ELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEE 804

Query: 136 QKNKKKEEEEEEEKEKVEMYR 156
            + +    E E E  +    R
Sbjct: 805 AERRLDALERELESLEQRRER 825



 Score = 29.3 bits (66), Expect = 1.0
 Identities = 22/71 (30%), Positives = 41/71 (57%), Gaps = 3/71 (4%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE---EEKQKDKKEEEVEEEEEEQKNKK 140
           EE ++E EE E+E EE K + E+  ++ E+ +EE    +++ ++ E E+    E  +  +
Sbjct: 256 EELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELE 315

Query: 141 KEEEEEEEKEK 151
            E EE EE+ +
Sbjct: 316 NELEELEERLE 326



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 27/70 (38%), Positives = 42/70 (60%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           + +E  KE EE EEE    +E+ E+ +++ E+ E+E E+ K + EE  EE EE Q+   +
Sbjct: 233 KLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLE 292

Query: 142 EEEEEEEKEK 151
            +EE EE E 
Sbjct: 293 LKEEIEELEG 302



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 70  QAISASETLENNEWEEEEKEKEEE--EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           Q       L   E E E+ +   E  EEE EE +E+ E+ +++ E+ EEE E  ++   +
Sbjct: 717 QLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAK 776

Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVEM 154
             EE EE ++ ++  +EE EE E+   
Sbjct: 777 LKEEIEELEEKRQALQEELEELEEELE 803



 Score = 28.1 bits (63), Expect = 2.6
 Identities = 22/76 (28%), Positives = 40/76 (52%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E +EE  E +EE EE E E     +  ++ E + EE E++ ++ +E++E  +EE + 
Sbjct: 282 ELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEE 341

Query: 139 KKKEEEEEEEKEKVEM 154
           ++   EE E+      
Sbjct: 342 RETLLEELEQLLAELE 357



 Score = 28.1 bits (63), Expect = 3.0
 Identities = 23/78 (29%), Positives = 46/78 (58%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E+LE    + +E+ +E EE+ +  ++E +E EE+  E +   +  +++ +  E   E  E
Sbjct: 768 ESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLE 827

Query: 136 QKNKKKEEEEEEEKEKVE 153
           Q+ ++ EEE EE +EK++
Sbjct: 828 QEIEELEEEIEELEEKLD 845



 Score = 27.8 bits (62), Expect = 3.3
 Identities = 28/71 (39%), Positives = 42/71 (59%), Gaps = 1/71 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDE-EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           E      E E E  E+ +E+ E E E+  E+ EE EEK  + +EE  E E+E ++ K++ 
Sbjct: 806 ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL 865

Query: 143 EEEEEEKEKVE 153
           EE E EKE++E
Sbjct: 866 EELEAEKEELE 876



 Score = 27.4 bits (61), Expect = 5.3
 Identities = 22/84 (26%), Positives = 46/84 (54%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
             E  E      E + +  E+  E  E+E +E EE+  E +E+ +E +++ +E E E EE
Sbjct: 801 ELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEE 860

Query: 134 EEQKNKKKEEEEEEEKEKVEMYRN 157
            +++ ++ E E+EE +++++    
Sbjct: 861 LKEELEELEAEKEELEDELKELEE 884



 Score = 26.6 bits (59), Expect = 7.4
 Identities = 27/82 (32%), Positives = 48/82 (58%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +   A    E  E EE+ +  +EE EE EEE E+ E      E++ E  E+++++ E+E+
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830

Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
           EE EEE +  +++ +E EE+ +
Sbjct: 831 EELEEEIEELEEKLDELEEELE 852


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 36.5 bits (85), Expect = 0.004
 Identities = 14/39 (35%), Positives = 29/39 (74%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           ++EE ++E+EE+E  +E+++    +K+ EKK++E EK +
Sbjct: 549 DKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLE 587



 Score = 35.8 bits (83), Expect = 0.008
 Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 8/57 (14%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV---EMYRN 157
           D+ EE +REK+E+E  K++ +  +  ++EE     KKK+E E+ EK K+   E ++ 
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEE-----KKKKELEKLEKAKIPPAEFFKR 599



 Score = 34.6 bits (80), Expect = 0.017
 Identities = 13/38 (34%), Positives = 26/38 (68%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
            E + E++EKE  +E++   K K ++E+KK+E ++ E+
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 32.7 bits (75), Expect = 0.073
 Identities = 15/41 (36%), Positives = 32/41 (78%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           +++EE ++EK+E E  K +K+  + +KQ++KK++E+E+ E+
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 31.5 bits (72), Expect = 0.19
 Identities = 17/47 (36%), Positives = 34/47 (72%), Gaps = 1/47 (2%)

Query: 113 KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE-EEEEKEKVEMYRNP 158
           K +++EE Q++K+E+E  +E++  +  KK+EE +++E EK+E  + P
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIP 592



 Score = 31.5 bits (72), Expect = 0.20
 Identities = 13/43 (30%), Positives = 30/43 (69%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           K +++EE + EKE+ E  ++++  ++ +++++K KKE E  E+
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 30.4 bits (69), Expect = 0.51
 Identities = 14/41 (34%), Positives = 29/41 (70%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
            +KEE + E+EE++   E +  ++ KK+EE++K++ +K E+
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 35.8 bits (83), Expect = 0.005
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
           EW++ +  +E   EE E      E+E  + EKKE +++K
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 33.5 bits (77), Expect = 0.031
 Identities = 13/32 (40%), Positives = 15/32 (46%)

Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           KD K  E    EE +     +EEE  E EK E
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKE 191



 Score = 31.9 bits (73), Expect = 0.082
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           +D +   E   EE E     KEEE  E E+++  KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 31.1 bits (71), Expect = 0.15
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           E  + +  +    EE E     K+EE  E E++E K KK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.8 bits (70), Expect = 0.22
 Identities = 9/35 (25%), Positives = 18/35 (51%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           E ++ +  +E   +E +     +EEE  + +KKE 
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKES 192



 Score = 30.4 bits (69), Expect = 0.33
 Identities = 13/33 (39%), Positives = 15/33 (45%)

Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           E K     EE   EE E     K+EE  E EK+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKK 190



 Score = 29.6 bits (67), Expect = 0.59
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            + K  E+   EE +     +EE   E E++++KKK+
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 29.6 bits (67), Expect = 0.61
 Identities = 8/39 (20%), Positives = 23/39 (58%)

Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           E K+ +  ++   +E E     +E+++ + E++E ++K+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 29.2 bits (66), Expect = 0.76
 Identities = 9/36 (25%), Positives = 18/36 (50%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           E ++ +  E+   EE +     +EEE  + +K+E  
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESK 193



 Score = 29.2 bits (66), Expect = 0.81
 Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           E +  K  +E   EE E     K++E  E E+KE  
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESK 193



 Score = 28.1 bits (63), Expect = 1.8
 Identities = 10/38 (26%), Positives = 20/38 (52%), Gaps = 1/38 (2%)

Query: 115 EEEEEKQKDK-KEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           E ++ K  ++   EE E     ++ +  E E++E K+K
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 35.9 bits (83), Expect = 0.005
 Identities = 15/54 (27%), Positives = 28/54 (51%)

Query: 65  TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
            ++             +  E+EE E +  +E+EE+E   +EDE +  E++E+EE
Sbjct: 17  VLLTFGNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEE 70



 Score = 30.5 bits (69), Expect = 0.30
 Identities = 14/40 (35%), Positives = 26/40 (65%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
           Q ++  E  E++  +E+E+++   EE+E E  E++EDEE 
Sbjct: 32  QDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEG 71



 Score = 29.7 bits (67), Expect = 0.54
 Identities = 15/45 (33%), Positives = 27/45 (60%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
             SA +  E+ E E++  +++EE+E   EE E +  EE++ E+ E
Sbjct: 28  FASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 29.0 bits (65), Expect = 1.2
 Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 2/37 (5%)

Query: 116 EEEEKQKDKKEEEVEEE--EEEQKNKKKEEEEEEEKE 150
           E+EE + D  +E+ E+E   EE +N+  EEEE+EE E
Sbjct: 36  EDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 28.2 bits (63), Expect = 1.8
 Identities = 12/34 (35%), Positives = 23/34 (67%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
           E++E E++  +E+EE E   +E++    +EEE+E
Sbjct: 36  EDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDE 69



 Score = 27.4 bits (61), Expect = 3.6
 Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 7/49 (14%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            +   E+EE E++   E +EDE          EE + +  EEE +EE E
Sbjct: 31  AQDLTEDEEAEDDVVDEDEEDE-------AVVEEDENELTEEEEDEEGE 72



 Score = 27.0 bits (60), Expect = 5.3
 Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 1/53 (1%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKK 110
            +        L +   A + + + + EE+E   EE+E E  EE+E +E E K 
Sbjct: 24  GKGPFASAQDLTEDEEAEDDVVDED-EEDEAVVEEDENELTEEEEDEEGEVKA 75



 Score = 26.3 bits (58), Expect = 8.7
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
            ++  E EE E++  +E E+DE   ++ E +  EEE+ ++ 
Sbjct: 31  AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEG 71


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 35.9 bits (83), Expect = 0.005
 Identities = 19/70 (27%), Positives = 35/70 (50%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           + L + +  EE  ++ E  + E+ E E D  E++  ++ EEEE    D   E+ +   EE
Sbjct: 201 DMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEE 260

Query: 136 QKNKKKEEEE 145
            ++ + E  E
Sbjct: 261 MESGEMEAAE 270



 Score = 30.2 bits (68), Expect = 0.40
 Identities = 13/62 (20%), Positives = 27/62 (43%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           +  EE  ++ +  D E   ++ + +E++  D+ EEE     +          EE E  ++
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEM 266

Query: 153 EM 154
           E 
Sbjct: 267 EA 268



 Score = 29.8 bits (67), Expect = 0.64
 Identities = 13/62 (20%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEE-EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
             E+ ++ + E+E+  KE+E++++ EEE+    D   +      EE +  + +  E   +
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASAD 274

Query: 133 EE 134
           + 
Sbjct: 275 DT 276



 Score = 27.1 bits (60), Expect = 4.6
 Identities = 12/66 (18%), Positives = 29/66 (43%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
             A S     E++  E+E+ ++ EEEE    +   ++ +    E +  E E  +   ++ 
Sbjct: 217 ESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDT 276

Query: 129 VEEEEE 134
            + ++ 
Sbjct: 277 PDSDDA 282


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 36.1 bits (84), Expect = 0.005
 Identities = 33/129 (25%), Positives = 54/129 (41%), Gaps = 12/129 (9%)

Query: 36  NVSLTDDAEFQCQVGPAQY-------HKPIR----ANARLTVILPQAISASETLENNEWE 84
            V +TD+A         +Y        K I     A AR+ + + +     E        
Sbjct: 354 GVRITDEALVAAVTLSDRYIPDRFLPDKAIDLLDEAGARVRLEIDKPEELDELERELAQL 413

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-EKQKDKKEEEVEEEEEEQKNKKKEE 143
           E EKE  E E++E+E+K  DE  + K  +  E E E + +  E+++ E           +
Sbjct: 414 EIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDEDDIAEVVARWTGIPVAK 473

Query: 144 EEEEEKEKV 152
             E+EKEK+
Sbjct: 474 LLEDEKEKL 482


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 35.7 bits (82), Expect = 0.006
 Identities = 19/68 (27%), Positives = 43/68 (63%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E++ K+KEE+  EE + K+  E E  K+ +KE  + +++ K+ EE E++ + ++ +++E+
Sbjct: 78  EQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQ 137

Query: 144 EEEEEKEK 151
             +   E+
Sbjct: 138 ARKAAAEQ 145



 Score = 33.8 bits (77), Expect = 0.029
 Identities = 21/70 (30%), Positives = 41/70 (58%), Gaps = 1/70 (1%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK-NKKKEE 143
           EE K K+  E+E  ++ EK+  + ++++K+ EE EKQ   ++++ EE+  +    +KK+ 
Sbjct: 90  EELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKA 149

Query: 144 EEEEEKEKVE 153
           E  + K   E
Sbjct: 150 EAAKAKAAAE 159



 Score = 31.8 bits (72), Expect = 0.13
 Identities = 14/76 (18%), Positives = 45/76 (59%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
            +E L+  +  E+E+ K+ E+E  + ++++ + EE +++ + E++++++  ++   E+++
Sbjct: 88  VAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKK 147

Query: 134 EEQKNKKKEEEEEEEK 149
           + +  K K   E  + 
Sbjct: 148 KAEAAKAKAAAEAAKL 163



 Score = 30.3 bits (68), Expect = 0.47
 Identities = 16/70 (22%), Positives = 35/70 (50%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + E+K++EE+  +   E++K  +  K +   E  + K   + +++ EE  +  +  K + 
Sbjct: 128 QLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKA 187

Query: 144 EEEEEKEKVE 153
           E    K+K E
Sbjct: 188 EAAAAKKKAE 197



 Score = 28.4 bits (63), Expect = 1.9
 Identities = 13/70 (18%), Positives = 43/70 (61%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           ++   + ++   ++ E++ K+K+E   ++ + K+  E+++  + E+E  + +E+QK  ++
Sbjct: 63  QYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEE 122

Query: 142 EEEEEEEKEK 151
            E++ + ++K
Sbjct: 123 AEKQAQLEQK 132



 Score = 28.4 bits (63), Expect = 2.2
 Identities = 22/81 (27%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK-KREKKEEEEEKQKDKKEEE 128
           +A  A+E  +        K+K E E +   EK K E E K K EKK E   ++K   E++
Sbjct: 175 EAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKK 234

Query: 129 VEEEEEEQKNKKKEEEEEEEK 149
               + +        +  E K
Sbjct: 235 KAAAKAKADKAAAAAKAAERK 255



 Score = 28.0 bits (62), Expect = 2.7
 Identities = 16/78 (20%), Positives = 32/78 (41%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
              E  +  E  K K   E  + +   + + + ++  K  EE + + +    + + E E 
Sbjct: 141 AAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEA 200

Query: 136 QKNKKKEEEEEEEKEKVE 153
           +   +K + E E K K E
Sbjct: 201 KAAAEKAKAEAEAKAKAE 218



 Score = 26.8 bits (59), Expect = 6.5
 Identities = 21/105 (20%), Positives = 40/105 (38%), Gaps = 3/105 (2%)

Query: 52  AQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
               K     AR      +  + +   +      + K   E +++ EE  +  E+ + K 
Sbjct: 128 QLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKA 187

Query: 112 EKKEEEEEKQKD---KKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           E    +++ + +     E+   E E + K +KK E   EEK   E
Sbjct: 188 EAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAE 232



 Score = 26.5 bits (58), Expect = 9.3
 Identities = 17/69 (24%), Positives = 38/69 (55%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
             E++++ +E E++ + E+K+++E   K   +++++ E  K K   E  + +   + KKK
Sbjct: 113 AQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKK 172

Query: 142 EEEEEEEKE 150
            EE  +  E
Sbjct: 173 AEEAAKAAE 181


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 35.8 bits (82), Expect = 0.006
 Identities = 20/60 (33%), Positives = 32/60 (53%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
            + EEE+ EK   + +K +K +E+ E Q  + E    EE E   + + E EEE + EK +
Sbjct: 126 SDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDD 185



 Score = 27.7 bits (61), Expect = 3.1
 Identities = 15/62 (24%), Positives = 32/62 (51%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + EE++ E++  +  + K+ DED E +  + E  + ++ +       E EEE   +K + 
Sbjct: 127 DVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDG 186

Query: 144 EE 145
           E+
Sbjct: 187 ED 188


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 35.4 bits (82), Expect = 0.007
 Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 12/63 (19%)

Query: 72  ISASET-----------LENNEW-EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
           ++AS T           L+   W  E +  K++EE   EE KE      KK    ++ E 
Sbjct: 133 LAASNTILLFVLVGVLVLQAGLWYAERKDAKQKEEFAAEERKEALAAAAKKSATPQKVET 192

Query: 120 KQK 122
           K+K
Sbjct: 193 KKK 195



 Score = 30.4 bits (69), Expect = 0.27
 Identities = 10/37 (27%), Positives = 18/37 (48%)

Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           E ++ KQK++   E  +E      KK    ++ E +K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194



 Score = 30.0 bits (68), Expect = 0.43
 Identities = 9/36 (25%), Positives = 15/36 (41%)

Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           E +  K K+E   EE +E      K+    ++ E  
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETK 193



 Score = 29.3 bits (66), Expect = 0.84
 Identities = 8/37 (21%), Positives = 18/37 (48%)

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           KD  ++++   +E +E      K+    ++ E +K K
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 28.1 bits (63), Expect = 1.6
 Identities = 9/39 (23%), Positives = 19/39 (48%)

Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           E+ + ++K+    EE +E      ++    ++ E K KK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 27.3 bits (61), Expect = 3.3
 Identities = 8/39 (20%), Positives = 20/39 (51%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           E ++ ++KE+   EE+K       ++    +K E  +++
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 27.0 bits (60), Expect = 4.6
 Identities = 8/37 (21%), Positives = 18/37 (48%)

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            + K++E+   EE K+      +     ++ + KKK+
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 36.1 bits (83), Expect = 0.007
 Identities = 18/66 (27%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK-KEEEEEEE 148
            +E+++E++++K + E +EK  +KKEE      D   +E   +E  +   + K+  +EE 
Sbjct: 17  MQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEES 76

Query: 149 KEKVEM 154
           K+ +E+
Sbjct: 77  KQLLEV 82



 Score = 33.0 bits (75), Expect = 0.056
 Identities = 15/65 (23%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
             +E      +  +EKD++++ ++ + E +E K  DKKEE V +  +    ++  +E  +
Sbjct: 5   NRREANINNNDRMQEKDDEKQDQKNRMELKE-KVLDKKEEVVTDNVDSPVKEQSSQENLK 63

Query: 148 EKEKV 152
             ++V
Sbjct: 64  IADEV 68


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 35.5 bits (82), Expect = 0.007
 Identities = 13/69 (18%), Positives = 31/69 (44%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + E +  EE+ +   E +   E    K ++K + E+K K  K +   + + + K K + +
Sbjct: 57  QPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPK 116

Query: 144 EEEEEKEKV 152
            ++   +  
Sbjct: 117 PKKPPSKTA 125



 Score = 28.2 bits (63), Expect = 1.8
 Identities = 11/62 (17%), Positives = 29/62 (46%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +   + +   E    + KEK + E+K ++ K + + K K K + + + + ++  +K   +
Sbjct: 68  KPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAK 127

Query: 144 EE 145
             
Sbjct: 128 AP 129


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 35.3 bits (82), Expect = 0.007
 Identities = 16/66 (24%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK---EEEE 145
            K    +EEEEE  +   +E      E++   + D++ +E E++   ++ ++K   E + 
Sbjct: 15  PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKA 74

Query: 146 EEEKEK 151
             EK K
Sbjct: 75  LAEKSK 80



 Score = 31.5 bits (72), Expect = 0.13
 Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK--VEMYRN 157
            K    KEEEEE  +   +E  E   E++   + +EE +EE++   +E YR 
Sbjct: 15  PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRR 66



 Score = 26.8 bits (60), Expect = 4.8
 Identities = 16/69 (23%), Positives = 32/69 (46%), Gaps = 1/69 (1%)

Query: 66  VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           ++ P+  S  E  E    E   +E  E   E++   E DE+ +++ + +  EE ++K   
Sbjct: 12  ILPPKPPSPKEEEEEAL-ELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLA 70

Query: 126 EEEVEEEEE 134
           E +   E+ 
Sbjct: 71  EMKALAEKS 79


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 35.8 bits (83), Expect = 0.007
 Identities = 16/78 (20%), Positives = 38/78 (48%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           + EEE  +    +++E   +  E +   EE++  K+E ++E  E ++  ++ +   E + 
Sbjct: 813 ESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKN 872

Query: 150 EKVEMYRNPVPMYRNPVP 167
           E VE   +P  + +  + 
Sbjct: 873 EFVEFKNDPKKLNKLIIA 890



 Score = 35.8 bits (83), Expect = 0.008
 Identities = 18/75 (24%), Positives = 32/75 (42%), Gaps = 1/75 (1%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
               L   +  EEE +   E + E  E + D  +  K    ++   K     +E  ++E+
Sbjct: 850 IKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEK 909

Query: 134 EEQKNKKK-EEEEEE 147
             +K KK  E+ +EE
Sbjct: 910 TTKKKKKDLEKTDEE 924



 Score = 35.0 bits (81), Expect = 0.014
 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 7/73 (9%)

Query: 84  EEEEKEKEEEEEEEE-------EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           E  E ++ EEE +          E + D  +  K    ++   K     +E  ++E+  +
Sbjct: 853 ELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTK 912

Query: 137 KNKKKEEEEEEEK 149
           K KK  E+ +EE 
Sbjct: 913 KKKKDLEKTDEEA 925



 Score = 31.9 bits (73), Expect = 0.13
 Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 1/73 (1%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKE-KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
              + EE     EE E   EE+K  K E + +  E +E EEE Q   + +    E +   
Sbjct: 822 LRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDP 881

Query: 138 NKKKEEEEEEEKE 150
            K  +    ++  
Sbjct: 882 KKLNKLIIAKDVL 894



 Score = 29.6 bits (67), Expect = 0.94
 Identities = 18/79 (22%), Positives = 38/79 (48%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           +A + L   E   +     +  +  +E  ++  + E++R K    + ++   + EE+E  
Sbjct: 780 NAEKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELI 839

Query: 133 EEEQKNKKKEEEEEEEKEK 151
            EEQKN K+E + E  + +
Sbjct: 840 NEEQKNLKQEIKLELSEIQ 858


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 34.7 bits (80), Expect = 0.008
 Identities = 21/70 (30%), Positives = 43/70 (61%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           N + EE +++++E  +E EEKE  + +++K+EKKE++  ++  KK+     +   +K  K
Sbjct: 89  NLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATK 148

Query: 141 KEEEEEEEKE 150
           K   ++EE +
Sbjct: 149 KTTTKKEEGK 158



 Score = 29.3 bits (66), Expect = 0.57
 Identities = 16/54 (29%), Positives = 30/54 (55%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
            E+ + E +E  +E +E+E  K K +K+E+ E++  E+  KKK  +  +   K 
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKK 145



 Score = 27.4 bits (61), Expect = 2.5
 Identities = 18/65 (27%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV--EEEEEEQKNKKKEEEEEEE 148
            EE E+E++E  K+ +E++  + K+E++EK++ K  E++  ++  +  KN  K+  ++  
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 149 KEKVE 153
            +K E
Sbjct: 152 TKKEE 156



 Score = 27.4 bits (61), Expect = 2.8
 Identities = 13/61 (21%), Positives = 34/61 (55%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E  +E +EKE  + ++E++++K++   +K  KK+  +  +   K+   +   ++++ 
Sbjct: 98  EQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157

Query: 139 K 139
           K
Sbjct: 158 K 158



 Score = 26.6 bits (59), Expect = 5.1
 Identities = 18/69 (26%), Positives = 37/69 (53%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           + P+     +   + E EE+E  K ++E++E++EK+  E   KK+  K  +   +K  K+
Sbjct: 90  LFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149

Query: 127 EEVEEEEEE 135
              ++EE +
Sbjct: 150 TTTKKEEGK 158


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 35.5 bits (82), Expect = 0.008
 Identities = 26/108 (24%), Positives = 51/108 (47%), Gaps = 3/108 (2%)

Query: 52  AQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
            +      A   L     +A    E       E   +E+ ++ +E   E+E+ + EE++ 
Sbjct: 27  KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86

Query: 112 EKKEEEEEKQKDK---KEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
            +KEE+ + + +K    E ++EE E+    ++ E EE E++   E+YR
Sbjct: 87  VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYR 134



 Score = 29.3 bits (66), Expect = 1.0
 Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 2/75 (2%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK--K 141
           EE+   + E+ +  E + E+ E     RE + EE EKQ D +   V     EQ  K   K
Sbjct: 90  EEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLK 149

Query: 142 EEEEEEEKEKVEMYR 156
             + E E+EK +  +
Sbjct: 150 LLDAELEEEKAQRVK 164



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 27/100 (27%), Positives = 41/100 (41%)

Query: 52  AQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
            +  +      +    L       + LEN   E E+     E E EE EK+ D +  +  
Sbjct: 77  EELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVA 136

Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
               E+  K   K  +   EEE+ Q+ KK EEE + E E+
Sbjct: 137 GLTPEQARKLLLKLLDAELEEEKAQRVKKIEEEADLEAER 176


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 35.4 bits (81), Expect = 0.009
 Identities = 19/78 (24%), Positives = 38/78 (48%), Gaps = 4/78 (5%)

Query: 76  ETLENNEW---EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
             L N +W   E E K+   +      EK+ D+    ++ + E    ++ +  EEEVE+ 
Sbjct: 334 RNLNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEH-EEEVEDY 392

Query: 133 EEEQKNKKKEEEEEEEKE 150
           E+E  + K+  +++E + 
Sbjct: 393 EDENDHSKRICDDDELEN 410



 Score = 31.1 bits (70), Expect = 0.28
 Identities = 12/65 (18%), Positives = 30/65 (46%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           L+++    E++  +    E+ E + ++ +E +  E+ E+ E++    K    ++E E   
Sbjct: 353 LDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHF 412

Query: 138 NKKKE 142
               E
Sbjct: 413 RAADE 417



 Score = 30.7 bits (69), Expect = 0.37
 Identities = 17/51 (33%), Positives = 25/51 (49%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
           P      + L   + E E +  EE E EEE E  +DE++  KR   ++E E
Sbjct: 359 PLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELE 409



 Score = 28.4 bits (63), Expect = 2.4
 Identities = 14/55 (25%), Positives = 27/55 (49%)

Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMY 155
            EK  +EE +R+    +      +K+ +     E+ + + +  EE E +E+VE Y
Sbjct: 338 NEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDY 392



 Score = 27.6 bits (61), Expect = 4.0
 Identities = 16/47 (34%), Positives = 20/47 (42%), Gaps = 7/47 (14%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEK-------DEDEEKKREKKE 115
           E +E      EE E EEE E+ E+E +        DE E   R   E
Sbjct: 371 EKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADE 417


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 35.4 bits (81), Expect = 0.010
 Identities = 15/52 (28%), Positives = 40/52 (76%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           ++E++++++++EE++EE+E++E+E K  + ++EE+E  +D   E+ E ++ +
Sbjct: 156 DDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207



 Score = 33.9 bits (77), Expect = 0.032
 Identities = 18/65 (27%), Positives = 40/65 (61%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E+         ++++E++++D+D+E+  E++EEEEE+ K   +E+ E+E  E    +K E
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSE 202

Query: 144 EEEEE 148
            ++ +
Sbjct: 203 VDKTD 207



 Score = 32.3 bits (73), Expect = 0.13
 Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 6/69 (8%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
            EE+    +    D+D+E + E  ++EE+      +EE EEEEEE K    E+EE+E  E
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEED------DEEEEEEEEEIKGFDDEDEEDEGGE 194

Query: 151 KVEMYRNPV 159
                ++ V
Sbjct: 195 DFTYEKSEV 203



 Score = 31.9 bits (72), Expect = 0.15
 Identities = 16/71 (22%), Positives = 42/71 (59%), Gaps = 3/71 (4%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV---EEEEEEQKNKKKEEEEEE 147
           EE+    +     D+DE++  +  +EE++++++++EEE+   ++E+EE +  +    E+ 
Sbjct: 142 EEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKS 201

Query: 148 EKEKVEMYRNP 158
           E +K + ++  
Sbjct: 202 EVDKTDCFKFI 212



 Score = 27.3 bits (60), Expect = 4.9
 Identities = 13/57 (22%), Positives = 30/57 (52%)

Query: 97  EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           + E  + D    K+R  +   EE    +    +++++E++     +EE++EE+E+ E
Sbjct: 121 KAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 35.5 bits (82), Expect = 0.010
 Identities = 24/85 (28%), Positives = 42/85 (49%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
             +E L   + EE  +EKE  EEEE  E    E   KK E+  E  E++  + + E+EE 
Sbjct: 389 PLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEEL 448

Query: 133 EEEQKNKKKEEEEEEEKEKVEMYRN 157
           + E +  + E E    + + ++ ++
Sbjct: 449 KREIEKLESELERFRREVRDKVRKD 473



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 1/75 (1%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDE-DEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           ET+E  E E  E ++E EE + E EK + E +  ++  + +  ++++   ++  +E  E+
Sbjct: 429 ETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEK 488

Query: 135 EQKNKKKEEEEEEEK 149
           E + KKK  EE E K
Sbjct: 489 ELEEKKKRVEELERK 503



 Score = 26.6 bits (59), Expect = 8.7
 Identities = 14/70 (20%), Positives = 42/70 (60%), Gaps = 1/70 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE ++E E+ E E E  + +   ++ +++++    +++ ++ E+E+EE+++  +  +++ 
Sbjct: 446 EELKREIEKLESELERFRRE-VRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKL 504

Query: 144 EEEEEKEKVE 153
            E  +  K+E
Sbjct: 505 AELRKMRKLE 514


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 34.7 bits (80), Expect = 0.010
 Identities = 27/76 (35%), Positives = 50/76 (65%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
               E ++  +++ E EEEE EE++K E++ +   K+EEE E++++KK+EE E +E E++
Sbjct: 8   RAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67

Query: 138 NKKKEEEEEEEKEKVE 153
            +K++EE E+ K    
Sbjct: 68  ARKEQEEYEKLKSSFV 83



 Score = 34.3 bits (79), Expect = 0.013
 Identities = 31/79 (39%), Positives = 51/79 (64%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           I A +  +  E +   +++E EEEE EE K+ +E  E +R+++EE EE+++ KKEEE  +
Sbjct: 3   IGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERK 62

Query: 132 EEEEQKNKKKEEEEEEEKE 150
           E EEQ  K++EE E+ +  
Sbjct: 63  EREEQARKEQEEYEKLKSS 81



 Score = 31.6 bits (72), Expect = 0.13
 Identities = 21/72 (29%), Positives = 51/72 (70%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
             +++ K EE++   +++E +E+E ++R+K EE+ E ++ ++EE  EE E++++ ++++E
Sbjct: 4   GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63

Query: 144 EEEEEKEKVEMY 155
            EE+ +++ E Y
Sbjct: 64  REEQARKEQEEY 75



 Score = 31.6 bits (72), Expect = 0.14
 Identities = 29/72 (40%), Positives = 48/72 (66%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E EE E+ K+ EE+ E E KE++E EE++ +KKEEEE K+++++  + +EE E+ K+   
Sbjct: 24  EEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFV 83

Query: 142 EEEEEEEKEKVE 153
            EEE  +K   +
Sbjct: 84  VEEEGTDKLSAD 95



 Score = 30.1 bits (68), Expect = 0.36
 Identities = 29/95 (30%), Positives = 53/95 (55%)

Query: 56  KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
           K I A  R  +   QA       E  E EE +K +E+ E E +EE+E +E+ EKK+E++E
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60

Query: 116 EEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
            +E +++ +KE+E  E+ +     ++E  ++   +
Sbjct: 61  RKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSAD 95



 Score = 30.1 bits (68), Expect = 0.41
 Identities = 28/76 (36%), Positives = 44/76 (57%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E  +  E + E + KEEEE EEE EK+K+E+E K+RE++  +E+++ +K +     EEE 
Sbjct: 29  EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEG 88

Query: 136 QKNKKKEEEEEEEKEK 151
                 +EE  E  E 
Sbjct: 89  TDKLSADEESNELLED 104



 Score = 29.3 bits (66), Expect = 0.73
 Identities = 23/75 (30%), Positives = 51/75 (68%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           +    +  + E+++   ++ E +E++ +E KK E+K E E K++++ EEE E+++EE++ 
Sbjct: 2   KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEER 61

Query: 139 KKKEEEEEEEKEKVE 153
           K++EE+  +E+E+ E
Sbjct: 62  KEREEQARKEQEEYE 76



 Score = 26.2 bits (58), Expect = 7.2
 Identities = 18/60 (30%), Positives = 42/60 (70%)

Query: 97  EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
             +++ K E+++ +R+++E EEE+++++K+ E + E E ++ ++ EEE E++KE+ E   
Sbjct: 4   GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 35.1 bits (81), Expect = 0.011
 Identities = 18/74 (24%), Positives = 36/74 (48%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
             EEE +       ++KK +K  E ++K + KK+++ E++E + + + K   +  +K K 
Sbjct: 42  FSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101

Query: 153 EMYRNPVPMYRNPV 166
              + P P     V
Sbjct: 102 TKKKPPKPKPNEDV 115



 Score = 33.9 bits (78), Expect = 0.029
 Identities = 15/71 (21%), Positives = 35/71 (49%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E+K  E +++ E+++K+K E +E K E + +   K   K ++  ++  + + N+  +   
Sbjct: 60  EDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAF 119

Query: 146 EEEKEKVEMYR 156
            +  E  E   
Sbjct: 120 NKIAELAEKSN 130



 Score = 33.5 bits (77), Expect = 0.036
 Identities = 21/68 (30%), Positives = 37/68 (54%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E      EEE +      +KD+ E+K  E K++ E+K+K KKE++  + E E K   K  
Sbjct: 37  EILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTP 96

Query: 144 EEEEEKEK 151
           ++ ++ +K
Sbjct: 97  KKSKKTKK 104



 Score = 33.5 bits (77), Expect = 0.041
 Identities = 16/71 (22%), Positives = 36/71 (50%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           KE      EEE +      ++ K+E K  E +K+ +KK+++ +E++E +   + +   + 
Sbjct: 36  KEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKT 95

Query: 148 EKEKVEMYRNP 158
            K+  +  + P
Sbjct: 96  PKKSKKTKKKP 106



 Score = 32.8 bits (75), Expect = 0.069
 Identities = 15/69 (21%), Positives = 34/69 (49%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E   N   ++ EK+K++++E++E + E +     K  KK ++ +K+  K +   + +  
Sbjct: 59  KEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118

Query: 135 EQKNKKKEE 143
             K  +  E
Sbjct: 119 FNKIAELAE 127



 Score = 32.4 bits (74), Expect = 0.091
 Identities = 20/88 (22%), Positives = 43/88 (48%), Gaps = 1/88 (1%)

Query: 64  LTVILPQAIS-ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
            ++IL   +S  S T + ++ +             +E      +EE K      +++K++
Sbjct: 1   FSLILCMILSFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKE 60

Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           DK  E  ++ E+++K KK+++E + E E
Sbjct: 61  DKNNESKKKSEKKKKKKKEKKEPKSEGE 88



 Score = 31.6 bits (72), Expect = 0.17
 Identities = 13/68 (19%), Positives = 32/68 (47%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + + +             ++      + E K      +KDKKE++  E +++ + KKK++
Sbjct: 18  KSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKK 77

Query: 144 EEEEEKEK 151
           +E++E + 
Sbjct: 78  KEKKEPKS 85



 Score = 30.8 bits (70), Expect = 0.32
 Identities = 17/86 (19%), Positives = 42/86 (48%), Gaps = 6/86 (6%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           T    E +      +++++E++  + K + E+KK++KKE++E K + + +   +  ++ +
Sbjct: 41  TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100

Query: 137 KNK------KKEEEEEEEKEKVEMYR 156
           K K      K  E+ +    K+    
Sbjct: 101 KTKKKPPKPKPNEDVDNAFNKIAELA 126



 Score = 30.1 bits (68), Expect = 0.49
 Identities = 13/73 (17%), Positives = 36/73 (49%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
            +     +  E+ +  E++K+K++E++E + E E     +  ++ K+ +++  K K  E+
Sbjct: 55  KKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114

Query: 129 VEEEEEEQKNKKK 141
           V+    +     +
Sbjct: 115 VDNAFNKIAELAE 127



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 10/64 (15%), Positives = 29/64 (45%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           ++ + +             K+      EEE +      + +++E++    KK+ E++++K
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76

Query: 150 EKVE 153
           +K +
Sbjct: 77  KKEK 80


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 35.2 bits (82), Expect = 0.011
 Identities = 16/58 (27%), Positives = 30/58 (51%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
              + + E  +  E+ EK   +KEEE+EK   +  E +++ E E+K + K  + +E  
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214



 Score = 31.7 bits (73), Expect = 0.15
 Identities = 21/79 (26%), Positives = 41/79 (51%), Gaps = 8/79 (10%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + +E+ ++ E ++EEE+++  ++  +  +K E E++KQ    + + E   +  K   KEE
Sbjct: 167 DSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPK-EGPVQIGKKIDKEE 225

Query: 144 -------EEEEEKEKVEMY 155
                   EEE +  VE Y
Sbjct: 226 ITPMKEINEEERRVVVEGY 244



 Score = 29.0 bits (66), Expect = 1.1
 Identities = 17/74 (22%), Positives = 41/74 (55%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           ++++ E E+ E ++EEE+E+  KE  E  +K   +K+++ +    K+      ++ +++ 
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEE 225

Query: 139 KKKEEEEEEEKEKV 152
               +E  EE+ +V
Sbjct: 226 ITPMKEINEEERRV 239


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 35.0 bits (80), Expect = 0.011
 Identities = 18/66 (27%), Positives = 39/66 (59%), Gaps = 2/66 (3%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE--EEEQKNKKKEEEEE 146
             +E+++  E +K + E  + + +K +EE  K KD K  ++++E    E++ + KE E +
Sbjct: 268 SPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQ 327

Query: 147 EEKEKV 152
           +++E V
Sbjct: 328 KKREPV 333



 Score = 32.3 bits (73), Expect = 0.11
 Identities = 19/67 (28%), Positives = 39/67 (58%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E +K E E+ + E K+ DE+  K ++ K  + +++    E+E E++E E + K++   E+
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAED 336

Query: 147 EEKEKVE 153
            +K K +
Sbjct: 337 LQKTKPQ 343



 Score = 31.5 bits (71), Expect = 0.17
 Identities = 22/81 (27%), Positives = 42/81 (51%), Gaps = 6/81 (7%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDED------EEKKREKKEEEEEKQKDKKEEEVEEE 132
           EN + E E+ + E ++ +EE  K KD        E K  EK+ E++E +  KK E V E+
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAED 336

Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
            ++ K + + +     ++ ++
Sbjct: 337 LQKTKPQVEAQPTSLNEDAID 357



 Score = 30.7 bits (69), Expect = 0.30
 Identities = 19/99 (19%), Positives = 50/99 (50%), Gaps = 7/99 (7%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEE-------EEEKEKDEDEEKKREKKEEEEEKQK 122
           +A  A +  +    E  +K++E +   +       +E+K+  E+++++ EK + E +K  
Sbjct: 235 KADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKND 294

Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPM 161
           ++  +  + +  + K + K  E+E E +++E  +   P+
Sbjct: 295 EEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV 333



 Score = 30.0 bits (67), Expect = 0.65
 Identities = 18/68 (26%), Positives = 37/68 (54%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E +E  +  ++ +EE +K + +  K ++K +  +   DK+ +EV ++++E KN  K  
Sbjct: 205 ERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPA 264

Query: 144 EEEEEKEK 151
           +    KE 
Sbjct: 265 DTSSPKED 272



 Score = 29.2 bits (65), Expect = 1.1
 Identities = 15/62 (24%), Positives = 30/62 (48%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
             E     +  +    K+E +  E+E ++K+ + +KKRE   E+ +K K + E +     
Sbjct: 293 NDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLN 352

Query: 134 EE 135
           E+
Sbjct: 353 ED 354



 Score = 28.0 bits (62), Expect = 2.8
 Identities = 16/89 (17%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-----EKQKDK 124
           +A    E L+  + + ++ +++ +  ++  +K++DE  +K++E K   +       ++DK
Sbjct: 214 RAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDK 273

Query: 125 KEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           +  E ++ E E+   + ++ +EE  +  +
Sbjct: 274 QVAENQKREIEKAQIEIKKNDEEALKAKD 302



 Score = 26.9 bits (59), Expect = 6.5
 Identities = 18/91 (19%), Positives = 49/91 (53%), Gaps = 11/91 (12%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE---- 133
           L+  E +E+ K  ++ +EE ++++   +  ++K +  ++  +KQ+D+  ++ +E +    
Sbjct: 203 LKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262

Query: 134 -------EEQKNKKKEEEEEEEKEKVEMYRN 157
                  +E K   + ++ E EK ++E+ +N
Sbjct: 263 PADTSSPKEDKQVAENQKREIEKAQIEIKKN 293


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 35.1 bits (81), Expect = 0.012
 Identities = 17/71 (23%), Positives = 43/71 (60%), Gaps = 1/71 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-EKQKDKKEEEVEEEEEEQKNKKKE 142
              E  + E ++++++ K+K   ++ K+ KK+ ++ +    +   E +E E +Q NKKK+
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60

Query: 143 EEEEEEKEKVE 153
           ++++++K+K  
Sbjct: 61  KKKKKKKKKNL 71



 Score = 33.5 bits (77), Expect = 0.040
 Identities = 14/71 (19%), Positives = 40/71 (56%), Gaps = 3/71 (4%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ---KNKK 140
             E  E E ++++++ +++  + + KK +K  ++++    +   E +E E +Q   K KK
Sbjct: 2   MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61

Query: 141 KEEEEEEEKEK 151
           K+++++++   
Sbjct: 62  KKKKKKKKNLG 72



 Score = 30.8 bits (70), Expect = 0.28
 Identities = 13/75 (17%), Positives = 41/75 (54%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           EN   +++++ K++  +++ ++ +KD D++     +   E ++ + K+   +++++++K 
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66

Query: 139 KKKEEEEEEEKEKVE 153
           KKK   E  +     
Sbjct: 67  KKKNLGEAYDLAYDL 81



 Score = 27.8 bits (62), Expect = 3.5
 Identities = 14/91 (15%), Positives = 36/91 (39%), Gaps = 2/91 (2%)

Query: 48  QVGPAQYHKPIRANARLTVILPQAISASETLEN--NEWEEEEKEKEEEEEEEEEEKEKDE 105
           +    +  K  +        L + IS ++  EN  N  ++++K+K+++++   E  +   
Sbjct: 20  KGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           D          ++     K     E+E ++ 
Sbjct: 80  DLPVVWSSAAFQDNSHIRKLGNWPEQEWKQT 110



 Score = 26.2 bits (58), Expect = 9.9
 Identities = 9/68 (13%), Positives = 33/68 (48%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           +N + +++  + +    E   E ++ E+++  ++KK+++++K+K    E  +   +    
Sbjct: 25  KNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVV 84

Query: 139 KKKEEEEE 146
                 ++
Sbjct: 85  WSSAAFQD 92


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 34.9 bits (81), Expect = 0.013
 Identities = 17/64 (26%), Positives = 27/64 (42%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           EE +  +E   + E  E  E   K  + K +  EK+K     E +  E E    K+E + 
Sbjct: 1   EELDALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDR 60

Query: 146 EEEK 149
             E+
Sbjct: 61  LREE 64



 Score = 28.4 bits (64), Expect = 1.7
 Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK---KREKKEEEEEKQKDKKEEEVEEE 132
           + L+    + E  E +E   + E+ K K  ++EK     E++  E E  + K+E +   E
Sbjct: 4   DALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLRE 63

Query: 133 EEEQ 136
           E E+
Sbjct: 64  EIER 67



 Score = 26.5 bits (59), Expect = 8.7
 Identities = 12/46 (26%), Positives = 24/46 (52%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
            LE+ + +  EKEK     EE+  + +    +++ ++  EE E+ K
Sbjct: 24  KLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLREEIERLK 69


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 34.7 bits (80), Expect = 0.014
 Identities = 20/73 (27%), Positives = 40/73 (54%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           SE L     ++ E+E ++E   E  E E +   +  R K++ + ++ K+K+ +E+E E +
Sbjct: 238 SEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAK 297

Query: 135 EQKNKKKEEEEEE 147
           E+K  KK+  +  
Sbjct: 298 EEKQLKKKLAQLA 310



 Score = 33.5 bits (77), Expect = 0.041
 Identities = 18/73 (24%), Positives = 34/73 (46%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +E + EK+ +E E  EEK+ ++   +     E  E   ++  ++  EE ++E   +  E 
Sbjct: 205 KEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFES 264

Query: 144 EEEEEKEKVEMYR 156
           E E   + V   R
Sbjct: 265 EYEPINKPVRPKR 277



 Score = 31.6 bits (72), Expect = 0.15
 Identities = 19/72 (26%), Positives = 36/72 (50%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           ++E E+E + E++ +E +  EEKK EK   E  +  +  E  +EE +++ + +  +E   
Sbjct: 200 QEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAW 259

Query: 147 EEKEKVEMYRNP 158
           E  E      N 
Sbjct: 260 EGFESEYEPINK 271



 Score = 31.2 bits (71), Expect = 0.24
 Identities = 17/58 (29%), Positives = 33/58 (56%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E+  KE E E +EE++ K+K     + +E  +E  +K+K +  ++ + +E  +K K K
Sbjct: 286 EKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLK 343



 Score = 30.4 bits (69), Expect = 0.36
 Identities = 18/62 (29%), Positives = 38/62 (61%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           N E   +E E+E +EE++ ++K       K+  K+  ++EK + +K+E+ +E  E++K K
Sbjct: 284 NKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLK 343

Query: 140 KK 141
           ++
Sbjct: 344 RR 345



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 18/91 (19%), Positives = 41/91 (45%), Gaps = 3/91 (3%)

Query: 60  ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
                + +   +    E  +++  EE + E   E  E E E     ++  + ++K + + 
Sbjct: 227 MAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEP---INKPVRPKRKTKAQR 283

Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
            ++ +++E   E +EE++ KKK  +    KE
Sbjct: 284 NKEKRRKELEREAKEEKQLKKKLAQLARLKE 314



 Score = 27.4 bits (61), Expect = 4.5
 Identities = 15/57 (26%), Positives = 34/57 (59%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           E+  +E E++  EEK+ +KK  +  + K+  +E  ++E+   + K++ +E  E+K+ 
Sbjct: 286 EKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKL 342



 Score = 27.0 bits (60), Expect = 5.9
 Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 23/109 (21%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK--------------------- 109
           A  AS   E +E   EE + + EEE ++E   +  + E                      
Sbjct: 228 APEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEK 287

Query: 110 --KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
             K  ++E +EEKQ  KK  ++   +E  K   ++E+    K++    R
Sbjct: 288 RRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKER 336



 Score = 26.6 bits (59), Expect = 7.3
 Identities = 19/72 (26%), Positives = 40/72 (55%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           N  +E+ ++  +EE E+E + ++K ++ E+  EKK E+   +  + +E  E   EE  + 
Sbjct: 189 NPSFEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDD 248

Query: 140 KKEEEEEEEKEK 151
            +EE ++E   +
Sbjct: 249 GEEESDDESAWE 260



 Score = 26.6 bits (59), Expect = 7.8
 Identities = 16/69 (23%), Positives = 32/69 (46%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E + E+K +E E  EE++ ++   +  +  E  E   E+  D  EEE ++E   +  + +
Sbjct: 206 EVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESE 265

Query: 142 EEEEEEEKE 150
            E   +   
Sbjct: 266 YEPINKPVR 274


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 35.0 bits (81), Expect = 0.014
 Identities = 21/66 (31%), Positives = 38/66 (57%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E + +E E + EE E + ++  +E  + E+K EE +++ +  E E+EE E E +  +   
Sbjct: 315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRL 374

Query: 144 EEEEEK 149
           EE EE+
Sbjct: 375 EELEEQ 380



 Score = 34.3 bits (79), Expect = 0.025
 Identities = 23/78 (29%), Positives = 38/78 (48%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
             LE  +    E+    E + EE E + +E E K  E  EE  E ++  +E + E E  E
Sbjct: 298 SRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLE 357

Query: 136 QKNKKKEEEEEEEKEKVE 153
            + ++ E E EE + ++E
Sbjct: 358 AELEELEAELEELESRLE 375



 Score = 34.3 bits (79), Expect = 0.027
 Identities = 24/72 (33%), Positives = 42/72 (58%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E + +E E + EE E + DE  E+  E +E+ EE +++ +  E E EE E + ++ E   
Sbjct: 315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRL 374

Query: 146 EEEKEKVEMYRN 157
           EE +E++E  R+
Sbjct: 375 EELEEQLETLRS 386



 Score = 32.7 bits (75), Expect = 0.073
 Identities = 24/100 (24%), Positives = 47/100 (47%), Gaps = 1/100 (1%)

Query: 59  RANARLTVILPQAISASETLENNEWEEEEKEKEEEE-EEEEEEKEKDEDEEKKREKKEEE 117
               RL  +  Q       LE  E + +E  +E  E EE+ EE +++ +  +   ++ E 
Sbjct: 306 ILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365

Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           E ++ + + EE+EE+ E  ++K  + E +      E+ R 
Sbjct: 366 ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERL 405



 Score = 32.0 bits (73), Expect = 0.16
 Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 2/81 (2%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE--EEEEEQ 136
           E      E    E++++   E     E + ++ E + EE E + D+  EE+   EE+ E+
Sbjct: 289 ELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEE 348

Query: 137 KNKKKEEEEEEEKEKVEMYRN 157
             ++ E  E E +E       
Sbjct: 349 LKEELESLEAELEELEAELEE 369



 Score = 30.4 bits (69), Expect = 0.49
 Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 82  EWEEEEKEKEEEEEEEEEEK--------EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           + EE   E  E EEE EE +        E    E++K+  +E     ++  +E E + EE
Sbjct: 268 KLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEE 327

Query: 134 EEQKNKKKEEEEEEEKEKVEM 154
            E K  +  EE  E +EK+E 
Sbjct: 328 LESKLDELAEELAELEEKLEE 348



 Score = 29.3 bits (66), Expect = 0.97
 Identities = 25/94 (26%), Positives = 44/94 (46%), Gaps = 3/94 (3%)

Query: 63  RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEE---EEKEKDEDEEKKREKKEEEEE 119
           RL  +  Q    SE +E+   E EE E+  EE E E      E+   EE     + E EE
Sbjct: 839 RLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEE 898

Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
             ++ +E E +  E  ++ ++  E+  + + ++E
Sbjct: 899 LSEELRELESKRSELRRELEELREKLAQLELRLE 932



 Score = 28.1 bits (63), Expect = 3.0
 Identities = 25/76 (32%), Positives = 42/76 (55%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
            L   E  EE +E +EE +E EEE E+   E ++ E+K EE   +  + EEE+EE ++E 
Sbjct: 231 VLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKEL 290

Query: 137 KNKKKEEEEEEEKEKV 152
                E    E+++++
Sbjct: 291 YALANEISRLEQQKQI 306



 Score = 27.7 bits (62), Expect = 3.1
 Identities = 22/71 (30%), Positives = 35/71 (49%), Gaps = 1/71 (1%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV-EEEEEEQKNK 139
            E EE+ +E +EE E  E E E+ E E ++ E + EE E+Q +    +V + E +     
Sbjct: 340 AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN 399

Query: 140 KKEEEEEEEKE 150
            + E  E   E
Sbjct: 400 NEIERLEARLE 410



 Score = 27.7 bits (62), Expect = 3.7
 Identities = 23/83 (27%), Positives = 37/83 (44%)

Query: 60  ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
           A     V   +   A  + E  E E E +E EE  EE EEE  + E E ++ E + E+ +
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795

Query: 120 KQKDKKEEEVEEEEEEQKNKKKE 142
           ++     E ++E   E     +E
Sbjct: 796 EELKALREALDELRAELTLLNEE 818



 Score = 27.7 bits (62), Expect = 3.7
 Identities = 15/61 (24%), Positives = 29/61 (47%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
              K+    E E E+ +E+     K+  + E E E+ +++ EE  EE  E +   ++ E 
Sbjct: 730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEA 789

Query: 145 E 145
           +
Sbjct: 790 Q 790



 Score = 27.3 bits (61), Expect = 5.2
 Identities = 21/68 (30%), Positives = 36/68 (52%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
             E EE E++ EE EE+  E ++   +  K+ E+ EEE E+ + + EE   +    +K+ 
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735

Query: 140 KKEEEEEE 147
            + E E E
Sbjct: 736 ARLEAEVE 743



 Score = 26.9 bits (60), Expect = 6.0
 Identities = 21/84 (25%), Positives = 40/84 (47%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +A++          EE E+ ++E EE   +     +D  +   + E+ EE+     +E  
Sbjct: 698 KALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELT 757

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
           E E E ++ +++ EE EEE  + E
Sbjct: 758 ELEAEIEELEERLEEAEEELAEAE 781



 Score = 26.9 bits (60), Expect = 6.5
 Identities = 25/75 (33%), Positives = 36/75 (48%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E +   E    E  KE EE EEE E+ + E EE  R+     ++  + + E E  EE   
Sbjct: 691 EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIA 750

Query: 136 QKNKKKEEEEEEEKE 150
           Q +K+  E E E +E
Sbjct: 751 QLSKELTELEAEIEE 765



 Score = 26.9 bits (60), Expect = 6.9
 Identities = 19/94 (20%), Positives = 38/94 (40%)

Query: 57  PIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE 116
               + +++ +          +E  E    +  KE  E E E E+ ++  EE + E  E 
Sbjct: 721 LEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEA 780

Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           E E ++ + + E  +EE +   +  +E   E   
Sbjct: 781 EAEIEELEAQIEQLKEELKALREALDELRAELTL 814



 Score = 26.6 bits (59), Expect = 9.1
 Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 9/82 (10%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK---------DKKE 126
           E L+    E EE+ +E   E +E E++ +E   +  E +EE EE QK          + E
Sbjct: 242 EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301

Query: 127 EEVEEEEEEQKNKKKEEEEEEE 148
           ++ +   E   N +++ EE E 
Sbjct: 302 QQKQILRERLANLERQLEELEA 323


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 34.6 bits (79), Expect = 0.014
 Identities = 29/84 (34%), Positives = 45/84 (53%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A   ++N E E  E+    EEEE+EE       E  +   +EEEEE +     E+V + E
Sbjct: 170 AQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYE 229

Query: 134 EEQKNKKKEEEEEEEKEKVEMYRN 157
            E+ +KK+ EEEE E+E + ++  
Sbjct: 230 GERIDKKQGEEEEMEEEVINLFEI 253



 Score = 28.0 bits (62), Expect = 2.3
 Identities = 34/86 (39%), Positives = 42/86 (48%), Gaps = 16/86 (18%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK----------------D 123
           +NE EE  +E +  EEEE+EE   D   E  RE  EEEEE+ +                D
Sbjct: 175 DNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERID 234

Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEK 149
           KK+ E EE EEE  N  + E EEE  
Sbjct: 235 KKQGEEEEMEEEVINLFEIEWEEESP 260



 Score = 27.3 bits (60), Expect = 4.0
 Identities = 16/73 (21%), Positives = 38/73 (52%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            + +E    + + + EEEE  E+ +  E++ +++   +   +  +E   EEEEE + +  
Sbjct: 162 EDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDD 221

Query: 141 KEEEEEEEKEKVE 153
            E+  + E E+++
Sbjct: 222 GEDVVDYEGERID 234



 Score = 26.1 bits (57), Expect = 8.8
 Identities = 19/64 (29%), Positives = 35/64 (54%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
              E   E  E+EEEE E  ++ E   D E +R  K++ EE++ +++   + E E E+++
Sbjct: 200 SYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEES 259

Query: 139 KKKE 142
             +E
Sbjct: 260 PSEE 263


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 35.1 bits (81), Expect = 0.015
 Identities = 18/49 (36%), Positives = 30/49 (61%), Gaps = 5/49 (10%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           W  ++ E +  +  EE+ K+K     KK+EKK+EEE K+++K   E+ E
Sbjct: 728 WNGKDAEFKISDSVEEKTKKK-----KKKEKKKEEEYKREEKARIEIAE 771



 Score = 34.3 bits (79), Expect = 0.021
 Identities = 18/70 (25%), Positives = 38/70 (54%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
             N +   E + + +EE+E   + E + D+EKK   +     ++K+ +E+  E + +++KN
Sbjct: 1512 SNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKN 1571

Query: 139  KKKEEEEEEE 148
            KK+ +   E 
Sbjct: 1572 KKQYKSNTEA 1581



 Score = 33.2 bits (76), Expect = 0.057
 Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 5/68 (7%)

Query: 78   LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            +EN   EE+E   + E E ++E+K   E     +EK  EE+  + D K     + + +++
Sbjct: 1520 IENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIK-----KRKNKKQ 1574

Query: 138  NKKKEEEE 145
             K   E E
Sbjct: 1575 YKSNTEAE 1582



 Score = 33.2 bits (76), Expect = 0.058
 Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 2/70 (2%)

Query: 85   EEEKEKEEEEEEEEEEKEKDEDEEKKREK-KEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
              EK+  E E   +EEKE     E + +K K+   E     +E+ +EE+  E    KK +
Sbjct: 1512 SNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESD-IKKRK 1570

Query: 144  EEEEEKEKVE 153
             +++ K   E
Sbjct: 1571 NKKQYKSNTE 1580



 Score = 29.3 bits (66), Expect = 1.1
 Identities = 11/42 (26%), Positives = 27/42 (64%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            ++ E +     +E+ K++KK+E++++++ K+EE+   E  E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771



 Score = 28.9 bits (65), Expect = 1.4
 Identities = 12/43 (27%), Positives = 26/43 (60%)

Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           R    ++ E +     EE  +++++++ KK+EE + EEK ++E
Sbjct: 726 RNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 10/41 (24%), Positives = 24/41 (58%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
             N   ++ E +  +  EE+ ++K+K E ++++  K+EE+ 
Sbjct: 725 FRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765



 Score = 28.1 bits (63), Expect = 2.5
 Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 7/71 (9%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK-------KEEEEEKQKDKKEEE 128
           ET E  E  E E+E + E E   E K   +++E   E+        EE E+  K  + EE
Sbjct: 244 ETSETEERVESEEETDVEIETASETKGTKQEQEGSTEEDPSPSLFSEEREDPDKIDETEE 303

Query: 129 VEEEEEEQKNK 139
           +    +E+  K
Sbjct: 304 IRVNGKEKIKK 314



 Score = 28.1 bits (63), Expect = 2.8
 Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 3/94 (3%)

Query: 67   ILPQAISASETLEN-NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
            I P  I     L N N  E   K+  ++++      EK   E + R ++E+E   Q + +
Sbjct: 1478 IKPWIIPIKLLLLNLNGNENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELE 1537

Query: 126  EEEVEEEEEEQK--NKKKEEEEEEEKEKVEMYRN 157
             ++ ++   E    N++K  EE+  +  ++  +N
Sbjct: 1538 SDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKN 1571



 Score = 27.4 bits (61), Expect = 4.3
 Identities = 21/62 (33%), Positives = 29/62 (46%), Gaps = 7/62 (11%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           KE  E EE  E E+E D + E   E K         K+E+E   EE+   +   EE E+ 
Sbjct: 243 KETSETEERVESEEETDVEIETASETK-------GTKQEQEGSTEEDPSPSLFSEEREDP 295

Query: 148 EK 149
           +K
Sbjct: 296 DK 297



 Score = 26.6 bits (59), Expect = 8.7
 Identities = 13/34 (38%), Positives = 24/34 (70%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
           S+++E    ++++KEK++EEE + EEK + E  E
Sbjct: 738 SDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 33.4 bits (77), Expect = 0.019
 Identities = 22/65 (33%), Positives = 42/65 (64%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE+E+EKEE+E +EE+EKE  +   +K E+++  EE +K K +      ++E  +++ +E
Sbjct: 19  EEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKE 78

Query: 144 EEEEE 148
           +E  +
Sbjct: 79  QERWD 83



 Score = 28.8 bits (65), Expect = 0.78
 Identities = 17/45 (37%), Positives = 31/45 (68%)

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           EEK+ EK+ E+EEK++ +++E+   +   QK ++++  EE EK K
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAK 59



 Score = 28.4 bits (64), Expect = 1.1
 Identities = 14/57 (24%), Positives = 38/57 (66%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           EE+ EE+E ++++++ ++++EK+  +   QK+++E+ +EE E+ +        ++E+
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDED 71



 Score = 28.0 bits (63), Expect = 1.5
 Identities = 21/72 (29%), Positives = 43/72 (59%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           +E+ EE+E E++EK+  EEK++E  +   +K++ +K  E  E+ + +   +  ++E+ ++
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDE 74

Query: 150 EKVEMYRNPVPM 161
           E  E  R   PM
Sbjct: 75  ELKEQERWDDPM 86



 Score = 27.3 bits (61), Expect = 2.6
 Identities = 14/48 (29%), Positives = 32/48 (66%)

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           D+  +    EE+ E+++ +KEE+  +EE+E++  K   ++EE ++++E
Sbjct: 6   DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLE 53


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 33.8 bits (78), Expect = 0.019
 Identities = 19/62 (30%), Positives = 37/62 (59%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
            +     L+  E EEEE+E+ E EE +EEE+  +  E++  + K E+  + + K++E ++
Sbjct: 89  KVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148

Query: 131 EE 132
           E+
Sbjct: 149 EQ 150



 Score = 31.9 bits (73), Expect = 0.067
 Identities = 18/54 (33%), Positives = 37/54 (68%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           +E+E+E+EEE E EE ++E+  DE  ++E  + + EK+++ + ++ E  +E+ K
Sbjct: 99  KEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 31.5 bits (72), Expect = 0.10
 Identities = 21/58 (36%), Positives = 37/58 (63%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
             ++KEKEEEEEEE E +E DE+E+     ++E  + +++K+ E   +++E  K + K
Sbjct: 95  GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 28.4 bits (64), Expect = 1.3
 Identities = 15/50 (30%), Positives = 33/50 (66%)

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
           +    +KKE+EEE++++ + EE++EEE+  +  +KE  + + +++ E  R
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENER 141


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 34.7 bits (80), Expect = 0.019
 Identities = 16/61 (26%), Positives = 30/61 (49%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           +EE   +  EE ++   +E      EE  EK  ++ E+  E +EE +K   K+ ++  E 
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503

Query: 150 E 150
           +
Sbjct: 504 D 504



 Score = 33.9 bits (78), Expect = 0.027
 Identities = 21/101 (20%), Positives = 45/101 (44%), Gaps = 2/101 (1%)

Query: 54  YHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK 113
            H+       L  +L + +     L++ E  ++   KE      EE +EK  ++ ++  +
Sbjct: 425 THEVGAEEIILYFLLKKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCE 484

Query: 114 KEEEEEKQKDKKEEEVEEEEEEQ--KNKKKEEEEEEEKEKV 152
            +EE +K+  KK +++ E +     +     EE  E+ E +
Sbjct: 485 NQEEVKKEVPKKVKKIPEVDTYLLLEELGINEETYEKLEAL 525



 Score = 29.3 bits (66), Expect = 1.1
 Identities = 13/64 (20%), Positives = 30/64 (46%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           EEE   +  E+ +    K       EE ++   E+  +  E +++ KK+  ++ ++  +V
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503

Query: 153 EMYR 156
           + Y 
Sbjct: 504 DTYL 507



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 11/54 (20%), Positives = 24/54 (44%)

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPV 159
           +EE   +  EE ++    +    + EE  E+  ++ E+  E ++E  +     V
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKV 497


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 34.0 bits (78), Expect = 0.020
 Identities = 18/87 (20%), Positives = 44/87 (50%), Gaps = 1/87 (1%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEE 128
           Q +   +    N W++ E+ ++E  +E E  ++K +  E+++     + +E   ++++  
Sbjct: 32  QVMKKRQQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHL 91

Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVEMY 155
           + E  E+    + E  E+ E+EK E +
Sbjct: 92  LNEAREDVATARDEWLEQLEREKQEFF 118


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 34.4 bits (80), Expect = 0.022
 Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 7/87 (8%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKD------EDEEKKREK-KEEEEEKQKDKKE 126
           A E   +   E EE + +E+EEEEE+E +        E  EK  EK K   ++ +K +K 
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKA 237

Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           +E + E    ++KK  +  E+ KE+++
Sbjct: 238 QEKKVEGRLAQHKKYAKLREKLKEELK 264



 Score = 33.2 bits (77), Expect = 0.052
 Identities = 16/67 (23%), Positives = 28/67 (41%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E  ++ E  E    E      +    E       + ++  ++E EEEEE++ +     +E
Sbjct: 154 EWYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADE 213

Query: 146 EEEKEKV 152
            E  EKV
Sbjct: 214 SELPEKV 220



 Score = 31.7 bits (73), Expect = 0.16
 Identities = 16/73 (21%), Positives = 35/73 (47%), Gaps = 1/73 (1%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ-KDKKEEEVEEEEEE 135
               +   EE+      E EE ++ E +E+EE + +     +E +  +K  E+ +   ++
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQ 230

Query: 136 QKNKKKEEEEEEE 148
            K  +K +E++ E
Sbjct: 231 YKKLRKAQEKKVE 243



 Score = 26.3 bits (59), Expect = 9.5
 Identities = 9/38 (23%), Positives = 16/38 (42%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           EE  + ++    ++E + +  E  EEE        E E
Sbjct: 61  EEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 34.2 bits (78), Expect = 0.022
 Identities = 17/120 (14%), Positives = 36/120 (30%), Gaps = 8/120 (6%)

Query: 37  VSLTDDAEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEE------EEKEK 90
           ++    A     VGP    +P + +     +  +    ++ +      +      E    
Sbjct: 303 LAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
            EE  E + E+E+  D     +     +   +       E      +   + E E  EK 
Sbjct: 363 VEETSEADIEREQPGDLAG--QAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKA 420



 Score = 32.3 bits (73), Expect = 0.12
 Identities = 6/52 (11%), Positives = 17/52 (32%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
             +     + + E      +E      E  +E E +  +K     +  + ++
Sbjct: 380 AGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDE 431



 Score = 31.5 bits (71), Expect = 0.19
 Identities = 9/60 (15%), Positives = 20/60 (33%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E+E+  +   +     + + E      +E      +   E E E  E+        + +E
Sbjct: 372 EREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDE 431



 Score = 31.1 bits (70), Expect = 0.25
 Identities = 6/51 (11%), Positives = 14/51 (27%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           + + +      EE      +  DE +    ++        K +E       
Sbjct: 388 QVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPG 438



 Score = 31.1 bits (70), Expect = 0.29
 Identities = 7/74 (9%), Positives = 23/74 (31%), Gaps = 1/74 (1%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           ++   +E  +  +   +     + + E       EE      E  +E + +  E+     
Sbjct: 366 TSEADIEREQPGDLAGQAPAAHQVDAEA-ASAAPEEPAALASEAHDETEPEVPEKAAPIP 424

Query: 133 EEEQKNKKKEEEEE 146
           +  + ++       
Sbjct: 425 DPAKPDELAVAGPG 438



 Score = 29.2 bits (65), Expect = 0.97
 Identities = 11/69 (15%), Positives = 19/69 (27%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E  E + E E+  +   +     +          EE      E   E E E  +      
Sbjct: 365 ETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIP 424

Query: 144 EEEEEKEKV 152
           +  +  E  
Sbjct: 425 DPAKPDELA 433



 Score = 29.2 bits (65), Expect = 1.2
 Identities = 11/94 (11%), Positives = 28/94 (29%)

Query: 52  AQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
           A+  K   A     V     +  ++    +    EE  + + E E+  +         + 
Sbjct: 330 AEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQV 389

Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           + +      ++         +E E +  +K    
Sbjct: 390 DAEAASAAPEEPAALASEAHDETEPEVPEKAAPI 423



 Score = 27.6 bits (61), Expect = 3.2
 Identities = 8/77 (10%), Positives = 19/77 (24%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
             +  ET E +   E+  +   +     +   +      +       E   + + E   +
Sbjct: 360 TPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEK 419

Query: 131 EEEEEQKNKKKEEEEEE 147
                   K  E     
Sbjct: 420 AAPIPDPAKPDELAVAG 436


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 34.5 bits (79), Expect = 0.023
 Identities = 18/54 (33%), Positives = 36/54 (66%), Gaps = 1/54 (1%)

Query: 86  EEKEKEEEE-EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           EEK +E E+  +EE E+E+  +E+++RE+++   E  + + + EVE+  E+ +N
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQN 305



 Score = 31.8 bits (72), Expect = 0.16
 Identities = 14/72 (19%), Positives = 31/72 (43%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E+    +EE + + +   +   EE + E + E E+ Q D  +     +EE         +
Sbjct: 50  EKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMDGID 109

Query: 145 EEEEKEKVEMYR 156
           ++    K+++ R
Sbjct: 110 DDSLDRKLKLER 121



 Score = 30.6 bits (69), Expect = 0.45
 Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 4/55 (7%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           EE+  E EK   EE +RE++ EE+ +    +EEE    E ++   K E E+  EK
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRR----REEEKAAMEADRAQAKAEVEKRREK 302



 Score = 27.1 bits (60), Expect = 5.8
 Identities = 14/45 (31%), Positives = 27/45 (60%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           LE    EE E+E++ EE+   EE++   + ++ + K E E+ ++K
Sbjct: 258 LEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREK 302



 Score = 26.8 bits (59), Expect = 6.7
 Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 107 EEKKRE-KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           EEK+RE +K  +EE +++++ EE    EEE+     E +  + K +VE  R 
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEK--AAMEADRAQAKAEVEKRRE 301



 Score = 26.4 bits (58), Expect = 9.0
 Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 5/59 (8%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           I+P  I      E   W EEE +   + +EE   +E+    + ++  + + E K+K  K
Sbjct: 417 IVPNNIP-----EELYWVEEEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLK 470



 Score = 26.4 bits (58), Expect = 9.4
 Identities = 16/78 (20%), Positives = 31/78 (39%), Gaps = 4/78 (5%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD----KKEEEVEEEEEEQ 136
                 EK    +EE + + +   E   ++ E ++E E +Q      K    V+EE    
Sbjct: 44  LSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESIST 103

Query: 137 KNKKKEEEEEEEKEKVEM 154
                +++  + K K+E 
Sbjct: 104 DMDGIDDDSLDRKLKLER 121


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 33.8 bits (78), Expect = 0.024
 Identities = 22/75 (29%), Positives = 45/75 (60%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            +  E  + EE E+ ++ EEEE EEE+ ++E+E ++ +++  E E Q ++ +++    + 
Sbjct: 5   EQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQA 64

Query: 135 EQKNKKKEEEEEEEK 149
           E +N +K  E E E+
Sbjct: 65  EFENLRKRTEREREE 79



 Score = 31.1 bits (71), Expect = 0.16
 Identities = 22/57 (38%), Positives = 39/57 (68%)

Query: 96  EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           ++E++ E+ + EE +  +K EEEE ++++ EEE E EEE+Q+  + E + EE K+K 
Sbjct: 3   DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKY 59



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 26/65 (40%), Positives = 43/65 (66%), Gaps = 1/65 (1%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE-EEVEEEEEEQKNKKKEE 143
           ++E++ EE + EE EE EK E+EE + E+ EEE E +++++E  E+E + EE K+K    
Sbjct: 3   DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRA 62

Query: 144 EEEEE 148
           + E E
Sbjct: 63  QAEFE 67


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 33.8 bits (78), Expect = 0.024
 Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           E+ E+EEEE+ +EEE+E+     K R   E+   KQK K+ ++ E+EE   +
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRS--EQSRLKQKAKEMQKEEDEEMRHR 169



 Score = 32.7 bits (75), Expect = 0.050
 Identities = 16/53 (30%), Positives = 33/53 (62%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           +  E+ E EEEE+++++E E   R  K   E+ +  +K +E+++EE+E+   +
Sbjct: 117 RFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169



 Score = 29.6 bits (67), Expect = 0.61
 Identities = 14/45 (31%), Positives = 27/45 (60%)

Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           + + +EE+KR+++E E   +  K   E    +++ K  +KEE+EE
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165



 Score = 26.5 bits (59), Expect = 7.1
 Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 5/44 (11%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREK-----KEEEEEKQK 122
           EEEEK  EEE E      +   ++ + ++K     KEE+EE + 
Sbjct: 125 EEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRH 168


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 33.9 bits (78), Expect = 0.025
 Identities = 18/55 (32%), Positives = 25/55 (45%), Gaps = 6/55 (10%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
               +K+KEEEE   E         E   E  E+   KQK K+E++ E EE   +
Sbjct: 116 RFLAQKQKEEEERRVERR------RELGLEDPEQLRLKQKAKEEQKAESEETRHR 164



 Score = 30.1 bits (68), Expect = 0.47
 Identities = 13/43 (30%), Positives = 28/43 (65%)

Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
            +K++EEE+++ ++  E+  E+ EQ   K++ +EE++ E  E 
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEET 161



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 12/47 (25%), Positives = 26/47 (55%), Gaps = 6/47 (12%)

Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           +K ++EE++R ++  E   +        E+   +QK K++++ E EE
Sbjct: 120 QKQKEEEERRVERRRELGLEDP------EQLRLKQKAKEEQKAESEE 160



 Score = 27.7 bits (62), Expect = 2.4
 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 2/48 (4%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE--KKEEEEEKQKD 123
           L   + EEEE+  E   E   E+ E+   ++K +E  K E EE + + 
Sbjct: 118 LAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRHRA 165


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 34.3 bits (79), Expect = 0.025
 Identities = 17/74 (22%), Positives = 36/74 (48%), Gaps = 1/74 (1%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE-EEVEEEEEEQKNKKKEEEE 145
            K+ +  +   + +  + ++     E  +   EK + KK  EE+ EE  E+   K ++++
Sbjct: 372 YKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKK 431

Query: 146 EEEKEKVEMYRNPV 159
            ++KE  E +R  V
Sbjct: 432 RKKKEWFEKFRWFV 445



 Score = 30.4 bits (69), Expect = 0.37
 Identities = 18/66 (27%), Positives = 31/66 (46%), Gaps = 4/66 (6%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +     +  E +E     E  +   +K E K+  EE     +EE +EE   + K KK+++
Sbjct: 379 KVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEE----IREELIEEGLLKSKKKKRKK 434

Query: 144 EEEEEK 149
           +E  EK
Sbjct: 435 KEWFEK 440



 Score = 30.1 bits (68), Expect = 0.63
 Identities = 16/65 (24%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
            + + EK   E  +E+ E  +  +   +  +K E+E +K +  K E+ E+E EE +   +
Sbjct: 259 AYADLEKLFNEALDEKFERDKIKQLASELEKKLEKELKKLE-NKLEKQEDELEELEKAAE 317

Query: 142 EEEEE 146
           E  ++
Sbjct: 318 ELRQK 322



 Score = 29.3 bits (66), Expect = 1.00
 Identities = 14/79 (17%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE-------- 134
            +    E+ + E ++ +   ++     K+ KK +  +   D++  E++E           
Sbjct: 343 ADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTA 402

Query: 135 EQKNKKKEEEEEEEKEKVE 153
            +K + K+  EE  +E +E
Sbjct: 403 LEKAEGKKAIEEIREELIE 421


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 34.1 bits (78), Expect = 0.026
 Identities = 23/77 (29%), Positives = 51/77 (66%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E  +    EE++K+KE+ +EE ++ K K+E +EK+  K++E+E+++K ++  + EEE++
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165

Query: 135 EQKNKKKEEEEEEEKEK 151
            ++ + K   ++  K+K
Sbjct: 166 RERVRAKSRPKKPPKKK 182



 Score = 33.3 bits (76), Expect = 0.045
 Identities = 26/68 (38%), Positives = 48/68 (70%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            ++ K E  +EEE+EKE+ ++E+KK+++K +EE K +  KEE  E+   ++K K+KE++ 
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154

Query: 146 EEEKEKVE 153
           EE +++ E
Sbjct: 155 EEPRDREE 162



 Score = 33.3 bits (76), Expect = 0.046
 Identities = 24/85 (28%), Positives = 52/85 (61%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E  E  + +EE+K+K+E+ +EE ++++  E+ ++KR  KE+E+EK+K  +E    EEE++
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165

Query: 136 QKNKKKEEEEEEEKEKVEMYRNPVP 160
           ++  + +   ++  +K    +   P
Sbjct: 166 RERVRAKSRPKKPPKKKPPNKKKEP 190



 Score = 32.9 bits (75), Expect = 0.068
 Identities = 21/65 (32%), Positives = 46/65 (70%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
            K  +E + E  +E+EK++++ K+ +KK++E+ K++ K  +  EE +E++  K+KE+E+E
Sbjct: 92  TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKE 151

Query: 147 EEKEK 151
           ++ E+
Sbjct: 152 KKVEE 156



 Score = 32.9 bits (75), Expect = 0.070
 Identities = 23/70 (32%), Positives = 48/70 (68%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E  KE+E+E+E+ +EEK+K +++ K+  K  + +E+ K+K+  + +E+E+E+K ++  +
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159

Query: 144 EEEEEKEKVE 153
            EEE+K +  
Sbjct: 160 REEEKKRERV 169



 Score = 32.6 bits (74), Expect = 0.074
 Identities = 20/70 (28%), Positives = 50/70 (71%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +E + E  +EEE+E+E+ ++++ ++K++ K+E ++ K K++ +E+   +E+E++ +KK E
Sbjct: 96  KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVE 155

Query: 144 EEEEEKEKVE 153
           E  + +E+ +
Sbjct: 156 EPRDREEEKK 165



 Score = 32.6 bits (74), Expect = 0.093
 Identities = 28/79 (35%), Positives = 54/79 (68%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
             ++  +N   +EEEKEKE+ +EE++++KEK ++E K R+ KEE +EK+  K++E+ +E+
Sbjct: 93  KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEK 152

Query: 133 EEEQKNKKKEEEEEEEKEK 151
           + E+   ++EE++ E    
Sbjct: 153 KVEEPRDREEEKKRERVRA 171



 Score = 31.8 bits (72), Expect = 0.14
 Identities = 24/81 (29%), Positives = 49/81 (60%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
            K K  +E + E  KE+++++E+ +E+K++++EK K++ ++   +EE ++K   KE+E+E
Sbjct: 90  AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKE 149

Query: 147 EEKEKVEMYRNPVPMYRNPVP 167
           +EK+  E         R  V 
Sbjct: 150 KEKKVEEPRDREEEKKRERVR 170



 Score = 31.8 bits (72), Expect = 0.17
 Identities = 18/75 (24%), Positives = 40/75 (53%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EEEK++E    +   +K   +    K+++  EEE++++  +E    + EE   N+++E+
Sbjct: 160 REEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREK 219

Query: 144 EEEEEKEKVEMYRNP 158
           EE++ K++       
Sbjct: 220 EEDDGKDRETTTSPM 234



 Score = 29.9 bits (67), Expect = 0.63
 Identities = 20/70 (28%), Positives = 42/70 (60%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + +E+ K+ + +EE +EK   +++EK++EKK EE   ++++K+ E    +   K   K++
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKK 182

Query: 144 EEEEEKEKVE 153
              ++KE  E
Sbjct: 183 PPNKKKEPPE 192



 Score = 29.5 bits (66), Expect = 0.88
 Identities = 20/69 (28%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREK-KEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           E ++++ +EE +E+   KEK++++EKK E+ ++ EEEK++++   +   ++  +K    +
Sbjct: 127 EPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNK 186

Query: 143 EEEEEEKEK 151
           ++E  E+EK
Sbjct: 187 KKEPPEEEK 195



 Score = 27.9 bits (62), Expect = 3.1
 Identities = 20/78 (25%), Positives = 42/78 (53%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A         +E  K  E+   +    K K   E K    KEEE+EK++ K+E++ ++E+
Sbjct: 64  AKCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEK 123

Query: 134 EEQKNKKKEEEEEEEKEK 151
            +++ K ++ +EE ++++
Sbjct: 124 PKEEPKDRKPKEEAKEKR 141



 Score = 27.5 bits (61), Expect = 3.5
 Identities = 15/70 (21%), Positives = 43/70 (61%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E+E++++++ EE  + E+EK  +  + + + ++  +K+   K++E  EEE++++  ++  
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAV 204

Query: 144 EEEEEKEKVE 153
           + + E+  V 
Sbjct: 205 KGKPEEPDVN 214



 Score = 27.2 bits (60), Expect = 5.6
 Identities = 21/87 (24%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK-----EEEEEKQKD 123
           P+        +    E+   +++E+E+E++ E+ +D +EEKKRE+       ++  K+K 
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKP 183

Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKE 150
             +++   EEE+Q+   +E  + + +E
Sbjct: 184 PNKKKEPPEEEKQRQAAREAVKGKPEE 210


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 34.3 bits (79), Expect = 0.026
 Identities = 17/71 (23%), Positives = 38/71 (53%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           +  + +E+ K  E+E E    +KE+ E+E ++ E    + E +    ++E +E E + + 
Sbjct: 841 QRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRE 900

Query: 139 KKKEEEEEEEK 149
            +++ EE E +
Sbjct: 901 LERKIEELEAQ 911



 Score = 31.2 bits (71), Expect = 0.27
 Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 1/89 (1%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE-EEVEE 131
             S+ L +   +  E EKE E+ E+EEEK K+  EE + +    E+E +  K E +E+E 
Sbjct: 706 ELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEA 765

Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYRNPVP 160
             EE +    + EE     +  +  + +P
Sbjct: 766 RIEELEEDLHKLEEALNDLEARLSHSRIP 794



 Score = 29.7 bits (67), Expect = 0.92
 Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 2/81 (2%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK--NK 139
           E EEE +E E    + E      + E  + E +  E E++ ++ E ++E++ +       
Sbjct: 865 ELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKA 924

Query: 140 KKEEEEEEEKEKVEMYRNPVP 160
           K E  EEE  E  +       
Sbjct: 925 KLEALEEELSEIEDPKGEDEE 945



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 24/68 (35%), Positives = 41/68 (60%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E E E ++ EE   + E E +K   E ++ E++ EEE K++DK  EE  E +EE ++ + 
Sbjct: 312 EKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRA 371

Query: 142 EEEEEEEK 149
           E EE +++
Sbjct: 372 ELEEVDKE 379



 Score = 28.1 bits (63), Expect = 2.8
 Identities = 17/65 (26%), Positives = 31/65 (47%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E    E E++      + E  EK+ ++ +E+    K++ +   +E E     K++ EEE 
Sbjct: 811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEEL 870

Query: 147 EEKEK 151
           EE E 
Sbjct: 871 EELEA 875



 Score = 28.1 bits (63), Expect = 3.0
 Identities = 23/71 (32%), Positives = 34/71 (47%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E E E    E    E++ + ED E++  K E E +K   + EE   E EEE+K + K  
Sbjct: 297 GELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356

Query: 144 EEEEEKEKVEM 154
           EE  E ++   
Sbjct: 357 EEYAELKEELE 367



 Score = 27.3 bits (61), Expect = 4.7
 Identities = 21/79 (26%), Positives = 39/79 (49%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E  +  E    E+EK E  +   +EK + E  E  +EK+  E +K+  +++    EEE 
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEEL 253

Query: 135 EQKNKKKEEEEEEEKEKVE 153
           E+  ++  E E+  +E  +
Sbjct: 254 EKLTEEISELEKRLEEIEQ 272



 Score = 26.6 bits (59), Expect = 8.0
 Identities = 17/64 (26%), Positives = 33/64 (51%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E +  E EEE+E++  E +K E + ++      + E++    +EE +  E+E    ++E 
Sbjct: 433 EAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQREL 492

Query: 144 EEEE 147
            E E
Sbjct: 493 AEAE 496



 Score = 26.6 bits (59), Expect = 8.8
 Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +EEEK KE  EE EE+    +++ E  + + +E E +  ++ EE++ + EE   + +   
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR-IEELEEDLHKLEEALNDLEARL 788

Query: 144 EEEEEKEKVEMYR 156
                 E      
Sbjct: 789 SHSRIPEIQAELS 801



 Score = 26.6 bits (59), Expect = 9.1
 Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 5/77 (6%)

Query: 80  NNEWEEEEKEKEEEE----EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE-EEEE 134
             E  + E+E    E    E E++      ++E   ++ +E +E++ D KE+    E+E 
Sbjct: 797 QAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI 856

Query: 135 EQKNKKKEEEEEEEKEK 151
           E  N KKEE EEE +E 
Sbjct: 857 ENLNGKKEELEEELEEL 873



 Score = 26.6 bits (59), Expect = 9.1
 Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 4/88 (4%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE----KKEEEEEKQKDKK 125
           + I         E E  EKE +E +E+  + KE+ +  EK+ E    KKEE EE+ ++ +
Sbjct: 815 REIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELE 874

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
               + E      KK+ +E E +  ++E
Sbjct: 875 AALRDLESRLGDLKKERDELEAQLRELE 902


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 34.2 bits (78), Expect = 0.026
 Identities = 18/66 (27%), Positives = 35/66 (53%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E++E EEE  E E   ++E +E ++DE+ ++  E   E+E +     EE+ E+ +E +  
Sbjct: 930 ESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESK 989

Query: 139 KKKEEE 144
              +  
Sbjct: 990 AAYDSR 995



 Score = 33.1 bits (75), Expect = 0.062
 Identities = 17/63 (26%), Positives = 35/63 (55%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            + E +E EEE  E +   +DE  + ++ EE +E  +D  E+E E +  ++++ +  +E 
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986

Query: 146 EEE 148
           E +
Sbjct: 987 ESK 989



 Score = 33.1 bits (75), Expect = 0.070
 Identities = 17/68 (25%), Positives = 33/68 (48%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E EE+  E E   ++E  E DEDEE     ++  E++ ++   +E + E+ ++   K   
Sbjct: 933  ESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAY 992

Query: 144  EEEEEKEK 151
            +    K +
Sbjct: 993  DSRPGKRR 1000



 Score = 32.3 bits (73), Expect = 0.099
 Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 1/71 (1%)

Query: 73   SASETLENNEWEEEEKEKEEEEEEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEEVEE 131
            S     E +E+E    ++ +E +E+EE  E  ED  E + E    +EE  +D  E E + 
Sbjct: 931  SDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKA 990

Query: 132  EEEEQKNKKKE 142
              + +  K+++
Sbjct: 991  AYDSRPGKRRK 1001



 Score = 31.5 bits (71), Expect = 0.22
 Identities = 15/65 (23%), Positives = 31/65 (47%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           ++ +E EEE  E E   D+E     ++EE ++  +   E+  E +   +   ++ +E E 
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELES 988

Query: 149 KEKVE 153
           K   +
Sbjct: 989 KAAYD 993



 Score = 30.8 bits (69), Expect = 0.32
 Identities = 19/75 (25%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE--EEEEKQKDKKEEEVEEEEEEQ 136
             ++E +E E+E  E E   ++E ++ +++E+  E  E   E+E + D  +EE  E+ +E 
Sbjct: 927  SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986

Query: 137  KNKKKEEEEEEEKEK 151
            ++K   +    ++ K
Sbjct: 987  ESKAAYDSRPGKRRK 1001



 Score = 29.6 bits (66), Expect = 0.76
 Identities = 16/65 (24%), Positives = 33/65 (50%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           ++E +E EEE  + E        + +E+E+  +  E+  E+E E   + +++ E+ +E E
Sbjct: 928 DDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELE 987

Query: 151 KVEMY 155
               Y
Sbjct: 988 SKAAY 992


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 34.1 bits (79), Expect = 0.028
 Identities = 17/67 (25%), Positives = 31/67 (46%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E  E+ KE  E E   ++EK  D+    E  + E   +  +K  +  E+  EQ   +  +
Sbjct: 404 ELRERAKEALETEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLEDLAEQAVDELID 463

Query: 144 EEEEEKE 150
            E +E++
Sbjct: 464 IEGDEEK 470



 Score = 32.9 bits (76), Expect = 0.058
 Identities = 17/85 (20%), Positives = 38/85 (44%)

Query: 66  VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           +++ +  S+ E L     EE  + +  +EE  EE +E+ ++  +     +EE+       
Sbjct: 371 LLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLS 430

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKE 150
            E ++ E   +  +K  +  E+  E
Sbjct: 431 LEGLDRELAFKLAEKGIKTLEDLAE 455



 Score = 30.2 bits (69), Expect = 0.52
 Identities = 20/79 (25%), Positives = 33/79 (41%), Gaps = 1/79 (1%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
            I   +     E  E  KE  E E   +EEK  D+    +     E   K  +K  + +E
Sbjct: 393 EIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLE-GLDRELAFKLAEKGIKTLE 451

Query: 131 EEEEEQKNKKKEEEEEEEK 149
           +  E+  ++  + E +EEK
Sbjct: 452 DLAEQAVDELIDIEGDEEK 470



 Score = 28.7 bits (65), Expect = 1.7
 Identities = 15/68 (22%), Positives = 32/68 (47%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           EE      EE  E E   +E  E+ RE+ +E  E +   +EE++ ++    +   +E   
Sbjct: 381 EELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRELAF 440

Query: 146 EEEKEKVE 153
           +  ++ ++
Sbjct: 441 KLAEKGIK 448



 Score = 27.5 bits (62), Expect = 3.3
 Identities = 15/84 (17%), Positives = 34/84 (40%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +A    +   +   +   +  + +EE  +   E+     ++      EE  + +  +EE 
Sbjct: 342 EASEKRQAEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEET 401

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
            EE  E+  +  E E   ++EK+ 
Sbjct: 402 VEELRERAKEALETEALAQEEKLA 425



 Score = 27.1 bits (61), Expect = 4.6
 Identities = 17/80 (21%), Positives = 33/80 (41%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E L+ +E   +   +E     EE      E+  +     EE  E+ +++ +E +E E  
Sbjct: 359 MEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEAL 418

Query: 135 EQKNKKKEEEEEEEKEKVEM 154
            Q+ K  ++    E    E+
Sbjct: 419 AQEEKLADDLLSLEGLDREL 438


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 34.0 bits (78), Expect = 0.028
 Identities = 14/75 (18%), Positives = 27/75 (36%), Gaps = 6/75 (8%)

Query: 84  EEEEKEKEEEEEEEEEEKE-KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           E E    + +     EE   K +  E +   K E   +     EE     +  Q+     
Sbjct: 869 EYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEE-----KSAQERMPGA 923

Query: 143 EEEEEEKEKVEMYRN 157
           +  EE  ++ + Y++
Sbjct: 924 DSPEELMKRAKEYQD 938



 Score = 33.6 bits (77), Expect = 0.035
 Identities = 9/56 (16%), Positives = 20/56 (35%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
             +    +   N  + E   +  +  EE+  +E+    +   E  +  +E Q   K
Sbjct: 886 NTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941



 Score = 33.2 bits (76), Expect = 0.055
 Identities = 12/60 (20%), Positives = 24/60 (40%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           K + +    E +    D +     +E   + Q  + +   + E   Q NK  EE+  +E+
Sbjct: 860 KHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQER 919



 Score = 32.4 bits (74), Expect = 0.10
 Identities = 13/58 (22%), Positives = 24/58 (41%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           EE   + +  E +   + E      K  E+K  +E        EE+ +  +E ++K K
Sbjct: 884 EENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 9/51 (17%), Positives = 20/51 (39%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
              L+N+   E   +  +  EE+  ++     +  +   K  +E + K K 
Sbjct: 892 YSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHKG 942



 Score = 27.8 bits (62), Expect = 3.4
 Identities = 9/57 (15%), Positives = 23/57 (40%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
            +I   E     ++ E +   + E   +  +  +++  +++    +  EE  K  KE
Sbjct: 879 NSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKE 935


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 33.9 bits (78), Expect = 0.030
 Identities = 17/46 (36%), Positives = 30/46 (65%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           NE+E E  E+EEEE EE EE+ +++  ++   K+  EE K++++  
Sbjct: 143 NEFELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELR 188


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 33.7 bits (77), Expect = 0.030
 Identities = 17/74 (22%), Positives = 33/74 (44%), Gaps = 7/74 (9%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKE-EEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           K++E+   E+ ++  ED  KK+E+K   +E+K+              Q     ++E  +E
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGG----GGSSGGQSGLSTKDEPPKE 220

Query: 149 KEK--VEMYRNPVP 160
           K +   +  R   P
Sbjct: 221 KRQKHHDPERRLEP 234



 Score = 33.3 bits (76), Expect = 0.033
 Identities = 13/65 (20%), Positives = 31/65 (47%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E+ +K +E+  +++E+++  DE+K+          Q     ++   +E+ QK+   E   
Sbjct: 173 EDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPKEKRQKHHDPERRL 232

Query: 146 EEEKE 150
           E +  
Sbjct: 233 EPQSH 237


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 33.9 bits (78), Expect = 0.031
 Identities = 18/70 (25%), Positives = 37/70 (52%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
             +++E + E EE   E E +E+ +++  ++  ++   + K   E  E+E E K  KKE 
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKEN 443

Query: 144 EEEEEKEKVE 153
           + E +++K  
Sbjct: 444 KNEFKEKKES 453



 Score = 32.7 bits (75), Expect = 0.080
 Identities = 23/76 (30%), Positives = 37/76 (48%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
              E + E KEK+E +EEEE E E++   EK   K  +  EK + ++EEE  +EE     
Sbjct: 439 LKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLK 498

Query: 139 KKKEEEEEEEKEKVEM 154
                 +  +K+  + 
Sbjct: 499 TTSSVGKSAKKQDSKK 514



 Score = 32.7 bits (75), Expect = 0.083
 Identities = 23/79 (29%), Positives = 44/79 (55%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E     E    E E EEE +EEE E+   ++  +++   E  E++ + KK ++  + E +
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448

Query: 136 QKNKKKEEEEEEEKEKVEM 154
           +K +  EEEE E++E+ ++
Sbjct: 449 EKKESDEEEELEDEEEAKV 467



 Score = 30.0 bits (68), Expect = 0.60
 Identities = 18/71 (25%), Positives = 42/71 (59%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E  E+E E ++ ++E + E ++ ++ +++ E ++EEE K +    + ++  E+ QK +++
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487

Query: 142 EEEEEEEKEKV 152
           EE +EE     
Sbjct: 488 EELDEENPWLK 498



 Score = 30.0 bits (68), Expect = 0.66
 Identities = 19/67 (28%), Positives = 41/67 (61%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
            +  E+E E ++ +KE   + ++K+E  EEEE + +++ + E    +  ++++K ++EEE
Sbjct: 427 PENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEE 486

Query: 147 EEKEKVE 153
           EE+   E
Sbjct: 487 EEELDEE 493



 Score = 29.6 bits (67), Expect = 0.83
 Identities = 24/71 (33%), Positives = 44/71 (61%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E +E+ +EEE EE  K+     +   E  E+E E +K KKE + E +E+++ ++++E 
Sbjct: 400 ELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEEL 459

Query: 144 EEEEEKEKVEM 154
           E+EEE +  ++
Sbjct: 460 EDEEEAKVEKV 470



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 18/71 (25%), Positives = 41/71 (57%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EEE ++EE EE  ++   + +   +  EK+ E ++ +K+ K E  E++E +++ + ++E
Sbjct: 403 GEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDE 462

Query: 144 EEEEEKEKVEM 154
           EE + ++    
Sbjct: 463 EEAKVEKVANK 473



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 19/74 (25%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           +  EE+E E  + EEE+E++ E D+D E       ++  K K+ +++E        K  +
Sbjct: 322 SVSEEDEDEDSDSEEEDEDDDEDDDDGENPW-MLRKKLGKLKEGEDDEENSGLLSMKFMQ 380

Query: 141 KEEEEEEEKEKVEM 154
           + E  ++E+   E+
Sbjct: 381 RAEARKKEENDAEI 394



 Score = 27.3 bits (61), Expect = 4.1
 Identities = 20/75 (26%), Positives = 44/75 (58%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           LE  E  +EE+ +E  ++     K   E+ EK+ E K+ ++E + + KE++  +EEEE +
Sbjct: 401 LEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELE 460

Query: 138 NKKKEEEEEEEKEKV 152
           ++++ + E+   + +
Sbjct: 461 DEEEAKVEKVANKLL 475



 Score = 26.6 bits (59), Expect = 8.5
 Identities = 17/66 (25%), Positives = 33/66 (50%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           ++K  + +E E+++E       K  ++ E  +K+++  E E    E E + +  EEE EE
Sbjct: 355 RKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEE 414

Query: 148 EKEKVE 153
             +K  
Sbjct: 415 PSKKNV 420


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 32.7 bits (75), Expect = 0.032
 Identities = 22/74 (29%), Positives = 53/74 (71%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           L + +W++E +++E +++ EE++++ +E   KKR K++++++K+K KK+ +   ++EE++
Sbjct: 54  LMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKE 113

Query: 138 NKKKEEEEEEEKEK 151
             K  EE  +E+E+
Sbjct: 114 GSKSSEESSDEEEE 127


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 33.7 bits (78), Expect = 0.033
 Identities = 18/51 (35%), Positives = 31/51 (60%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           EE  +EE EE E  EK +   ++  RE+ EE EE+  D  +++ +E+++E 
Sbjct: 333 EENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEG 383


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 32.7 bits (75), Expect = 0.036
 Identities = 17/72 (23%), Positives = 44/72 (61%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            +  + ++EE+E+ +  +  ++E    + K ++E+++ D+  E ++E  +E++ K++E++
Sbjct: 26  RDGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQK 85

Query: 145 EEEEKEKVEMYR 156
           E+EE EK     
Sbjct: 86  EKEEIEKYREEN 97


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 33.7 bits (77), Expect = 0.037
 Identities = 15/41 (36%), Positives = 30/41 (73%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
           +E  +W +E KEK+EE++EE+ E+ K E+ +++ E++E+  
Sbjct: 270 IEGWDWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310



 Score = 28.3 bits (63), Expect = 2.3
 Identities = 17/47 (36%), Positives = 29/47 (61%)

Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           +E KE++EEK ++K EE   EE +E+  ++++    E  EKV +  N
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEAN 323



 Score = 27.9 bits (62), Expect = 3.3
 Identities = 17/44 (38%), Positives = 25/44 (56%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           EKK EK EE+ E+ K ++ +E  EEEE+    +  E+   E  K
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANK 324



 Score = 26.7 bits (59), Expect = 7.8
 Identities = 13/46 (28%), Positives = 27/46 (58%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           E++E++ EE + EE +E+ ++E++    E  E+   +   K  +EV
Sbjct: 285 EKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKEV 330


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 32.9 bits (76), Expect = 0.037
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           D+ E K  EK E++ EK K K E++  ++EE + 
Sbjct: 64  DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 30.6 bits (70), Expect = 0.23
 Identities = 19/48 (39%), Positives = 28/48 (58%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           NE  ++ KE  E    +++E +  E  EKK EK + + EK+K KKEE 
Sbjct: 48  NEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95



 Score = 27.5 bits (62), Expect = 2.2
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
            +E K  +K E++ ++   K E+ + ++EE K +
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 26.7 bits (60), Expect = 4.3
 Identities = 15/53 (28%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 100 EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           E+ KD  E  +    +++E K  +K E+  + E+ + K +KK+ ++EE K ++
Sbjct: 49  EEYKDLKESLEAALLDKKELKAWEKAEK--KAEKAKAKAEKKKAKKEEPKPRL 99


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 33.5 bits (77), Expect = 0.038
 Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 15/85 (17%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE---------------KKEEEEEKQKDKKE 126
           E EEEE+E+EE  E +E E+  D +  + RE                 +        +++
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQK 347

Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEK 151
           +E EE+E+E++ +++EE EE E E+
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 32.3 bits (74), Expect = 0.092
 Identities = 23/89 (25%), Positives = 42/89 (47%), Gaps = 27/89 (30%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD---------------------------K 124
            EEEEEEEE+E+  + ++  E  + E  + ++                           +
Sbjct: 287 LEEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQ 346

Query: 125 KEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           K+EE E+E+EE++ +++E EE E +E   
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375



 Score = 28.5 bits (64), Expect = 1.6
 Identities = 14/28 (50%), Positives = 23/28 (82%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
           N  +++E+E++E+EEEEEEE+E +E E 
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEP 370



 Score = 27.7 bits (62), Expect = 3.0
 Identities = 18/37 (48%), Positives = 27/37 (72%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
           E EE+E+EEE  E +EE E +E+EE++ E+  +EEE 
Sbjct: 169 EGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEG 205



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 15/33 (45%), Positives = 25/33 (75%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE 112
           N E ++EE+E+E+EEEEEEEE+ ++ + E+   
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375



 Score = 26.9 bits (60), Expect = 5.3
 Identities = 21/43 (48%), Positives = 29/43 (67%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           E E  E EEEEEEE+  + DEE + E++EEEE ++  K+EE  
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 33.6 bits (77), Expect = 0.040
 Identities = 16/75 (21%), Positives = 27/75 (36%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           + LE +E  +  K+ +E          K   +  +      EE+  K +K  E       
Sbjct: 39  DELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHP 98

Query: 136 QKNKKKEEEEEEEKE 150
           +      E EEEE+ 
Sbjct: 99  KSKAPSTESEEEEEP 113


>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
           This group consists of Eukaryotic TruB proteins similar
           to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
           makes psi55 in the T loop of both cytoplasmic and
           mitochondrial tRNAs. Psi55 is almost universally
           conserved.  Pseudouridine synthases catalyze the
           isomerization of specific uridines in an RNA molecule to
           pseudouridines (5-ribosyluracil, psi).
          Length = 312

 Score = 33.2 bits (76), Expect = 0.040
 Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 9/103 (8%)

Query: 56  KPIRANARLTVILPQAISASET----LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
           K +   AR    LP+ I   +     L   +W E         EEE ++ E+   +   +
Sbjct: 152 KRLYEYAREGKPLPRPIERRQVVVSELLVKDWIEPGPLFTRTVEEEGKQYERSVVKMLGK 211

Query: 112 EKKEEEEEKQKDKK-----EEEVEEEEEEQKNKKKEEEEEEEK 149
           E K   E  +          EEVE   EE K K + EEE  EK
Sbjct: 212 ELKTFAEVTELTATAEGDPVEEVEATHEESKRKSEVEEEANEK 254


>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain.  Ig: immunoglobulin (Ig)
          domain found in the Ig superfamily. The Ig superfamily
          is a heterogenous group of proteins, built on a common
          fold comprised of a sandwich of two beta sheets.
          Members of this group are components of immunoglobulin,
          neuroglia, cell surface glycoproteins, such as, T-cell
          receptors, CD2, CD4, CD8, and membrane glycoproteins,
          such as, butyrophilin and chondroitin sulfate
          proteoglycan core protein. A predominant feature of
          most Ig domains is a disulfide bridge connecting the
          two beta-sheets with a tryptophan residue packed
          against the disulfide bond.
          Length = 74

 Score = 31.7 bits (71), Expect = 0.040
 Identities = 9/34 (26%), Positives = 12/34 (35%)

Query: 16 FPSYMRVGDSSHGVYNLRIRNVSLTDDAEFQCQV 49
                   +S G   L I NV+L D   + C  
Sbjct: 29 LTRVRSSRGTSSGSSTLTISNVTLEDSGTYTCVA 62


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 33.5 bits (77), Expect = 0.041
 Identities = 18/45 (40%), Positives = 29/45 (64%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           E E++EEE +EK   +  ++  ++ EE  +  K E +VEEEEE+Q
Sbjct: 97  EREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQ 141



 Score = 32.3 bits (74), Expect = 0.088
 Identities = 29/97 (29%), Positives = 50/97 (51%), Gaps = 15/97 (15%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDE--DEEKKREKKEEEEEKQKDKKE------E 127
           + LE  + EEE +EK  ++  E+  ++ +E     K   K EEEEEKQ +  E       
Sbjct: 94  KELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQMEILEKALKSYL 153

Query: 128 EVEEEEEEQKNK-----KKEEEE--EEEKEKVEMYRN 157
           ++ +EE +   +     +KE EE  ++E + +E YR+
Sbjct: 154 KIVKEENKSLQRLAKALQKESEERTQDETKMIEEYRD 190


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 33.4 bits (77), Expect = 0.041
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEE 135
           ++R K+EEE   +  K +EE  EEEEE
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEE 336



 Score = 28.8 bits (65), Expect = 1.5
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           +++ K+EEE   E  + + +  EEEEE   +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340



 Score = 28.4 bits (64), Expect = 2.1
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            +R  K+EEE   +  K +E   EEEE+ 
Sbjct: 309 LRRRSKQEEEAAAEAAKAQEEAAEEEEES 337



 Score = 28.0 bits (63), Expect = 2.6
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           ++  + EEE   +  K +EE  EEE+E + 
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIN 339



 Score = 26.8 bits (60), Expect = 7.1
 Identities = 8/27 (29%), Positives = 14/27 (51%)

Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVEM 154
             ++EEE      K +EE  E+E+  +
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESI 338


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 33.3 bits (77), Expect = 0.042
 Identities = 13/46 (28%), Positives = 27/46 (58%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
           A+   + +  +EE KE    + +    +E++E+EE++ E++  EEE
Sbjct: 275 AAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 31.4 bits (72), Expect = 0.18
 Identities = 18/37 (48%), Positives = 22/37 (59%)

Query: 64  LTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEE 100
           L   L + +SA       E EEEE+E+EEEEE  EEE
Sbjct: 284 LDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 31.0 bits (71), Expect = 0.21
 Identities = 14/41 (34%), Positives = 27/41 (65%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           +++  +EE KE    + +    +EEEEE++++++EE  EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 30.6 bits (70), Expect = 0.36
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
            K    EE +E    + +    EEEEE++ +++EEE  EE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 29.4 bits (67), Expect = 0.83
 Identities = 11/38 (28%), Positives = 22/38 (57%)

Query: 113 KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
            K+  +E+ K+    + +    E++ +++EEEEEEE  
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317



 Score = 28.3 bits (64), Expect = 1.7
 Identities = 14/41 (34%), Positives = 23/41 (56%)

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           D++   E+ +E    Q      E EEEEEE++ +++  EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 27.5 bits (62), Expect = 3.0
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            + +    E KE    + +    EE EEEEEE++ ++  EEE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 27.5 bits (62), Expect = 3.3
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           EE K+    + +    EEE++ +++EEEEE  +E
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 27.5 bits (62), Expect = 3.6
 Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 3/44 (6%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
            ++  +EE +E    + +    E++EEEEE   +++EEE  EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEE---EEEEEEPSEEE 320


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 33.3 bits (76), Expect = 0.043
 Identities = 17/77 (22%), Positives = 40/77 (51%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           S+ +E + +E  + E +E ++EEE   ++ E+ EK+ +  + E  + +++KE+   EE +
Sbjct: 57  SQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQ 116

Query: 135 EQKNKKKEEEEEEEKEK 151
             +     +    + E 
Sbjct: 117 YLREYNLFDRNNLQLED 133



 Score = 32.9 bits (75), Expect = 0.051
 Identities = 21/83 (25%), Positives = 38/83 (45%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
            L          E  + E +E +KEEE   +E E+ + ED++   E  E +EEK++ + E
Sbjct: 54  KLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENE 113

Query: 127 EEVEEEEEEQKNKKKEEEEEEEK 149
           E     E    ++   + E+  +
Sbjct: 114 ELQYLREYNLFDRNNLQLEDNLQ 136



 Score = 31.0 bits (70), Expect = 0.22
 Identities = 16/70 (22%), Positives = 41/70 (58%), Gaps = 1/70 (1%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE-EEEEQKNKKKEE 143
           E+E++  ++   + E +  +    +  + + +E +K++++  +E+EE E+E+     +  
Sbjct: 42  EKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELV 101

Query: 144 EEEEEKEKVE 153
           E +EEKE++E
Sbjct: 102 ELQEEKEQLE 111



 Score = 31.0 bits (70), Expect = 0.26
 Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 5/84 (5%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDE---DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            + E +  E    E  + E DE   +EE+  ++ EE E++  D   E VE +EE  K + 
Sbjct: 53  SKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEE--KEQL 110

Query: 141 KEEEEEEEKEKVEMYRNPVPMYRN 164
           + EE +  +E     RN + +  N
Sbjct: 111 ENEELQYLREYNLFDRNNLQLEDN 134



 Score = 28.3 bits (63), Expect = 1.8
 Identities = 18/82 (21%), Positives = 37/82 (45%), Gaps = 3/82 (3%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKD---EDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           L+  + EEE    E EE E+E++       E +E+K + + EE +  ++    +    + 
Sbjct: 72  LDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQL 131

Query: 135 EQKNKKKEEEEEEEKEKVEMYR 156
           E   +  E + E    +++  R
Sbjct: 132 EDNLQSLELQYEYSLNQLDKLR 153



 Score = 27.9 bits (62), Expect = 2.7
 Identities = 17/89 (19%), Positives = 36/89 (40%), Gaps = 8/89 (8%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E  + E  + E+E +  K+     E +  +    E    +  E + EEE    + ++ E+
Sbjct: 32  ERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEK 91

Query: 144 EE--------EEEKEKVEMYRNPVPMYRN 164
           E+        E ++EK ++    +   R 
Sbjct: 92  EDDDLDGELVELQEEKEQLENEELQYLRE 120



 Score = 27.9 bits (62), Expect = 2.9
 Identities = 14/70 (20%), Positives = 38/70 (54%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E ++ K+EEE   +E +E +++++    +  E +E+++  + EE++   E     +   +
Sbjct: 71  ELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQ 130

Query: 145 EEEEKEKVEM 154
            E+  + +E+
Sbjct: 131 LEDNLQSLEL 140


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 33.1 bits (76), Expect = 0.045
 Identities = 17/71 (23%), Positives = 37/71 (52%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
              ++     E   + + K K + E    +   E +EK+K+KK+EE ++++ E+  ++ E
Sbjct: 294 KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIE 353

Query: 143 EEEEEEKEKVE 153
           + E +  +K E
Sbjct: 354 KLEVQATDKEE 364



 Score = 32.3 bits (74), Expect = 0.087
 Identities = 24/87 (27%), Positives = 46/87 (52%), Gaps = 3/87 (3%)

Query: 59  RANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
           +   ++  +  Q     + +   E   + K K + + E + EK   E +EKK+EKK+EE+
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340

Query: 119 EKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           +K   K+ E +EE  E+ + +  ++EE
Sbjct: 341 KK---KQIERLEERIEKLEVQATDKEE 364



 Score = 32.3 bits (74), Expect = 0.092
 Identities = 20/70 (28%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
             + + K  ++     E   D   + K  K E + EK   + +E+ +E+++E+K KK+ E
Sbjct: 288 ALKYQLKRLKKMILLFEMISDLKRKLKS-KFERDNEKLDAEVKEKKKEKKKEEKKKKQIE 346

Query: 144 EEEEEKEKVE 153
             EE  EK+E
Sbjct: 347 RLEERIEKLE 356



 Score = 31.6 bits (72), Expect = 0.17
 Identities = 15/73 (20%), Positives = 37/73 (50%), Gaps = 1/73 (1%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
             + + K  ++     E   +     + K E+  E+ + +  +K++E ++EE+++K + +
Sbjct: 288 ALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK-QIE 346

Query: 142 EEEEEEEKEKVEM 154
             EE  EK +V+ 
Sbjct: 347 RLEERIEKLEVQA 359



 Score = 29.6 bits (67), Expect = 0.71
 Identities = 16/65 (24%), Positives = 39/65 (60%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
             E   + + + + K + ++E+   E KE+++EK+K++K+++  E  EE+  K + +  +
Sbjct: 302 LFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATD 361

Query: 147 EEKEK 151
           +E+ K
Sbjct: 362 KEENK 366


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 33.5 bits (77), Expect = 0.045
 Identities = 13/56 (23%), Positives = 24/56 (42%)

Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
           KE + ++E+K           K+ KEE+   +   +         E +KEK+E  +
Sbjct: 214 KEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIK 269



 Score = 30.5 bits (69), Expect = 0.40
 Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 1/68 (1%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K KE    +EE+    + +    +  KEE++      + E++     E  +K+K E+ ++
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEY-DKEKLEKIKD 270

Query: 148 EKEKVEMY 155
            KEK +  
Sbjct: 271 LKEKAKKL 278



 Score = 27.8 bits (62), Expect = 3.7
 Identities = 15/73 (20%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 66  VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK---KEEEEEKQK 122
           +IL   I + E   N E +           +  +E++K  D   + E       E +K+K
Sbjct: 205 IILDTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEK 264

Query: 123 DKKEEEVEEEEEE 135
            +K ++++E+ ++
Sbjct: 265 LEKIKDLKEKAKK 277


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 33.4 bits (76), Expect = 0.045
 Identities = 17/82 (20%), Positives = 41/82 (50%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           IS S    N+  E E+K   E   + E E+++D +E ++ + +EE    +K KK ++++ 
Sbjct: 287 ISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKG 346

Query: 132 EEEEQKNKKKEEEEEEEKEKVE 153
           ++        +  ++ +   ++
Sbjct: 347 KKNGLDKDDSDSGDDSDDSDID 368



 Score = 32.2 bits (73), Expect = 0.092
 Identities = 17/76 (22%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK--EEEEEKQKDKKEEEVEEEEEE 135
            + +E++ ++ + E  EE+   +     ++ ++RE K   E   K + +++E+ EE EEE
Sbjct: 267 DDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEE 326

Query: 136 QKNKKKEEEEEEEKEK 151
           +  ++    ++ +K K
Sbjct: 327 KNEEEGGLSKKGKKLK 342



 Score = 31.1 bits (70), Expect = 0.26
 Identities = 15/69 (21%), Positives = 37/69 (53%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E EE++ EEE    ++ K+  + + KK    +++ +   D  + +++ E+       K++
Sbjct: 322 ESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQ 381

Query: 144 EEEEEKEKV 152
           +E +++E V
Sbjct: 382 KEPKKEEPV 390



 Score = 28.4 bits (63), Expect = 2.0
 Identities = 19/80 (23%), Positives = 43/80 (53%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
               E+ +  E  +EEK K+++++  + +K+ D+D++ KR   ++ +E   D  ++E  E
Sbjct: 224 DDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGRE 283

Query: 132 EEEEQKNKKKEEEEEEEKEK 151
           E+    +     + EE ++K
Sbjct: 284 EDYISDSSASGNDPEEREDK 303



 Score = 27.6 bits (61), Expect = 3.6
 Identities = 13/70 (18%), Positives = 34/70 (48%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +++  E + ++ ++E  +E    +        EE E +   +     E E+++ +++ EE
Sbjct: 266 DDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEE 325

Query: 144 EEEEEKEKVE 153
           E+ EE+  + 
Sbjct: 326 EKNEEEGGLS 335


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 32.7 bits (75), Expect = 0.046
 Identities = 18/56 (32%), Positives = 31/56 (55%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           + E+ E+E EE E EE  E + +EE   E+ E E   ++  K  E+E + +E + +
Sbjct: 4   KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEER 59



 Score = 28.8 bits (65), Expect = 1.0
 Identities = 16/60 (26%), Positives = 36/60 (60%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
            EE+ E+ EE+ ++ + E+  E + EEE  +++ + E ++EE+ +    + + +E EE+ 
Sbjct: 1   MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERY 60



 Score = 26.1 bits (58), Expect = 8.9
 Identities = 21/77 (27%), Positives = 41/77 (53%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
              E +E  E EE  +++ EEE  EEE + +  DEE+ +  + E +  + +++   ++ +
Sbjct: 7   QVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYLRLQAD 66

Query: 133 EEEQKNKKKEEEEEEEK 149
            E  K + ++E E  EK
Sbjct: 67  FENYKRRIQKENEAAEK 83


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 32.7 bits (74), Expect = 0.052
 Identities = 18/79 (22%), Positives = 39/79 (49%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           I+   T+  N+  +   E+     EE  +         + +  + ++E+   K +E ++E
Sbjct: 153 IAEHPTITENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQE 212

Query: 132 EEEEQKNKKKEEEEEEEKE 150
            + E+K  ++E+EEEE +E
Sbjct: 213 HKTEEKQPQEEQEEEEVEE 231



 Score = 30.8 bits (69), Expect = 0.23
 Identities = 16/73 (21%), Positives = 37/73 (50%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E L NN  E  +         + +  +  +++   + ++  +E K ++K+ +E +EEEE 
Sbjct: 170 ERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEV 229

Query: 136 QKNKKKEEEEEEE 148
           ++  K+EE +  +
Sbjct: 230 EEEAKQEEGQGTD 242



 Score = 26.5 bits (58), Expect = 6.8
 Identities = 15/73 (20%), Positives = 42/73 (57%)

Query: 63  RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           RL   + + + +S +L  +   +  K K+E+   + +E  ++   E+K+ ++E+EEE+ +
Sbjct: 171 RLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVE 230

Query: 123 DKKEEEVEEEEEE 135
           ++ ++E  +  ++
Sbjct: 231 EEAKQEEGQGTDD 243


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 33.1 bits (76), Expect = 0.052
 Identities = 20/70 (28%), Positives = 27/70 (38%), Gaps = 4/70 (5%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV----EEEEEEQKNKKKEE 143
           K  E       EE EK E E K  E+  +  EK   + EE      E   E  +  +  E
Sbjct: 74  KGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLE 133

Query: 144 EEEEEKEKVE 153
           +  EE E + 
Sbjct: 134 DLLEELEPLA 143



 Score = 30.8 bits (70), Expect = 0.31
 Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 2/68 (2%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           E  E  E +    E   + E+   + ++E E  + + E   E+  EE       E  E E
Sbjct: 216 ELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLA--VREILEIE 273

Query: 149 KEKVEMYR 156
           K   ++  
Sbjct: 274 KALGDVLS 281



 Score = 29.3 bits (66), Expect = 1.0
 Identities = 13/50 (26%), Positives = 19/50 (38%), Gaps = 3/50 (6%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKK---REKKEEEEEKQ 121
           SE +   E    E + E E    E E   ++  E+    RE  E E+   
Sbjct: 228 SELISELEEVIAEIQDELESLRSELEALAEKIAEELLAVREILEIEKALG 277



 Score = 27.3 bits (61), Expect = 4.5
 Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 14/82 (17%)

Query: 56  KPIRANARLTVILPQAISASETLEN--NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK 113
           + ++       ILP+ +   E LE      EE  K  E+   E EE           R+ 
Sbjct: 71  RSVKGLEGRLFILPEEV---EKLEAELKSLEEVIKPAEKFSSEVEEL---------TRKL 118

Query: 114 KEEEEEKQKDKKEEEVEEEEEE 135
           +E   E  ++ ++ E   EE E
Sbjct: 119 EERLSELDEELEDLEDLLEELE 140


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 32.8 bits (75), Expect = 0.053
 Identities = 16/68 (23%), Positives = 39/68 (57%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
            +   A ++  +++ +  + + ++++ E+E      E E+ K+E+ EE+E ++++K  EE
Sbjct: 114 ARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEE 173

Query: 129 VEEEEEEQ 136
            +  EEE 
Sbjct: 174 EKAREEEI 181



 Score = 30.5 bits (69), Expect = 0.30
 Identities = 20/75 (26%), Positives = 35/75 (46%), Gaps = 5/75 (6%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            + +      + +  ++   D+D E +      E EK K ++ EE E EEEE+       
Sbjct: 117 SDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKA-----A 171

Query: 144 EEEEEKEKVEMYRNP 158
           EEE+ +E+  +  NP
Sbjct: 172 EEEKAREEEILTGNP 186


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 32.9 bits (75), Expect = 0.054
 Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK-------DKKE 126
             E  E  E  E   ++E EEE  + +   +++  +     E+E+E +        D +E
Sbjct: 226 TKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEE 285

Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           E  E+E+E++K  KK  E+E+E E++E
Sbjct: 286 ETEEKEKEKRKRLKKMMEDEDEDEEME 312



 Score = 32.5 bits (74), Expect = 0.085
 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 8/87 (9%)

Query: 76  ETLENNEWEEEEKEKE--------EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           E+ E    +E+E E E        + EEE EE++++     KK  + E+E+E+ +   E 
Sbjct: 258 ESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPES 317

Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVEM 154
            VEEEE E+       ++EEEKE+V +
Sbjct: 318 PVEEEESEEPEPPPLPKKEEEKEEVTV 344



 Score = 29.8 bits (67), Expect = 0.75
 Identities = 20/73 (27%), Positives = 38/73 (52%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           ++KE  E   +EE E++  +     + E  E    D+ E+E E +   +++  +EE EE+
Sbjct: 231 EKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEK 290

Query: 148 EKEKVEMYRNPVP 160
           EKEK +  +  + 
Sbjct: 291 EKEKRKRLKKMME 303



 Score = 29.8 bits (67), Expect = 0.75
 Identities = 17/73 (23%), Positives = 33/73 (45%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           SE          E E  E    +E+E E +     +R   EEE E+++ +K + +++  E
Sbjct: 244 SEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMME 303

Query: 135 EQKNKKKEEEEEE 147
           ++   ++ E   E
Sbjct: 304 DEDEDEEMEIVPE 316



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 20/74 (27%), Positives = 39/74 (52%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            E EE+EKEK +  ++  E++++DE+ E   E   EEEE ++ +     ++EEE+++   
Sbjct: 285 EETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTV 344

Query: 141 KEEEEEEEKEKVEM 154
             +       +  M
Sbjct: 345 SPDGGRRRGRRRVM 358



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 24/82 (29%), Positives = 37/82 (45%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           S  ET E  + + +  +K  E+E+E+EE E   +   + E+ EE E     KKEEE EE 
Sbjct: 283 SEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEV 342

Query: 133 EEEQKNKKKEEEEEEEKEKVEM 154
                  ++       K+K   
Sbjct: 343 TVSPDGGRRRGRRRVMKKKTFK 364


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 32.9 bits (75), Expect = 0.055
 Identities = 29/81 (35%), Positives = 48/81 (59%), Gaps = 4/81 (4%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
             LE  E + EE EK   +E  + E+E +++ EK    KEEE   ++   EE ++E E++
Sbjct: 361 PLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLG--KEEESLFKRVALEEGLKELEQD 418

Query: 136 QKNKKKE--EEEEEEKEKVEM 154
           ++N  KE  +EE+E  EK++M
Sbjct: 419 EENFLKELSKEEKELLEKLKM 439



 Score = 32.9 bits (75), Expect = 0.058
 Identities = 24/70 (34%), Positives = 38/70 (54%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE EK   +E  + EEE EK+ ++  K E+   +    ++  +E  ++EE   K   KEE
Sbjct: 371 EEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEE 430

Query: 144 EEEEEKEKVE 153
           +E  EK K+E
Sbjct: 431 KELLEKLKME 440



 Score = 32.9 bits (75), Expect = 0.061
 Identities = 28/82 (34%), Positives = 44/82 (53%), Gaps = 4/82 (4%)

Query: 76  ETLENNEWEEEE---KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           E LE +  E E+   KE  + EEE E+E EK   EE+   K+   EE  K+ +++E    
Sbjct: 364 ERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFL 423

Query: 133 EEEQKNKKKEEEE-EEEKEKVE 153
           +E  K +K+  E+ + E  +VE
Sbjct: 424 KELSKEEKELLEKLKMEASEVE 445



 Score = 29.0 bits (65), Expect = 1.1
 Identities = 20/63 (31%), Positives = 34/63 (53%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            EE EKE E+  +EEE   ++   EE  +E +++EE   K+  +EE E  E+ +    + 
Sbjct: 385 EEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEEKELLEKLKMEASEV 444

Query: 143 EEE 145
           E+ 
Sbjct: 445 EKL 447


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 33.1 bits (75), Expect = 0.058
 Identities = 21/55 (38%), Positives = 35/55 (63%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           EKE  +E+EE+ ++ ++D +E K  E KE +E K+K  K+ +   +E E +NK K
Sbjct: 212 EKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHK 266



 Score = 30.8 bits (69), Expect = 0.38
 Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 4/65 (6%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           E+  E++  DEDEE  ++  E+ EE     K EEV+E +E +K K K+ +E  ++ +V+ 
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEE----PKVEEVKEGDEGKKKKTKKVKEVTKEYEVQN 263

Query: 155 YRNPV 159
              P+
Sbjct: 264 KHKPL 268


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 32.7 bits (74), Expect = 0.059
 Identities = 25/72 (34%), Positives = 42/72 (58%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  + E+E   +EEEE + E E   D+ E++K +KK+ E+    D+  E  E EEEE   
Sbjct: 194 EKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSE 253

Query: 139 KKKEEEEEEEKE 150
            + +E+E+E+ +
Sbjct: 254 SESDEDEDEDNK 265



 Score = 27.7 bits (61), Expect = 2.9
 Identities = 16/65 (24%), Positives = 34/65 (52%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +  E  + EEEE  E E ++DEDE+ K + ++ + +  K  ++  +  E E+++  +K  
Sbjct: 239 QSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKIP 298

Query: 144 EEEEE 148
             +  
Sbjct: 299 AVQHS 303


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 33.2 bits (76), Expect = 0.059
 Identities = 14/52 (26%), Positives = 25/52 (48%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           EE  +E  EEE+++   K          +K    ++K K+ K   +++EE E
Sbjct: 844 EEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERE 895



 Score = 26.6 bits (59), Expect = 9.4
 Identities = 12/52 (23%), Positives = 22/52 (42%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           EE  EE  E+++D+   +          K     + + +E +    K+EE E
Sbjct: 844 EEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERE 895


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 33.0 bits (76), Expect = 0.061
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E EEE +E E EEE +E E EK+  E +K+  +++ ++  K  K+EE   EE  +   ++
Sbjct: 65  EIEEELRE-EYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123

Query: 142 EEE 144
            E 
Sbjct: 124 PEP 126



 Score = 32.6 bits (75), Expect = 0.080
 Identities = 21/67 (31%), Positives = 37/67 (55%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           +E EEE  EE EE+ K+ + EK+  + E++  ++K KK  +  ++EE    +  E E EE
Sbjct: 64  EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123

Query: 148 EKEKVEM 154
            +  +  
Sbjct: 124 PEPPLRP 130



 Score = 29.9 bits (68), Expect = 0.65
 Identities = 18/56 (32%), Positives = 32/56 (57%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
            E  EE +E+ +E E E+E  E ++   +K+ KK  ++ K ++   EE+ E E E+
Sbjct: 68  EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 32.8 bits (75), Expect = 0.061
 Identities = 19/67 (28%), Positives = 39/67 (58%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E E E+EEE+E+++++   DEDE++      EE     +   + V+E++EE ++    ++
Sbjct: 136 ESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKK 195

Query: 145 EEEEKEK 151
           +  E+ K
Sbjct: 196 DVLEEPK 202



 Score = 32.1 bits (73), Expect = 0.14
 Identities = 19/62 (30%), Positives = 38/62 (61%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +EEE+E E+++++  +++++DED     E+     E   D+ +E+ EE+E+   +KK   
Sbjct: 139 DEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKKDVL 198

Query: 144 EE 145
           EE
Sbjct: 199 EE 200



 Score = 28.2 bits (63), Expect = 2.2
 Identities = 22/83 (26%), Positives = 41/83 (49%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           N ++E++     E+ E E+EE+E+DED++      E+E+E      EE     E      
Sbjct: 121 NKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRV 180

Query: 140 KKEEEEEEEKEKVEMYRNPVPMY 162
           K+++EE+E+ +  +      P  
Sbjct: 181 KEDDEEDEDADLSKKDVLEEPKM 203



 Score = 27.1 bits (60), Expect = 5.9
 Identities = 19/73 (26%), Positives = 42/73 (57%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           NN+  E++  +  E+ E E+E+E++++++      +E+E++      EEV    E   ++
Sbjct: 120 NNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR 179

Query: 140 KKEEEEEEEKEKV 152
            KE++EE+E   +
Sbjct: 180 VKEDDEEDEDADL 192


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 32.9 bits (76), Expect = 0.061
 Identities = 20/65 (30%), Positives = 41/65 (63%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E L+N +    +K +E +EE++ EE+EKDE        K + +E+++++++EE ++E EE
Sbjct: 498 EELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKEEKDKETEE 557

Query: 136 QKNKK 140
            + + 
Sbjct: 558 DEPEG 562


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 33.1 bits (75), Expect = 0.061
 Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 1/59 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
            EEE+E E EE+EE  EKE  +  +EK  E++EEE +  KD  + E + ++   K+  K
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGK 198



 Score = 31.9 bits (72), Expect = 0.14
 Identities = 19/56 (33%), Positives = 33/56 (58%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            +E+EE E EE+EE  + E  +   EK EEEEE++    ++ ++ E + +K   K+
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKD 195



 Score = 31.2 bits (70), Expect = 0.24
 Identities = 27/100 (27%), Positives = 43/100 (43%), Gaps = 3/100 (3%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK---KREKKEEEEEKQKDKK 125
           PQ  S      +  W     + EE         E D +  K        EEEEE + ++ 
Sbjct: 92  PQRRSCDVRSRSTLWSLFHDDDEENLPSSIAPPEIDPEPRKPIVPDLVLEEEEEVEMEED 151

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRNP 165
           EE  E+E  +  ++K EEEEEEE + ++ + +     + P
Sbjct: 152 EEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKP 191



 Score = 26.9 bits (59), Expect = 5.5
 Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 7/63 (11%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
            EEEEE E +EDEE         E++     +E+ EEEEEE+    K+  + E + K   
Sbjct: 140 LEEEEEVEMEEDEEY-------YEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPS 192

Query: 155 YRN 157
            ++
Sbjct: 193 VKD 195


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 33.1 bits (75), Expect = 0.062
 Identities = 15/84 (17%), Positives = 40/84 (47%), Gaps = 3/84 (3%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE---EEEEKQKDKKEEEV 129
            A    +  E + +  ++E      +++  K+ D E+    +E   ++ E  ++   EE 
Sbjct: 389 QAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEE 448

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
           E +  E ++ +KE+ E  ++ +++
Sbjct: 449 ERQLREFRDMEKEDREFPDEAELQ 472



 Score = 31.2 bits (70), Expect = 0.27
 Identities = 21/78 (26%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE----EEEEKQKDKKEEEVEEE 132
             +    E+EE+E  +  +EE      D+D+ K+ + +E    EE     ++  EE+  E
Sbjct: 387 FYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPE 446

Query: 133 EEEQKNKKKEEEEEEEKE 150
           EEE++ ++  + E+E++E
Sbjct: 447 EEERQLREFRDMEKEDRE 464



 Score = 28.1 bits (62), Expect = 2.5
 Identities = 17/77 (22%), Positives = 36/77 (46%), Gaps = 6/77 (7%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE------EEQ 136
           ++ +  E EEEE+ +  ++E         + KE + E+    +E  +++ E       E+
Sbjct: 388 YQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEE 447

Query: 137 KNKKKEEEEEEEKEKVE 153
           + ++  E  + EKE  E
Sbjct: 448 EERQLREFRDMEKEDRE 464


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 33.0 bits (75), Expect = 0.064
 Identities = 19/58 (32%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 76  ETLENNEWEEEEKEKEE-EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           E +E  E +E +   EE  +EE E+E E DE   ++  + E  E +  D  E   E E
Sbjct: 330 EVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 32.9 bits (76), Expect = 0.064
 Identities = 14/45 (31%), Positives = 25/45 (55%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           NE  +E KE+ +    +++E +     +KK+EK+E +  K K K 
Sbjct: 46  NEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 29.4 bits (67), Expect = 0.73
 Identities = 12/40 (30%), Positives = 23/40 (57%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           E+ +E +EE +    D+ E K   K ++++EKQ+ K  + 
Sbjct: 47  EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKA 86



 Score = 26.7 bits (60), Expect = 6.9
 Identities = 11/45 (24%), Positives = 26/45 (57%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           E+ +E +E+ K    +KK  K   + +K+K+K+E +  + + + +
Sbjct: 47  EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 32.7 bits (74), Expect = 0.066
 Identities = 16/59 (27%), Positives = 34/59 (57%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEE  +EE+++EE+++ + DEDE+   +  +E++E + D      ++   +  N+   E
Sbjct: 46  EEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNE 104



 Score = 30.3 bits (68), Expect = 0.46
 Identities = 14/66 (21%), Positives = 35/66 (53%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           ++  +E EEE  E+E D++E+   +  E+E++   D  E++ +E++++         ++ 
Sbjct: 39  RKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDG 98

Query: 148 EKEKVE 153
            +   E
Sbjct: 99  NETDNE 104



 Score = 27.3 bits (60), Expect = 5.0
 Identities = 15/74 (20%), Positives = 42/74 (56%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E E++++E ++++++E+E+ + D+D+E   ++ +++     D   ++  E + E   
Sbjct: 48  EAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGF 107

Query: 139 KKKEEEEEEEKEKV 152
              ++E ++  E V
Sbjct: 108 ADSDDESDDGSEYV 121


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 32.5 bits (74), Expect = 0.069
 Identities = 17/70 (24%), Positives = 39/70 (55%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +++ K  EE  E+E  +    +D+  K    + ++   ++  +++ +  EE++  + +EE
Sbjct: 212 QDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEE 271

Query: 144 EEEEEKEKVE 153
           EE EE+E+ E
Sbjct: 272 EETEEEEEEE 281



 Score = 32.1 bits (73), Expect = 0.11
 Identities = 18/71 (25%), Positives = 40/71 (56%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
              ++K K  EE  E+E  +    ++   +  + + +K  D+++++ + +  E+K  K++
Sbjct: 209 DSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKED 268

Query: 143 EEEEEEKEKVE 153
           EEEEE +E+ E
Sbjct: 269 EEEEETEEEEE 279



 Score = 30.2 bits (68), Expect = 0.41
 Identities = 14/58 (24%), Positives = 37/58 (63%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
             A S  +  + ++ + ++ + EE+++++ +  E+ E +E + E++ EEEE+++D+ E
Sbjct: 228 DAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285



 Score = 28.6 bits (64), Expect = 1.4
 Identities = 12/60 (20%), Positives = 41/60 (68%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            + +A+   +  +  + + +K ++EE+++++ +  E++E K +++EEE E+++++++E+ 
Sbjct: 226 SSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 31.9 bits (73), Expect = 0.076
 Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 3/36 (8%)

Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           EK+E E +KR +KE  E+    KKEEE+ E + +Q+
Sbjct: 82  EKEERELRKRAEKEALEQA---KKEEELREAKRQQR 114



 Score = 26.8 bits (60), Expect = 3.9
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
           NE EE E  K  E+E  E+ K+++E  E KR+++
Sbjct: 81  NEKEERELRKRAEKEALEQAKKEEELREAKRQQR 114



 Score = 26.1 bits (58), Expect = 6.4
 Identities = 13/37 (35%), Positives = 24/37 (64%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
           K+ E+EE E  +  EK+  E+ K+E++  E ++Q+ K
Sbjct: 79  KKNEKEERELRKRAEKEALEQAKKEEELREAKRQQRK 115



 Score = 26.1 bits (58), Expect = 6.7
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 9/41 (21%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           KK EK+E E  K+          E+E  +  KKEEE  E K
Sbjct: 79  KKNEKEERELRKR---------AEKEALEQAKKEEELREAK 110


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 32.5 bits (74), Expect = 0.086
 Identities = 20/78 (25%), Positives = 37/78 (47%), Gaps = 6/78 (7%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE---- 130
            E   N +  EEE E+E E        + D+DE    ++ E+EE+ ++ +   + E    
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 131 --EEEEEQKNKKKEEEEE 146
                +E K +KK+E ++
Sbjct: 61  EKARLKELKKQKKQEIQK 78



 Score = 32.5 bits (74), Expect = 0.10
 Identities = 21/71 (29%), Positives = 33/71 (46%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEE+   +  EEE+EEE E            +E   + +D+ EE+ EE E   K +  + 
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 144 EEEEEKEKVEM 154
           E+   KE  + 
Sbjct: 61  EKARLKELKKQ 71



 Score = 30.9 bits (70), Expect = 0.34
 Identities = 16/89 (17%), Positives = 43/89 (48%), Gaps = 8/89 (8%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD--------KK 125
             E +     EEE++E+ E           D++   + E ++EE++++ +        K+
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           E+   +E ++QK ++ ++  E++   ++ 
Sbjct: 61  EKARLKELKKQKKQEIQKILEQQNAAIDA 89



 Score = 30.5 bits (69), Expect = 0.42
 Identities = 15/72 (20%), Positives = 41/72 (56%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
            + + A      ++ +++E   E+E+E+EE+++E +   + +  K+E+   K+  K++++
Sbjct: 15  EEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQ 74

Query: 129 VEEEEEEQKNKK 140
             ++  EQ+N  
Sbjct: 75  EIQKILEQQNAA 86



 Score = 29.0 bits (65), Expect = 1.4
 Identities = 11/41 (26%), Positives = 22/41 (53%)

Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           +E +   +  QK + ++ ++  E E+       EE+EEKE+
Sbjct: 778 KEVRYLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQ 818


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 32.3 bits (74), Expect = 0.087
 Identities = 24/53 (45%), Positives = 39/53 (73%), Gaps = 1/53 (1%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE-EEEEKQKDKKEEEVEEEEEE 135
           EE+ +E+EEEE  E EE+E++EDEE+K+EK    ++ K  + KE+++E+  EE
Sbjct: 17  EEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEE 69



 Score = 28.4 bits (64), Expect = 2.0
 Identities = 21/45 (46%), Positives = 32/45 (71%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           +K +EK  +  EK ++K EEE EEE  E + +++EE+EEE+KEK 
Sbjct: 3   DKLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKP 47



 Score = 28.0 bits (63), Expect = 2.8
 Identities = 16/31 (51%), Positives = 26/31 (83%)

Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           +K EE++EEEEEE+  + +EEEEEE++E+ +
Sbjct: 14  EKVEEKIEEEEEEEAPEAEEEEEEEDEEEKK 44



 Score = 27.6 bits (62), Expect = 3.3
 Identities = 23/67 (34%), Positives = 46/67 (68%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           KEK  +  E+ EEK ++E+EE+  E +EEEEE+ +++K+E+    ++ +  + KE++ E+
Sbjct: 6   KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65

Query: 148 EKEKVEM 154
             E++E+
Sbjct: 66  LLEELEL 72



 Score = 26.1 bits (58), Expect = 9.0
 Identities = 20/48 (41%), Positives = 28/48 (58%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
              E  E  E  E E+E+EEE+EEE++EK    D+ K  E KE++ E 
Sbjct: 18  EKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 32.3 bits (74), Expect = 0.087
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
               E EEEEEEEEE  E++E+EE++ E+  EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 30.8 bits (70), Expect = 0.35
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
               + +EEEEEEEE  +E++E+EE++    EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 30.8 bits (70), Expect = 0.36
 Identities = 18/33 (54%), Positives = 21/33 (63%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE 107
           S      E EEEE+  EEEEEEEEEE+   E+E
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 30.4 bits (69), Expect = 0.47
 Identities = 18/27 (66%), Positives = 22/27 (81%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEE 108
           E EEEE+E  EEEEEEEEE+E+  +EE
Sbjct: 307 EEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 28.8 bits (65), Expect = 1.4
 Identities = 18/33 (54%), Positives = 21/33 (63%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDE 105
           S  E  E  E EEE  E+EEEEEEEEE   ++E
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 16/34 (47%), Positives = 24/34 (70%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
               + EEEEEEEEE  ++E+EE++ E++  EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 20/49 (40%), Positives = 28/49 (57%)

Query: 63  RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
           RL +   +      +    E EEEE+E+  EEEEEEEE+E+   EE+ R
Sbjct: 287 RLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVR 335



 Score = 26.9 bits (60), Expect = 6.1
 Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 8/43 (18%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
               E EEEEEEEE+  +E+        EEEEE+++   EEEV
Sbjct: 300 PSPPEPEEEEEEEEEVPEEE--------EEEEEEEERTFEEEV 334



 Score = 26.5 bits (59), Expect = 7.8
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKE 150
            E  EEEEEE++  ++EEEEEEE+E
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEE 327



 Score = 26.5 bits (59), Expect = 9.1
 Identities = 15/25 (60%), Positives = 19/25 (76%)

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKE 150
            EE EEEEEE   +++EEEEEEE+ 
Sbjct: 305 PEEEEEEEEEVPEEEEEEEEEEERT 329



 Score = 26.5 bits (59), Expect = 9.2
 Identities = 17/40 (42%), Positives = 25/40 (62%), Gaps = 6/40 (15%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           ++R      E ++++++EEEV EEEEE      EEEEEE 
Sbjct: 295 RRRPPPSPPEPEEEEEEEEEVPEEEEE------EEEEEER 328


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 32.1 bits (73), Expect = 0.089
 Identities = 20/73 (27%), Positives = 44/73 (60%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E L    +  +E+ +E + + +E+++  +E E++  E ++E++  +K  +E+E ++E EE
Sbjct: 132 EILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEE 191

Query: 136 QKNKKKEEEEEEE 148
            K + +E  EE E
Sbjct: 192 LKEELEELLEELE 204



 Score = 29.8 bits (67), Expect = 0.47
 Identities = 20/64 (31%), Positives = 41/64 (64%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
            +E+ EE + + +EK++  +E +K  ++ E+E+   +K  EE E+++E ++ K++ EE  
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200

Query: 147 EEKE 150
           EE E
Sbjct: 201 EELE 204



 Score = 28.7 bits (64), Expect = 1.2
 Identities = 18/60 (30%), Positives = 39/60 (65%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
            +E  EE +    E+K+R ++ E+E ++ + +++ +E+  EE++ KK+ EE +EE E++ 
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200



 Score = 27.1 bits (60), Expect = 4.3
 Identities = 22/70 (31%), Positives = 41/70 (58%), Gaps = 1/70 (1%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           L   +      E     +E+KE+ EE E+E EE E ++D  +K  ++ +E++K+ ++ +E
Sbjct: 136 LLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEE-KEKKKELEELKE 194

Query: 128 EVEEEEEEQK 137
           E+EE  EE +
Sbjct: 195 ELEELLEELE 204


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 32.7 bits (75), Expect = 0.089
 Identities = 12/47 (25%), Positives = 23/47 (48%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
           +  ++     E    +E EEE  EE +E D ++ K  +  ++E+  Q
Sbjct: 819 TYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQ 865



 Score = 28.0 bits (63), Expect = 2.7
 Identities = 13/51 (25%), Positives = 24/51 (47%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           ++  +    NE E    E+ EEE  EE E+   ED + + +  +E+  +  
Sbjct: 817 LTTYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQLG 867



 Score = 26.9 bits (60), Expect = 6.5
 Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 56  KPIRANA-RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKK 110
           K I+A   RLT    + I+  E L   E EEE  E+ EE + E+ + E D D+E  
Sbjct: 808 KGIKAKGNRLTTYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDT 863



 Score = 26.5 bits (59), Expect = 8.6
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
            E E    EE EEE  E+ ++ D E  + + + ++E    
Sbjct: 826 NELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQ 865


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 32.2 bits (73), Expect = 0.093
 Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDE--DEEKKREKKEEEEEKQKDKKEEEVEEE 132
           +    + E EE + E E E     ++ E D    E+ +RE +E + +K    + E  +E 
Sbjct: 207 TSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDES 266

Query: 133 EEEQKNKKKEEEEEEEKEK 151
           E+E + +  EE+E E+K +
Sbjct: 267 EDEYEEEAGEEQENEDKGE 285



 Score = 32.2 bits (73), Expect = 0.10
 Identities = 23/85 (27%), Positives = 39/85 (45%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
                +    T   N+ E EE + E E E     K+ + D E K + + E  E + DK  
Sbjct: 197 SYLNELDDEVTSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKAN 256

Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEK 151
               E E+E +++ +EE  EE++ +
Sbjct: 257 TNETEIEDESEDEYEEEAGEEQENE 281



 Score = 29.1 bits (65), Expect = 0.95
 Identities = 19/84 (22%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
             +    T   NE ++E      + E EE + E + +     +  E + E  K+K   E 
Sbjct: 189 DELDEEVTSYLNELDDEVTSYFNDGENEENDDELEAEVISYLKDGENDNEV-KEKIRREY 247

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
            E + ++ N  + E E+E +++ E
Sbjct: 248 REWKGDKANTNETEIEDESEDEYE 271


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 32.4 bits (73), Expect = 0.094
 Identities = 27/88 (30%), Positives = 39/88 (44%), Gaps = 4/88 (4%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           + I   E     + E+EE   + E EE        E+EE   E   E E ++  +K+EE+
Sbjct: 594 ELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEEL 653

Query: 130 ----EEEEEEQKNKKKEEEEEEEKEKVE 153
               E EE     KK  +   EEK K+E
Sbjct: 654 RGNFELEERGDPEKKDVDWYTEEKRKIE 681



 Score = 29.7 bits (66), Expect = 0.74
 Identities = 23/83 (27%), Positives = 37/83 (44%), Gaps = 4/83 (4%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            E EE   E + E E EE  ++K    E+ R   E EE    +KK+ +   EE+ +  ++
Sbjct: 628 AENEESADEVDYETEREENARKK----EELRGNFELEERGDPEKKDVDWYTEEKRKIEEQ 683

Query: 141 KEEEEEEEKEKVEMYRNPVPMYR 163
            +    E +  V   R  +  YR
Sbjct: 684 LKINRSEFETMVPESRVVIEGYR 706



 Score = 29.3 bits (65), Expect = 0.99
 Identities = 18/72 (25%), Positives = 41/72 (56%), Gaps = 3/72 (4%)

Query: 85   EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
             E +EK E ++   +E+ KD++E+++ E  +  +E++  KKE   +E E+  +    +  
Sbjct: 1009 PECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKE---KEREQRIRKTIHDNY 1065

Query: 145  EEEEKEKVEMYR 156
            +E  K++++  R
Sbjct: 1066 KEMAKKRLKKKR 1077



 Score = 29.3 bits (65), Expect = 1.1
 Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 4/83 (4%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK----KREKKEEEEEKQKDKKEEEVEEEEE 134
           E+ E EE   + E EE        ++E+       + E++E   +K++ +   E+EE  +
Sbjct: 605 EDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGD 664

Query: 135 EQKNKKKEEEEEEEKEKVEMYRN 157
            +K       EE+ K + ++  N
Sbjct: 665 PEKKDVDWYTEEKRKIEEQLKIN 687



 Score = 27.4 bits (60), Expect = 4.4
 Identities = 19/65 (29%), Positives = 28/65 (43%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           E EEE  ++ EK   E+ + E+   + E ++ +      E EE       E E EE   K
Sbjct: 590 EGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARK 649

Query: 152 VEMYR 156
            E  R
Sbjct: 650 KEELR 654



 Score = 27.0 bits (59), Expect = 5.8
 Identities = 20/79 (25%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 63   RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
            R+ + +P        +++   +E  K++EE+E  E  ++ K    E++  KKE+E E++ 
Sbjct: 1002 RIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAK----EEEIGKKEKEREQRI 1057

Query: 123  DKKEEEVEEEEEEQKNKKK 141
             K   +  +E  +++ KKK
Sbjct: 1058 RKTIHDNYKEMAKKRLKKK 1076


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 32.2 bits (74), Expect = 0.095
 Identities = 28/123 (22%), Positives = 50/123 (40%), Gaps = 31/123 (25%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEE--EEEEEEKEKDEDEEKK-REKKEEEEEKQK-- 122
           L +  + SE +   + E EE EKE E   EE +E  +K  +E     E  E E E+ +  
Sbjct: 207 LEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEAL 266

Query: 123 -----------------DKKEEEVEEEEEEQKNKK---KEEEEEEEKEKVEMYRNPVPMY 162
                            + + ++++E  ++        +  E +EE+E+V     P    
Sbjct: 267 SKFLKTDKTFAIEGWVPEDRVKKLKELIDKATGGSAYVEFVEPDEEEEEV-----PT-KL 320

Query: 163 RNP 165
           +NP
Sbjct: 321 KNP 323



 Score = 31.4 bits (72), Expect = 0.20
 Identities = 26/80 (32%), Positives = 38/80 (47%), Gaps = 8/80 (10%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE- 142
           +E E+E ++   E  E +E+    E  RE KEE EE +K++  E + EE +E   K  E 
Sbjct: 191 DEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKER--ESLLEELKELAKKYLEE 248

Query: 143 -----EEEEEEKEKVEMYRN 157
                E  E E E+ E    
Sbjct: 249 LLALYEYLEIELERAEALSK 268



 Score = 30.3 bits (69), Expect = 0.43
 Identities = 13/56 (23%), Positives = 28/56 (50%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
                +     E+++K   K  EE  K  +++ E++E+E +E + +  E E E ++
Sbjct: 63  RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKE 118



 Score = 29.5 bits (67), Expect = 0.76
 Identities = 18/59 (30%), Positives = 31/59 (52%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            K     EE+++   +  E+  K  E++ E+ EK+  + EEE+ E E E K  ++E E 
Sbjct: 67  PKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIER 125



 Score = 28.7 bits (65), Expect = 1.5
 Identities = 21/67 (31%), Positives = 30/67 (44%), Gaps = 4/67 (5%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            E  E EEE    E   E +EE +E     EK+RE   EE ++   K  EE+    E  +
Sbjct: 202 FERLELEEEGTPSELIREIKEELEEI----EKERESLLEELKELAKKYLEELLALYEYLE 257

Query: 138 NKKKEEE 144
            + +  E
Sbjct: 258 IELERAE 264



 Score = 28.4 bits (64), Expect = 2.3
 Identities = 23/108 (21%), Positives = 40/108 (37%), Gaps = 26/108 (24%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEE-KEKD------EDEEKKREKKEEEEEKQKD 123
                + +     EE  K+ EEE E+ E+E KE +      E+E K+ E++ E  E   +
Sbjct: 72  LREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGN 131

Query: 124 -------------------KKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
                                 E+  EE + + + +  E    +K  V
Sbjct: 132 FDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYV 179



 Score = 27.2 bits (61), Expect = 5.5
 Identities = 17/85 (20%), Positives = 34/85 (40%), Gaps = 10/85 (11%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEE--------EKEKDEDEEKKREKKEEEEEKQKDKKE 126
            E        E + E  E    ++          KE  ++ E++ +K   E  + +++  
Sbjct: 153 PEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGT 212

Query: 127 --EEVEEEEEEQKNKKKEEEEEEEK 149
             E + E +EE +  +KE E   E+
Sbjct: 213 PSELIREIKEELEEIEKERESLLEE 237


>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
           topoisomerase IV), A subunit [DNA replication,
           recombination, and repair].
          Length = 804

 Score = 32.2 bits (74), Expect = 0.099
 Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 2/79 (2%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE--KKEEEEEKQKDKKEEEVEEEEEE 135
           L   E E+ EKE +E E+E  + ++    EE+  +  KKE  E K+K   E   E  EEE
Sbjct: 431 LTGLEEEKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERRTEIVEEE 490

Query: 136 QKNKKKEEEEEEEKEKVEM 154
           +   + E+   EE   V +
Sbjct: 491 EDEIEDEDLIAEEDVVVTL 509


>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein.  A number of members of this
           family are annotated as being transmembrane proteins
           induced by tumour necrosis factor alpha, but no
           literature was found to support this.
          Length = 330

 Score = 32.0 bits (73), Expect = 0.10
 Identities = 18/79 (22%), Positives = 41/79 (51%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            EW+E EKE ++ +E     K+K E+ EK +E+      +Q+ + ++ +   ++ +K+  
Sbjct: 4   EEWKELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLKKLKKSLT 63

Query: 141 KEEEEEEEKEKVEMYRNPV 159
            E+ E  E+ + ++     
Sbjct: 64  PEDSELVEQLEEQIKERKN 82


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 32.0 bits (73), Expect = 0.11
 Identities = 19/56 (33%), Positives = 30/56 (53%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           E     E++EEEEE+E+DE   K      E EE ++   +E+ E++EE+  N    
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSP 229



 Score = 31.3 bits (71), Expect = 0.19
 Identities = 24/92 (26%), Positives = 39/92 (42%), Gaps = 22/92 (23%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE---------- 134
             E +K +EE+EE  EKE++E+      KK E       ++ E +E  EE          
Sbjct: 107 NYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRAD 166

Query: 135 ------------EQKNKKKEEEEEEEKEKVEM 154
                        +K +++EEEEE+E     +
Sbjct: 167 VDVNSMLEALFRREKKEEEEEEEEDEALIKSL 198



 Score = 26.2 bits (58), Expect = 9.2
 Identities = 14/68 (20%), Positives = 32/68 (47%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E+++E+EEEEE+E   K      E + +++  ++E  +D +E+       +  +    +
Sbjct: 179 REKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAK 238

Query: 144 EEEEEKEK 151
                K+ 
Sbjct: 239 PTSILKKS 246



 Score = 26.2 bits (58), Expect = 9.5
 Identities = 17/72 (23%), Positives = 31/72 (43%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E L   E +EEE+E+EE+E   +      E EE +R   +E+ E  ++  +     +   
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGS 233

Query: 136 QKNKKKEEEEEE 147
               K     ++
Sbjct: 234 SSPAKPTSILKK 245


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 32.3 bits (74), Expect = 0.11
 Identities = 13/62 (20%), Positives = 30/62 (48%), Gaps = 12/62 (19%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
            NE ++ + E+EE    +E++ ++D+  +       ++++      EEE + E E     
Sbjct: 219 QNELDDIDDEEEERVLADEDDDDEDDMFD------MDDDD------EEESDPEVERTSTI 266

Query: 140 KK 141
           K+
Sbjct: 267 KE 268



 Score = 28.8 bits (65), Expect = 1.3
 Identities = 13/49 (26%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           L++ + EEEE+   +E++++E++    +D+++  E+ + E E+    KE
Sbjct: 222 LDDIDDEEEERVLADEDDDDEDDMFDMDDDDE--EESDPEVERTSTIKE 268


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 32.1 bits (73), Expect = 0.12
 Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 2/75 (2%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKR--EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           EK   E+EE EE  + + E+EE++R   +KEEEE++   +K ++   +E E       E 
Sbjct: 138 EKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQALLDELETSTLPAAEL 197

Query: 145 EEEEKEKVEMYRNPV 159
             + K+        V
Sbjct: 198 IAQHKKNSVKLEMQV 212



 Score = 30.5 bits (69), Expect = 0.37
 Identities = 21/74 (28%), Positives = 45/74 (60%), Gaps = 5/74 (6%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEE-----KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           L NN   E  K+K E  ++E ++     KEK   E+++ E+  E E+++++++   +++E
Sbjct: 109 LTNNIDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKE 168

Query: 133 EEEQKNKKKEEEEE 146
           EEEQ+  K++ ++ 
Sbjct: 169 EEEQQMNKRKNKQA 182


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 31.7 bits (72), Expect = 0.12
 Identities = 20/73 (27%), Positives = 43/73 (58%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
            +K +E+EEE+E  ++    +++ +K E EE +++ +  E   E+  EQ+ K++  +EEE
Sbjct: 33  IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92

Query: 148 EKEKVEMYRNPVP 160
           ++E+ E  +    
Sbjct: 93  KEEEAEDVKQQEV 105



 Score = 30.2 bits (68), Expect = 0.38
 Identities = 23/85 (27%), Positives = 52/85 (61%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
               +  +T    + +E+E+EKE  +E  + ++E+ + E ++ EK++E  E +++   E+
Sbjct: 22  EAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQ 81

Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVE 153
            E+EE  Q+ +K+EE E+ ++++V 
Sbjct: 82  QEKEEIAQEEEKEEEAEDVKQQEVF 106



 Score = 29.4 bits (66), Expect = 0.75
 Identities = 22/65 (33%), Positives = 40/65 (61%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           + N  E  + ++EE++ E EE +++ E  E +RE   E++EK++  +EEE EEE E+ K 
Sbjct: 43  KENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQ 102

Query: 139 KKKEE 143
           ++   
Sbjct: 103 QEVFS 107



 Score = 28.6 bits (64), Expect = 1.3
 Identities = 21/71 (29%), Positives = 43/71 (60%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E  +E  + +EEE++ E E+ + E E  + E+++  E+++K++  +E E+EEE +  
Sbjct: 41  EEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDV 100

Query: 139 KKKEEEEEEEK 149
           K++E    + K
Sbjct: 101 KQQEVFSFKRK 111



 Score = 28.6 bits (64), Expect = 1.3
 Identities = 18/75 (24%), Positives = 48/75 (64%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            +EKE+E+E  +E  + +++E + +  E ++E+E  + ++++   ++E+EE   ++++EE
Sbjct: 36  ADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEE 95

Query: 145 EEEEKEKVEMYRNPV 159
           E E+ ++ E++    
Sbjct: 96  EAEDVKQQEVFSFKR 110



 Score = 28.3 bits (63), Expect = 1.4
 Identities = 24/85 (28%), Positives = 53/85 (62%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
            +    S+    ++ EE++ E EE E+E+E  + + ED  +++EK+E  +E++K+++ E+
Sbjct: 40  EEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAED 99

Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVE 153
           V+++E     +KK  +E   +EK++
Sbjct: 100 VKQQEVFSFKRKKPFKEMNLEEKID 124


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 31.5 bits (72), Expect = 0.13
 Identities = 16/72 (22%), Positives = 36/72 (50%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           +  +E  +     E +  +K   E ++KKRE +E+ E  +   +++   +EE ++K  K+
Sbjct: 206 DLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKE 265

Query: 142 EEEEEEEKEKVE 153
            +  +  K+ V 
Sbjct: 266 SKGVKALKKVVA 277



 Score = 29.6 bits (67), Expect = 0.76
 Identities = 18/59 (30%), Positives = 32/59 (54%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
            +  +E  K     E K  +K  +E +K+K + EE+VE  E   + K+K +EE ++K+ 
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 30.5 bits (69), Expect = 0.14
 Identities = 14/65 (21%), Positives = 33/65 (50%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            +K K + E +E+  K   ++++ K + KE + +K      +    E   +K K K++++
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 146 EEEKE 150
           + +K 
Sbjct: 62  DCKKG 66



 Score = 28.2 bits (63), Expect = 1.0
 Identities = 13/69 (18%), Positives = 35/69 (50%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            ++ + + E +E+  + +K ED+ K + K+ + ++      +    E   E+   KK+++
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 145 EEEEKEKVE 153
           + ++  K E
Sbjct: 62  DCKKGCKWE 70



 Score = 27.0 bits (60), Expect = 2.1
 Identities = 8/46 (17%), Positives = 23/46 (50%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           +W+++E + + + +E + +K      +    +   E+ K K  K++
Sbjct: 17  KWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62


>gnl|CDD|219514 pfam07686, V-set, Immunoglobulin V-set domain.  This domain is
           found in antibodies as well as neural protein P0 and
           CTL4 amongst others.
          Length = 114

 Score = 30.7 bits (69), Expect = 0.14
 Identities = 14/44 (31%), Positives = 20/44 (45%), Gaps = 1/44 (2%)

Query: 23  GDSSHGVYNLRIRNVSLTDDAEFQCQVGPAQYHKPIRANARLTV 66
           G+ S   ++L I N+ L+D   + C V          A  RLTV
Sbjct: 72  GNGSKNDFSLTISNLRLSDSGTYTCAVS-NPNELVFGAGTRLTV 114


>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
          Length = 1560

 Score = 32.1 bits (73), Expect = 0.15
 Identities = 13/63 (20%), Positives = 22/63 (34%), Gaps = 2/63 (3%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKD--EDEEKKREKKEEEEEKQKDKKEEEVEE 131
           A+E  EN + +E + + E  E +       D      K+    E  +    D     +  
Sbjct: 878 AAELKENKDSKEGDADAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDGAALDAAAGRLMT 937

Query: 132 EEE 134
            EE
Sbjct: 938 REE 940


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 31.2 bits (70), Expect = 0.16
 Identities = 22/56 (39%), Positives = 38/56 (67%)

Query: 98  EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           +E K +D DE  +REK ++E E+  ++K E++ EE E+Q  +  E+EEE E++K +
Sbjct: 11  KEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPK 66



 Score = 27.0 bits (59), Expect = 4.0
 Identities = 24/65 (36%), Positives = 39/65 (60%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           +E ++ + +E  E EK K E EE   +K E+  E+ +D+ EE  E+EEE ++ K KEE +
Sbjct: 11  KEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPKEEID 70

Query: 146 EEEKE 150
              +E
Sbjct: 71  YPIQE 75


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 31.5 bits (72), Expect = 0.17
 Identities = 13/33 (39%), Positives = 24/33 (72%)

Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           KE++E E++K+ KKE +EE++   + + +EE E
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423



 Score = 26.5 bits (59), Expect = 8.1
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
           KE+EE+E+E++ +KE  E+     + K  EE
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEE 421


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 31.7 bits (71), Expect = 0.17
 Identities = 31/78 (39%), Positives = 40/78 (51%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E +E N  E  E+  EE  EE  EE  ++  EE   E  EE  E+  ++  EEVEE  EE
Sbjct: 977  ENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEE 1036

Query: 136  QKNKKKEEEEEEEKEKVE 153
               +  EE EE  +E VE
Sbjct: 1037 YDEENVEEIEENAEENVE 1054



 Score = 30.9 bits (69), Expect = 0.28
 Identities = 28/70 (40%), Positives = 40/70 (57%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE  +E +EE  EE EE  ++ DEE   E +E  EE  ++  EE +EE +EE   + +E 
Sbjct: 1016 EENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEEN 1075

Query: 144  EEEEEKEKVE 153
             EE  +E VE
Sbjct: 1076 IEENIEENVE 1085



 Score = 30.9 bits (69), Expect = 0.30
 Identities = 29/72 (40%), Positives = 40/72 (55%)

Query: 82   EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
            E EE  +E +EE  EE EE  ++  EE   E  EE +E+  ++ EE +EE  EE   +  
Sbjct: 1029 EVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENV 1088

Query: 142  EEEEEEEKEKVE 153
            EE  EE +E VE
Sbjct: 1089 EENVEEIEENVE 1100



 Score = 30.9 bits (69), Expect = 0.31
 Identities = 29/78 (37%), Positives = 42/78 (53%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E +E N  E  E+  EE  EE  EE  ++  EE   E  EE +E+  ++ EE VEE +EE
Sbjct: 981  ENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEE 1040

Query: 136  QKNKKKEEEEEEEKEKVE 153
               + +E  EE  +E +E
Sbjct: 1041 NVEEIEENAEENVEENIE 1058



 Score = 30.9 bits (69), Expect = 0.32
 Identities = 30/77 (38%), Positives = 40/77 (51%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            EN E   EE  +E  EE  EE  E++ +E  +   +E  EE  ++  EE VEE  EE   
Sbjct: 961  ENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVE 1020

Query: 139  KKKEEEEEEEKEKVEMY 155
            +  EE  EE +E VE Y
Sbjct: 1021 EYDEENVEEVEENVEEY 1037



 Score = 30.5 bits (68), Expect = 0.39
 Identities = 28/70 (40%), Positives = 36/70 (51%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE  +E EE  EE  EE  ++  EE   E  EE EE  ++  EE VEE  EE   + +E 
Sbjct: 1039 EENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEEN 1098

Query: 144  EEEEEKEKVE 153
             EE  +E  E
Sbjct: 1099 VEENVEENAE 1108



 Score = 30.5 bits (68), Expect = 0.40
 Identities = 28/70 (40%), Positives = 38/70 (54%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE  +E  EE  EE +E+  +E EE   E  EE  E+ ++  EE VEE  EE   +  EE
Sbjct: 1008 EENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEE 1067

Query: 144  EEEEEKEKVE 153
              EE +E +E
Sbjct: 1068 NVEEIEENIE 1077



 Score = 30.5 bits (68), Expect = 0.45
 Identities = 29/78 (37%), Positives = 39/78 (50%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E  +    EE E+  EE  EE  EE  ++ DEE   E +E  EE  ++  EE VEE  EE
Sbjct: 1035 EEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEE 1094

Query: 136  QKNKKKEEEEEEEKEKVE 153
             +   +E  EE  +E  E
Sbjct: 1095 IEENVEENVEENAEENAE 1112



 Score = 30.5 bits (68), Expect = 0.49
 Identities = 26/68 (38%), Positives = 36/68 (52%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE ++E  EE EE  EE  ++  EE   E  EE EE  ++  EE  EE  EE   +  EE
Sbjct: 1062 EEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEE 1121

Query: 144  EEEEEKEK 151
             ++E  E+
Sbjct: 1122 YDDENPEE 1129



 Score = 29.8 bits (66), Expect = 0.69
 Identities = 29/78 (37%), Positives = 39/78 (50%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E +E N  E  E+  EE  EE  EE  ++  EE   E  EE  E+  ++ +EE  EE EE
Sbjct: 973  ENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEE 1032

Query: 136  QKNKKKEEEEEEEKEKVE 153
               +  EE  EE +E  E
Sbjct: 1033 NVEEYDEENVEEIEENAE 1050



 Score = 29.8 bits (66), Expect = 0.72
 Identities = 28/78 (35%), Positives = 41/78 (52%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E +E N  E  E+  EE  EE  EE  ++  EE   E  EE  E+  ++  EE +EE  E
Sbjct: 969  ENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVE 1028

Query: 136  QKNKKKEEEEEEEKEKVE 153
            +  +  EE +EE  E++E
Sbjct: 1029 EVEENVEEYDEENVEEIE 1046



 Score = 29.8 bits (66), Expect = 0.76
 Identities = 29/80 (36%), Positives = 40/80 (50%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E +E N  E  E+  EE  EE  EE  ++  EE   E  EE  E+ ++  EE  EE  EE
Sbjct: 985  ENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEE 1044

Query: 136  QKNKKKEEEEEEEKEKVEMY 155
             +   +E  EE  +E +E Y
Sbjct: 1045 IEENAEENVEENIEENIEEY 1064



 Score = 29.4 bits (65), Expect = 1.1
 Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E +E N  E  E+  EE +EE  EE E++  EE   E  EE EE  ++  EE +EE  EE
Sbjct: 1005 ENVEENVEENIEENVEEYDEENVEEVEENV-EEYDEENVEEIEENAEENVEENIEENIEE 1063

Query: 136  QKNKKKEEEEEEEKEKVE 153
               +  EE EE  +E +E
Sbjct: 1064 YDEENVEEIEENIEENIE 1081



 Score = 29.0 bits (64), Expect = 1.5
 Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 1/82 (1%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EE  +E  EE  EE +E+  +E EE   E  EE  E+  ++  EE+EE  EE   +  EE
Sbjct: 1050 EENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEE 1109

Query: 144  EEEEE-KEKVEMYRNPVPMYRN 164
              EE  +E  E Y +  P   N
Sbjct: 1110 NAEENAEENAEEYDDENPEEHN 1131



 Score = 28.6 bits (63), Expect = 1.8
 Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 3/81 (3%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK---EEEVEEE 132
            E +E N  E  E+  EE  EE  EE  ++ DEE   E +E  EE  ++     EE  EE 
Sbjct: 993  ENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEEN 1052

Query: 133  EEEQKNKKKEEEEEEEKEKVE 153
             EE   +  EE +EE  E++E
Sbjct: 1053 VEENIEENIEEYDEENVEEIE 1073



 Score = 28.2 bits (62), Expect = 2.4
 Identities = 26/73 (35%), Positives = 38/73 (52%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            EN E + EE  +E  EE  EE  E++ +E  +   +E  EE  ++  EE +EE  EE   
Sbjct: 953  ENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVE 1012

Query: 139  KKKEEEEEEEKEK 151
            +  EE  EE  E+
Sbjct: 1013 ENIEENVEEYDEE 1025



 Score = 28.2 bits (62), Expect = 2.5
 Identities = 27/81 (33%), Positives = 39/81 (48%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            EN E   EE  +E  EE  EE  E++ +E  +   +E  EE  ++  EE +EE  EE   
Sbjct: 965  ENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDE 1024

Query: 139  KKKEEEEEEEKEKVEMYRNPV 159
            +  EE EE  +E  E     +
Sbjct: 1025 ENVEEVEENVEEYDEENVEEI 1045



 Score = 28.2 bits (62), Expect = 2.7
 Identities = 31/81 (38%), Positives = 41/81 (50%), Gaps = 2/81 (2%)

Query: 73   SASETLENNEWEEEEKEKEEEEEEEEEEKEKDED--EEKKREKKEEEEEKQKDKKEEEVE 130
            +  E  EN E   EE  +E  EE +EE  E+ E+  EE   E  EE  E+  ++ EE VE
Sbjct: 1041 NVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVE 1100

Query: 131  EEEEEQKNKKKEEEEEEEKEK 151
            E  EE   +  EE  EE  E+
Sbjct: 1101 ENVEENAEENAEENAEENAEE 1121



 Score = 28.2 bits (62), Expect = 2.8
 Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 2/75 (2%)

Query: 79   ENNEWEEEEKEKEEEEEEEE--EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
            EN E  +EE  +E EE  EE  EE  ++  EE   E +E  EE  ++  EE  EE  EE 
Sbjct: 1059 ENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118

Query: 137  KNKKKEEEEEEEKEK 151
              +  +E  EE  E+
Sbjct: 1119 AEEYDDENPEEHNEE 1133



 Score = 27.4 bits (60), Expect = 4.7
 Identities = 26/72 (36%), Positives = 38/72 (52%)

Query: 80   NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
            N E + EE  +E+ EE  EE  E++ +E  +   +E  EE  ++  EE VEE  EE   +
Sbjct: 946  NIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEE 1005

Query: 140  KKEEEEEEEKEK 151
              EE  EE  E+
Sbjct: 1006 NVEENVEENIEE 1017



 Score = 27.1 bits (59), Expect = 6.5
 Identities = 31/84 (36%), Positives = 42/84 (50%), Gaps = 9/84 (10%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDED---------EEKKREKKEEEEEKQKDKKEEEV 129
            EN E   EE ++E  EE EE  +E DE+         EE   E  EE  E+  ++  EE+
Sbjct: 1013 ENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEI 1072

Query: 130  EEEEEEQKNKKKEEEEEEEKEKVE 153
            EE  EE   +  EE  EE  E++E
Sbjct: 1073 EENIEENIEENVEENVEENVEEIE 1096



 Score = 26.3 bits (57), Expect = 9.7
 Identities = 21/60 (35%), Positives = 31/60 (51%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E +E N  E  E+  EE  EE EE  E++ +E  +   +E  EE  ++  +E  EE  EE
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNEE 1133


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 31.2 bits (71), Expect = 0.18
 Identities = 13/44 (29%), Positives = 18/44 (40%), Gaps = 1/44 (2%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           W + E E EE+EE   E    ++      E +E EE       E
Sbjct: 204 WRDPE-EAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246



 Score = 30.0 bits (68), Expect = 0.42
 Identities = 11/42 (26%), Positives = 16/42 (38%)

Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           ++ +E EEK+    E    +       E EE EE       E
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 31.5 bits (72), Expect = 0.19
 Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 11/90 (12%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDE-----------DEEKKREKKEEEEEK 120
           I+AS  L   E E   KE E   EE+++ KE+ E            E+  +E  ++   +
Sbjct: 492 ITASSGLSEEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAE 551

Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           +K+K E+ V E +E  K +  EE + + +E
Sbjct: 552 EKEKIEKAVAELKEALKGEDVEEIKAKTEE 581



 Score = 26.5 bits (59), Expect = 8.3
 Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 10/59 (16%)

Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEE----------EEEEQKNKKKEEEEEEEKEKVEM 154
           +EE +R  KE E   ++DKK +E  E          + E+   +  ++   EEKEK+E 
Sbjct: 500 EEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIEK 558


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 31.7 bits (72), Expect = 0.19
 Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 13/90 (14%)

Query: 84  EEEEKEKEEEEEEEE--EEKEKDE----------DEEKKREKKEEEEEKQKDKKEEEVEE 131
            E EK K E+E   E  E K   E           +EK + K  + ++   +K  +E  E
Sbjct: 522 NEAEKYKAEDEANRERVEAKNGLENYCYSMKNTLQDEKVKGKLSDSDKATIEKAIDEALE 581

Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYRNPVPM 161
             E+ +  +KEE E ++KE VE   NP+  
Sbjct: 582 WLEKNQLAEKEEFEHKQKE-VESVCNPIMT 610


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 30.6 bits (70), Expect = 0.20
 Identities = 12/62 (19%), Positives = 25/62 (40%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           QA S  + L     +  E E      E + +  + + E+ +   + +E E +  + E E 
Sbjct: 45  QAKSIKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAES 104

Query: 130 EE 131
           + 
Sbjct: 105 DH 106



 Score = 28.3 bits (64), Expect = 1.5
 Identities = 14/67 (20%), Positives = 24/67 (35%), Gaps = 7/67 (10%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           E   +  E E    + E + K           +K+K     E +E E K K+ E E +  
Sbjct: 55  ELAADLLELEAAAPRAELQAKI-------ARYKKEKARYRSEAKELEAKAKEAEAESDHA 107

Query: 149 KEKVEMY 155
             +   +
Sbjct: 108 LHQHHRF 114



 Score = 26.8 bits (60), Expect = 4.4
 Identities = 7/40 (17%), Positives = 18/40 (45%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
           E + +    ++E+     E ++ E + K+ E + +    Q
Sbjct: 71  ELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110



 Score = 26.8 bits (60), Expect = 4.7
 Identities = 5/38 (13%), Positives = 18/38 (47%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
           + +    ++E+   + + ++ E K ++ E E +    +
Sbjct: 73  QAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 30.6 bits (70), Expect = 0.21
 Identities = 15/61 (24%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 98  EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK-KKEEEEEEEKEKVEMYR 156
           + E+ K E E++ ++ +EE+EE +K   E E + E  E++ + +++ EE+   +++   +
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLK 176

Query: 157 N 157
            
Sbjct: 177 K 177



 Score = 28.3 bits (64), Expect = 1.5
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
           E ++ E+EKEE E+   E + K E  EK+ E++ + EEK+
Sbjct: 128 EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKR 167



 Score = 27.6 bits (62), Expect = 2.5
 Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 8/61 (13%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E+ + E E+E ++ EEEKE+ E    + E K E  EK+        EEEE + + K+  +
Sbjct: 119 EQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKR--------EEEERQIEEKRHAD 170

Query: 144 E 144
           E
Sbjct: 171 E 171


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 31.2 bits (71), Expect = 0.21
 Identities = 14/43 (32%), Positives = 31/43 (72%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
           E+ E   E+ E+EEEE++  E +++DE++E + E+++++E  +
Sbjct: 277 EDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 30.8 bits (70), Expect = 0.29
 Identities = 15/46 (32%), Positives = 31/46 (67%)

Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
            DEE   E  E+ EE++++    E E+E+EE +++++EE+++E  +
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 29.6 bits (67), Expect = 0.65
 Identities = 14/34 (41%), Positives = 25/34 (73%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
           E  E  E E++  E E+E+EE+E+E+E+++D+E 
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEG 317



 Score = 29.2 bits (66), Expect = 0.97
 Identities = 16/44 (36%), Positives = 30/44 (68%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           E +  E  EK +EEEEE++  E E +++E++  E++E+++E  K
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 29.2 bits (66), Expect = 1.0
 Identities = 15/46 (32%), Positives = 31/46 (67%)

Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
             +ED E+  EK EEEEE+    + E+ +EE+E+++ ++ ++E ++
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 29.2 bits (66), Expect = 1.1
 Identities = 18/43 (41%), Positives = 31/43 (72%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
           E  EE  EK EEEEEE++  E ++++E+  +++EEE++ + DK
Sbjct: 277 EDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 28.9 bits (65), Expect = 1.4
 Identities = 16/39 (41%), Positives = 26/39 (66%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
           E LE  E EEEE +  E E+E+EE+++++E+E+     K
Sbjct: 281 ELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 28.1 bits (63), Expect = 2.4
 Identities = 19/46 (41%), Positives = 32/46 (69%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            +EE+ EE  EK E+EE++ +  E E+E ++D+ EEE E+++E  K
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 28.1 bits (63), Expect = 2.5
 Identities = 15/51 (29%), Positives = 33/51 (64%), Gaps = 5/51 (9%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
            ++E  EE  E+ EE+E+++D  +  ++ EE+E+     +EEE +++E ++
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDED-----EEEEEDDDEGDK 319



 Score = 27.7 bits (62), Expect = 2.7
 Identities = 16/45 (35%), Positives = 31/45 (68%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           +EE+ E+  E+ EEEEE++   + E + E+ E+EEE++ D + ++
Sbjct: 275 DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 31.4 bits (72), Expect = 0.22
 Identities = 15/55 (27%), Positives = 26/55 (47%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            +E K+ +      + E     + E +  +KEE E+   +  E E  EE+EE+ N
Sbjct: 826 YKEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEELN 880



 Score = 29.1 bits (66), Expect = 1.3
 Identities = 18/71 (25%), Positives = 29/71 (40%), Gaps = 8/71 (11%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           E  +E ++ +      K       E E +  +KEE  +      + +K EE+EEE     
Sbjct: 824 ERYKEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEE----- 878

Query: 153 EMYRNPVPMYR 163
               NP+P Y 
Sbjct: 879 ---LNPLPPYT 886


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 30.8 bits (70), Expect = 0.22
 Identities = 18/91 (19%), Positives = 46/91 (50%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           +  L +         + E+    +E+++K   ++K++  KE +E K+++KK ++VE+  E
Sbjct: 146 TPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLE 205

Query: 135 EQKNKKKEEEEEEEKEKVEMYRNPVPMYRNP 165
            Q+   K+ + +++K   +     V  ++  
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKKE 236



 Score = 26.2 bits (58), Expect = 8.5
 Identities = 12/67 (17%), Positives = 36/67 (53%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           ++E+E++  +  E  +   +  + ++   +  + EK     E++ ++  ++++   KE +
Sbjct: 129 DDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELK 188

Query: 145 EEEEKEK 151
           E +E+EK
Sbjct: 189 ERKEREK 195


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 31.1 bits (71), Expect = 0.23
 Identities = 16/70 (22%), Positives = 36/70 (51%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           +E  E E + EE K++ +  E+K E+ E E EK++++ E+   +  ++   K  ++ +  
Sbjct: 88  EENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147

Query: 148 EKEKVEMYRN 157
             E ++    
Sbjct: 148 LSEALKGLNY 157



 Score = 28.1 bits (63), Expect = 3.0
 Identities = 14/48 (29%), Positives = 26/48 (54%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            E +E+ +  + E+K  EK     EK+  + E E++  E+E K  +K+
Sbjct: 404 AELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQ 451



 Score = 27.3 bits (61), Expect = 5.2
 Identities = 19/56 (33%), Positives = 34/56 (60%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           E E + EE ++E ++ +EK E  E + EKKEEE EK K+K  ++  ++  ++ +  
Sbjct: 92  EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 30.7 bits (70), Expect = 0.23
 Identities = 21/59 (35%), Positives = 31/59 (52%)

Query: 96  EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           +   EK K E EE K+   E + + +K KK  E  EE  E   + K+ E+E +K K E+
Sbjct: 68  KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAEL 126



 Score = 29.5 bits (67), Expect = 0.52
 Identities = 20/78 (25%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
               E+ +KE EE ++   E + + E  +K RE+ EE  E  ++ K+    E+E ++   
Sbjct: 68  KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQ---LEKELKKLKA 124

Query: 140 KKEEEEEEEKEKVEMYRN 157
           + E+ E+ + E++E  + 
Sbjct: 125 ELEKYEKNDPERIEKLKE 142



 Score = 28.4 bits (64), Expect = 1.3
 Identities = 20/56 (35%), Positives = 31/56 (55%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           E+ K+  EE EE  E  E+ +  EK+ +K + E EK +    E +E+ +EE K  K
Sbjct: 93  EKLKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAK 148



 Score = 28.0 bits (63), Expect = 2.1
 Identities = 18/68 (26%), Positives = 41/68 (60%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE ++   E + + E+ K+  E+ E++ E  EE ++ +K+ K+ + E E+ E+ + ++ E
Sbjct: 79  EELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPERIE 138

Query: 144 EEEEEKEK 151
           + +EE + 
Sbjct: 139 KLKEETKV 146


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 31.5 bits (71), Expect = 0.23
 Identities = 26/86 (30%), Positives = 46/86 (53%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
             +           E +EEEKEK+ +EEE +   +++E+ + +  K E  +   ++K +E
Sbjct: 260 ELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKE 319

Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVE 153
             +E ++ +K  KKE+EE EE EK  
Sbjct: 320 SEKELKKLEKELKKEKEEIEELEKEL 345



 Score = 30.7 bits (69), Expect = 0.37
 Identities = 29/79 (36%), Positives = 49/79 (62%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E  E  + E   KEKE  EEEE+ EK K E+E++++ K +EEE +  +++ +E  E  E
Sbjct: 760 KEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLE 819

Query: 135 EQKNKKKEEEEEEEKEKVE 153
           E++   ++EE+ +E+E  E
Sbjct: 820 EEQLLIEQEEKIKEEELEE 838



 Score = 30.3 bits (68), Expect = 0.58
 Identities = 22/71 (30%), Positives = 37/71 (52%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
                EE     +E   +E+EE E  + E +K E+   +  K+  ++E+E + +EEE K 
Sbjct: 232 YLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKL 291

Query: 139 KKKEEEEEEEK 149
             KEEEE + +
Sbjct: 292 LAKEEEELKSE 302



 Score = 29.9 bits (67), Expect = 0.65
 Identities = 31/71 (43%), Positives = 48/71 (67%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E ++K  EE +  E++ KEK+E+EEK R KKEEEEE++ +   +E E  EEE+K +K + 
Sbjct: 730 EAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKV 789

Query: 144 EEEEEKEKVEM 154
           EEE+E++    
Sbjct: 790 EEEKEEKLKAQ 800



 Score = 29.9 bits (67), Expect = 0.78
 Identities = 26/82 (31%), Positives = 46/82 (56%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
                    E  + E  + + EE+ +E E+E +K E E KK +++ EE EK+  + E + 
Sbjct: 293 AKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKR 352

Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
           E EEEE++  +K +E+ E+ E+
Sbjct: 353 EAEEEEEEQLEKLQEKLEQLEE 374



 Score = 29.6 bits (66), Expect = 0.98
 Identities = 29/81 (35%), Positives = 44/81 (54%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
              +  E  E    EEE K   +EEEE + E  K E  +   E+K +E EK+  K E+E+
Sbjct: 272 LKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKEL 331

Query: 130 EEEEEEQKNKKKEEEEEEEKE 150
           ++E+EE +  +KE +E E K 
Sbjct: 332 KKEKEEIEELEKELKELEIKR 352



 Score = 29.2 bits (65), Expect = 1.1
 Identities = 29/88 (32%), Positives = 50/88 (56%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +    +E  E  E  + E+EKEE+ + +EEE    E+E K+  +  EEE+   +++E+  
Sbjct: 773 KEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIK 832

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           EEE EE   + KEE++ E+  + E+ R 
Sbjct: 833 EEELEELALELKEEQKLEKLAEEELERL 860



 Score = 29.2 bits (65), Expect = 1.1
 Identities = 22/78 (28%), Positives = 43/78 (55%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
              E    E+EE E  ++E E+EEE      +E K E+KE++ ++++ K   + EEE + 
Sbjct: 242 LLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKS 301

Query: 136 QKNKKKEEEEEEEKEKVE 153
           +  K +  + ++E++  E
Sbjct: 302 ELLKLERRKVDDEEKLKE 319



 Score = 29.2 bits (65), Expect = 1.3
 Identities = 27/83 (32%), Positives = 45/83 (54%)

Query: 71   AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
            A  A   L+     EE   +E +E+E+EE+ +++E+E  KR    +EE    +       
Sbjct: 925  AEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEF 984

Query: 131  EEEEEQKNKKKEEEEEEEKEKVE 153
            EE+EE+ NK + ++E  E+EK E
Sbjct: 985  EEKEERYNKDELKKERLEEEKKE 1007



 Score = 28.8 bits (64), Expect = 1.4
 Identities = 27/81 (33%), Positives = 42/81 (51%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            E    + E++ K+E+EE EE EKE  E E K+  ++EEEE+ +K +++ E  EEE   K
Sbjct: 320 SEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAK 379

Query: 138 NKKKEEEEEEEKEKVEMYRNP 158
            K + E      +  E     
Sbjct: 380 KKLESERLSSAAKLKEEELEL 400



 Score = 28.8 bits (64), Expect = 1.4
 Identities = 29/82 (35%), Positives = 50/82 (60%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
            S     E    EEEEK ++ + EEE+EEK K ++EE +  ++E +EE +  ++E+ + E
Sbjct: 767 KSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIE 826

Query: 132 EEEEQKNKKKEEEEEEEKEKVE 153
           +EE+ K ++ EE   E KE+ +
Sbjct: 827 QEEKIKEEELEELALELKEEQK 848



 Score = 28.8 bits (64), Expect = 1.7
 Identities = 27/71 (38%), Positives = 47/71 (66%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
             E+K KE+EEEEE+   +K+E+EE+K E   +E+E  +++++ E  + EEE++ K K +
Sbjct: 741 LLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQ 800

Query: 144 EEEEEKEKVEM 154
           EEE    + E+
Sbjct: 801 EEELRALEEEL 811



 Score = 28.8 bits (64), Expect = 1.8
 Identities = 22/68 (32%), Positives = 41/68 (60%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEE +  EEE +EE E  E+++   ++ EK +EEE ++   + +E ++ E+  + + +  
Sbjct: 801 EEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERL 860

Query: 144 EEEEEKEK 151
           EEE  KE+
Sbjct: 861 EEEITKEE 868



 Score = 28.4 bits (63), Expect = 1.9
 Identities = 22/96 (22%), Positives = 54/96 (56%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
           I     L     + I +S+     E E   +  +E +EEE+E+K ++E+ +   +++EE 
Sbjct: 240 IDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEEL 299

Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           + +    +  +V++EE+ ++++K+ ++ E+E +K +
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELKKLEKELKKEK 335



 Score = 28.4 bits (63), Expect = 2.5
 Identities = 26/73 (35%), Positives = 49/73 (67%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            E EE + E  + E  + +++EK ++ EK+ +K E+E +K+K++ EE  +E +E +  ++
Sbjct: 294 KEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKRE 353

Query: 141 KEEEEEEEKEKVE 153
            EEEEEE+ EK++
Sbjct: 354 AEEEEEEQLEKLQ 366



 Score = 28.0 bits (62), Expect = 2.6
 Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 5/124 (4%)

Query: 39  LTDDAEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEE 98
           L  D     +       + +R          Q +   E +     +E ++E++E++ +EE
Sbjct: 228 LYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEE 287

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK-----NKKKEEEEEEEKEKVE 153
           E K   ++EE+ + +  + E ++ D +E+  E E+E +K      K+KEE EE EKE  E
Sbjct: 288 ELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKE 347

Query: 154 MYRN 157
           +   
Sbjct: 348 LEIK 351



 Score = 28.0 bits (62), Expect = 2.6
 Identities = 27/79 (34%), Positives = 49/79 (62%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E  E  E E+   E+EE+ +EEE E+   E +E+++ +K  EEE ++ ++E   EE  +
Sbjct: 812 KEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQ 871

Query: 135 EQKNKKKEEEEEEEKEKVE 153
           E   K++E EE++ K+++E
Sbjct: 872 ELLLKEEELEEQKLKDELE 890



 Score = 28.0 bits (62), Expect = 2.8
 Identities = 29/82 (35%), Positives = 50/82 (60%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +  S    LE  + ++EEK KE E+E ++ EKE  +++E+  E ++E +E +  ++ EE 
Sbjct: 298 ELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEE 357

Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
           EEE+ E+  +K E+ EEE   K
Sbjct: 358 EEEQLEKLQEKLEQLEEELLAK 379



 Score = 28.0 bits (62), Expect = 2.8
 Identities = 28/79 (35%), Positives = 44/79 (55%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
                 E E+EEK K +EEE    E+E  E+ E   E++   E+++K K+EE  E   E 
Sbjct: 784 TEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALEL 843

Query: 136 QKNKKKEEEEEEEKEKVEM 154
           ++ +K E+  EEE E++E 
Sbjct: 844 KEEQKLEKLAEEELERLEE 862



 Score = 27.6 bits (61), Expect = 3.7
 Identities = 31/82 (37%), Positives = 46/82 (56%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           + +      E  E EE EKE +E E + E E+E++E  EK +EK E+ EE+   KK+ E 
Sbjct: 325 KKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLES 384

Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
           E      K K++E E + E+EK
Sbjct: 385 ERLSSAAKLKEEELELKNEEEK 406



 Score = 27.6 bits (61), Expect = 4.1
 Identities = 16/80 (20%), Positives = 35/80 (43%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A+ + E  + +E  K+  EE E   E     E+ + +  K +E+ +K  +  + + + E 
Sbjct: 163 AAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLEL 222

Query: 134 EEQKNKKKEEEEEEEKEKVE 153
           EE+     +  +  E+    
Sbjct: 223 EEENLLYLDYLKLNEERIDL 242



 Score = 27.6 bits (61), Expect = 4.3
 Identities = 26/77 (33%), Positives = 46/77 (59%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E  E  E    +KE+EEEE+ E   KEK+  EE+++ +K + EE++++K + + EE   
Sbjct: 747 KEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRA 806

Query: 135 EQKNKKKEEEEEEEKEK 151
            ++  K+E E  EE++ 
Sbjct: 807 LEEELKEEAELLEEEQL 823



 Score = 27.2 bits (60), Expect = 4.9
 Identities = 22/70 (31%), Positives = 40/70 (57%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +EEE    EEE +EE E  ++E    ++E+K +EEE ++   E + E++ E+   ++ E 
Sbjct: 800 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELER 859

Query: 144 EEEEEKEKVE 153
            EEE  ++  
Sbjct: 860 LEEEITKEEL 869



 Score = 27.2 bits (60), Expect = 5.0
 Identities = 19/73 (26%), Positives = 38/73 (52%)

Query: 79   ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
             + + +EE+ ++EEEE  +     K+E          E EEK++   ++E+++E  E++ 
Sbjct: 946  ADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEK 1005

Query: 139  KKKEEEEEEEKEK 151
            K+   E  EE  +
Sbjct: 1006 KELLREIIEETCQ 1018



 Score = 27.2 bits (60), Expect = 5.0
 Identities = 21/73 (28%), Positives = 31/73 (42%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  +   EE E   E   + EE +  E + K++ KK  E  + K+K E E E        
Sbjct: 174 ERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYL 233

Query: 139 KKKEEEEEEEKEK 151
           K  EE  +  +E 
Sbjct: 234 KLNEERIDLLQEL 246



 Score = 27.2 bits (60), Expect = 5.2
 Identities = 17/70 (24%), Positives = 33/70 (47%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           +EK +++E  ++  E+ E+  +     EE + ++   KE+  +  E  Q  +K E EEE 
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226

Query: 148 EKEKVEMYRN 157
                 +  N
Sbjct: 227 LLYLDYLKLN 236



 Score = 27.2 bits (60), Expect = 5.2
 Identities = 21/79 (26%), Positives = 46/79 (58%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E L   ++ +  +E+ +  +E   +++++ +  K+  +KEEE   Q  K+ +E E+E++ 
Sbjct: 225 ENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKL 284

Query: 136 QKNKKKEEEEEEEKEKVEM 154
           Q+ + K   +EEE+ K E+
Sbjct: 285 QEEELKLLAKEEEELKSEL 303



 Score = 27.2 bits (60), Expect = 5.4
 Identities = 28/96 (29%), Positives = 49/96 (51%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
           I           Q I         E EE   +K +E +++  E+ K  +++ K +++EEE
Sbjct: 695 ILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEE 754

Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           + + K ++EEE + E   ++ +  EEEE+ EK KVE
Sbjct: 755 KSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVE 790



 Score = 26.5 bits (58), Expect = 8.9
 Identities = 20/71 (28%), Positives = 38/71 (53%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
              E+ E E+E     +  +  ++  +  +   ++E+EE +  K+E E EEE   Q  K+
Sbjct: 215 QLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKE 274

Query: 141 KEEEEEEEKEK 151
            +EEE+E+K +
Sbjct: 275 NKEEEKEKKLQ 285



 Score = 26.5 bits (58), Expect = 9.1
 Identities = 23/79 (29%), Positives = 40/79 (50%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           LE    ++   E++E E EE   +E     + + E +E   E+  +K++EE  +EEEE++
Sbjct: 903 LEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEER 962

Query: 138 NKKKEEEEEEEKEKVEMYR 156
           NK+    +EE      M  
Sbjct: 963 NKRLLLAKEELGNVNLMAI 981


>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
            Members of this protein family are polyribonucleotide
           nucleotidyltransferase, also called polynucleotide
           phosphorylase. Some members have been shown also to have
           additional functions as guanosine pentaphosphate
           synthetase and as poly(A) polymerase (see model
           TIGR02696 for an exception clade, within this family)
           [Transcription, Degradation of RNA].
          Length = 684

 Score = 31.3 bits (72), Expect = 0.23
 Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 3/73 (4%)

Query: 81  NEWEEE-EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE--EEVEEEEEEQK 137
            E  EE  KEK E    E +E+ K + +E   E   +E  +  +K+E    ++  +EE  
Sbjct: 211 EELAEEAGKEKWEFVAPEVDEELKAKVKELAYEAGLKEAYQITEKQERYAALDAIKEEVL 270

Query: 138 NKKKEEEEEEEKE 150
               EEEEEEE+ 
Sbjct: 271 EALAEEEEEEEEA 283



 Score = 28.2 bits (64), Expect = 2.5
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE---EEEEE 135
           E  E+   E ++E + + +E   E    E  +  +K+E        KEE +E   EEEEE
Sbjct: 220 EKWEFVAPEVDEELKAKVKELAYEAGLKEAYQITEKQERYAALDAIKEEVLEALAEEEEE 279

Query: 136 QKNKKKEEEEEEEKEKVE 153
           ++   +E+E +E  + +E
Sbjct: 280 EEEADREKEIKEAFKDLE 297


>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
          Length = 580

 Score = 31.4 bits (72), Expect = 0.23
 Identities = 17/87 (19%), Positives = 33/87 (37%), Gaps = 3/87 (3%)

Query: 58  IRANARLTVILPQAISASETLENNE-WEEEEKEKEEEEEEEEEE--KEKDEDEEKKREKK 114
           +  + R   +   A          E  EE+    E+E ++  E+   +  E+  ++   K
Sbjct: 494 LLEHLRDAGLEELAALLESLAVWEEISEEDIAALEKELKDALEKLRDQLLEERLEELIAK 553

Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E   E      EE +E  +   + K+K
Sbjct: 554 ERLLEGHGLSSEERLELLQLLIELKRK 580



 Score = 29.4 bits (67), Expect = 0.87
 Identities = 19/95 (20%), Positives = 36/95 (37%), Gaps = 2/95 (2%)

Query: 59  RANARLTVILPQAISASETLENNEWEEEEKEKEE--EEEEEEEEKEKDEDEEKKREKKEE 116
           +        L + +  +   E     E     EE  EE+    EKE  +  EK R++  E
Sbjct: 485 QPGLTTGSQLLEHLRDAGLEELAALLESLAVWEEISEEDIAALEKELKDALEKLRDQLLE 544

Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           E  ++   KE  +E      + + +  +   E ++
Sbjct: 545 ERLEELIAKERLLEGHGLSSEERLELLQLLIELKR 579



 Score = 27.5 bits (62), Expect = 3.8
 Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 1/85 (1%)

Query: 69  PQAISASETLENNEWEEEEKEKEE-EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           P   + S+ LE+      E+     E     EE  +++    ++E K+  E+ +    EE
Sbjct: 486 PGLTTGSQLLEHLRDAGLEELAALLESLAVWEEISEEDIAALEKELKDALEKLRDQLLEE 545

Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKV 152
            +EE   +++  +      EE+ ++
Sbjct: 546 RLEELIAKERLLEGHGLSSEERLEL 570


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 31.1 bits (71), Expect = 0.23
 Identities = 9/40 (22%), Positives = 15/40 (37%)

Query: 59  RANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEE 98
           +    L      A +A+              +EEEEEE++
Sbjct: 264 KIKEALANPSAAAAAAAAAAAAAAAAAAAPAEEEEEEEDD 303



 Score = 28.4 bits (64), Expect = 1.7
 Identities = 11/48 (22%), Positives = 14/48 (29%), Gaps = 7/48 (14%)

Query: 66  VILPQAISASETLENNEWE-------EEEKEKEEEEEEEEEEKEKDED 106
              P A    E L N                       EEEE+E+D+ 
Sbjct: 257 FTFPLAEKIKEALANPSAAAAAAAAAAAAAAAAAAAPAEEEEEEEDDM 304



 Score = 26.5 bits (59), Expect = 7.8
 Identities = 6/20 (30%), Positives = 8/20 (40%)

Query: 129 VEEEEEEQKNKKKEEEEEEE 148
                       +EEEEEE+
Sbjct: 283 AAAAAAAAAAPAEEEEEEED 302


>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
           Provisional.
          Length = 606

 Score = 31.3 bits (71), Expect = 0.24
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
           NNE  EE +     E  EE+E++ DED++
Sbjct: 578 NNEASEETETITVPENNEEDEEDDDEDDD 606


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 30.5 bits (69), Expect = 0.25
 Identities = 14/61 (22%), Positives = 27/61 (44%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           K++  + ++  K+   + EKK  +   E   +K   E      EE    +++E EE   +
Sbjct: 118 KDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177

Query: 150 E 150
           E
Sbjct: 178 E 178



 Score = 28.6 bits (64), Expect = 1.3
 Identities = 16/69 (23%), Positives = 31/69 (44%), Gaps = 4/69 (5%)

Query: 86  EEKEKEEEEEEE----EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           ++K  + ++  +    E EK+ +E   +   +K+  E      +E    EEEE ++   +
Sbjct: 118 KDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177

Query: 142 EEEEEEEKE 150
           E   EE  E
Sbjct: 178 EAPAEESAE 186



 Score = 27.4 bits (61), Expect = 2.7
 Identities = 15/61 (24%), Positives = 27/61 (44%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +  +K   E E++  E + +   E+K  E      E+    +EEE EE   E+   ++  
Sbjct: 126 KAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEESA 185

Query: 144 E 144
           E
Sbjct: 186 E 186



 Score = 26.7 bits (59), Expect = 5.4
 Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 6/66 (9%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK------EEEVEEEEEEQKNKKKEEEE 145
           E + E + E  K+E E K   KK++  + +K  K      E++V E   E   +KK  E 
Sbjct: 96  EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEA 155

Query: 146 EEEKEK 151
                +
Sbjct: 156 AAVAAE 161



 Score = 26.7 bits (59), Expect = 6.0
 Identities = 13/53 (24%), Positives = 21/53 (39%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
            +E   N    E   EK+  E      +E    EE++ E+   EE   ++  E
Sbjct: 134 EAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEESAE 186


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 31.1 bits (71), Expect = 0.25
 Identities = 14/73 (19%), Positives = 27/73 (36%), Gaps = 6/73 (8%)

Query: 84  EEEEKEKEE------EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            E+EK+K E      E  +   E+EK   E + ++  E    K KD     +   + ++ 
Sbjct: 441 IEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKA 500

Query: 138 NKKKEEEEEEEKE 150
              +    +    
Sbjct: 501 AATQPIVIKAGAR 513



 Score = 28.4 bits (64), Expect = 2.4
 Identities = 12/51 (23%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 102 EKDEDEEKKREKKEEEEEKQK-DKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
            K E    ++EKK+ EE K + + ++  +E E+  ++ + K+  E    + 
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKD 484



 Score = 27.6 bits (62), Expect = 4.0
 Identities = 11/67 (16%), Positives = 21/67 (31%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E EK   E   ++  E    +D++         + K+    +  V +      N      
Sbjct: 463 EREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAA 522

Query: 145 EEEEKEK 151
            E  K +
Sbjct: 523 REARKAQ 529



 Score = 26.8 bits (60), Expect = 5.8
 Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 96  EEEEEKEKDEDEEKKR-EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
             E+EK+K  +E K R E ++   E++K  +E   ++  E +  K K+
Sbjct: 440 AIEQEKKK-AEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKD 486


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 30.9 bits (70), Expect = 0.25
 Identities = 13/57 (22%), Positives = 21/57 (36%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           +  +  E   +E   E E +     +   +  EE  +   +E E   EEE  K  K 
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKP 252



 Score = 30.5 bits (69), Expect = 0.43
 Identities = 12/50 (24%), Positives = 19/50 (38%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
            T++    E E +     E   E  +E  E   ++ E   EEE  +  K 
Sbjct: 203 PTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKP 252



 Score = 29.3 bits (66), Expect = 1.00
 Identities = 15/55 (27%), Positives = 20/55 (36%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           E  E   +E   E E E     E   E  EE  E   ++ E+  EEE  +     
Sbjct: 199 EPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPP 253



 Score = 28.9 bits (65), Expect = 1.3
 Identities = 14/55 (25%), Positives = 21/55 (38%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           L+  E  E   ++   E E E     +   E   E  E   E+ +D  EEE  + 
Sbjct: 195 LDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKP 249



 Score = 28.9 bits (65), Expect = 1.4
 Identities = 10/58 (17%), Positives = 22/58 (37%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           +  E  +   +E   E + +     +   +  EE  +   ++ E+  EEE  +  K  
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPP 253



 Score = 28.2 bits (63), Expect = 2.0
 Identities = 15/62 (24%), Positives = 22/62 (35%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
           +  E  E    +   + E E     +   E  EE  E   ++ E+  EEE  K      P
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVP 255

Query: 159 VP 160
            P
Sbjct: 256 KP 257



 Score = 26.2 bits (58), Expect = 9.7
 Identities = 14/59 (23%), Positives = 22/59 (37%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           +  E  E   DE   +   +     E   +  EE  E   EE ++  +EE  +  K  V
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPV 254


>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 535

 Score = 31.2 bits (71), Expect = 0.26
 Identities = 17/85 (20%), Positives = 39/85 (45%), Gaps = 5/85 (5%)

Query: 61  NARLTVILPQAISASETLENNEWEEEEKEKEEE-----EEEEEEEKEKDEDEEKKREKKE 115
            A++ +      + +      E++   ++K+ +     E  EEE K+ DE+ ++ + K+ 
Sbjct: 447 GAKIKIQKALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQLKEA 506

Query: 116 EEEEKQKDKKEEEVEEEEEEQKNKK 140
            E ++     E+ +EE +E    K 
Sbjct: 507 TELQEFMINNEDLIEEAKELFGIKS 531


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 30.2 bits (69), Expect = 0.26
 Identities = 10/33 (30%), Positives = 24/33 (72%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           ++++++++++E E  E+++KK    EE +E QK
Sbjct: 83  RKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQK 115



 Score = 28.7 bits (65), Expect = 1.1
 Identities = 14/47 (29%), Positives = 28/47 (59%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           +KK++KK++E E  ++KK++ V  EE ++  K  +       +KV +
Sbjct: 84  KKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAIKKVPL 130



 Score = 27.5 bits (62), Expect = 2.2
 Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 6/49 (12%)

Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
            K  D   KR+KK+      K KK+E    EE+++K    EE +E++K 
Sbjct: 74  PKLNDAYFKRKKKK------KKKKKEGEFFEEKKKKYVVSEERKEDQKA 116


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 30.7 bits (69), Expect = 0.27
 Identities = 10/46 (21%), Positives = 19/46 (41%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           N  +E ++ E     + +EEE   D+D E   +      ++  D  
Sbjct: 268 NYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNG 313



 Score = 26.1 bits (57), Expect = 9.5
 Identities = 7/40 (17%), Positives = 18/40 (45%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
           ++E     + +EEE+ + + +ED +       +E+     
Sbjct: 275 DDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 31.3 bits (71), Expect = 0.27
 Identities = 24/71 (33%), Positives = 38/71 (53%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
             E  E E+E  E  E EE  KE+ E++ +K E   EE E+ K+K + +  +EE  Q   
Sbjct: 507 EKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLED 566

Query: 140 KKEEEEEEEKE 150
           + +E +E  +E
Sbjct: 567 RLQELKELLEE 577



 Score = 29.3 bits (66), Expect = 0.98
 Identities = 23/79 (29%), Positives = 47/79 (59%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E LE  E  +EE E++ E+ E   E+ ++  E+ + ++ +EE  + +D+ +E  E  EE
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577

Query: 135 EQKNKKKEEEEEEEKEKVE 153
            +  + ++EE EE +E+++
Sbjct: 578 LRLLRTRKEELEELRERLK 596



 Score = 29.0 bits (65), Expect = 1.5
 Identities = 26/88 (29%), Positives = 40/88 (45%), Gaps = 1/88 (1%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK-QKDKK 125
            L + I   E       EE  +  E EE  +EE +EK E  E   E+ EE +EK Q  + 
Sbjct: 498 ELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQL 557

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           +EE+ + E+  +  K+  EE       +
Sbjct: 558 KEELRQLEDRLQELKELLEELRLLRTRK 585



 Score = 28.6 bits (64), Expect = 2.0
 Identities = 29/92 (31%), Positives = 50/92 (54%)

Query: 62  ARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
             L   L Q     ++LE +E E E +E EEE E E E+     + E+  +   EE E++
Sbjct: 603 KELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEK 662

Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
            ++ E E+  E +  +N+++ EE+ EE E++E
Sbjct: 663 VEELEAEIRRELQRIENEEQLEEKLEELEQLE 694



 Score = 28.2 bits (63), Expect = 2.7
 Identities = 26/69 (37%), Positives = 38/69 (55%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E EEE +      EE EE  EK +  E++ EK EE+ EK + + EE  EE+ E  K  ++
Sbjct: 302 ELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEE 361

Query: 142 EEEEEEEKE 150
             +E EE+ 
Sbjct: 362 RLKELEERL 370



 Score = 27.8 bits (62), Expect = 3.3
 Identities = 26/75 (34%), Positives = 43/75 (57%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            EE EKE  E  E E +E +E+  +++E+ E  EE ++ +KE    EEE  +  + +E  
Sbjct: 468 PEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEAL 527

Query: 145 EEEEKEKVEMYRNPV 159
           +EE +EK+E   N +
Sbjct: 528 KEELEEKLEKLENLL 542



 Score = 27.4 bits (61), Expect = 4.8
 Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 1/84 (1%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
                    E  E  EE +     +EE EE +E+   E KK+ K+ EE   Q ++  + +
Sbjct: 561 LRQLEDRLQELKELLEELRLLRTRKEELEELRER-LKELKKKLKELEERLSQLEELLQSL 619

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
           E  E E + ++ EEE E E EK+ 
Sbjct: 620 ELSEAENELEEAEEELESELEKLN 643



 Score = 27.0 bits (60), Expect = 5.1
 Identities = 30/112 (26%), Positives = 46/112 (41%)

Query: 43  AEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKE 102
            E Q +    +  + I A       L +     E L+    E E  E E  +  EEE +E
Sbjct: 219 EEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRE 278

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
            +   E+  EK E  EE +++ +E E E E      ++ EE  E+ K   E 
Sbjct: 279 LERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEER 330



 Score = 27.0 bits (60), Expect = 6.4
 Identities = 26/70 (37%), Positives = 42/70 (60%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E++  E  E E EE +E+   E+++ E +EE EE +K+ +E E E  E  +  +  +E
Sbjct: 470 EHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKE 529

Query: 144 EEEEEKEKVE 153
           E EE+ EK+E
Sbjct: 530 ELEEKLEKLE 539



 Score = 27.0 bits (60), Expect = 6.8
 Identities = 23/84 (27%), Positives = 45/84 (53%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +     E L++ E   E+ E++ E+ E E E+  +E  E  +  +E  +E ++  +E E 
Sbjct: 316 ELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEK 375

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
           E E+  ++ K+ EE  +E KE++ 
Sbjct: 376 ELEKALERLKQLEEAIQELKEELA 399



 Score = 26.6 bits (59), Expect = 7.0
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E  E+  E  EEE +E ++ E+ ++E+++ E ++E E  ++   E E E+E  E+  
Sbjct: 196 ELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELK 255

Query: 139 KKKEEEEEEEKEKVE 153
            +  E E  E E ++
Sbjct: 256 ARLLEIESLELEALK 270



 Score = 26.6 bits (59), Expect = 7.2
 Identities = 18/78 (23%), Positives = 40/78 (51%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E + E+ E   EE EE +EK + +  +++  + E+  ++ K+  EE       +++ 
Sbjct: 529 EELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEEL 588

Query: 139 KKKEEEEEEEKEKVEMYR 156
           ++  E  +E K+K++   
Sbjct: 589 EELRERLKELKKKLKELE 606



 Score = 26.6 bits (59), Expect = 8.8
 Identities = 24/68 (35%), Positives = 42/68 (61%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE E +  E  E+ E+  E  E+E K+ +K EE +E+Q++++ E+  E  EE+  + +EE
Sbjct: 188 EELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEE 247

Query: 144 EEEEEKEK 151
           +E  E+ K
Sbjct: 248 KERLEELK 255



 Score = 26.3 bits (58), Expect = 9.7
 Identities = 26/95 (27%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 62  ARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
            +L+ +L       E LE  E +E +K +E +EE+EEEE E++ +  ++R  + EEE+++
Sbjct: 192 GQLSELLEDIEDLLEALEE-ELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKER 250

Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
            ++ +  + E E  +    K  EEE  + +  +  
Sbjct: 251 LEELKARLLEIESLELEALKIREEELRELERLLEE 285


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 30.8 bits (69), Expect = 0.28
 Identities = 22/68 (32%), Positives = 42/68 (61%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           NE E   +++EEE+E+ EE +E +E E  + E  ++E+ ++ D+KE + E +E E   + 
Sbjct: 99  NEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGEY 158

Query: 141 KEEEEEEE 148
             EE +++
Sbjct: 159 IIEEVDDD 166



 Score = 28.9 bits (64), Expect = 1.2
 Identities = 19/65 (29%), Positives = 38/65 (58%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           ++ + E + E E        +EE+  +K+EEE+E  ++ +E E  E  EE+ +  ++ E+
Sbjct: 80  DKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEK 139

Query: 146 EEEKE 150
           ++EKE
Sbjct: 140 DDEKE 144



 Score = 28.1 bits (62), Expect = 2.6
 Identities = 21/61 (34%), Positives = 34/61 (55%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           ++  E  + + EE E+ EE E  EEE + DED EK  EK+ + E  + +   E + EE +
Sbjct: 105 TQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGEYIIEEVD 164

Query: 135 E 135
           +
Sbjct: 165 D 165



 Score = 27.7 bits (61), Expect = 3.0
 Identities = 24/83 (28%), Positives = 45/83 (54%), Gaps = 1/83 (1%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A++  +N    E E      + EEE   +K+E+E++  E+ EE EE +   +EE  ++E+
Sbjct: 78  ANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEV-VEEEYDDDED 136

Query: 134 EEQKNKKKEEEEEEEKEKVEMYR 156
            E+ ++K+ + E +E E    Y 
Sbjct: 137 SEKDDEKESDAEGDENELAGEYI 159



 Score = 27.3 bits (60), Expect = 3.7
 Identities = 21/59 (35%), Positives = 33/59 (55%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
           I      E    +EEE++++ EE EE EE E  E+E    E  E+++EK+ D + +E E
Sbjct: 95  IIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENE 153



 Score = 26.6 bits (58), Expect = 7.5
 Identities = 21/65 (32%), Positives = 36/65 (55%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           E+E   ++EEEE E  E+ E+  E +  EEE   D+  E+ +E+E + +  + E   E  
Sbjct: 100 EEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGEYI 159

Query: 149 KEKVE 153
            E+V+
Sbjct: 160 IEEVD 164


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 30.7 bits (70), Expect = 0.28
 Identities = 24/76 (31%), Positives = 39/76 (51%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E  E  +    EK+KEEE+  E +E+   E  ++  EK E E EK   ++E  +E + +E
Sbjct: 214 EAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQE 273

Query: 136 QKNKKKEEEEEEEKEK 151
           Q+   KE  + E +  
Sbjct: 274 QEELLKEGFKTEAESL 289



 Score = 28.0 bits (63), Expect = 2.3
 Identities = 14/64 (21%), Positives = 35/64 (54%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            +E+  + E  + E  E E++   EK++E+++  E +++  +E   +  E+ +  ++K  
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260

Query: 144 EEEE 147
            E+E
Sbjct: 261 AEQE 264



 Score = 27.3 bits (61), Expect = 3.7
 Identities = 22/95 (23%), Positives = 47/95 (49%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           QA++A E     E  + E  + E+E   E++KE+++  E +    +E  ++  +K E E 
Sbjct: 197 QALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAER 256

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRN 164
           E+   EQ+   + + +E+E+   E ++      + 
Sbjct: 257 EKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQK 291


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 31.2 bits (71), Expect = 0.28
 Identities = 23/73 (31%), Positives = 33/73 (45%), Gaps = 4/73 (5%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE + E  EE+E  EE  EK E+  K+ E++ EE EK      +EVE         K   
Sbjct: 493 EEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEK----LLDEVELLTGANSGGKTSL 548

Query: 144 EEEEEKEKVEMYR 156
            E + +  V  + 
Sbjct: 549 LELKAQIVVLAHM 561


>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 31.0 bits (70), Expect = 0.28
 Identities = 15/68 (22%), Positives = 34/68 (50%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           KE + E E++  +++   D   ++ +   E+ KQ  + + E EE E  +K  K +   ++
Sbjct: 362 KEAQAEIEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKIDKAQAALKQ 421

Query: 148 EKEKVEMY 155
            +   ++Y
Sbjct: 422 AQSAFDLY 429


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 30.5 bits (69), Expect = 0.29
 Identities = 15/70 (21%), Positives = 29/70 (41%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            E E KE+ E EEE E +   D          + +E +Q+   ++     ++E+K+ K  
Sbjct: 67  RERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDP 126

Query: 143 EEEEEEKEKV 152
               ++    
Sbjct: 127 RGGTQDVVDK 136


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 30.9 bits (70), Expect = 0.30
 Identities = 15/61 (24%), Positives = 31/61 (50%), Gaps = 3/61 (4%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           +  +E    E D  EE     K +E+   +        +E++E ++ ++ +EE E+K+KV
Sbjct: 304 KPSDEPSLPESDIHEE---IPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKV 360

Query: 153 E 153
           +
Sbjct: 361 K 361



 Score = 30.1 bits (68), Expect = 0.47
 Identities = 15/60 (25%), Positives = 26/60 (43%), Gaps = 2/60 (3%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD--KKEEEVEEEEEEQKNKKK 141
           +  ++    E +  EE  + DE    +        ++  +    EE  EE E++QK KKK
Sbjct: 304 KPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKK 363



 Score = 27.1 bits (60), Expect = 4.6
 Identities = 16/69 (23%), Positives = 32/69 (46%), Gaps = 6/69 (8%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           S   +L  ++  EE  + +E+   E        DE+ + E  EE +E      E E +++
Sbjct: 306 SDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKE------EVEKKQK 359

Query: 133 EEEQKNKKK 141
            +++  K+K
Sbjct: 360 VKKKPRKRK 368



 Score = 26.3 bits (58), Expect = 8.7
 Identities = 13/58 (22%), Positives = 27/58 (46%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +  ++    E +  E+    DE+   E         +D ++E+ EE +EE + K+K +
Sbjct: 304 KPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVK 361


>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
           Toc86/159, G and M domains.  The long precursor of the
           86K protein originally described is proposed to have
           three domains. The N-terminal A-domain is acidic,
           repetitive, weakly conserved, readily removed by
           proteolysis during chloroplast isolation, and not
           required for protein translocation. The other domains
           are designated G (GTPase) and M (membrane anchor); this
           family includes most of the G domain and all of M
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 763

 Score = 31.1 bits (70), Expect = 0.30
 Identities = 20/86 (23%), Positives = 38/86 (44%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           + E  + E E   + +EE  E E D+    K   K +  +  K++++  +EE +   K  
Sbjct: 384 DEEDSDIELEDSSDSDEESGEDEYDQLPPFKPLTKAQMAKLSKEQRKAYLEEYDYRVKLL 443

Query: 140 KKEEEEEEEKEKVEMYRNPVPMYRNP 165
           +K++  EE K    M +    +   P
Sbjct: 444 QKKQWREELKRMKMMKKFGKEIGELP 469


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 30.9 bits (70), Expect = 0.30
 Identities = 20/62 (32%), Positives = 32/62 (51%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
           EE     DE ++  K++     ++  +EEE EEEEEE++  + E   EE     E+   P
Sbjct: 377 EEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPEMP 436

Query: 159 VP 160
           +P
Sbjct: 437 MP 438



 Score = 29.8 bits (67), Expect = 0.68
 Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEK 109
           W EEE+E+EEEEEEE  E E   +E  
Sbjct: 401 WAEEEEEEEEEEEEEAAEAEAPMEEPV 427


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 31.0 bits (70), Expect = 0.31
 Identities = 9/27 (33%), Positives = 21/27 (77%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEE 117
            +E+E++ +E E D ++E+KR ++E++
Sbjct: 371 SDEDEDDSDEDEVDYEKERKRRREEDK 397


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 30.7 bits (69), Expect = 0.32
 Identities = 10/65 (15%), Positives = 21/65 (32%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
             EE +  E E          ++    E++      K +  +   +E+ + KK E  +  
Sbjct: 198 SLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKAL 257

Query: 149 KEKVE 153
                
Sbjct: 258 AALEA 262



 Score = 26.1 bits (57), Expect = 9.8
 Identities = 13/69 (18%), Positives = 24/69 (34%)

Query: 64  LTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
           L+  L  A    + LE +    + K    +   +EE   K  +  K     E      +D
Sbjct: 210 LSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKALAALEAANAADED 269

Query: 124 KKEEEVEEE 132
            ++ +   E
Sbjct: 270 PQDRDAAVE 278


>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
          Length = 542

 Score = 30.6 bits (70), Expect = 0.33
 Identities = 19/69 (27%), Positives = 32/69 (46%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
            L   +   E  E     EE   +E+EE   E E   DE+ E+++ ++E + E    +  
Sbjct: 461 RLRPLLPEEELAEEEAAAEEAALEEDEEAALEVELSDDEELEEEKAEEELKYEDLLKRLR 520

Query: 127 EEVEEEEEE 135
           E  +E+ EE
Sbjct: 521 ELAKEDPEE 529



 Score = 28.7 bits (65), Expect = 1.8
 Identities = 18/64 (28%), Positives = 38/64 (59%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
             E+E  EEE   EE   ++++E     +  ++EE +++K EEE++ E+  ++ ++  +E
Sbjct: 466 LPEEELAEEEAAAEEAALEEDEEAALEVELSDDEELEEEKAEEELKYEDLLKRLRELAKE 525

Query: 145 EEEE 148
           + EE
Sbjct: 526 DPEE 529


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 30.5 bits (69), Expect = 0.34
 Identities = 17/56 (30%), Positives = 34/56 (60%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
             A SAS+ L+    E  ++E   EEE E+++ E  +   +K+++++++EEKQ  +
Sbjct: 60  TAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVRE 115



 Score = 28.2 bits (63), Expect = 1.8
 Identities = 13/51 (25%), Positives = 31/51 (60%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
             +E +++  E  +++   E++ EK + E  KQ  +K++E +++EE+Q  +
Sbjct: 65  ASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVRE 115



 Score = 26.7 bits (59), Expect = 6.4
 Identities = 10/53 (18%), Positives = 29/53 (54%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
                   +E +++  +  DEE   E++ E+++ +  K+  + ++E+++Q+ K
Sbjct: 59  STAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEK 111



 Score = 26.7 bits (59), Expect = 7.1
 Identities = 11/49 (22%), Positives = 28/49 (57%)

Query: 97  EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
              ++ + +  E   E+   EEE +KD+ E   +  +++Q+ +++EE++
Sbjct: 64  SASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQ 112


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 30.9 bits (70), Expect = 0.35
 Identities = 15/64 (23%), Positives = 30/64 (46%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K   E +EE+ EE++++  +  KR+K    ++K K  K +           KK  + E +
Sbjct: 185 KSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESK 244

Query: 148 EKEK 151
            + +
Sbjct: 245 LEAQ 248


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 30.8 bits (70), Expect = 0.35
 Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 8/63 (12%)

Query: 58  IRANARLTVILPQA--ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
           +   ARL V+ P+A  + A       +  EEE        E++++  +DE+E+   E + 
Sbjct: 243 LALAARL-VLAPRATRLPAPP-----QPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRA 296

Query: 116 EEE 118
            EE
Sbjct: 297 LEE 299



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 11/44 (25%), Positives = 21/44 (47%), Gaps = 3/44 (6%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
              +  EEE        +D+D+     + EEE++  +D+  EE+
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDP---PEDEEEQDDAEDRALEEI 300



 Score = 27.3 bits (61), Expect = 4.5
 Identities = 10/40 (25%), Positives = 18/40 (45%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
              +  EEE        +  +   E+EE+Q D ++  +EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299


>gnl|CDD|225962 COG3428, COG3428, Predicted membrane protein [Function unknown].
          Length = 494

 Score = 30.5 bits (69), Expect = 0.35
 Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 1/48 (2%)

Query: 88  KEKEEEEEEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E E E  E E   E      E + +K +  E  + D  EEEV E   
Sbjct: 123 GEAEAELVEREIAFELAALVREARVKKLDALELAEADTPEEEVAEVLA 170


>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter.  [Transport and binding
           proteins, Other].
          Length = 953

 Score = 30.8 bits (70), Expect = 0.37
 Identities = 16/56 (28%), Positives = 22/56 (39%), Gaps = 6/56 (10%)

Query: 88  KEKEEEEEEEEE--EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           K + E  E  EE     +    EK RE K+ +    K  K      + E Q N +K
Sbjct: 835 KPQTESMEAFEEMIRPFRLHKTEKDREAKDPKMTWTKPWK----ITDAELQSNVRK 886



 Score = 29.7 bits (67), Expect = 0.73
 Identities = 20/121 (16%), Positives = 44/121 (36%), Gaps = 33/121 (27%)

Query: 64  LTVILPQAISASETLENNEWE----------EEEKEKEEEEEE------------EEEEK 101
           LT++LP  ++  +      W+          +++   E+E+++            E    
Sbjct: 774 LTLLLPYLLTTKKV-----WKKCKIRIFVGAQKDDRSEQEKKDMATLLYKFRIDAEVIVV 828

Query: 102 EKDEDEEKKREKKEEEEEKQ------KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMY 155
             D + + + E  E  EE        K +K+ E ++ +       K  + E +    + Y
Sbjct: 829 LMDINAKPQTESMEAFEEMIRPFRLHKTEKDREAKDPKMTWTKPWKITDAELQSNVRKSY 888

Query: 156 R 156
           R
Sbjct: 889 R 889


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 30.3 bits (69), Expect = 0.39
 Identities = 11/45 (24%), Positives = 24/45 (53%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           DE  E+K  ++E E+ ++K     E ++E+  ++  + EE +   
Sbjct: 4   DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTP 48


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 30.7 bits (69), Expect = 0.39
 Identities = 15/76 (19%), Positives = 29/76 (38%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
            S++ +LE    E+        E  EEE  + +E+E    E +   +   +   E  V +
Sbjct: 475 FSSANSLERGADEDYLIVNGTNEPYEEEVIKTNENENFPLENESPCDGISRGTSELPVND 534

Query: 132 EEEEQKNKKKEEEEEE 147
           ++    N+      E 
Sbjct: 535 DDNNNLNRGNTLRSEG 550


>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
           protein; Provisional.
          Length = 230

 Score = 30.3 bits (68), Expect = 0.41
 Identities = 18/76 (23%), Positives = 36/76 (47%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A + L   +     K K+    E  +EK  +++EEK   + E+    +  +++ + E E 
Sbjct: 113 AKKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPELSEEKSDSELET 172

Query: 134 EEQKNKKKEEEEEEEK 149
           + QK + KE++   E 
Sbjct: 173 QPQKTQLKEKKPSIEH 188


>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
          Length = 633

 Score = 30.6 bits (69), Expect = 0.42
 Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 9/73 (12%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE--------EEEKQKDKKEEEVE-EEEE 134
           E    +    ++EEEEE    ED       +EE        E EK+  ++E EV+ + +E
Sbjct: 461 ESSSSDGSGHDDEEEEEPLLSEDRVITSSVEEEVTEGELWYELEKELQRQETEVDAQAQE 520

Query: 135 EQKNKKKEEEEEE 147
           E+    KE  EEE
Sbjct: 521 EEAAAAKEITEEE 533



 Score = 26.4 bits (58), Expect = 8.8
 Identities = 20/80 (25%), Positives = 39/80 (48%), Gaps = 1/80 (1%)

Query: 65  TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
           +V      S+S+   +++ EEEE    E+       +E+  + E   E  E+E ++Q+ +
Sbjct: 455 SVEHKSESSSSDGSGHDDEEEEEPLLSEDRVITSSVEEEVTEGELWYEL-EKELQRQETE 513

Query: 125 KEEEVEEEEEEQKNKKKEEE 144
            + + +EEE     +  EEE
Sbjct: 514 VDAQAQEEEAAAAKEITEEE 533


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 30.3 bits (67), Expect = 0.42
 Identities = 19/68 (27%), Positives = 43/68 (63%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E E+EE++ E+E+++ E+E  +  + + K E+++++ E++K K E+E ++          
Sbjct: 150 ELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAI 209

Query: 142 EEEEEEEK 149
           E E+E++K
Sbjct: 210 ELEQEKQK 217



 Score = 29.1 bits (64), Expect = 1.1
 Identities = 23/95 (24%), Positives = 52/95 (54%), Gaps = 1/95 (1%)

Query: 64  LTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK- 122
              +L  A  A +  +  E  + +KE E   +   +   + E EE+K E+++++ EK+  
Sbjct: 111 AAALLLAACFADDQDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGI 170

Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           +    +++ E+E+QK ++++++ E+EK+K     N
Sbjct: 171 ELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIAN 205



 Score = 28.3 bits (62), Expect = 2.0
 Identities = 20/70 (28%), Positives = 45/70 (64%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E++E++ E+E+++ EKE  E    + + ++E+++ +++K++ E E+++      K   
Sbjct: 150 ELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAI 209

Query: 144 EEEEEKEKVE 153
           E E+EK+K E
Sbjct: 210 ELEQEKQKTE 219



 Score = 27.9 bits (61), Expect = 3.1
 Identities = 19/70 (27%), Positives = 45/70 (64%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  + E+E  E    + + E+EK+K E E++K E+++++     +K   E+E+E+++ +N
Sbjct: 161 EKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTEN 220

Query: 139 KKKEEEEEEE 148
           +K++  +E++
Sbjct: 221 EKQDLIKEQK 230


>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
           [Intracellular trafficking and secretion].
          Length = 94

 Score = 28.9 bits (65), Expect = 0.43
 Identities = 9/45 (20%), Positives = 26/45 (57%)

Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           K+   + + E  ++ K EE++++++E   + +  +EE ++   E+
Sbjct: 40  KKAASDVKNELDEELKLEELDDKKKELTAELQATKEELDQLASEL 84



 Score = 28.1 bits (63), Expect = 1.0
 Identities = 16/55 (29%), Positives = 27/55 (49%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           K+   + + E  E+ + EE   +KKE   E Q  K+E +    E ++  KKK + 
Sbjct: 40  KKAASDVKNELDEELKLEELDDKKKELTAELQATKEELDQLASELKEDLKKKAKP 94


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 30.0 bits (68), Expect = 0.44
 Identities = 22/67 (32%), Positives = 27/67 (40%), Gaps = 6/67 (8%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            +E+   E     +EE K+   +  K  EK     E      E   EEEEE        E
Sbjct: 118 NDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEI------AE 171

Query: 144 EEEEEKE 150
           EEEE KE
Sbjct: 172 EEEEVKE 178


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 29.4 bits (66), Expect = 0.45
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
               +EEEEEEEE EK++  DE + ++ ++ E
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHE 123



 Score = 27.1 bits (60), Expect = 3.2
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
               +E+EEEEEE E++++ DE + ++ ++ E   E
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAE 127


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 30.4 bits (69), Expect = 0.46
 Identities = 16/51 (31%), Positives = 28/51 (54%), Gaps = 1/51 (1%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           E+L   E  +E  + +  EE +EEE ++++D +   E K+  EE  K + E
Sbjct: 52  ESLRE-EIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDE 101



 Score = 30.0 bits (68), Expect = 0.52
 Identities = 22/98 (22%), Positives = 36/98 (36%), Gaps = 6/98 (6%)

Query: 49  VGPAQYHKPIRANARLTVILPQAISASETLE------NNEWEEEEKEKEEEEEEEEEEKE 102
            G       I  + R  +  P       +LE      N E        EE   EE  ++ 
Sbjct: 3   GGSVNEDSSIPLSLRPLIDKPAEDEDPLSLEELIERINQERGSFRDLTEESLREEIAKEA 62

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
              D  ++ +++E++EE   D  EE  +  EE  K + 
Sbjct: 63  AKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARD 100



 Score = 26.9 bits (60), Expect = 6.6
 Identities = 16/50 (32%), Positives = 26/50 (52%)

Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
             +D  EE  RE+  +E  K    +E + EE++EE  N   EE ++  +E
Sbjct: 45  SFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94



 Score = 26.6 bits (59), Expect = 6.9
 Identities = 14/42 (33%), Positives = 21/42 (50%)

Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           E   EE  K+  K +   E +EEE   +   ++ EE K+ VE
Sbjct: 52  ESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVE 93



 Score = 26.6 bits (59), Expect = 8.0
 Identities = 9/44 (20%), Positives = 25/44 (56%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            ++  + +  EE +E+E DE+++    ++ ++  ++  K  +EV
Sbjct: 59  AKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDEV 102


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.0 bits (68), Expect = 0.52
 Identities = 19/77 (24%), Positives = 31/77 (40%), Gaps = 5/77 (6%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKR-----EKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           EE E  K   EEE  ++K+K + ++ K+        E         +      +  E K 
Sbjct: 167 EEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKK 226

Query: 139 KKKEEEEEEEKEKVEMY 155
            KK+     + EK E+Y
Sbjct: 227 LKKKRSIAPDNEKSEVY 243



 Score = 26.2 bits (58), Expect = 8.4
 Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 3/59 (5%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           EEE E  K + E+E   R KK+++++K+K KK        E   +     E      +V
Sbjct: 166 EEEVELLKARLEEE---RAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQV 221


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 30.1 bits (68), Expect = 0.52
 Identities = 25/89 (28%), Positives = 41/89 (46%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
            A +A+E        EE++ KE E E E  +++ DE E    E   E +      K+   
Sbjct: 246 AATAAAEVPPAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNE 305

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
           E     Q+N++ EEE + +K  VE+  + 
Sbjct: 306 ELTTVRQENEELEEEYKIKKRTVELLPDA 334


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 30.2 bits (68), Expect = 0.53
 Identities = 14/105 (13%), Positives = 35/105 (33%), Gaps = 10/105 (9%)

Query: 56  KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
             +R N    +I P               E  + K+    E+ E K +   +E  ++++ 
Sbjct: 306 WTLRRNRLRMIITPWRAPEL-------HAENAEIKKTRTAEKNEAKAR--KKEIAQKRRA 356

Query: 116 EEEEKQKDKKEEEVEE-EEEEQKNKKKEEEEEEEKEKVEMYRNPV 159
            E E  ++ ++E          +    + +++   +       P 
Sbjct: 357 AEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPS 401


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 29.9 bits (67), Expect = 0.53
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
             S +   ++ E EE E+EEEEEEEEEE +    E+++     + 
Sbjct: 216 PNSSSSSGSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA 260



 Score = 28.7 bits (64), Expect = 1.3
 Identities = 15/43 (34%), Positives = 25/43 (58%)

Query: 56  KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEE 98
             + +      +     S+S +  ++E EE+E+E+EEEEEEEE
Sbjct: 201 SKVASPTPRLGLRTPPNSSSSSGSDSESEEDEEEEEEEEEEEE 243


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 30.0 bits (67), Expect = 0.54
 Identities = 22/61 (36%), Positives = 34/61 (55%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
            LE  E E +  + + ++  EEEE E DE+ E+  E K+E  EK   + +EE  + +EE 
Sbjct: 222 PLEKLEEEIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEI 281

Query: 137 K 137
           K
Sbjct: 282 K 282



 Score = 28.1 bits (62), Expect = 2.5
 Identities = 18/51 (35%), Positives = 34/51 (66%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           +EEE E +EE +E +ED+++  EK   E +++K K +EE++  E + K++ 
Sbjct: 242 EEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRN 292



 Score = 27.7 bits (61), Expect = 3.9
 Identities = 19/58 (32%), Positives = 35/58 (60%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           N+ ++  +E+E E +EE EE E+D+ E  ++   E +EEK K  +E +  E + + +N
Sbjct: 235 NDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRN 292


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 29.7 bits (67), Expect = 0.57
 Identities = 19/52 (36%), Positives = 32/52 (61%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           +EE  E++E++E  +EE+    D+E+  E   EEE+    ++EEEV+ E E 
Sbjct: 13  DEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEA 64


>gnl|CDD|226111 COG3583, COG3583, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 309

 Score = 29.8 bits (67), Expect = 0.59
 Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 2/70 (2%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE--EEEQKNKKKE 142
             + +KE E+ + E    +D    K +EK  +E +    +   EV  E  +E  +   KE
Sbjct: 209 NSKTDKENEQVDFETVYVEDPSMNKGKEKVVQEGQPGTKEVTFEVVTENGKETSREVVKE 268

Query: 143 EEEEEEKEKV 152
              EE K+ V
Sbjct: 269 VVVEEAKDAV 278


>gnl|CDD|215120 PLN02188, PLN02188, polygalacturonase/glycoside hydrolase family
           protein.
          Length = 404

 Score = 29.8 bits (67), Expect = 0.59
 Identities = 13/37 (35%), Positives = 17/37 (45%)

Query: 49  VGPAQYHKPIRANARLTVILPQAISASETLENNEWEE 85
           +GP Q+H P    + LT  L  A   S     N+W E
Sbjct: 81  IGPVQFHGPCTNVSSLTFTLKAATDLSRYGSGNDWIE 117


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 29.2 bits (66), Expect = 0.60
 Identities = 18/42 (42%), Positives = 28/42 (66%), Gaps = 5/42 (11%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           EE+E + +EE+EEEE+++E D+++E      EEEE   K  K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDE-----SEEEESPVKKVK 146



 Score = 28.4 bits (64), Expect = 0.97
 Identities = 17/47 (36%), Positives = 31/47 (65%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
           P  IS    + + E E ++ E++EEEE++EE+ ++DE EE++   K+
Sbjct: 98  PVHISGQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144



 Score = 28.4 bits (64), Expect = 1.2
 Identities = 12/37 (32%), Positives = 29/37 (78%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
            E +E +++E+++EEE++EE++++++ E+EE   +K 
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145



 Score = 28.1 bits (63), Expect = 1.4
 Identities = 18/45 (40%), Positives = 33/45 (73%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
           I  Q + ASE  E+++ EE+E+E+++EE+++E+E E++E   KK 
Sbjct: 101 ISGQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145



 Score = 27.3 bits (61), Expect = 3.0
 Identities = 15/38 (39%), Positives = 27/38 (71%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
            +E E +++EE+EE+E DE+++ + E +EEE   +K K
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 26.5 bits (59), Expect = 4.5
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
            +E+E +++EE++E+++DEE   E + EEEE    K 
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145


>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein.  Bacillus subtilis stage V
           sporulation protein R is involved in spore cortex
           formation. Little is known about cortex biosynthesis,
           except that it depends on several sigma E controlled
           genes, including spoVR.
          Length = 427

 Score = 29.9 bits (68), Expect = 0.60
 Identities = 14/49 (28%), Positives = 24/49 (48%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
            EEE+  +EE EE  + +          + +K EEE  ++ +  EE +E
Sbjct: 177 LEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQE 225



 Score = 27.2 bits (61), Expect = 4.7
 Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 4/55 (7%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE----EEEQKN 138
             K   EEE+  ++E++E  + +          Q++K EEE        EE Q++
Sbjct: 172 PIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQED 226


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 29.9 bits (68), Expect = 0.61
 Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 6/68 (8%)

Query: 87  EKEKEEEEEEEEEEKEKDE-DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
               +   E    EKE ++ ++E  R +K+   E    K  EEV E+E     K+K  E 
Sbjct: 807 AGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKE-----KEKLAEY 861

Query: 146 EEEKEKVE 153
           + +  K+E
Sbjct: 862 QVKLAKLE 869



 Score = 29.1 bits (66), Expect = 1.3
 Identities = 27/116 (23%), Positives = 40/116 (34%), Gaps = 15/116 (12%)

Query: 49  VGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEK------- 101
           VG  +    + AN          I     LE  E    + E+   E              
Sbjct: 754 VGSEELEDRLEANED-------DIKGLANLEELEILSPDPEEPPVEVTAVVGGAELFLPL 806

Query: 102 EKDEDEEKKREKKEEEEEK-QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
               D   +  + E+E EK +K+    E +   E    K  EE  E+EKEK+  Y+
Sbjct: 807 AGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQ 862


>gnl|CDD|227478 COG5149, TOA1, Transcription initiation factor IIA, large chain
           [Transcription].
          Length = 293

 Score = 29.6 bits (66), Expect = 0.62
 Identities = 15/81 (18%), Positives = 31/81 (38%), Gaps = 5/81 (6%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE-----EKKREKKEEEEEKQKDKKEE 127
           SA  TL N +W E   +  + E + E  + +D  +     E+  E      + +    ++
Sbjct: 163 SAITTLRNTDWRERLIDDTQSEWDGERMRRRDGKQGIHQYERLSEGPAHAFKGKPTTAKD 222

Query: 128 EVEEEEEEQKNKKKEEEEEEE 148
           E    + +  +    + E E 
Sbjct: 223 EGMFSDLDDSDVDSGDSEIEG 243


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 29.9 bits (67), Expect = 0.63
 Identities = 9/81 (11%), Positives = 16/81 (19%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           SA E       E       E                       E      +++     + 
Sbjct: 110 SAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQP 169

Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
             E   ++ E    E +    
Sbjct: 170 SHEDSPEEPEPPTSEPEPDSP 190



 Score = 29.2 bits (65), Expect = 1.2
 Identities = 14/84 (16%), Positives = 29/84 (34%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
             E+  +E E    E E +       E        +    +  + +    ++    N ++
Sbjct: 170 SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQ 229

Query: 142 EEEEEEEKEKVEMYRNPVPMYRNP 165
             E E+E  + E    P P +R+ 
Sbjct: 230 AVEHEDEPTEPEREGPPFPGHRSH 253



 Score = 27.6 bits (61), Expect = 3.6
 Identities = 12/77 (15%), Positives = 20/77 (25%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           A     + + E + +  EE    E+EE+ +        E             EE      
Sbjct: 60  AVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLS 119

Query: 134 EEQKNKKKEEEEEEEKE 150
            E  +    E       
Sbjct: 120 PENTSGSSPESPASHSP 136



 Score = 27.6 bits (61), Expect = 3.7
 Identities = 13/73 (17%), Positives = 23/73 (31%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           N+      +   E+  EE E    E E       + E        +   +E  E Q    
Sbjct: 160 NQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTP 219

Query: 141 KEEEEEEEKEKVE 153
           ++      ++ VE
Sbjct: 220 QQAPSPNTQQAVE 232



 Score = 27.2 bits (60), Expect = 5.3
 Identities = 10/78 (12%), Positives = 17/78 (21%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           S SE++ +         +E       E       E                  E    E 
Sbjct: 95  SGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES 154

Query: 133 EEEQKNKKKEEEEEEEKE 150
                N++     +   E
Sbjct: 155 HNPSPNQQPSSFLQPSHE 172



 Score = 26.9 bits (59), Expect = 5.7
 Identities = 7/79 (8%), Positives = 16/79 (20%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           S +   +                 E    +       ++     +   +   EE E    
Sbjct: 124 SGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTS 183

Query: 135 EQKNKKKEEEEEEEKEKVE 153
           E +       + E      
Sbjct: 184 EPEPDSPGPPQSETPTSSP 202



 Score = 26.5 bits (58), Expect = 7.5
 Identities = 10/73 (13%), Positives = 24/73 (32%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
            + +  EE +    E E +     + E        +   +E  + +     +      + 
Sbjct: 170 SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQ 229

Query: 139 KKKEEEEEEEKEK 151
             + E+E  E E+
Sbjct: 230 AVEHEDEPTEPER 242


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 30.0 bits (67), Expect = 0.65
 Identities = 25/70 (35%), Positives = 41/70 (58%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
              EN   EEEEKE+  EE EE EE E     E+K + ++ EE ++++K+ E  EEE+ +
Sbjct: 104 SGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPK 163

Query: 136 QKNKKKEEEE 145
           + + ++   E
Sbjct: 164 RGSLEENNGE 173



 Score = 29.2 bits (65), Expect = 1.2
 Identities = 23/72 (31%), Positives = 41/72 (56%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E++  E E  E+EE+EE  EE +E +E E   + +++ +    ++ ++EE E E EE++ 
Sbjct: 102 EDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEK 161

Query: 139 KKKEEEEEEEKE 150
            K+   EE   E
Sbjct: 162 PKRGSLEENNGE 173



 Score = 28.8 bits (64), Expect = 1.4
 Identities = 20/76 (26%), Positives = 42/76 (55%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            E   + +E+   E E  ++E++E+ RE++EE EE +   K E+  +  + ++ +K+E+E
Sbjct: 94  SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKE 153

Query: 145 EEEEKEKVEMYRNPVP 160
            E E+E+     +   
Sbjct: 154 PEPEEEEKPKRGSLEE 169



 Score = 28.1 bits (62), Expect = 2.3
 Identities = 22/70 (31%), Positives = 41/70 (58%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E  +  +E+   E E  E++E EE + E++E EE +   K E++ +  + E+  K+++E
Sbjct: 94  SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKE 153

Query: 144 EEEEEKEKVE 153
            E EE+EK +
Sbjct: 154 PEPEEEEKPK 163



 Score = 28.1 bits (62), Expect = 2.3
 Identities = 27/90 (30%), Positives = 47/90 (52%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
            P      ++   NE  EEE+++E  EE EE E+ +   + +++    + EE QK++KE 
Sbjct: 95  EPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEP 154

Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           E EEEE+ ++   +E   E    K++   N
Sbjct: 155 EPEEEEKPKRGSLEENNGEFMTHKLKHTEN 184



 Score = 27.3 bits (60), Expect = 4.3
 Identities = 18/67 (26%), Positives = 43/67 (64%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
             E E+ +++++E   E E+ K + EE+++  +EEE+ +++++ + +  EEEE+++ K++
Sbjct: 199 GKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEE 258

Query: 142 EEEEEEE 148
            E    E
Sbjct: 259 IERRRAE 265



 Score = 27.3 bits (60), Expect = 4.5
 Identities = 25/67 (37%), Positives = 38/67 (56%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           EE K+K EE  +  EE+E+   +E+   K  EEEEK++ K+E E    E  +K +K  E+
Sbjct: 217 EELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPED 276

Query: 145 EEEEKEK 151
              E +K
Sbjct: 277 GLSEDKK 283


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 29.7 bits (67), Expect = 0.66
 Identities = 10/39 (25%), Positives = 25/39 (64%)

Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           + E+EK+      E E+ +++++ + K+++E EE + V+
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQ 111



 Score = 28.9 bits (65), Expect = 0.97
 Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E+EKE      E E+ K+K+E E KK+++ EE +  QK K ++ ++E++     + K  E
Sbjct: 75  EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQK-KIDQYIKEKQLSSSLQTKLTE 133

Query: 145 E 145
           E
Sbjct: 134 E 134



 Score = 28.9 bits (65), Expect = 1.0
 Identities = 10/34 (29%), Positives = 22/34 (64%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           + E+E+E      + E+ ++K+E E +K+K+ +E
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106



 Score = 28.6 bits (64), Expect = 1.4
 Identities = 13/49 (26%), Positives = 29/49 (59%)

Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           K +DE E      + E+ +K+++ + ++ +E EE +  +KK ++  +EK
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEK 121



 Score = 27.8 bits (62), Expect = 2.7
 Identities = 7/37 (18%), Positives = 22/37 (59%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
           + E+E+E      E ++ ++K+  + ++++E ++ K 
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKA 109



 Score = 27.8 bits (62), Expect = 3.0
 Identities = 13/59 (22%), Positives = 32/59 (54%)

Query: 96  EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           + E+EKE      +  + K++EE + K KKE E  +  +++ ++  +E++     + ++
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKL 131



 Score = 26.2 bits (58), Expect = 9.5
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           QK + E+E+     E +  KK+EE E +K+K
Sbjct: 72  QKPEDEKELSASSLEAEQAKKKEEAEAKKKK 102


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.0 bits (68), Expect = 0.69
 Identities = 24/67 (35%), Positives = 46/67 (68%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E+E ++ EE +EE EE EK+ +  +  ++K EE+ ++ +++ EE+++E EE + K KE +
Sbjct: 227 EKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK 286

Query: 145 EEEEKEK 151
           E +EK +
Sbjct: 287 ELKEKAE 293



 Score = 29.6 bits (67), Expect = 0.82
 Identities = 24/72 (33%), Positives = 39/72 (54%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
               EE +K+ +E E+  EE +E+ E  E+ + KKEE E  +K       E+ E+E +  
Sbjct: 337 EERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEEL 396

Query: 140 KKEEEEEEEKEK 151
           +K +EE EE+  
Sbjct: 397 EKAKEEIEEEIS 408



 Score = 27.0 bits (60), Expect = 6.0
 Identities = 26/73 (35%), Positives = 46/73 (63%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           +E +E +EE EE E+E E  E  ++K E+K  E E++ ++ ++E+EE EE+ K  K+ +E
Sbjct: 231 KELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKE 290

Query: 145 EEEEKEKVEMYRN 157
           + EE  K+  +  
Sbjct: 291 KAEEYIKLSEFYE 303


>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
          Length = 382

 Score = 29.8 bits (68), Expect = 0.69
 Identities = 16/78 (20%), Positives = 32/78 (41%), Gaps = 4/78 (5%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK--- 137
            E EEEE+E EEEE +     E    +     K+E+++   + ++ E+  +    Q    
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQPVNVQPAQF 284

Query: 138 -NKKKEEEEEEEKEKVEM 154
            +         E   +++
Sbjct: 285 QSFDPPPLATTEPRNLDL 302



 Score = 28.3 bits (64), Expect = 2.1
 Identities = 15/74 (20%), Positives = 33/74 (44%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEEE+E+E EEEE +     +    +     ++E+++   +++E  +E +       + +
Sbjct: 226 EEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQPVNVQPAQFQ 285

Query: 144 EEEEEKEKVEMYRN 157
             +         RN
Sbjct: 286 SFDPPPLATTEPRN 299


>gnl|CDD|227391 COG5058, LAG1, Protein transporter of the TRAM (translocating
           chain-associating membrane) superfamily, longevity
           assurance factor [Intracellular trafficking and
           secretion].
          Length = 395

 Score = 29.8 bits (67), Expect = 0.72
 Identities = 16/71 (22%), Positives = 24/71 (33%), Gaps = 18/71 (25%)

Query: 51  PAQYHK--PIRANARLTVILPQAIS-------------ASETLENNEWEEEEKEKEEEEE 95
             Q +K         +  IL QA+              A   +   E ++E  + E EEE
Sbjct: 328 ATQQYKCWISLP---IVFILLQALQLVNIYWLFLIVRVAYRVIWEGELKDERSDDESEEE 384

Query: 96  EEEEEKEKDED 106
            + E  E   D
Sbjct: 385 SDLESSEDKND 395


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 29.6 bits (67), Expect = 0.72
 Identities = 12/72 (16%), Positives = 25/72 (34%), Gaps = 3/72 (4%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE---EEEEKQKDKKEEEVEEEEEE 135
              +  EE K  ++ E       E+   E K   ++      +EK   ++ EE+  E   
Sbjct: 83  PAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVA 142

Query: 136 QKNKKKEEEEEE 147
              ++ +     
Sbjct: 143 LYKQEIQSPTRL 154



 Score = 29.6 bits (67), Expect = 0.78
 Identities = 16/86 (18%), Positives = 39/86 (45%), Gaps = 10/86 (11%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK--KEEEEEKQKDKKEEEVEEEE 133
           +T E  + ++ E +  ++  E++ E+ K   + ++      E + ++ KD  EE  E   
Sbjct: 65  QTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEIAEMMR 124

Query: 134 EEQKNKKKEEE--------EEEEKEK 151
           +E+   ++ EE         ++E + 
Sbjct: 125 DEKVPIRELEEIPPEFVALYKQEIQS 150


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 29.9 bits (67), Expect = 0.74
 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 1/63 (1%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ-KNKKKEEEEEEEKEK 151
            E +     EK   EEKKR+K+EEE   +   +     EE  E  K  K++ E E +K +
Sbjct: 487 SEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLE 546

Query: 152 VEM 154
            ++
Sbjct: 547 HDL 549


>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
           trafficking and secretion].
          Length = 340

 Score = 29.5 bits (67), Expect = 0.75
 Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 11/67 (16%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR-----------EKKEEEEEKQKDKKEEEV 129
            E EE E EKE+ EEE+EEE+++   E  K+             K    +K K+K +E++
Sbjct: 9   EELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKGIKGLFLKKIKEKLDEDL 68

Query: 130 EEEEEEQ 136
            EE EE 
Sbjct: 69  LEELEEL 75



 Score = 29.1 bits (66), Expect = 1.1
 Identities = 22/74 (29%), Positives = 38/74 (51%), Gaps = 3/74 (4%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ---KDKKEEEVEEEEEEQKNKK 140
               ++ EEE EE E EKEK E+E+++ EK+   E  +      K+   +  +     K 
Sbjct: 1   MGFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKGIKGLFLKKI 60

Query: 141 KEEEEEEEKEKVEM 154
           KE+ +E+  E++E 
Sbjct: 61  KEKLDEDLLEELEE 74


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 29.5 bits (66), Expect = 0.75
 Identities = 17/68 (25%), Positives = 33/68 (48%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            E  K++++EE     E+   E  +      EE ++  +KE+E+EE  E   + + E  E
Sbjct: 205 PETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGE 264

Query: 146 EEEKEKVE 153
           + E  + +
Sbjct: 265 DSEDGETK 272



 Score = 28.3 bits (63), Expect = 2.1
 Identities = 18/84 (21%), Positives = 36/84 (42%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E    E  E      EE+ +D  +E++ E+  E +   +D+  E+ E+ E + ++    E
Sbjct: 221 ERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSE 280

Query: 144 EEEEEKEKVEMYRNPVPMYRNPVP 167
             E   E  +M    +    + +P
Sbjct: 281 YIERISEIRKMKDERLSSLASMMP 304



 Score = 28.3 bits (63), Expect = 2.2
 Identities = 12/68 (17%), Positives = 39/68 (57%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           N  +EE  +   E  ++++++E+    E++  +  E      +++  ++ +E+E ++N +
Sbjct: 194 NPLDEEAPDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVE 253

Query: 141 KEEEEEEE 148
           ++ ++E+E
Sbjct: 254 RDSDDEDE 261



 Score = 27.5 bits (61), Expect = 3.2
 Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 2/75 (2%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDE--DEEKKREKKEEEEEKQKDKKEEE 128
               +E +E   W  EE+ ++  +E+E EE  + +  DE++  E  E+ E K +     E
Sbjct: 221 ERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSE 280

Query: 129 VEEEEEEQKNKKKEE 143
             E   E +  K E 
Sbjct: 281 YIERISEIRKMKDER 295


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 29.5 bits (67), Expect = 0.75
 Identities = 21/65 (32%), Positives = 37/65 (56%), Gaps = 4/65 (6%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEE----EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           E E E  +KEKDE  +++ E  E+E    EE+  D +E+   E+   Q  ++ +EE E+ 
Sbjct: 418 EIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQV 477

Query: 149 KEKVE 153
           + ++E
Sbjct: 478 RLELE 482


>gnl|CDD|216881 pfam02115, Rho_GDI, RHO protein GDP dissociation inhibitor. 
          Length = 200

 Score = 29.2 bits (66), Expect = 0.76
 Identities = 13/44 (29%), Positives = 26/44 (59%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           E+E   ++ EE   E+E+++D    K   +K  +E ++ DK++E
Sbjct: 3   EQEPTAEQAEEIAAEDEEDEDSVNYKPPPQKSLKEIQELDKEDE 46


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 29.9 bits (68), Expect = 0.77
 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE---EEEEQKNKK 140
             E+ E      ++E    K + E + REK + +   +  ++E    E    E E+K ++
Sbjct: 136 PPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQE 195

Query: 141 KEEEEEEEKEKVE 153
            E + E+ +EK  
Sbjct: 196 LEAQLEQLQEKAA 208



 Score = 26.8 bits (60), Expect = 7.0
 Identities = 15/84 (17%), Positives = 38/84 (45%), Gaps = 5/84 (5%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           P+ +  +   E    +++ + +  E+ + +   E  + E    E    E E+++ + E +
Sbjct: 140 PENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQ 199

Query: 129 VEEEEEEQKNKKKEEEEEEEKEKV 152
           +E+ +E+       E  +E K+K 
Sbjct: 200 LEQLQEKA-----AETSQERKQKR 218


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 29.7 bits (66), Expect = 0.77
 Identities = 21/82 (25%), Positives = 45/82 (54%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E E ++  E ++E++++E +  E  K E + +  ++  ++KEE+ E EE E+  +   +E
Sbjct: 283 EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADE 342

Query: 145 EEEEKEKVEMYRNPVPMYRNPV 166
             E ++ +E  R  +    NP+
Sbjct: 343 LNELEKGIEEKRRQMESATNPI 364



 Score = 29.3 bits (65), Expect = 0.88
 Identities = 19/65 (29%), Positives = 32/65 (49%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E ++EE E  E  +EE +    DE  E+K E  E EE ++  +   +   E E+   +K+
Sbjct: 295 EQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKR 354

Query: 142 EEEEE 146
            + E 
Sbjct: 355 RQMES 359



 Score = 27.4 bits (60), Expect = 3.9
 Identities = 24/70 (34%), Positives = 38/70 (54%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E E EE   EE+ +    E E  E  E  +E+++EE E  +  KEE   +  +E   +K+
Sbjct: 265 EEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKE 324

Query: 142 EEEEEEEKEK 151
           E++E EE E+
Sbjct: 325 EDDENEENER 334



 Score = 26.2 bits (57), Expect = 10.0
 Identities = 15/70 (21%), Positives = 37/70 (52%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           SE  +  + EE E  +  +EE + +  ++  +E+++ ++ EE E   +   +E  E E+ 
Sbjct: 290 SEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKG 349

Query: 135 EQKNKKKEEE 144
            ++ +++ E 
Sbjct: 350 IEEKRRQMES 359


>gnl|CDD|179867 PRK04598, tatA, twin arginine translocase protein A; Provisional.
          Length = 81

 Score = 27.9 bits (62), Expect = 0.79
 Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 3/41 (7%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKK---REKKEEEEEKQKDKKE 126
           +K   EEE  +  +KD D E K   + K E   EK+KDK++
Sbjct: 40  KKAMSEEESAKANKKDADFEPKNLEQAKTEAAAEKKKDKEQ 80



 Score = 26.4 bits (58), Expect = 2.7
 Identities = 11/36 (30%), Positives = 22/36 (61%)

Query: 116 EEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           EEE  + +KK+ + E +  EQ   +   E++++KE+
Sbjct: 45  EEESAKANKKDADFEPKNLEQAKTEAAAEKKKDKEQ 80


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 29.5 bits (67), Expect = 0.81
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           K E  E++RE +EE EEK ++  E+   EE
Sbjct: 87  KAETLEERREIREEAEEKWEEALEKGDLEE 116


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 29.8 bits (67), Expect = 0.81
 Identities = 20/81 (24%), Positives = 39/81 (48%)

Query: 78   LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            LE  + EE  KE+E  E E +E+   +E   ++++++ ++EE+QK   +E +  E     
Sbjct: 2569 LEKAKIEESRKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNREVSGTD 2628

Query: 138  NKKKEEEEEEEKEKVEMYRNP 158
            +  K      ++     Y N 
Sbjct: 2629 DTNKNHNTGHDESNYGRYSNK 2649


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 29.4 bits (66), Expect = 0.81
 Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 83  WEEEEKEKEEEEE-----EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           WEEE ++ +  EE     EE +++  D    K+  K  E+ E +  K++ E  + E  +K
Sbjct: 10  WEEELRKAQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKK 69

Query: 138 NKKKEEEEEEEKEKV 152
            KK ++E+ + + KV
Sbjct: 70  KKKFKKEKVDVRVKV 84


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 29.3 bits (65), Expect = 0.82
 Identities = 13/49 (26%), Positives = 32/49 (65%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
           +W E  + KE+E ++E+ ++++  D E ++++K  +EE  K+ ++  V+
Sbjct: 26  DWFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVD 74



 Score = 28.5 bits (63), Expect = 1.7
 Identities = 18/48 (37%), Positives = 30/48 (62%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           E  E +E++ KDED +++R    E E+KQK  KEE  +E E+   +++
Sbjct: 29  ESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRR 76


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 29.0 bits (65), Expect = 0.82
 Identities = 17/64 (26%), Positives = 34/64 (53%), Gaps = 5/64 (7%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            ++ +K+++E+  EEEK+ + D++K      EE E   D  EE  + +E    N   + +
Sbjct: 65  TKKSKKKDKEKLTEEEKKPESDDDK-----TEENENDPDNNEESGDSQESASANSLSDID 119

Query: 145 EEEE 148
            E++
Sbjct: 120 NEDD 123



 Score = 29.0 bits (65), Expect = 0.93
 Identities = 12/51 (23%), Positives = 26/51 (50%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           EEE+K + ++++ EE E + D +EE    ++        D   E+  ++ +
Sbjct: 78  EEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDNEDDMDDSD 128


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 29.7 bits (67), Expect = 0.83
 Identities = 15/67 (22%), Positives = 29/67 (43%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K K   E  +E  K K+    KK+    + +E        E ++   +++  K E  + E
Sbjct: 346 KVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLE 405

Query: 148 EKEKVEM 154
           E +++E 
Sbjct: 406 ELKRLEN 412



 Score = 27.0 bits (60), Expect = 5.6
 Identities = 12/58 (20%), Positives = 28/58 (48%), Gaps = 2/58 (3%)

Query: 62  ARLTVILPQAISASETLENN--EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
            R  + L +  S++   + N  +W   E +K   +EE+ + +    +E K+ E  +++
Sbjct: 359 KRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQK 416



 Score = 26.6 bits (59), Expect = 7.2
 Identities = 13/71 (18%), Positives = 32/71 (45%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           N+W  ++ +   E  +E  ++++    +K+    + +E   K    E  +   +E++ K 
Sbjct: 340 NDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKL 399

Query: 141 KEEEEEEEKEK 151
           +  + EE K  
Sbjct: 400 EFIKLEELKRL 410


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 28.8 bits (65), Expect = 0.83
 Identities = 14/55 (25%), Positives = 32/55 (58%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           +E +EE    +E  +KK++K  + ++   D      ++ +EE  +++ +E+EE+E
Sbjct: 74  DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 26.5 bits (59), Expect = 4.3
 Identities = 18/71 (25%), Positives = 37/71 (52%), Gaps = 1/71 (1%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           L +N+W        +E +EE    E+  D++KK+   + +++   D    + + +EE+  
Sbjct: 60  LGDNKWGLRSWYPVDEIDEEIIPLEEKFDKKKKKFM-DGDDDIIDDDILPDDDFDEEDLD 118

Query: 138 NKKKEEEEEEE 148
            +  E+EE+EE
Sbjct: 119 EEDDEDEEDEE 129


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 29.4 bits (66), Expect = 0.84
 Identities = 15/86 (17%), Positives = 43/86 (50%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E+   +     +  K+  E+ +++ K+ ++ ++ K  KK + ++K+K KK+  ++++ +
Sbjct: 35  LESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYD 94

Query: 135 EQKNKKKEEEEEEEKEKVEMYRNPVP 160
              + K   ++      + + R P P
Sbjct: 95  NFFDSKNNSKQFAGPLAISLMRKPKP 120



 Score = 27.5 bits (61), Expect = 4.2
 Identities = 20/102 (19%), Positives = 50/102 (49%), Gaps = 7/102 (6%)

Query: 71  AISASET--LENNEWEEEEKEKEEEEEEEEEEKEK--DEDEEKKREKKEEEEEKQKDKKE 126
           +IS+  T  L+N  + +  K    +   E + K+   + D  KK  +K +++ K+ +K +
Sbjct: 8   SISSENTIDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTD 67

Query: 127 EEVEEEEEEQKNKKKEEEE---EEEKEKVEMYRNPVPMYRNP 165
           +  + ++ + K KKK +++   +++ +     +N    +  P
Sbjct: 68  DLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGP 109


>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score = 29.3 bits (66), Expect = 0.87
 Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 3/76 (3%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
              L     E   + KEE  E ++E            E+ EEEEEK  +K  E   +  +
Sbjct: 214 KSELVEYSSEIVNEIKEEVCETDDESAY---VSLDAEEEFEEEEEKPAEKSTESTFQLSK 270

Query: 135 EQKNKKKEEEEEEEKE 150
           E    K+ +E  + +E
Sbjct: 271 ETSIAKESKELPDGEE 286


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 29.2 bits (65), Expect = 0.92
 Identities = 14/58 (24%), Positives = 35/58 (60%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           K K + +++E +EK++  +E + + +K+E+  +Q   + +  ++ ++   NK+K  EE
Sbjct: 147 KTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKAMEE 204



 Score = 26.1 bits (57), Expect = 9.0
 Identities = 11/42 (26%), Positives = 30/42 (71%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           K + +K+E +EKQ+  +E E++ E++EQ +++ + + + +++
Sbjct: 149 KSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQ 190



 Score = 26.1 bits (57), Expect = 9.9
 Identities = 12/54 (22%), Positives = 33/54 (61%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           +N + E +EK+++ EE E + EK++   ++ + + + +++ +Q +  + +  EE
Sbjct: 151 KNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKAMEE 204


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 29.0 bits (66), Expect = 0.93
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
           KK  E E  E  ++N +K  E   E  + E   
Sbjct: 85  KKGTEEERRERREENLEKALELLREGNRSEARE 117


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 29.3 bits (65), Expect = 0.98
 Identities = 21/104 (20%), Positives = 48/104 (46%), Gaps = 5/104 (4%)

Query: 56  KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
           KP+  + +L V   +    S++ E  +    EK+ ++ +++E++EKEK+ D++KK+E + 
Sbjct: 165 KPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224

Query: 116 EEE-----EKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
            +      +          E +E    N       + E ++ + 
Sbjct: 225 FKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKD 268



 Score = 27.7 bits (61), Expect = 3.3
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 97  EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           EEE + +   E+ KREKK+ E+EK+  ++   +  E +E
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDE 123


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 29.3 bits (66), Expect = 1.0
 Identities = 14/20 (70%), Positives = 16/20 (80%)

Query: 83  WEEEEKEKEEEEEEEEEEKE 102
           W  EE+E+EEEEEEEEEE  
Sbjct: 402 WAAEEEEEEEEEEEEEEEPV 421


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 29.1 bits (66), Expect = 1.0
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
            K K+++  E+E+  ++ KK+EEEEEEE   
Sbjct: 324 SKRKEQQAEEKEKPAEEAKKEEEEEEEESVD 354



 Score = 28.0 bits (63), Expect = 2.4
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           + K+++ EEK+K  +E + EEEEEE+++       + 
Sbjct: 325 KRKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDP 361



 Score = 28.0 bits (63), Expect = 2.8
 Identities = 13/31 (41%), Positives = 22/31 (70%)

Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           K+K+++ EE E+  EE K +++EEEEE   +
Sbjct: 325 KRKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355



 Score = 26.8 bits (60), Expect = 6.1
 Identities = 16/40 (40%), Positives = 24/40 (60%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
            +E++ EEKEK  +E KK E++EEEE        + +E E
Sbjct: 326 RKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDPIELE 365



 Score = 26.8 bits (60), Expect = 7.3
 Identities = 12/43 (27%), Positives = 25/43 (58%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
             KR++++ EE+++  ++ ++ EEEEEE+        +  E E
Sbjct: 323 LSKRKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDPIELE 365


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 29.5 bits (66), Expect = 1.0
 Identities = 16/88 (18%), Positives = 36/88 (40%)

Query: 65  TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
             + P      E+  +  + E +      E +   + +  ++ E+  E K    E +  K
Sbjct: 642 MEVEPMESEKEESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRK 701

Query: 125 KEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           + E++  +E E+ N     EEE++ +  
Sbjct: 702 EIEDLLFDESEEDNIVGMIEEEKDADDF 729



 Score = 28.7 bits (64), Expect = 1.6
 Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 6/90 (6%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEE--EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           Q  S S+  + +E   E K    E E  +E E+   DE EE       EEE+   D K E
Sbjct: 673 QVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNE 732

Query: 128 ----EVEEEEEEQKNKKKEEEEEEEKEKVE 153
                +EE E  + N   E+   + +++ +
Sbjct: 733 WQDISLEELEALEANLLAEQNSLKAQKQQQ 762


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 24/71 (33%), Positives = 37/71 (52%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
               KEKE   EEEEE+ E+  +E     K+  EE ++   +  E  EEE +++  KK +
Sbjct: 29  VLSIKEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTD 88

Query: 144 EEEEEKEKVEM 154
           E     EK++M
Sbjct: 89  EASSIIEKLQM 99



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 25/91 (27%), Positives = 39/91 (42%)

Query: 63  RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           R            E  +  +   EE E++  E  E  E+E  E+  KK ++     EK +
Sbjct: 39  REEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQ 98

Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
            + E E EE EEE +   +E + E  +E  E
Sbjct: 99  MQIEREQEEWEEELERLIEEAKAEGYEEGYE 129



 Score = 27.7 bits (62), Expect = 2.8
 Identities = 24/80 (30%), Positives = 40/80 (50%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E  E  E   EE   E +E  EE E++  E  E   E+ +EE  K+ D+    +E+ + +
Sbjct: 41  EEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQ 100

Query: 136 QKNKKKEEEEEEEKEKVEMY 155
            + +++E EEE E+   E  
Sbjct: 101 IEREQEEWEEELERLIEEAK 120



 Score = 26.9 bits (60), Expect = 5.8
 Identities = 21/70 (30%), Positives = 42/70 (60%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +E+E+ +EEEEE+ E+ +E+  DE K+  ++ EE+  +  +  EE  +EE  +K  +   
Sbjct: 33  KEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASS 92

Query: 144 EEEEEKEKVE 153
             E+ + ++E
Sbjct: 93  IIEKLQMQIE 102


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 28.7 bits (65), Expect = 1.1
 Identities = 14/47 (29%), Positives = 23/47 (48%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           N           E E +EE+ ++ D+D+E + E  EEE +   D +E
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79



 Score = 27.9 bits (63), Expect = 2.2
 Identities = 13/43 (30%), Positives = 27/43 (62%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK 113
           A +    L+  + E+++ + E+E+E++EEE +   D E+ RE+
Sbjct: 41  AAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARER 83



 Score = 27.2 bits (61), Expect = 3.8
 Identities = 14/40 (35%), Positives = 27/40 (67%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
           P A +A+ T    E E +E++ E++++++E+E E DE+E 
Sbjct: 32  PNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71



 Score = 26.8 bits (60), Expect = 5.3
 Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
                     E E +EE  +D+D++ + E +++EEE       EE  E
Sbjct: 35  AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARE 82


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 29.1 bits (66), Expect = 1.1
 Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 3/83 (3%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE---EEEKQKDKKEEEVEEEEE 134
           ++  E +EE KE EEE +E EEE EK +   KK  KK       EK++  ++    +E+ 
Sbjct: 327 VDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKL 386

Query: 135 EQKNKKKEEEEEEEKEKVEMYRN 157
            ++ ++ EEE +E KE++E   +
Sbjct: 387 SEELEELEEELKELKEELESLYS 409


>gnl|CDD|224273 COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family)
           [Replication,    recombination, and repair].
          Length = 248

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 21/79 (26%), Positives = 41/79 (51%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           +EE+E E+EE EE  ++     EE +R K+  E   + +++  +V  + + +   KKE  
Sbjct: 77  KEEEEAEDEELEEPRDELVARLEEYERYKEAAELLAELEEERRDVFSKIKPEIKIKKERR 136

Query: 145 EEEEKEKVEMYRNPVPMYR 163
             EE   ++++R    + R
Sbjct: 137 PVEEISLIDLFRAYQKILR 155


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 29.0 bits (65), Expect = 1.2
 Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK-EEEVEEEEEEQKNK 139
                +E + EE+ +    + +EKK+E+ + +EEKQ+D + E+  +E  EE K K
Sbjct: 335 SSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEK 389



 Score = 29.0 bits (65), Expect = 1.4
 Identities = 14/41 (34%), Positives = 26/41 (63%)

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           E    EK +    +QK+KK+EE + +EE+Q++ + E+  +E
Sbjct: 341 ENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 28.9 bits (65), Expect = 1.2
 Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 8/76 (10%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ--------KDKKEEEVEEEEEE 135
           ++EE++++E E+E++  KE  E +E      +++  K          DK  EE E E  E
Sbjct: 16  DDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFLPDKAREEKEAELRE 75

Query: 136 QKNKKKEEEEEEEKEK 151
           +  ++  +++E  KE+
Sbjct: 76  ELREEFLKKQEAVKEE 91


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 28.7 bits (64), Expect = 1.2
 Identities = 21/75 (28%), Positives = 35/75 (46%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           E  E E E +E +E  +  E E  + E+  KK   +  + +K+ D+ E  VE  EEE  +
Sbjct: 157 ELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELIS 216

Query: 139 KKKEEEEEEEKEKVE 153
              +E      + +E
Sbjct: 217 DLVKETLNLAPKDIE 231


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 28.6 bits (65), Expect = 1.2
 Identities = 14/36 (38%), Positives = 24/36 (66%)

Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           E ++    +  +++EEE EEEEEE++ ++ EE E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 28.6 bits (65), Expect = 1.5
 Identities = 14/33 (42%), Positives = 23/33 (69%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
           +    +  EEEEEE E+E++E+EE++ E+ E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 28.6 bits (65), Expect = 1.6
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE 116
           +    E  EEEEEE EE+E++E+EE+  E + E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 28.2 bits (64), Expect = 1.9
 Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 5/57 (8%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
           IR+    T  +  AI     LE  +    E  +EEEEE EEEE+E++E+E ++ E +
Sbjct: 207 IRSIKLYTSKIADAI-----LEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 2.7
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           E +Q    E   EEEEE ++ +++EEEEE E+ + E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 2.8
 Identities = 13/33 (39%), Positives = 23/33 (69%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
           + +  E  EEEEEE +E++E+EE++  ++ E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.1 bits (61), Expect = 4.5
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           E ++    E  + E++E EEE++++++EE  E E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.1 bits (61), Expect = 5.1
 Identities = 13/36 (36%), Positives = 24/36 (66%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E ++ +  E  E+++++ EEE EEEEEE+  + + E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.1 bits (61), Expect = 5.3
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
              E  EEEEEE E++E+EE++ E +E E E
Sbjct: 228 RLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family consists
           of several ATP synthase E chain sequences which are
           components of the CF(0) subunit.
          Length = 83

 Score = 27.4 bits (61), Expect = 1.2
 Identities = 9/34 (26%), Positives = 22/34 (64%)

Query: 113 KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           KK+EE+ ++ + +E+ +E+ +     KK+ ++E 
Sbjct: 33  KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEA 66


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 29.2 bits (65), Expect = 1.2
 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 2/50 (4%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDE--DEEKKREKKEEEEEKQKDKKEEE 128
           N  EE   E E E   E +   KDE     ++++   E   KQ+   E+E
Sbjct: 89  NNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKE 138


>gnl|CDD|235604 PRK05778, PRK05778, 2-oxoglutarate ferredoxin oxidoreductase
           subunit beta; Validated.
          Length = 301

 Score = 28.7 bits (65), Expect = 1.3
 Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 1/51 (1%)

Query: 113 KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV-EMYRNPVPMY 162
           ++  ++   K K EE+ +  + ++  +K  EEE   K  +   Y+N  P +
Sbjct: 221 REYYKKRVYKLKLEEDYDPTDRDKAAEKMLEEELGGKIPIGVFYKNERPTF 271


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 28.9 bits (65), Expect = 1.3
 Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            EW E+ ++K++  E   E   +    +K    ++ EE     KKEEE ++  EE+   +
Sbjct: 844 KEWIEDLRQKKKLIERLIEAINQYR-AKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNE 902

Query: 141 KE 142
           +E
Sbjct: 903 EE 904



 Score = 27.4 bits (61), Expect = 4.7
 Identities = 14/73 (19%), Positives = 29/73 (39%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
               E   +   ++ +  E+ E+     KK E+ ++  E++   +EE    ++      K
Sbjct: 858 ERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGK 917

Query: 141 KEEEEEEEKEKVE 153
             E   E  EK+ 
Sbjct: 918 LPENGTELVEKLA 930



 Score = 26.6 bits (59), Expect = 8.4
 Identities = 19/106 (17%), Positives = 43/106 (40%), Gaps = 11/106 (10%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEE-----------EEEEEEKEKDED 106
           +  N      +    S +  L++ ++E E   K+E              ++    E  ++
Sbjct: 778 LIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDYILELLQTFNDKNGAYESLKE 837

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
             ++  K+  E+ +QK K  E + E   + + KK +  E+ E+  +
Sbjct: 838 LIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYI 883


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 28.0 bits (63), Expect = 1.3
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEE 108
                  EE++EEEEEE+EK+E EE
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESEE 95



 Score = 27.2 bits (61), Expect = 1.8
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDE 107
                E+++EEEEEEEEKE+ E+E
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 26.8 bits (60), Expect = 2.5
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEE 108
                 +E++EEEEEEE++++ +EE
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 26.1 bits (58), Expect = 4.6
 Identities = 12/25 (48%), Positives = 18/25 (72%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEE 108
              E++KEEEEEEEE+E+ ++E   
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAA 99


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 28.9 bits (65), Expect = 1.3
 Identities = 13/65 (20%), Positives = 28/65 (43%), Gaps = 3/65 (4%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE---EEQKNKKKEEEEEEE 148
           +   + +    K+  + ++     E E K+ ++  E +E+E    EE  N   EE+   +
Sbjct: 56  DTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLD 115

Query: 149 KEKVE 153
           +   E
Sbjct: 116 ENLEE 120



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 18/63 (28%), Positives = 30/63 (47%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
            +    +E  + EEE    E E K+ E+  E  EK+ ++ EE +   +EE+    +  EE
Sbjct: 61  PDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEE 120

Query: 146 EEE 148
             E
Sbjct: 121 LSE 123



 Score = 27.3 bits (61), Expect = 4.5
 Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 2/68 (2%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            +   K      +E  + +E+  D E   E KE EE  +  +KE    EE     +++K 
Sbjct: 55  KDTGGKPDVPPSKEFLDLEEEILDLEA--EIKEVEENLESLEKEINELEEWLNVLDEEKS 112

Query: 143 EEEEEEKE 150
             +E  +E
Sbjct: 113 FLDENLEE 120


>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain.  This alpha
           helical domain is found in a set of bacterial plasmid
           replication proteins. The domain is found to the
           C-terminus of the primase/polymerase domain. Mutants of
           this domain are defective in template binding,
           dinucleotide formation and conformation change prior to
           DNA extension.
          Length = 135

 Score = 28.2 bits (63), Expect = 1.4
 Identities = 20/57 (35%), Positives = 31/57 (54%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMY 155
           E K+++ED E+  EK +EE  K    K + VE   EE   K K+  EE +++K  + 
Sbjct: 1   EGKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEKKKAIL 57


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 29.0 bits (65), Expect = 1.4
 Identities = 20/88 (22%), Positives = 39/88 (44%), Gaps = 8/88 (9%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE----KKEEEEEKQ----KD 123
           I A    E   W+ E + + E+ E   E+  ++E  + ++E    K  E  E +    + 
Sbjct: 270 ILAETEAEVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIER 329

Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           +KE E+E +E        + + +EE + 
Sbjct: 330 QKETELEPQERSYFINAAQRQAQEEAKA 357



 Score = 26.7 bits (59), Expect = 7.8
 Identities = 15/104 (14%), Positives = 34/104 (32%), Gaps = 5/104 (4%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
               A     +  A +  +        E++   +  E+  E +    E E +    K E 
Sbjct: 226 AENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAET 285

Query: 118 EEKQKDKKEE-----EVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
             + +  +       + E+ + EQ+ +  +  E  E     + R
Sbjct: 286 RREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIER 329



 Score = 26.3 bits (58), Expect = 8.4
 Identities = 19/94 (20%), Positives = 34/94 (36%), Gaps = 5/94 (5%)

Query: 60  ANARLTVILPQAISASETLENNEWEEEEKEKEEEEE-----EEEEEKEKDEDEEKKREKK 114
              + T + PQ  S        + +EE K      E      E   +   E EE +R ++
Sbjct: 328 ERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAERAEQ 387

Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
                  +  ++E+VE     +  K + E +  E
Sbjct: 388 AALVAAAEAAEQEQVEIAVRAEAAKAEAEAQAAE 421


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 28.9 bits (65), Expect = 1.4
 Identities = 19/112 (16%), Positives = 47/112 (41%), Gaps = 6/112 (5%)

Query: 46   QCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDE 105
              +V   QY  P  +          +  +S T +  +   E      +++++ E+K   +
Sbjct: 1275 PKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARK 1334

Query: 106  DEEKKREKKEEEEE------KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
             + K R K+    +      + + KK +   E++++ +    E+E++E+ E 
Sbjct: 1335 KKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 12/84 (14%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK-----------EEEVEEE 132
            ++  K+K E+   EEEE   +ED+E   E  EEE                   +E+VE+ 
Sbjct: 1049 KDIIKKKSEKITAEEEEGA-EEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKL 1107

Query: 133  EEEQKNKKKEEEEEEEKEKVEMYR 156
              E + K+KE E+ +     +M+ 
Sbjct: 1108 NAELEKKEKELEKLKNTTPKDMWL 1131



 Score = 28.1 bits (63), Expect = 2.5
 Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 62   ARLTVILPQAISASETLENNE----WEEE----EKEKEEEEEEEEEEKEKDEDEEKKREK 113
             +L   L +     E L+N      W E+    E+  EE+EE EE+E  K++  + K + 
Sbjct: 1105 EKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKG 1164

Query: 114  KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
            K  +  K K KK+E+ +++    K+KK      
Sbjct: 1165 KASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197



 Score = 27.7 bits (62), Expect = 3.8
 Identities = 22/100 (22%), Positives = 46/100 (46%), Gaps = 18/100 (18%)

Query: 76   ETLENNEWEEEEKEKEEEE-----------EEEEEEKEKDEDEEKKREKKEEEEEKQK-- 122
            E +E    E E+KEKE E+           E+ ++ +E  E++E+  EK+  +E++ K  
Sbjct: 1102 EKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSK 1161

Query: 123  -----DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
                  K  +   +++E++K K   ++ ++        R 
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201



 Score = 27.3 bits (61), Expect = 5.5
 Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 1/74 (1%)

Query: 76   ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            E LE  E E EEKE  +E+  + + K K     K + KK+E+++K+    + +       
Sbjct: 1139 EALEEQE-EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197

Query: 136  QKNKKKEEEEEEEK 149
             K    +E+ + + 
Sbjct: 1198 SKRVDSDEKRKLDD 1211



 Score = 26.5 bits (59), Expect = 7.7
 Identities = 27/109 (24%), Positives = 47/109 (43%), Gaps = 26/109 (23%)

Query: 70   QAISASETLENNEWEEEEKEKEEEEEEEEEE------------------KEKDEDEEKKR 111
                + +     E   EE ++ ++E++EEE                   KEK E    + 
Sbjct: 1052 IKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKLNAEL 1111

Query: 112  EKKEEEEEKQKDKKEE--------EVEEEEEEQKNKKKEEEEEEEKEKV 152
            EKKE+E EK K+   +        + EE  EEQ+  +++E  +E++ K 
Sbjct: 1112 EKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKS 1160


>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
           methyltransferase subunit [Defense mechanisms].
          Length = 489

 Score = 28.9 bits (65), Expect = 1.5
 Identities = 17/73 (23%), Positives = 35/73 (47%), Gaps = 1/73 (1%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            E++E E   +    EE  E + + +  +   ++EEE+    D+  +E+ E + E    +
Sbjct: 416 REFKEIEGFSKSVSLEEIRENDYNLNPGRYVAEEEEEDIDL-DELLQELSEIDAELAELE 474

Query: 141 KEEEEEEEKEKVE 153
           KE EE  ++  + 
Sbjct: 475 KELEEILKELLLN 487


>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
          Length = 313

 Score = 28.6 bits (64), Expect = 1.5
 Identities = 16/78 (20%), Positives = 34/78 (43%), Gaps = 1/78 (1%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEE-EKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           +        E      + + + + E+ E + E+ +DE+  ++ E    +    + + EEE
Sbjct: 99  EEEEDKFEQEEAPIPVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVPMAEVQPEEE 158

Query: 129 VEEEEEEQKNKKKEEEEE 146
            E E +E +  K E E +
Sbjct: 159 TEIEVDEPEEPKPEPELD 176


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 14/57 (24%), Positives = 38/57 (66%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           K KE+   ++ + K+++ ++EK++E++  +   + +K+  E+E+++ E K  K+E++
Sbjct: 67  KAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEKK 123



 Score = 27.7 bits (62), Expect = 1.7
 Identities = 23/74 (31%), Positives = 42/74 (56%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           +N        +E E+E++E   E    E +K+ K++   +K K KKEE  +E+E+E++  
Sbjct: 36  SNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFM 95

Query: 140 KKEEEEEEEKEKVE 153
           K   E E+E+ ++E
Sbjct: 96  KALAEAEKERAELE 109


>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
           2-oxoacid:ferredoxin oxidoreductases, beta subunit
           [Energy production and conversion].
          Length = 294

 Score = 28.4 bits (64), Expect = 1.5
 Identities = 12/74 (16%), Positives = 29/74 (39%), Gaps = 6/74 (8%)

Query: 78  LENNEW-EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           +E   W     +  + EEE+ ++         ++  + EE  +       E +++ +++ 
Sbjct: 224 VETGYWPLYRYEPGKAEEEKGKDIPIGIFYPVEEYLEYEERFKHLTKSNPELIQKLQKD- 282

Query: 137 KNKKKEEEEEEEKE 150
                +E  EE K 
Sbjct: 283 ----VDERWEELKR 292


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 27.9 bits (63), Expect = 1.6
 Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 3/68 (4%)

Query: 81  NEWEEEEKEKEEEEE---EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            E E E+++KEEE+E    E     +      + ++  ++   + D+ +EE  + EE+  
Sbjct: 14  EELEFEQRKKEEEKEKYLAERCPPLRLSLSRAELQELCKKLHARIDRLDEERYDIEEKVA 73

Query: 138 NKKKEEEE 145
            K KE E+
Sbjct: 74  KKDKEIED 81


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 28.3 bits (63), Expect = 1.6
 Identities = 17/59 (28%), Positives = 36/59 (61%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           +E E+E  +EEE+   +    ++ E   E+E+ Q+++ +++  +E EE K K +++E E
Sbjct: 2   EEVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQKLQQKEVE 60


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 28.2 bits (63), Expect = 1.6
 Identities = 9/51 (17%), Positives = 27/51 (52%)

Query: 100 EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           E +  + + + R+ +   E  QK+++E     ++  +  ++ ++E  ++KE
Sbjct: 142 EADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKKE 192



 Score = 27.1 bits (60), Expect = 3.5
 Identities = 11/53 (20%), Positives = 28/53 (52%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           ++ E D  + K R +K +   +   K +EE+ +  ++    +++ ++E  K+K
Sbjct: 139 DKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKK 191


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 28.0 bits (63), Expect = 1.6
 Identities = 16/88 (18%), Positives = 52/88 (59%), Gaps = 3/88 (3%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           P   +A + LE    E ++ + E +++E+E +KE+ + +++     EE  + ++ + +++
Sbjct: 31  PAGKAAQKQLEK---EFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQK 87

Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
            +E +++Q+  ++E ++++++    +Y 
Sbjct: 88  QQELQQKQQAAQQELQQKQQELLQPIYD 115


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 28.4 bits (64), Expect = 1.6
 Identities = 16/59 (27%), Positives = 27/59 (45%)

Query: 92  EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           E   E+   K +     KK+++KE ++++   K E E  + EE +  KK    E  E  
Sbjct: 5   EALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63


>gnl|CDD|224106 COG1185, Pnp, Polyribonucleotide nucleotidyltransferase
           (polynucleotide phosphorylase) [Translation, ribosomal
           structure and biogenesis].
          Length = 692

 Score = 28.7 bits (65), Expect = 1.7
 Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 1/64 (1%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K+K E E    +E+ + +  +   E + +E    ++K+E     +  ++K +++   EEE
Sbjct: 227 KKKWELEPPSLDEELEAKVRDL-AEDELKEAVGIREKQERSAALDAIKEKIEEELSGEEE 285

Query: 148 EKEK 151
              K
Sbjct: 286 SSLK 289


>gnl|CDD|223997 COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex,
           dehydrogenase (E1) component, eukaryotic type, alpha
           subunit [Energy production and conversion].
          Length = 358

 Score = 28.4 bits (64), Expect = 1.7
 Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 6/60 (10%)

Query: 93  EEEEEEEEKEKD--EDEEKKREKK----EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
            +EE EE K++D      K   +     EEE E  + + + EV+E  E  +     +  E
Sbjct: 277 SKEEVEEWKKRDPIVRLRKYLIEAGILSEEELEAIEAEAKAEVDEAVEFAEASPYPDVSE 336


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 28.7 bits (64), Expect = 1.7
 Identities = 18/103 (17%), Positives = 41/103 (39%)

Query: 44  EFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEK 103
           EF+  +       P +   +   I  +    +         ++    EE  +        
Sbjct: 352 EFEIYLYKIINANPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTL 411

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
            + +    +KK  +  K++D K+EE+ E+E  +K++K  + +E
Sbjct: 412 YKTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDE 454



 Score = 26.3 bits (58), Expect = 9.8
 Identities = 15/77 (19%), Positives = 34/77 (44%), Gaps = 2/77 (2%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEE--KKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           +  E   + ++K     +    K+ +  EE  K     +   + +    ++++ +  +EQ
Sbjct: 371 KKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNKEQ 430

Query: 137 KNKKKEEEEEEEKEKVE 153
             KK+E  E+E  +K E
Sbjct: 431 DIKKEELLEKEFVKKSE 447


>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
          Length = 697

 Score = 28.5 bits (65), Expect = 1.8
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEE 135
           +KREKK  E   ++ ++EE  E EEE 
Sbjct: 327 RKREKKAAELAAEEAEEEEAAEPEEES 353



 Score = 27.8 bits (63), Expect = 3.1
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEE 145
           K++ K  E   EE EE++  + EEE 
Sbjct: 328 KREKKAAELAAEEAEEEEAAEPEEES 353


>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
           motility and secretion].
          Length = 214

 Score = 28.3 bits (63), Expect = 1.8
 Identities = 18/63 (28%), Positives = 32/63 (50%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
              E+ +E+ EE   + K  D+E   E  E  E+ +++   E+V+  EEE   K+K   E
Sbjct: 18  FALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKEKVMSE 77

Query: 146 EEE 148
           + +
Sbjct: 78  KLK 80



 Score = 26.7 bits (59), Expect = 6.2
 Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 1/83 (1%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
              + EE+ EK E+      E+ K++ EE   + K+ ++E  ++  E   + +E     K
Sbjct: 1   KKPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEK 60

Query: 140 KKE-EEEEEEKEKVEMYRNPVPM 161
               EEE  EKEKV   +   P 
Sbjct: 61  VDSIEEEISEKEKVMSEKLKEPA 83


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 28.5 bits (64), Expect = 1.9
 Identities = 15/76 (19%), Positives = 28/76 (36%), Gaps = 1/76 (1%)

Query: 41  DDAEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEE-EE 99
            D E       A   +PIR++A     + + + +       + E+E   + EE E     
Sbjct: 102 FDEEPDEPAETAGSLRPIRSSAAAYRDINRELGSFYEYPKTDEEKELLREVEELESRLAT 161

Query: 100 EKEKDEDEEKKREKKE 115
           E     + E++    E
Sbjct: 162 EPSPAPELEEQLALME 177



 Score = 27.8 bits (62), Expect = 3.2
 Identities = 14/68 (20%), Positives = 24/68 (35%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE ++  E                +  RE     E  + D+++E + E EE +     E 
Sbjct: 104 EEPDEPAETAGSLRPIRSSAAAYRDINRELGSFYEYPKTDEEKELLREVEELESRLATEP 163

Query: 144 EEEEEKEK 151
               E E+
Sbjct: 164 SPAPELEE 171



 Score = 27.4 bits (61), Expect = 4.5
 Identities = 11/79 (13%), Positives = 34/79 (43%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           ++     ++ +  E+E  E+++K+K    +   +  E+ +++ +D    +   +EE  + 
Sbjct: 50  KDEGIIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEP 109

Query: 139 KKKEEEEEEEKEKVEMYRN 157
            +        +     YR+
Sbjct: 110 AETAGSLRPIRSSAAAYRD 128


>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
           (Spore_III_AF).  This family represents the stage III
           sporulation protein AF (Spore_III_AF) of the bacterial
           endospore formation program, which exists in some but
           not all members of the Firmicutes (formerly called
           low-GC Gram-positives). The C-terminal region of these
           proteins is poorly conserved.
          Length = 185

 Score = 28.0 bits (63), Expect = 1.9
 Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 6/77 (7%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE------EEEKQKDKKEEEVEEEEEEQKN 138
           E++ ++E   + ++ E E DED E      +E      EE K+K K + E    + +   
Sbjct: 93  EKKLKEEYGVKVKDVEVEIDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSK 152

Query: 139 KKKEEEEEEEKEKVEMY 155
            K+EEEE EE EK++ +
Sbjct: 153 PKEEEEESEEAEKIKNF 169


>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
          Length = 815

 Score = 28.6 bits (64), Expect = 1.9
 Identities = 18/80 (22%), Positives = 38/80 (47%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
           S+       E +   ++++ E +E E E+   + KK +K+++         E+ V+E EE
Sbjct: 7   SQAHAPAAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEE 66

Query: 135 EQKNKKKEEEEEEEKEKVEM 154
             K  ++    E+ +E +E 
Sbjct: 67  FCKAIEEHLSIEQMREILEE 86


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 28.2 bits (63), Expect = 1.9
 Identities = 27/61 (44%), Positives = 36/61 (59%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           LE  E +EEE+  EEEEE+ EEE E++E+E ++    EEEEE      E    EE EE +
Sbjct: 71  LELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREELR 130

Query: 138 N 138
            
Sbjct: 131 E 131



 Score = 27.4 bits (61), Expect = 3.4
 Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
             S  LE++   EE + +  E++EEEE  E++E++ ++  ++EEEE      +     EE
Sbjct: 55  GLSGVLESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEY-----QRGPFGEE 109

Query: 133 EEEQKNKKKEEEEEEEKE 150
           EEE  +   E    EE+E
Sbjct: 110 EEEDGDSYDELPTPEERE 127


>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS).  This family
           consists of several mammalian selenoprotein S (SelS)
           sequences. SelS is a plasma membrane protein and is
           present in a variety of tissues and cell types. The
           function of this family is unknown.
          Length = 190

 Score = 27.9 bits (62), Expect = 1.9
 Identities = 17/61 (27%), Positives = 36/61 (59%), Gaps = 3/61 (4%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
             E+E    D D+  +R+   E  E  + + +E+++ +  + K K+K+ EEE+ ++K+EM
Sbjct: 65  SSEKEITVFDPDKVVRRQ---EALEAARLRMQEDLDAKAAKFKEKQKQLEEEKRRQKIEM 121

Query: 155 Y 155
           +
Sbjct: 122 W 122


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 28.5 bits (63), Expect = 2.0
 Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDE 105
           +  +A   +E  E+N+ E  + E EEE+EEEE E    E
Sbjct: 78  LFAEAGEEAEE-EDNDRECPDTEAEEEDEEEEIEAPDPE 115



 Score = 27.3 bits (60), Expect = 4.1
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE 107
           A E  E  + + E  + E EEE+EEEE E  + E
Sbjct: 82  AGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115



 Score = 27.3 bits (60), Expect = 4.8
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE 112
           E EEE+ ++E  + E EEE E++E E    E
Sbjct: 85  EAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115



 Score = 26.6 bits (58), Expect = 7.8
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
           E  EE EEE+ ++E  + E ++ +++EE E
Sbjct: 81  EAGEEAEEEDNDRECPDTEAEEEDEEEEIE 110



 Score = 26.6 bits (58), Expect = 9.1
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           EE EEE+ +++    E EEE+E+E++E
Sbjct: 84  EEAEEEDNDRECPDTEAEEEDEEEEIE 110



 Score = 26.2 bits (57), Expect = 9.4
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEE 119
           EE EEE+   +  D E + E +EEE E
Sbjct: 84  EEAEEEDNDRECPDTEAEEEDEEEEIE 110


>gnl|CDD|222508 pfam14033, DUF4246, Protein of unknown function (DUF4246).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and fungi.
           Proteins in this family are typically between 392 and
           644 amino acids in length.
          Length = 496

 Score = 28.4 bits (64), Expect = 2.0
 Identities = 17/72 (23%), Positives = 34/72 (47%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E   KE+E EE + E  +DED E   E+ EE  + ++++++    E       +      
Sbjct: 262 EYHPKEKEPEEPKPEPGEDEDYEDYWERLEEYLDWREERRKAIHPEPGTFFPYEPTPPHI 321

Query: 146 EEEKEKVEMYRN 157
           E+  +  + +R+
Sbjct: 322 EKTVDLQKDFRD 333


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 26.8 bits (60), Expect = 2.0
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 85  EEEKEKEEEEEEEEEEKEKDED 106
                 EEE++EEEEE+E+D+D
Sbjct: 61  AAAAAAEEEKKEEEEEEEEDDD 82



 Score = 26.0 bits (58), Expect = 4.3
 Identities = 11/17 (64%), Positives = 15/17 (88%)

Query: 84  EEEEKEKEEEEEEEEEE 100
             EE++KEEEEEEEE++
Sbjct: 65  AAEEEKKEEEEEEEEDD 81



 Score = 25.3 bits (56), Expect = 7.9
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 84  EEEEKEKEEEEEEEEEEKEKD 104
                E+E++EEEEEEE++ D
Sbjct: 62  AAAAAEEEKKEEEEEEEEDDD 82


>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Flap endonuclease-1 (FEN1) is
           involved in multiple DNA metabolic pathways, including
           DNA replication processes (5' flap DNA endonuclease
           activity and double stranded DNA 5'-exonuclease
           activity) and DNA repair processes (long-patch base
           excision repair) in eukaryotes and archaea. Interaction
           between FEN1 and PCNA (Proliferating cell nuclear
           antigen) is an essential prerequisite to FEN1's DNA
           replication functionality and stimulates FEN1 nuclease
           activity by 10-50 fold. FEN1 belongs to the
           FEN1-EXO1-like family of structure-specific, 5'
           nucleases. These nucleases contain a PIN (PilT N
           terminus) domain with a helical arch/clamp region (I
           domain) of variable length (approximately 45 residues in
           FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
           domain, an atypical helix-hairpin-helix-2-like region.
           Both the H3TH domain (not included here) and the helical
           arch/clamp region are involved in DNA binding. Nucleases
           within this group also have a carboxylate-rich active
           site that is involved in binding essential divalent
           metal ion cofactors (Mg2+/Mn2+).  FEN1 has a C-terminal
           extension containing residues forming the consensus
           PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
           to PCNA.
          Length = 261

 Score = 28.2 bits (64), Expect = 2.0
 Identities = 15/30 (50%), Positives = 21/30 (70%)

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           K  + EK+RE++EE EEK ++ KEE   EE
Sbjct: 88  KSGELEKRRERREEAEEKLEEAKEEGDAEE 117


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 28.2 bits (64), Expect = 2.0
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
            +EE    +E+ +     E +++EKK E+ E
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 27.8 bits (63), Expect = 2.9
 Identities = 11/45 (24%), Positives = 25/45 (55%)

Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           + + +EE    ++  +    +E EE++K  +K E   ++ +KVE+
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEITIDDFDKVEL 553



 Score = 27.8 bits (63), Expect = 3.1
 Identities = 9/32 (28%), Positives = 19/32 (59%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE 116
           + E+E    +E+ E    K+ +E++K+ +K E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 27.5 bits (62), Expect = 4.4
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 96  EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           + EEE    +++ +    KE EE+++K +K E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 27.1 bits (61), Expect = 5.7
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           D +EE    K++ E    K+ +E+E + E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 26.7 bits (60), Expect = 7.6
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
           + EEE    +++ +    K+ E+KE++ EK
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEK 540



 Score = 26.3 bits (59), Expect = 9.4
 Identities = 9/32 (28%), Positives = 18/32 (56%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
           + E+E    +E+ E    ++ E+K +K E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 28.4 bits (63), Expect = 2.0
 Identities = 22/75 (29%), Positives = 42/75 (56%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
             LE++E ++E  +KE + + EEE++E DE E+ + E++  E    K K +E V E  + 
Sbjct: 273 AELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDL 332

Query: 136 QKNKKKEEEEEEEKE 150
           +  K+    E+ + +
Sbjct: 333 ESPKELSSFEKRQAK 347



 Score = 26.9 bits (59), Expect = 7.2
 Identities = 22/83 (26%), Positives = 41/83 (49%)

Query: 75  SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
            E   +    + E   ++EEEEEE+E  +DE  + + E     E +   +   + E E++
Sbjct: 100 HEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDD 159

Query: 135 EQKNKKKEEEEEEEKEKVEMYRN 157
           E+K  ++EE  EE++   +  R 
Sbjct: 160 EEKKMEEEEAGEEKESVEQATRE 182


>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
           repeat [General function prediction only].
          Length = 561

 Score = 28.3 bits (63), Expect = 2.0
 Identities = 22/59 (37%), Positives = 29/59 (49%), Gaps = 3/59 (5%)

Query: 97  EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE--EEEKEKVE 153
           E+  +  DED+ +   KK    E  K++     EE E  Q NK + EEE   E KE VE
Sbjct: 503 EQWPEYSDEDKIRSLLKKLRAIEALKERMRSG-EELEVIQVNKIETEEEVLSELKELVE 560


>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
           hydroxylase/glutamate synthase subunit beta;
           Provisional.
          Length = 752

 Score = 28.6 bits (64), Expect = 2.1
 Identities = 17/60 (28%), Positives = 24/60 (40%), Gaps = 1/60 (1%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYR 163
           D DE  KR    +  E ++  K EE       +  K  + +E    E+V M     P YR
Sbjct: 249 DFDEMLKRMGAYKTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELD-PEYR 307



 Score = 28.2 bits (63), Expect = 2.3
 Identities = 16/66 (24%), Positives = 23/66 (34%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E EE  K EE       + +   + K+R   E     + D +       EE      KE+
Sbjct: 264 EGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEEVNLGLTKEQ 323

Query: 144 EEEEEK 149
              E K
Sbjct: 324 AMTEAK 329



 Score = 27.8 bits (62), Expect = 3.3
 Identities = 9/53 (16%), Positives = 20/53 (37%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
             E EE  + E+       + R+  + +E    ++      + E    N+ +E
Sbjct: 262 TIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEE 314


>gnl|CDD|226908 COG4531, ZnuA, ABC-type Zn2+ transport system, periplasmic
           component/surface adhesin [Inorganic ion transport and
           metabolism].
          Length = 318

 Score = 28.2 bits (63), Expect = 2.1
 Identities = 24/101 (23%), Positives = 34/101 (33%), Gaps = 13/101 (12%)

Query: 23  GDSSHGVYNLRIRNVSLTDDAEFQCQVGP---AQYHKPIRANARLTVI-LPQAISASETL 78
           G S H  Y+LR  +V     A+    VGP   A   KP+       V+ L         L
Sbjct: 57  GASPH-DYSLRPSDVKRLQSADLVVWVGPDLEAFLDKPLSGLPGAKVVTLADLPGVKPLL 115

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
                    +E  + EEE +   +    + K       E E
Sbjct: 116 F--------REAHDHEEEHDHGHDHAGADHKGDHDHHHEGE 148


>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
          Length = 1119

 Score = 28.2 bits (63), Expect = 2.2
 Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 4/80 (5%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E  + EKE  +  ++E KE+     K+ +  E E+  +K KKE E +++ E   +     
Sbjct: 575 ESSKYEKEFNKLVKDELKERLSHLTKE-QVDEMEKAYEKFKKEREADDDPEHLDSFPILS 633

Query: 144 EEEEEKEKVEMYRNPVPMYR 163
             +  KE  E    P  +Y+
Sbjct: 634 LSDLNKETEE---IPTKLYK 650



 Score = 26.7 bits (59), Expect = 7.8
 Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 97  EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           E  E  K E E  K  K E +E      K E+V+E E+  +  KKE E +++ E ++
Sbjct: 572 EAVESSKYEKEFNKLVKDELKERLSHLTK-EQVDEMEKAYEKFKKEREADDDPEHLD 627


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 28.3 bits (63), Expect = 2.2
 Identities = 10/38 (26%), Positives = 28/38 (73%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
           ++  E    ++++E E +  +E+E+++E++++DEDE++
Sbjct: 873 LTDLEEYLISQFQENEDDDADEDEDQDEDEDEDEDEDE 910


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 28.3 bits (63), Expect = 2.2
 Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 78   LENNEWEEEEKEKEEEEEEEEEEKEKDED--------EEKKREKKEEEEEKQKDKKEEEV 129
            L+  E E ++ +KE+ + EE  EK +           ++  R+  ++   K+  KK  E 
Sbjct: 1158 LDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASES 1217

Query: 130  EEEEEEQKNKKKEEEEEEE 148
            E  EE   +   E E   E
Sbjct: 1218 ETTEETYGSSAMETENVAE 1236



 Score = 27.1 bits (60), Expect = 6.5
 Identities = 15/67 (22%), Positives = 31/67 (46%), Gaps = 6/67 (8%)

Query: 84   EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E+E  + ++E+ + EE +EK +    + E    ++  ++  K+         +K  KK  
Sbjct: 1162 EKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPA------PKKTTKKAS 1215

Query: 144  EEEEEKE 150
            E E  +E
Sbjct: 1216 ESETTEE 1222


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 27.1 bits (60), Expect = 2.2
 Identities = 19/32 (59%), Positives = 24/32 (75%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEE 100
           P A +A+   E  E +EEEK+KEEE+EEEEEE
Sbjct: 66  PAAAAAAAAEEKAEEKEEEKKKEEEKEEEEEE 97


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 28.4 bits (64), Expect = 2.2
 Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 4/42 (9%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
           N E  E EK     EE  +E  E+ ++E K+RE+ + E E+Q
Sbjct: 116 NQEIAEREKA----EEARQEAFEQLKNEIKEREETQIELEQQ 153


>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 260 amino acids in length.
          Length = 242

 Score = 27.9 bits (62), Expect = 2.3
 Identities = 8/48 (16%), Positives = 20/48 (41%)

Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           ++    ++  K +   EE V      +   + +E   + + ++E  RN
Sbjct: 195 RKPSNSKDPRKYEHNYEENVPMYHMTEPIDQYKEPTTQGQTRMERRRN 242


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 28.2 bits (63), Expect = 2.4
 Identities = 16/47 (34%), Positives = 30/47 (63%), Gaps = 3/47 (6%)

Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           +  EK+ + K ++ E+ +DK+E++ EE +E+ K   K  E+ EE +K
Sbjct: 11  KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIK---KAIEKAEEAKK 54


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 26.9 bits (60), Expect = 2.4
 Identities = 14/24 (58%), Positives = 19/24 (79%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDE 107
               +E+EEEEEEEEEE+E+ E+E
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 26.5 bits (59), Expect = 3.1
 Identities = 14/24 (58%), Positives = 18/24 (75%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEE 108
                +EEEEEEEEEE+E++E EE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEE 94



 Score = 26.5 bits (59), Expect = 3.2
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEE 108
                E+EEEEEEEEEE+E++ +EE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95


>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein.  This
           is the P3 protein section of the Potyviridae
           polyproteins. The function is not known except that the
           protein is essential to viral survival.
          Length = 445

 Score = 28.3 bits (64), Expect = 2.4
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
           E ++ K+E  EEK+ +K+E+E+ +     K + K   EEE  E +E  +NP
Sbjct: 271 EHRKLKREAAEEKE-EKEEKEIRKLYLYSKLEGKLPTEEEFLEYIE-EKNP 319



 Score = 27.9 bits (63), Expect = 2.5
 Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 12/70 (17%)

Query: 86  EEKEKEEEEEEEEEEK--EKDEDEEKKREKKEEEEE-----KQKDKK-----EEEVEEEE 133
           + K +  EE+EE+EEK   K     K   K   EEE     ++K+ +     EE +EEEE
Sbjct: 274 KLKREAAEEKEEKEEKEIRKLYLYSKLEGKLPTEEEFLEYIEEKNPELLETAEELIEEEE 333

Query: 134 EEQKNKKKEE 143
            E + K  +E
Sbjct: 334 VEHQAKTADE 343


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 27.7 bits (62), Expect = 2.4
 Identities = 19/70 (27%), Positives = 36/70 (51%), Gaps = 3/70 (4%)

Query: 84  EEEEKEKEEEEEEEEEEK---EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           + + K++E E+E  E E+   E D+  E    K +EE + ++ +  +E  EE +E + K 
Sbjct: 47  DADTKKREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLLDEAREEADEIREKW 106

Query: 141 KEEEEEEEKE 150
           +E    E+  
Sbjct: 107 QEALRREQAA 116


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 26.4 bits (59), Expect = 2.4
 Identities = 14/27 (51%), Positives = 22/27 (81%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEE 108
           EWEEEE+ ++ E E+EE+E+E D+D+ 
Sbjct: 47  EWEEEEEGEDLESEDEEDEEEDDDDDM 73


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 25.8 bits (57), Expect = 2.5
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKR 111
           E+EK E+EEE E+E+E +E  E KR
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 27.6 bits (62), Expect = 2.5
 Identities = 17/39 (43%), Positives = 23/39 (58%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           KK+ KKEEE +K K K E  V + E  +KN K ++   E
Sbjct: 124 KKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLE 162


>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
           N-terminal domain.  Autophagocytosis is a
           starvation-induced process responsible for transport of
           cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
           ubiquitin like modifier that is topologically similar to
           the canonical E2 enzyme. It catalyzes the conjugation of
           Atg8 and phosphatidylethanolamine.
          Length = 146

 Score = 27.3 bits (61), Expect = 2.6
 Identities = 11/62 (17%), Positives = 32/62 (51%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E  +  EE  E+E+E +        R+K++++   ++D  E   ++++    +   E+++
Sbjct: 83  EYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDD 142

Query: 146 EE 147
           ++
Sbjct: 143 DD 144



 Score = 26.9 bits (60), Expect = 4.2
 Identities = 8/62 (12%), Positives = 30/62 (48%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E     EE  E+E+++           K++++   ++   E  +++++   +   +E+++
Sbjct: 83  EYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDD 142

Query: 147 EE 148
           ++
Sbjct: 143 DD 144



 Score = 25.8 bits (57), Expect = 9.1
 Identities = 9/68 (13%), Positives = 34/68 (50%), Gaps = 4/68 (5%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            + E  +  +E  E+E+E++        + ++K +  +E+       E+ +++++  +  
Sbjct: 80  KQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIP----EIGDDDDDVVDSS 135

Query: 141 KEEEEEEE 148
             +E++++
Sbjct: 136 DADEDDDD 143



 Score = 25.8 bits (57), Expect = 9.8
 Identities = 12/63 (19%), Positives = 30/63 (47%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K+ E  +  EE  +++DED+         +++K     EE++ E  ++  +     + +E
Sbjct: 80  KQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADE 139

Query: 148 EKE 150
           + +
Sbjct: 140 DDD 142


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 27.9 bits (62), Expect = 2.6
 Identities = 20/98 (20%), Positives = 41/98 (41%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +A  +S+  +  E E E++ K EE  E++ + E++ED + +   + +        +EE  
Sbjct: 162 EAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGE 221

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRNPVP 167
           E           E E +E   ++   R+      +  P
Sbjct: 222 EAPSINYNEDTSESESDESDSEISESRSVSDSEESSPP 259



 Score = 27.5 bits (61), Expect = 4.1
 Identities = 11/61 (18%), Positives = 37/61 (60%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E + ++ + +++ +K K ++ ++  +K +EEE + +D+ + E   E++    ++++ + E
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202

Query: 147 E 147
           +
Sbjct: 203 D 203



 Score = 26.3 bits (58), Expect = 9.9
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 6/78 (7%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV------EEE 132
           +  E +E   + +EEE E E+E + +E  E   + +EEE+   +D  + +       +EE
Sbjct: 159 KKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEE 218

Query: 133 EEEQKNKKKEEEEEEEKE 150
           E E+       E+  E E
Sbjct: 219 EGEEAPSINYNEDTSESE 236


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 27.4 bits (61), Expect = 2.6
 Identities = 7/38 (18%), Positives = 13/38 (34%), Gaps = 1/38 (2%)

Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
            K +            E +   E  + + K+ EE  +K
Sbjct: 104 RKNDAHTHASILLSSNE-QNSTEALQLRAKKREEHRKK 140



 Score = 25.9 bits (57), Expect = 8.4
 Identities = 5/41 (12%), Positives = 14/41 (34%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
           +N+            E+   E  +    + ++  KK  + +
Sbjct: 105 KNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKKWYQND 145


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 27.5 bits (61), Expect = 2.6
 Identities = 24/68 (35%), Positives = 42/68 (61%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E +E+++E+EEE+EEE EE E  EDEE+  E +EEEEE ++D  + +  E++        
Sbjct: 46  EKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNS 105

Query: 142 EEEEEEEK 149
            +++  + 
Sbjct: 106 TQDDNAQN 113



 Score = 27.1 bits (60), Expect = 4.0
 Identities = 23/86 (26%), Positives = 53/86 (61%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E ++   W++E+  KE E+ ++E++++ +E+EE+  E+ EE E+ + +++  E EEEEEE
Sbjct: 24  EHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEE 83

Query: 136 QKNKKKEEEEEEEKEKVEMYRNPVPM 161
            +    + ++ E+K   +++ +    
Sbjct: 84  DEEDNVDLKDIEKKNINDIFNSTQDD 109


>gnl|CDD|222863 PHA02540, 61, DNA primase; Provisional.
          Length = 337

 Score = 28.0 bits (63), Expect = 2.6
 Identities = 13/50 (26%), Positives = 24/50 (48%)

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
           KD + +  RE   E  +++   K   V + + E K +KK  E+    E++
Sbjct: 72  KDYEPDLYREYIMERFKERGTGKGRPVPKPKFEFKKEKKVIEKLPFCERL 121


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 26.9 bits (60), Expect = 2.6
 Identities = 14/46 (30%), Positives = 26/46 (56%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
            +  E  ++ +E++EE+E+D++E  +R KKEEE  +  D       
Sbjct: 43  TDYLEVTDKVKEKKEEREEDKEELIERIKKEEETFEDVDLGSAGCG 88


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 27.4 bits (61), Expect = 2.7
 Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREK--KEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           +E  + ++   + E E +K + E +K +K  K +E + Q D K E + +  + +   KKE
Sbjct: 37  QESPQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKE 96

Query: 143 EEEEEEKEKVEMYRNPV 159
           +      +K + Y   +
Sbjct: 97  KLVNAFNKKQQEYEKDL 113


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 27.4 bits (61), Expect = 2.8
 Identities = 16/46 (34%), Positives = 27/46 (58%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
             E EE   + E   EE ++ ++K + E+K+  KK +E EK++ KK
Sbjct: 96  KKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 27.5 bits (62), Expect = 2.8
 Identities = 19/51 (37%), Positives = 35/51 (68%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           +KEE  + ++E  EK E+  +++EK+    ++Q ++KEEE+EE  EEQ+ +
Sbjct: 89  QKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQE 139


>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
          Length = 292

 Score = 27.9 bits (63), Expect = 2.8
 Identities = 15/30 (50%), Positives = 21/30 (70%)

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           K E+ E++RE +EE EEK ++ KEE   EE
Sbjct: 40  KAEEIEERREVREEAEEKYEEAKEEGDLEE 69


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 28.0 bits (63), Expect = 2.9
 Identities = 10/45 (22%), Positives = 23/45 (51%), Gaps = 1/45 (2%)

Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
           +E+++ K+  +K         ++K K+ +   E+++  V  Y NP
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH-VLHYHNP 127


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 27.8 bits (63), Expect = 3.0
 Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 15/93 (16%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR----------EKK-EEEEEK 120
           I+AS  L + E E   K+ E   EE+++ KE  E     R          EK  +E  +K
Sbjct: 494 ITASSGLSDEEIERMVKDAEANAEEDKKRKELVE----ARNQADSLIYQTEKTLKELGDK 549

Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
               ++E++E   +E K   K E++E  K K E
Sbjct: 550 VPADEKEKIEAAIKELKEALKGEDKEAIKAKTE 582


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 27.9 bits (63), Expect = 3.0
 Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 1/62 (1%)

Query: 58  IRANARLTV-ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE 116
           +R+ A+  + I+ + +    +L   E  E  +E   E  EEEEEK++++         E+
Sbjct: 40  LRSKAKEIIPIVKEVVEEVNSLSLEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEK 99

Query: 117 EE 118
            +
Sbjct: 100 GK 101



 Score = 27.5 bits (62), Expect = 4.1
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
            EE++E+ EE   E  E+E+++ EEKK        EK K
Sbjct: 63  LEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKGK 101



 Score = 27.1 bits (61), Expect = 4.8
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
            E+Q+++ EE   E  EE++ KK+E++        E
Sbjct: 63  LEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAE 98


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 27.2 bits (61), Expect = 3.0
 Identities = 20/56 (35%), Positives = 36/56 (64%)

Query: 96  EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           EE+EE+E+ EDEE+  E++ EE E++ ++ ++ +     E +N +K  E E E+ K
Sbjct: 1   EEKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAK 56


>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein.  This is a family
           of proteins which regulate checkpoint kinases. In
           Schizosaccharomyces pombe this protein is called Rad26
           and in Saccharomyces cerevisiae it is called LCD1.
          Length = 648

 Score = 27.9 bits (62), Expect = 3.0
 Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
           K+EE  KQK  +  E++E+ +++  K K+E +  E E+
Sbjct: 13  KQEERNKQK-SRVNELKEKHDQELQKLKQELQSLEDER 49


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 27.8 bits (62), Expect = 3.1
 Identities = 10/67 (14%), Positives = 29/67 (43%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + +E + E+++E+E+   +   +E ++   + + E KQ D               +  ++
Sbjct: 487 DADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDDD 546

Query: 144 EEEEEKE 150
           + E  + 
Sbjct: 547 DIEGVEP 553


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 27.8 bits (62), Expect = 3.1
 Identities = 17/77 (22%), Positives = 29/77 (37%), Gaps = 4/77 (5%)

Query: 66  VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE-EEKQKDK 124
           V+LP+A   +   E            E  E EEE  E D+ +    E+ ++  EE   D 
Sbjct: 258 VLLPRA---TRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEELMFDA 314

Query: 125 KEEEVEEEEEEQKNKKK 141
            E ++ +         +
Sbjct: 315 VEADLPDNILATLQTVQ 331


>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
          Length = 702

 Score = 28.0 bits (62), Expect = 3.1
 Identities = 10/25 (40%), Positives = 19/25 (76%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREK 113
           E+++EEE + + KE+D+ + + REK
Sbjct: 13  EEDDEEEMDMDVKEEDDGDRRNREK 37


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 27.7 bits (61), Expect = 3.1
 Identities = 16/103 (15%), Positives = 39/103 (37%), Gaps = 9/103 (8%)

Query: 64  LTVILPQAISAS---------ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
           +TV+    ISA+           L +  ++ ++   +      E    + E    + E++
Sbjct: 46  VTVLTGMLISAATLAILFLLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETERE 105

Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
               E QK ++E E   +E     +   + ++E     +  ++
Sbjct: 106 AARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQD 148


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 27.7 bits (62), Expect = 3.1
 Identities = 14/47 (29%), Positives = 26/47 (55%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           E EE+ +++E E  ++   +  DED EK  +  ++ +E+Q     EE
Sbjct: 460 ELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEE 506



 Score = 27.3 bits (61), Expect = 4.3
 Identities = 11/49 (22%), Positives = 29/49 (59%), Gaps = 2/49 (4%)

Query: 104 DEDEEKKREKKEEEE--EKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
             + E+K EK+E E   ++  +  +E++E+  ++ K  K+ +++ + +E
Sbjct: 458 SPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEE 506



 Score = 26.5 bits (59), Expect = 9.1
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           EE++E+EE E   K+  E  +E+ EK+ 
Sbjct: 462 EEKLEKEERELLQKRSSELTDEDLEKII 489


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 27.6 bits (61), Expect = 3.2
 Identities = 12/24 (50%), Positives = 20/24 (83%)

Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
           EEE+E++KN  KEE+E ++KE+ +
Sbjct: 94  EEEKEKKKNLNKEEKEAKKKERAK 117


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 27.5 bits (61), Expect = 3.2
 Identities = 19/59 (32%), Positives = 34/59 (57%), Gaps = 4/59 (6%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE----EQKNKKKEEE 144
           KEE   E++  +E+  DE  K+E KE  +E+ ++K+    + E+E    + KNK  E++
Sbjct: 3   KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDK 61


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 27.5 bits (61), Expect = 3.5
 Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 26/100 (26%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEK--------------------EKDEDEEKKREKKEEEE 118
              E +E+ ++KEEE+EE + +K                    E  E+E K   K E+ E
Sbjct: 368 SKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVEKSE 427

Query: 119 EKQKDKKEEEVE------EEEEEQKNKKKEEEEEEEKEKV 152
           E   D + + ++      E EE +K K+KE+E  E  EKV
Sbjct: 428 EDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKV 467


>gnl|CDD|221514 pfam12297, EVC2_like, Ellis van Creveld protein 2 like protein.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 571 and 1310 amino
           acids in length. There are two conserved sequence
           motifs: LPA and ELH. EVC2 is implicated in Ellis van
           Creveld chondrodysplastic dwarfism in humans. Mutations
           in this protein can give rise to this congenital
           condition. LIMBIN is a protein which shares around 80%
           sequence homology with EVC2 and it is implicated in a
           similar condition in bovine chondrodysplastic dwarfism.
          Length = 429

 Score = 27.5 bits (61), Expect = 3.5
 Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 5/79 (6%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE-----KKEEEEEKQKDKK 125
           A++A   LE  +  E + ++E  E E+ EE  K   E +  E           E++  ++
Sbjct: 221 ALTAECNLETRKKMEAQHQREMAEMEQAEELLKRAPEREAVECSSLLDTLHGLEQEHLQR 280

Query: 126 EEEVEEEEEEQKNKKKEEE 144
              +++EE+  K  ++   
Sbjct: 281 SLLLQQEEDFAKAHRQLAV 299


>gnl|CDD|227371 COG5038, COG5038, Ca2+-dependent lipid-binding protein, contains C2
           domain [General function prediction only].
          Length = 1227

 Score = 27.8 bits (62), Expect = 3.5
 Identities = 12/54 (22%), Positives = 22/54 (40%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
             +E +  +K   E+ K + +K    E+       +   EE  E +E  +MY  
Sbjct: 847 SLEEVRYVDKVSSEKRKSEKRKSALDEKTISLVDKEDSVEESIEVEELTDMYSL 900



 Score = 27.0 bits (60), Expect = 5.2
 Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 4/56 (7%)

Query: 91  EEEEEEEEEEKEKDEDEEKK---REKKEEEEEKQKDKKEE-EVEEEEEEQKNKKKE 142
           EE    ++   EK + E++K    EK     +K+   +E  EVEE  +    K K 
Sbjct: 849 EEVRYVDKVSSEKRKSEKRKSALDEKTISLVDKEDSVEESIEVEELTDMYSLKPKL 904


>gnl|CDD|215406 PLN02761, PLN02761, lipase class 3 family protein.
          Length = 527

 Score = 27.7 bits (61), Expect = 3.6
 Identities = 9/37 (24%), Positives = 22/37 (59%)

Query: 96  EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           ++++  ++    + KRE++ EEE ++K+    E+  E
Sbjct: 57  QQKQSNKQTHVSDNKREEEPEEELEEKEVSLREIWRE 93


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 27.2 bits (61), Expect = 3.6
 Identities = 13/41 (31%), Positives = 25/41 (60%), Gaps = 6/41 (14%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQK------DKKEEEV 129
           E E+E+K+    EEKK  K+E+++ ++       D ++E+V
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEPYGYCLVDGRKEKV 136


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 26.8 bits (60), Expect = 3.7
 Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 6/65 (9%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDED----EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           E+ +K +EE +EE +   E DED      +   K++    +   K  EE  + +E++K K
Sbjct: 28  EKYDKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRN--KAAEERRKLKEKKKKK 85

Query: 140 KKEEE 144
           KKE E
Sbjct: 86  KKELE 90


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 26.5 bits (59), Expect = 3.7
 Identities = 16/47 (34%), Positives = 28/47 (59%), Gaps = 4/47 (8%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           +E+++ + E   E  E   EK     E +KREKKE+ E+K +++ +E
Sbjct: 45  SEFKDYKDEPPPEPTETWLEKR----EREKREKKEKLEKKLEEELKE 87



 Score = 25.3 bits (56), Expect = 7.6
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           KD  +E   E  E   EK++ +K E+ E+ E++ + + KE
Sbjct: 48  KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87


>gnl|CDD|167284 PRK01833, tatA, twin arginine translocase protein A; Provisional.
          Length = 74

 Score = 26.0 bits (57), Expect = 3.8
 Identities = 15/43 (34%), Positives = 24/43 (55%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           D  E  K  KK   ++K KD + E+VE +E     +K +E+E+
Sbjct: 31  DLGESVKGFKKAMADDKPKDAEFEKVEAKEAASTEQKAKEKEQ 73


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 27.3 bits (61), Expect = 3.8
 Identities = 17/101 (16%), Positives = 35/101 (34%), Gaps = 25/101 (24%)

Query: 62  ARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK------------ 109
           A L  ++ +  +A+ T            +    E E E + ++E E+K            
Sbjct: 108 ADLLALMEEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKKSGMGPLLLVLAV 167

Query: 110 -------------KREKKEEEEEKQKDKKEEEVEEEEEEQK 137
                         + K++E+     D  E +  +E+EE+ 
Sbjct: 168 ALIGGGAYYYFKFYKPKQQEKGAPDDDLDEYDYGDEDEEED 208



 Score = 26.9 bits (60), Expect = 4.9
 Identities = 10/42 (23%), Positives = 15/42 (35%)

Query: 97  EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
           EEE+                 E    + + E   EEE E+K+
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKKS 156


>gnl|CDD|239500 cd03406, Band_7_3, A subgroup of the band 7 domain of flotillin
           (reggie) like proteins. This subgroup contains proteins
           similar to stomatin, prohibitin, flotillin, HlfK/C and
           podicin.  Many of these band 7 domain-containing
           proteins are lipid raft-associated.  Individual proteins
           of this band 7 domain family may cluster to form
           membrane microdomains which may in turn recruit
           multiprotein complexes.  Microdomains formed from
           flotillin proteins may in addition be dynamic units with
           their own regulatory functions.  Flotillins have been
           implicated in signal transduction, vesicle trafficking,
           cytoskeleton rearrangement and are known to interact
           with a variety of proteins.  Stomatin interacts with and
           regulates members of the degenerin/epithelia Na+ channel
           family in mechanosensory cells of Caenorhabditis elegans
           and vertebrate neurons and participates in trafficking
           of Glut1 glucose transporters. Prohibitin may act as a
           chaperone for the stabilization of mitochondrial
           proteins.  Prokaryotic HflK/C plays a role in the
           decision between lysogenic and lytic cycle growth during
           lambda phage infection. Flotillins have been implicated
           in the progression of prion disease, in the pathogenesis
           of neurodegenerative diseases such as Parkinson's and
           Alzheimer's disease and, in cancer invasion and
           metastasis. Mutations in the podicin gene give rise to
           autosomal recessive steroid resistant nephritic
           syndrome.
          Length = 280

 Score = 27.4 bits (61), Expect = 3.8
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ--KDKKEEEVEEEEEEQK 137
           N E  E EK K     ++++  EK+ + E+K+   E E+  Q  K    ++V E+E E++
Sbjct: 164 NYELMEAEKTKLLIAIQKQKVVEKEAETERKKAVIEAEKVAQVAKILFGQKVMEKETEKR 223

Query: 138 NKKKEEEEEEEKEK 151
             + E+E    +EK
Sbjct: 224 ISEIEDEAFLAREK 237


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 27.0 bits (60), Expect = 3.9
 Identities = 5/45 (11%), Positives = 10/45 (22%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
                E +     +    KK++   E    +      E       
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAA 151


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 27.6 bits (62), Expect = 3.9
 Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 5/66 (7%)

Query: 96  EEEEEKEKDEDEEKKREKKEEEEE-----KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           +EE  K   E EE + E+ E  +E     K+ +   EE+  E +E K K KE E   ++ 
Sbjct: 35  DEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDEL 94

Query: 151 KVEMYR 156
           + E+  
Sbjct: 95  EAELDT 100


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 27.1 bits (60), Expect = 3.9
 Identities = 15/73 (20%), Positives = 30/73 (41%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E       E++   + E E+      +   D+    +K+  E+ K   +  E ++  E++
Sbjct: 40  EDAAAELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEVNERLKALEKK 99

Query: 136 QKNKKKEEEEEEE 148
           +   K  EEE E 
Sbjct: 100 KAELKDLEEEREG 112


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 27.5 bits (61), Expect = 3.9
 Identities = 17/103 (16%), Positives = 42/103 (40%), Gaps = 10/103 (9%)

Query: 51  PAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKE-------- 102
           P   ++P  A+A++ + +PQ    S+   + +  +   ++E+  EE  +           
Sbjct: 86  PQHQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQ 145

Query: 103 --KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
                  +   +  +    +Q+ + EE VE   E ++  +K+ 
Sbjct: 146 PVHSAAPQPAVQTVQPAVPEQQVQPEEVVEPAPEVKRPPRKDT 188


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 27.6 bits (61), Expect = 3.9
 Identities = 15/48 (31%), Positives = 28/48 (58%)

Query: 97  EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           +E +K++   +E+ +   E++   +KDKK+EE E+  EE    K + E
Sbjct: 3   DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGDLE 50



 Score = 26.4 bits (58), Expect = 8.2
 Identities = 11/44 (25%), Positives = 27/44 (61%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
           +E +K+++  +E+ +   EK    +K ++K+EEE+  ++D   +
Sbjct: 3   DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLK 46


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 27.7 bits (61), Expect = 4.0
 Identities = 16/65 (24%), Positives = 33/65 (50%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           +E + +  ++E+ + E  ++ + EE   E     + +Q +        EE+E   ++KEE
Sbjct: 724 DEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEE 783

Query: 144 EEEEE 148
           EE +E
Sbjct: 784 EENKE 788



 Score = 27.3 bits (60), Expect = 4.3
 Identities = 15/40 (37%), Positives = 22/40 (55%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
           +  ++     EE E  EEE+EEEE KE      KK+++K 
Sbjct: 762 DNSDSFAESSEEDESSEEEKEEEENKEVSAKRAKKKQRKN 801



 Score = 27.3 bits (60), Expect = 4.6
 Identities = 15/73 (20%), Positives = 40/73 (54%), Gaps = 4/73 (5%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEE----EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           + +++ + + E  ++ ++EE  +E       + ++ +   E  EE+E  +++K+EEE +E
Sbjct: 729 DAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEEENKE 788

Query: 132 EEEEQKNKKKEEE 144
              ++  KK+ + 
Sbjct: 789 VSAKRAKKKQRKN 801


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 27.8 bits (61), Expect = 4.0
 Identities = 18/65 (27%), Positives = 35/65 (53%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
           E LE  E E  E+E+ E  E E  E+E+ E E  +R++ E +   + +++  +  E +  
Sbjct: 461 ERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRL 520

Query: 136 QKNKK 140
           +K ++
Sbjct: 521 EKARR 525



 Score = 27.4 bits (60), Expect = 4.6
 Identities = 17/64 (26%), Positives = 34/64 (53%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E  E+EE E  E E  E+ E E  +RE+ E E  ++   + + ++  E E+ ++ + +  
Sbjct: 461 ERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRL 520

Query: 146 EEEK 149
           E+ +
Sbjct: 521 EKAR 524



 Score = 26.6 bits (58), Expect = 7.5
 Identities = 19/70 (27%), Positives = 39/70 (55%), Gaps = 2/70 (2%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
           E+ + E+E    +  E    E+K+ E+ E EE ++ ++  E +E  E E+  +++ E E 
Sbjct: 436 ERARIEKENAHRKALEMKILEKKRIERLEREERERLER--ERMERIERERLERERLERER 493

Query: 147 EEKEKVEMYR 156
            E++++E  R
Sbjct: 494 LERDRLERDR 503



 Score = 26.6 bits (58), Expect = 8.1
 Identities = 15/64 (23%), Positives = 35/64 (54%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E+K  E  E EE E  E++  E  +RE+ E E  +++  + + +E +  ++  +++ +  
Sbjct: 456 EKKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRL 515

Query: 146 EEEK 149
           E ++
Sbjct: 516 ERDR 519



 Score = 26.6 bits (58), Expect = 8.5
 Identities = 19/71 (26%), Positives = 38/71 (53%), Gaps = 1/71 (1%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE-EEVEEEEEEQKNKKKEEEE 145
           EKE    +  E +  EK   E  +RE++E  E ++ ++ E E +E E  E++  +++  E
Sbjct: 441 EKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERERLERDRLE 500

Query: 146 EEEKEKVEMYR 156
            +  +++E  R
Sbjct: 501 RDRLDRLERER 511


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 26.7 bits (59), Expect = 4.0
 Identities = 12/36 (33%), Positives = 23/36 (63%), Gaps = 2/36 (5%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEE--EQKNKKKE 142
            K+E+++EEE  +  K+EE + E  +  +Q+N  K+
Sbjct: 95  SKKEEEKEEEIPEPTKEEELLGEIRDLLKQQNSSKD 130



 Score = 25.6 bits (56), Expect = 9.8
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 88  KEKEEEEEEEEEEKEKDED 106
            +KEEE+EEE  E  K+E+
Sbjct: 95  SKKEEEKEEEIPEPTKEEE 113


>gnl|CDD|130844 TIGR01784, T_den_put_tspse, conserved hypothetical protein
           (putative transposase or invertase).  Several lines of
           evidence suggest that members of this family (loaded as
           a fragment mode model to find part-length matches) are
           associated with transposition, inversion, or
           recombination. Members are found in small numbers of
           genomes, but in large copy numbers in many of those
           species, including over 30 full length and fragmentary
           members in Treponema denticola. The strongest
           similarities are usually within rather than between
           species. PSI-BLAST shows similarity to proteins
           designated as possible transposases, DNA invertases
           (resolvases), and recombinases. In the oral pathogenic
           spirochete Treponema denticola, full-length members are
           often found near transporters or other membrane
           proteins. This family includes members of the putative
           transposase family pfam04754.
          Length = 270

 Score = 27.4 bits (61), Expect = 4.2
 Identities = 16/89 (17%), Positives = 34/89 (38%), Gaps = 4/89 (4%)

Query: 70  QAISASETLENNEWEEEEKEKEEE----EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
             I        + W    K K+ +       +    +K E+E K      +E E  + ++
Sbjct: 154 AIIDERLLDVLDRWLLLLKGKDNQSVNLIFLQIPFIQKAEEEIKTLLLSSKELELYEQRE 213

Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           +   +    E++ K++  EE   + K+E 
Sbjct: 214 KYMTDAISAEEEGKEEGIEEGILEAKLET 242


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 27.6 bits (61), Expect = 4.2
 Identities = 25/126 (19%), Positives = 42/126 (33%), Gaps = 10/126 (7%)

Query: 36  NVSLTDDAEFQCQVGPAQYHKPIRANARLTVILPQAI-SASETLENNEWEEEEKEKEEEE 94
           N SLT  A +              +N  L   +PQ     S      E+ +     E   
Sbjct: 63  NASLTFFALYIP---------LTDSNKPLPRAVPQDYERESAVSPIVEYPQALSPGESLP 113

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           E   E K+    + K  E   + +     K ++ V + ++ QK KK   +      + E 
Sbjct: 114 ESLSETKQVTVSKRKGSELDSDSDSSDSSKGKKLVNKPKKRQKYKKATIQSPTSLTEEEN 173

Query: 155 YRNPVP 160
           Y   + 
Sbjct: 174 YNTEIC 179


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 27.4 bits (61), Expect = 4.2
 Identities = 9/55 (16%), Positives = 22/55 (40%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
                E   E+ E      E++ ++ K  +  +++ K   ++  E    + K +E
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEE 228


>gnl|CDD|218954 pfam06246, Isy1, Isy1-like splicing family.  Isy1 protein is
           important in the optimisation of splicing.
          Length = 253

 Score = 27.3 bits (61), Expect = 4.3
 Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 11/76 (14%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV--------- 129
           E+    E E E+E+E  E+  E+ K+  +E  + +++E EE      +++          
Sbjct: 173 EDGVLLEYEAEREKELIEKLVEEWKEIQDEDLQAEEDEREEDIYIPLDDDAEFVAHVPVP 232

Query: 130 --EEEEEEQKNKKKEE 143
             EE EE    K+K E
Sbjct: 233 TQEEVEEWLLEKRKRE 248


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 27.6 bits (61), Expect = 4.3
 Identities = 11/81 (13%), Positives = 29/81 (35%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
            + E + N +      +   E+  E E+    E     R+ + E++ +      ++  +E
Sbjct: 581 RSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQE 640

Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
              +       +    +E+V 
Sbjct: 641 LALKLTALHALQLTLTQERVR 661


>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding.  This region
           tends to appear at the N-terminus of proteins also
           containing DNA-binding HMG (high mobility group) boxes
           (pfam00505) and appears to bind the armadillo repeat of
           CTNNB1 (beta-catenin), forming a stable complex.
           Signaling by Wnt through TCF/LCF is involved in
           developmental patterning, induction of neural tissues,
           cell fate decisions and stem cell differentiation.
           Isoforms of HMG T-cell factors lacking the N-terminal
           CTNNB1-binding domain cannot fulfill their role as
           transcriptional activators in T-cell differentiation.
          Length = 200

 Score = 27.2 bits (60), Expect = 4.4
 Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 1/66 (1%)

Query: 103 KDE-DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPM 161
           KDE ++E+K  +    E    D K   V E E  + +  +     +   + E YR+    
Sbjct: 22  KDEGEQEEKISENVGAERDLADVKSSLVNESESSESSSSEAISRRDHPPRYESYRDKSRE 81

Query: 162 YRNPVP 167
               V 
Sbjct: 82  KLEDVS 87


>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
           and repair].
          Length = 1187

 Score = 27.3 bits (61), Expect = 4.4
 Identities = 9/33 (27%), Positives = 14/33 (42%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKD 104
           I     L   +  E  +E+  E E E E+K + 
Sbjct: 243 IKLRRKLYGEKRAERVREELREVEREREKKRRK 275


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 27.2 bits (60), Expect = 4.4
 Identities = 22/72 (30%), Positives = 40/72 (55%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
           ISASE  +     E   E+E  EE E+E + + EDEEK  +++ E+ +++     +  + 
Sbjct: 202 ISASEPFQLESEVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKA 261

Query: 132 EEEEQKNKKKEE 143
            EEE++  ++ E
Sbjct: 262 LEEEEEKGERPE 273



 Score = 26.8 bits (59), Expect = 6.3
 Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 3/83 (3%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
           A    +  + +  E  + E E E   EEE  E+ EDE    E + E+EEK  D++ E+ +
Sbjct: 193 ADDDPDEWKISASEPFQLESEVEASPEEENYEEYEDET---ELEVEDEEKALDEQTEDPQ 249

Query: 131 EEEEEQKNKKKEEEEEEEKEKVE 153
           +E+    + KK  EEEEEK +  
Sbjct: 250 QEDALAGDAKKALEEEEEKGERP 272


>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD.  YqfD is part of
           the sigma-E regulon in the sporulation program of
           endospore-forming Gram-positive bacteria. Mutation
           results in a sporulation defect in Bacillus subtilis.
           Members are found in all currently known
           endospore-forming bacteria, including the genera
           Bacillus, Symbiobacterium, Carboxydothermus,
           Clostridium, and Thermoanaerobacter [Cellular processes,
           Sporulation and germination].
          Length = 382

 Score = 27.3 bits (61), Expect = 4.6
 Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 7/69 (10%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
            KE   E +E+ E+  K+E  EK +EK E+E +K+ D   + V +       K   E  E
Sbjct: 312 VKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKKELDPNAKIVSD-------KILSERVE 364

Query: 147 EEKEKVEMY 155
             K KV ++
Sbjct: 365 GGKVKVTVH 373


>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2).  Surfeit locus
           protein 2 is part of a group of at least six sequence
           unrelated genes (Surf-1 to Surf-6). The six Surfeit
           genes have been classified as housekeeping genes, being
           expressed in all tissue types tested and not containing
           a TATA box in their promoter region. The exact function
           of SURF2 is unknown.
          Length = 244

 Score = 27.2 bits (60), Expect = 4.7
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDE-EKKREKKEEEEEKQKDKK 125
           TL+N   E+   E ++ E ++E+E E +  E ++KR KK+     +K KK
Sbjct: 181 TLKNPGKEQNGDEDDDFETDDEDEMEVESPELQQKRSKKQSGSLTKKFKK 230


>gnl|CDD|220113 pfam09111, SLIDE, SLIDE.  The SLIDE domain adopts a secondary
           structure comprising a main core of three alpha-helices.
           It has a role in DNA binding, contacting DNA target
           sites similar to c-Myb (pfam00249) repeats or
           homeodomains.
          Length = 118

 Score = 26.4 bits (59), Expect = 4.8
 Identities = 9/27 (33%), Positives = 19/27 (70%)

Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYRNP 158
           E  E+K +K++ +++  + K+E Y+NP
Sbjct: 9   ERGEEKIEKRKNQQKALRRKLEQYKNP 35


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 27.2 bits (60), Expect = 4.8
 Identities = 17/72 (23%), Positives = 34/72 (47%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E ++  K+K  EE++   +K     E  K +K  +    +  KK  +V++ +  +   KK
Sbjct: 3   ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKK 62

Query: 142 EEEEEEEKEKVE 153
              + E+ E V+
Sbjct: 63  VTVKFEKTESVK 74


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 27.0 bits (60), Expect = 4.9
 Identities = 16/72 (22%), Positives = 46/72 (63%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
            + E    +EE +K   E + ++ K+++++++++++++  +++EEE++ +K EE  ++  
Sbjct: 169 AKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQKWM 228

Query: 150 EKVEMYRNPVPM 161
           + V+    PVP+
Sbjct: 229 KNVKQRPKPVPL 240



 Score = 27.0 bits (60), Expect = 5.5
 Identities = 15/49 (30%), Positives = 31/49 (63%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
           +  EE++++E EKE++E E ++R  KE+ EE  + K ++  ++   + K
Sbjct: 106 KLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHK 154



 Score = 26.2 bits (58), Expect = 8.4
 Identities = 13/61 (21%), Positives = 40/61 (65%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           + ++K ++  EE++++E+EK+ +E + R++  +E+ ++  +++ +   ++   K+KK+  
Sbjct: 99  QRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAA 158

Query: 144 E 144
           E
Sbjct: 159 E 159


>gnl|CDD|118278 pfam09746, Membralin, Tumour-associated protein.  Membralin is
           evolutionarily highly conserved; though it seems to
           represent a unique protein family. The protein appears
           to contain several transmembrane regions. In humans it
           is expressed in certain cancers, particularly ovarian
           cancers. Membralin-like gene homologues have been
           identified in plants including grape, cotton and tomato.
          Length = 375

 Score = 27.1 bits (60), Expect = 4.9
 Identities = 7/56 (12%), Positives = 19/56 (33%), Gaps = 8/56 (14%)

Query: 120 KQKDKKEEEVEEEEEEQKNKKK-----EEEEEEEKEKVEMYRNP---VPMYRNPVP 167
           +++        ++E  +  K         +  +  E +E Y+N    + +   P  
Sbjct: 88  EKRAIWLGIKRDQEIVRSLKDSYYYGIGPQTRQNHETLERYQNILGKLGLPVRPTF 143


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 27.2 bits (60), Expect = 5.0
 Identities = 26/76 (34%), Positives = 42/76 (55%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            E  E EEE++  EEEEEE EEE+E D +++ +   K     K+     +E E E  ++K
Sbjct: 2   TEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEK 61

Query: 138 NKKKEEEEEEEKEKVE 153
             ++E + + E+ K E
Sbjct: 62  ALEEEAKRKAEERKRE 77


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 26.7 bits (59), Expect = 5.0
 Identities = 21/78 (26%), Positives = 38/78 (48%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           +  E  +K  ++   E+ E   K   ++KK   ++++  K+  KK++E EE  E    + 
Sbjct: 26  STTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEEL 85

Query: 141 KEEEEEEEKEKVEMYRNP 158
            + EE EE +K   Y  P
Sbjct: 86  SDSEENEENDKKVDYELP 103



 Score = 26.3 bits (58), Expect = 7.7
 Identities = 10/46 (21%), Positives = 24/46 (52%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +E+E+  E   EE  + +E +E+++K   +  + +    +   E+V
Sbjct: 72  KEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDV 117



 Score = 25.9 bits (57), Expect = 9.1
 Identities = 14/42 (33%), Positives = 27/42 (64%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           KK +KK++E+E+  +   EE+ + EE ++N KK + E  + +
Sbjct: 65  KKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQ 106


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 26.5 bits (59), Expect = 5.0
 Identities = 18/64 (28%), Positives = 36/64 (56%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            E E+ KEE +    E +++ E+  ++  +  E+ +K+ ++  EE++ E EE+  + KE 
Sbjct: 47  AEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEA 106

Query: 144 EEEE 147
            E E
Sbjct: 107 AEAE 110


>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor. 
          Length = 456

 Score = 27.3 bits (60), Expect = 5.1
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 93  EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           + +EE +EK +D+      E+ E E E  + + E
Sbjct: 54  QGQEEPDEKTQDQQSLSDVERAEPEVEASRGRGE 87


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 27.3 bits (60), Expect = 5.1
 Identities = 11/55 (20%), Positives = 26/55 (47%)

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            + +E+ +E++K   +D +  ++ K   E+ +      +V       K  KK+E+
Sbjct: 220 DKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEK 274


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 26.5 bits (59), Expect = 5.2
 Identities = 24/59 (40%), Positives = 35/59 (59%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           EE +KE+EEE+EE+E+E E DED      KK    +  + +       E+E++K KKKE
Sbjct: 28  EEYDKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKE 86


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 26.5 bits (59), Expect = 5.3
 Identities = 12/34 (35%), Positives = 24/34 (70%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           E+ E +   +EEK+ DE+E K++ ++ +E E++K
Sbjct: 91  ERAEAQRARDEEKKLDEEEAKRQHEEAKEREREK 124


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 27.0 bits (59), Expect = 5.3
 Identities = 14/51 (27%), Positives = 28/51 (54%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E+ E E E++ +E  + EK E+E  + K++    V   EE ++    +E++
Sbjct: 349 EDIEYETEENREEGTQAEKYEQEIARLKERLRVSVRRLEEYERRLLGQEQQ 399


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 27.0 bits (59), Expect = 5.3
 Identities = 14/60 (23%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 109 KKREKK---EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRNP 165
           K++EK+   E+E ++Q  ++ + + E+ E+Q  +    EE   +E  +     V     P
Sbjct: 96  KRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQPAGVNVANNPQP 155


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 27.0 bits (60), Expect = 5.4
 Identities = 18/83 (21%), Positives = 28/83 (33%), Gaps = 2/83 (2%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
           E   E E E E   E  ++      K E K + + K K K  ++VEE+ +  +  K  E 
Sbjct: 69  EPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPK--REVKPVEP 126

Query: 145 EEEEKEKVEMYRNPVPMYRNPVP 167
                 +      P         
Sbjct: 127 RPASPFENTAPARPTSSTATAAA 149


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 26.8 bits (59), Expect = 5.5
 Identities = 14/68 (20%), Positives = 36/68 (52%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E   +EE +    + K+K E+E K++ +   E+     ++E+   +E +  ++K    
Sbjct: 81  EAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLA 140

Query: 144 EEEEEKEK 151
           E+ +++++
Sbjct: 141 EDSDDRQE 148


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 26.4 bits (58), Expect = 5.5
 Identities = 15/49 (30%), Positives = 29/49 (59%)

Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVP 160
           + +EE+ + Q ++ EE+ EE E+ + +  +E+EE EE E+     +  P
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAP 66


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 26.4 bits (59), Expect = 5.5
 Identities = 15/45 (33%), Positives = 26/45 (57%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
           E E + E EE+EK++ + E ++R+   EE  K  D   ++ EE +
Sbjct: 77  EAERKAEREEDEKKRLKRENERRKALGEEPLKSLDDLPKDKEEPD 121


>gnl|CDD|223642 COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit
           (sigma70/sigma32) [Transcription].
          Length = 342

 Score = 26.9 bits (60), Expect = 5.6
 Identities = 11/73 (15%), Positives = 23/73 (31%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
               + + E  ++ E+E+   E  E   + K   E  ++ ++                  
Sbjct: 23  LLVREAEVELAKQLEDEQLLVELGEDLTDLKLGREPSERARRPAGRLSFYIRAIEAAPLL 82

Query: 144 EEEEEKEKVEMYR 156
             EEEK      +
Sbjct: 83  TPEEEKALARRLK 95


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 27.1 bits (60), Expect = 5.6
 Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 9/64 (14%)

Query: 110 KREKKEE-------EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMY 162
           KR+ KEE       E    +  K+E    + +  K +KK EE++E  ++VE  R+  P  
Sbjct: 3   KRKIKEEMLQILAPEIYGPRPVKDEAKPRKIKRVKKRKKREEKDELDDEVEFVRSFAP-- 60

Query: 163 RNPV 166
           R  V
Sbjct: 61  RRRV 64


>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
          Length = 193

 Score = 26.4 bits (59), Expect = 5.7
 Identities = 16/56 (28%), Positives = 28/56 (50%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
           S  E     E   EE E E E++EE +E E +++ ++   +  E E +  + + EE
Sbjct: 2   SNEEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEAQLAEAQAEE 57



 Score = 26.0 bits (58), Expect = 9.3
 Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 2/59 (3%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            EE+K  +E+  EE E +   E +E+  E + E+E  + D +  E+E +  E + ++++
Sbjct: 3   NEEQKTPQEQVSEEIEAEV--EQQEEADEAELEDELDEADARIAELEAQLAEAQAEERD 59


>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
           type) [DNA replication, recombination, and repair].
          Length = 1444

 Score = 27.3 bits (61), Expect = 5.7
 Identities = 21/94 (22%), Positives = 39/94 (41%), Gaps = 3/94 (3%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
            +   +L+V  P        L   + + +      E+     E +  +++  + +KK + 
Sbjct: 99  KKIIEQLSVKSP---IFKSLLNKLKLKVKGNNILIEQVLNNPEFDHFKNKSPELQKKLQS 155

Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
               +   E EV +  EEQ+ +K EE   EE EK
Sbjct: 156 FGFPQLLIEFEVNDISEEQEFEKFEEAINEEVEK 189


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 26.7 bits (59), Expect = 5.7
 Identities = 24/93 (25%), Positives = 46/93 (49%), Gaps = 5/93 (5%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
           +A  A   LE    E EE+ K  +E+ ++E++     +E  R++ ++E E+Q+ + EE +
Sbjct: 76  EAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELL 135

Query: 130 EEEEE-----EQKNKKKEEEEEEEKEKVEMYRN 157
           + +EE     E   +  EEE  E + +      
Sbjct: 136 KMQEESVLRQEAMRRATEEEILEMRRETIEEEA 168


>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
           This family represents homologues of COX16 which has
           been shown to be involved in assembly of cytochrome
           oxidase. Protein in this family are typically between
           106 and 134 amino acids in length.
          Length = 79

 Score = 25.6 bits (57), Expect = 5.8
 Identities = 12/41 (29%), Positives = 23/41 (56%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           +K ++ EEEE  +  KK  +V+ EEE  + +  + ++ E  
Sbjct: 31  RKVQQLEEEEALKLLKKRRKVDLEEEYYRLQGLDIDDWENV 71


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score = 26.9 bits (59), Expect = 5.9
 Identities = 15/71 (21%), Positives = 28/71 (39%), Gaps = 2/71 (2%)

Query: 79  ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE--EEEEQ 136
           E  E E  + E EE   +EE+      + E   E+ ++       +      E   +E+ 
Sbjct: 351 ETAEGELLQFEAEEIYNDEEQYTRPVYENEDFEERVDDVSNGGLAEFAPAAVEVAPKEDA 410

Query: 137 KNKKKEEEEEE 147
           K  K+ + + E
Sbjct: 411 KVTKRGKAKRE 421


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 25.7 bits (57), Expect = 5.9
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 87  EKEKEEEEEEEEEEKEKDED 106
             E ++EE++EEEE+E D+D
Sbjct: 78  AAEAKKEEKKEEEEEESDDD 97



 Score = 25.7 bits (57), Expect = 6.5
 Identities = 9/17 (52%), Positives = 15/17 (88%)

Query: 84  EEEEKEKEEEEEEEEEE 100
           E +++EK+EEEEEE ++
Sbjct: 80  EAKKEEKKEEEEEESDD 96



 Score = 25.7 bits (57), Expect = 6.7
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 85  EEEKEKEEEEEEEEEEKEKD 104
             E +KEE++EEEEEE + D
Sbjct: 78  AAEAKKEEKKEEEEEESDDD 97



 Score = 25.4 bits (56), Expect = 7.9
 Identities = 14/32 (43%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 69  PQAISASETLENNEWEEEEKEKEEEEEEEEEE 100
           P A +A+      E ++EEK KEEEEEE +++
Sbjct: 67  PAAAAAAAAAAAAEAKKEEK-KEEEEEESDDD 97


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 26.6 bits (59), Expect = 6.1
 Identities = 21/69 (30%), Positives = 42/69 (60%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
            E+K+ + +++E +  +E DEDE    E KEEEE+ + D  ++E ++E+E+     +   
Sbjct: 95  TEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILI 154

Query: 145 EEEEKEKVE 153
           E++E ++ E
Sbjct: 155 EDDEVDEDE 163


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 26.2 bits (58), Expect = 6.2
 Identities = 26/91 (28%), Positives = 48/91 (52%), Gaps = 3/91 (3%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           ++P  I   E L++   E +E E E E   E+ E+   + E   REK+  ++ +QK  + 
Sbjct: 27  LMPLVIRVLENLDSLASENQELEVELELLREDNERLSTQYE---REKELRKQAEQKLLEF 83

Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
           E+  E+E+++  KK E+ EE  ++     +N
Sbjct: 84  EDELEQEKKELQKKIEDLEENVRQLELKAKN 114


>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3).  This family
           consists of several optic atrophy 3 (OPA3) proteins.
           OPA3 deficiency causes type III 3-methylglutaconic
           aciduria (MGA) in humans. This disease manifests with
           early bilateral optic atrophy, spasticity,
           extrapyramidal dysfunction, ataxia, and cognitive
           deficits, but normal longevity.
          Length = 134

 Score = 26.1 bits (58), Expect = 6.3
 Identities = 10/33 (30%), Positives = 22/33 (66%)

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           +E K+E++ ++E ++ + +  E+E E E QK +
Sbjct: 102 KEAKKEEELQQELEELEARVGELELEIERQKAR 134


>gnl|CDD|184060 PRK13453, PRK13453, F0F1 ATP synthase subunit B; Provisional.
          Length = 173

 Score = 26.4 bits (58), Expect = 6.3
 Identities = 16/68 (23%), Positives = 38/68 (55%)

Query: 80  NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
           N + ++ E+ K   ++ EEE K+K ++ +++ +K  E+ + Q  +++E++  E   + N 
Sbjct: 55  NRDIDDAEQAKLNAQKLEEENKQKLKETQEEVQKILEDAKVQARQQQEQIIHEANVRANG 114

Query: 140 KKEEEEEE 147
             E  + E
Sbjct: 115 MIETAQSE 122


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 26.1 bits (58), Expect = 6.3
 Identities = 24/68 (35%), Positives = 39/68 (57%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
            EEEK   ++E ++   KEK + E+K+ EKK E E+  K   EE+ E E E+ + + +E 
Sbjct: 2   PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEEL 61

Query: 144 EEEEEKEK 151
           E E  + +
Sbjct: 62  EAELARRE 69



 Score = 26.1 bits (58), Expect = 6.6
 Identities = 21/63 (33%), Positives = 39/63 (61%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           E EEE+   ++E +K   +EK + +K++EE+K + +K  ++  EE+ +   +K E+E EE
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60

Query: 149 KEK 151
            E 
Sbjct: 61  LEA 63


>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
           protein [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 234

 Score = 26.6 bits (59), Expect = 6.3
 Identities = 24/88 (27%), Positives = 38/88 (43%), Gaps = 1/88 (1%)

Query: 63  RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEE-EKEKDEDEEKKREKKEEEEEKQ 121
           R   +  Q I   +          EK   EEEE E+  E +++E E   +E +E  EE  
Sbjct: 9   RDDALPFQVIEFKDLANAGPKVAPEKAVLEEEELEQALEAKEEELESAAQELQEGIEEGA 68

Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
           ++  EE  +   EE   + +EE    E+
Sbjct: 69  REGYEEGFQLGYEEGFEEGQEEGRVLER 96



 Score = 26.2 bits (58), Expect = 9.1
 Identities = 17/69 (24%), Positives = 31/69 (44%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
            E ++      +   E+   ++E+ E+  E KEEE E    + +E +EE   E   +  +
Sbjct: 18  IEFKDLANAGPKVAPEKAVLEEEELEQALEAKEEELESAAQELQEGIEEGAREGYEEGFQ 77

Query: 143 EEEEEEKEK 151
              EE  E+
Sbjct: 78  LGYEEGFEE 86


>gnl|CDD|236356 PRK08944, motB, flagellar motor protein MotB; Reviewed.
          Length = 302

 Score = 26.9 bits (60), Expect = 6.3
 Identities = 12/69 (17%), Positives = 32/69 (46%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           +  ++  E +  E +      +++ +++  E   + K +K    E+ E+  K   ++ E+
Sbjct: 98  DMTQQTLEFQAGESDSAGGTQQQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEK 157

Query: 146 EEEKEKVEM 154
           E E   +E+
Sbjct: 158 EIEDGAIEI 166



 Score = 26.1 bits (58), Expect = 9.7
 Identities = 15/74 (20%), Positives = 36/74 (48%), Gaps = 1/74 (1%)

Query: 64  LTVILPQAISASE-TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           + VI+ Q I  ++ TLE    E +     +++  ++   E   + + ++   +E+ E   
Sbjct: 89  IEVIMQQTIDMTQQTLEFQAGESDSAGGTQQQRGKQRGGEASAEVKAEKAAAQEQIEDLV 148

Query: 123 DKKEEEVEEEEEEQ 136
            K  +++E+E E+ 
Sbjct: 149 KKLAQQLEKEIEDG 162


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 26.9 bits (60), Expect = 6.3
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
           K  + EK+RE++EE EE+ +   EE  +EE ++Q
Sbjct: 95  KSGELEKRRERREEAEEELEKAIEEGDDEEIKKQ 128


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 27.1 bits (60), Expect = 6.3
 Identities = 16/72 (22%), Positives = 37/72 (51%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E E EE+ ++   ++EEE +  DE  E+K     +  +K++     +  +   +++   +
Sbjct: 399 EDEAEEEARQRLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNLYDRRDTAWQKQRFLR 458

Query: 142 EEEEEEEKEKVE 153
           E++   E++K E
Sbjct: 459 EKQTAFERQKTE 470


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 26.8 bits (59), Expect = 6.4
 Identities = 14/63 (22%), Positives = 33/63 (52%), Gaps = 3/63 (4%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           +EK E E+ E +++ K + E   RE+K +EE  ++ +  E  +        + + + +++
Sbjct: 9   REKLEREQRERKQRAKLKLE---RERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKADQQ 65

Query: 148 EKE 150
            +E
Sbjct: 66  MQE 68



 Score = 26.4 bits (58), Expect = 10.0
 Identities = 12/42 (28%), Positives = 25/42 (59%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
           E  + +E+ E E+ E K++ + + ++  K +EE  KQ++  E
Sbjct: 4   ELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIE 45


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 25.8 bits (57), Expect = 6.4
 Identities = 18/49 (36%), Positives = 24/49 (48%)

Query: 58  IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDED 106
           I+  A        A +A+      E   E  E EEEE+EEE E+E D+D
Sbjct: 54  IKNAAEAPAAAAAAGAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 26.6 bits (59), Expect = 6.4
 Identities = 9/43 (20%), Positives = 20/43 (46%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
           E+KR+++     K+K K  +  +E  ++   K  +      K+
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198


>gnl|CDD|173635 cd05054, PTKc_VEGFR, Catalytic domain of the Protein Tyrosine
           Kinases, Vascular Endothelial Growth Factor Receptors.
           Protein Tyrosine Kinase (PTK) family; Vascular
           Endothelial Growth Factor Receptor (VEGFR) subfamily;
           catalytic (c) domain. The VEGFR subfamily consists of
           VEGFR1 (Flt1), VEGFR2 (Flk1), VEGFR3 (Flt4), and similar
           proteins. The PTKc family is part of a larger
           superfamily that includes the catalytic domains of other
           kinases such as protein serine/threonine kinases, RIO
           kinases, and phosphoinositide 3-kinase (PI3K). PTKs
           catalyze the transfer of the gamma-phosphoryl group from
           ATP to tyrosine (tyr) residues in protein substrates.
           VEGFR subfamily members are receptor tyr kinases (RTKs)
           containing an extracellular ligand-binding region with
           seven immunoglobulin (Ig)-like domains, a transmembrane
           segment, and an intracellular catalytic domain. In
           VEGFR3, the fifth Ig-like domain is replaced by a
           disulfide bridge. The binding of VEGFRs to their
           ligands, the VEGFs, leads to receptor dimerization,
           activation, and intracellular signaling. There are five
           VEGF ligands in mammals, which bind, in an overlapping
           pattern to the three VEGFRs, which can form homo or
           heterodimers. VEGFRs regulate the cardiovascular system.
           They are critical for vascular development during
           embryogenesis and blood vessel formation in adults. They
           induce cellular functions common to other growth factor
           receptors such as cell migration, survival, and
           proliferation. VEGFR1 binds VEGFA, VEGFB, and placenta
           growth factor (PLGF). It regulates monocyte and
           macrophage migration, vascular permeability,
           haematopoiesis, and the recruitment of haematopietic
           progenitor cells from the bone marrow.
          Length = 337

 Score = 26.7 bits (59), Expect = 6.5
 Identities = 9/34 (26%), Positives = 23/34 (67%)

Query: 71  AISASETLENNEWEEEEKEKEEEEEEEEEEKEKD 104
           ++S+S++  ++ + E++   + EE+EE +E  K+
Sbjct: 135 SVSSSQSSASSGFIEDKSLSDVEEDEEGDELYKE 168


>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 350 to 473 amino acids in length. This domain is
           found associated with pfam07923.
          Length = 402

 Score = 26.5 bits (59), Expect = 6.7
 Identities = 21/79 (26%), Positives = 35/79 (44%), Gaps = 4/79 (5%)

Query: 83  WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           WEE EK  + E   + E    D  +  +   KE + E      E++ ++EE E+ +   E
Sbjct: 25  WEEREKFMKYERGWDTESLVGDPLDISE-SVKELKLEMYTSLAEDQPKKEEIERLSTDSE 83

Query: 143 EEEEEEKEKVE-MYRNPVP 160
           E      ++VE  Y   +P
Sbjct: 84  EIPRL--DRVESFYSEALP 100


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 26.9 bits (60), Expect = 6.9
 Identities = 25/69 (36%), Positives = 38/69 (55%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           E E++ +  EE  E+ E+   E  E   EK+E  EE ++   E E E EE+ +   + EE
Sbjct: 506 EAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEE 565

Query: 144 EEEEEKEKV 152
           E EE +E+V
Sbjct: 566 EAEEAREEV 574



 Score = 26.9 bits (60), Expect = 6.9
 Identities = 18/65 (27%), Positives = 31/65 (47%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EE  +E  EE  E E E E+  +  + R ++ EE E++ ++  E   +   +  N +   
Sbjct: 355 EERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFL 414

Query: 144 EEEEE 148
           EE  E
Sbjct: 415 EELRE 419


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 26.8 bits (59), Expect = 6.9
 Identities = 16/82 (19%), Positives = 40/82 (48%)

Query: 73  SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           S +ET      E  E  +   ++ E E K   E      ++ E+E +  ++   +  E E
Sbjct: 646 SLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLEDELEKERQNHQELIAKCRELE 705

Query: 133 EEEQKNKKKEEEEEEEKEKVEM 154
           E+ ++ +++E  ++ ++E+ ++
Sbjct: 706 EKIERAEQEENMQKLDEEEQKI 727


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 26.6 bits (59), Expect = 6.9
 Identities = 23/92 (25%), Positives = 45/92 (48%), Gaps = 13/92 (14%)

Query: 72  ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR---------EKK-EEEEEKQ 121
           I  + TL     +E E+  +E E+   E+KEK E  + K          EK+ +E ++K 
Sbjct: 496 IQGASTLPK---DEVERMVKEAEKNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKI 552

Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
            ++K+E++E   ++ +   + +  E  K  +E
Sbjct: 553 SEEKKEKIENLIKKLRQALQNDNYESIKSLLE 584


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 26.9 bits (59), Expect = 6.9
 Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 3/70 (4%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE---EEEEEQKNKK 140
           E++  E+E  +E  + E + D          E+E +   D   +E +   E ++    + 
Sbjct: 454 EDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQA 513

Query: 141 KEEEEEEEKE 150
            EEEEEEE+ 
Sbjct: 514 IEEEEEEERA 523


>gnl|CDD|223336 COG0258, Exo, 5'-3' exonuclease (including N-terminal domain of
           PolI) [DNA replication, recombination, and repair].
          Length = 310

 Score = 26.5 bits (59), Expect = 7.1
 Identities = 9/39 (23%), Positives = 15/39 (38%)

Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K    + E  E+ K  +E+E+ +E   Q     E     
Sbjct: 70  KPPTFRHELLEEYKANREKEMPDELAPQIPILTELLVAL 108


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 26.2 bits (58), Expect = 7.1
 Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 5/62 (8%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE-----KQKDKKEEEVEEEEEEQKNK 139
           E+  +  + E ++++ +E +E+EE + + +  E       KQ + K E   E  E+  + 
Sbjct: 2   EDSDDIIDYESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIESASENIEDATSN 61

Query: 140 KK 141
            K
Sbjct: 62  PK 63


>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex.  Med21 has been
           known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
           Drosophila. The heterodimer of the two subunits Med7 and
           Med21 appears to act as a hinge between the middle and
           the tail regions of Mediator.
          Length = 132

 Score = 26.1 bits (58), Expect = 7.2
 Identities = 14/39 (35%), Positives = 26/39 (66%), Gaps = 2/39 (5%)

Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
           EEE+ ++  + EEE+ E E E+  ++  +E+E+  +KVE
Sbjct: 87  EEEQLRRIKELEEELREVEAER--EEAVKEKEKLLKKVE 123


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 26.8 bits (59), Expect = 7.2
 Identities = 15/68 (22%), Positives = 35/68 (51%)

Query: 88  KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
           K+  ++ E+ E +KE    + +K+ K E   + +  ++E+E ++   +      ++ EE 
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170

Query: 148 EKEKVEMY 155
            KE +E +
Sbjct: 171 FKEYLENF 178


>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus.  This family contains the N
           terminal region of several LUC7 protein homologues and
           only contains eukaryotic proteins. LUC7 has been shown
           to be a U1 snRNA associated protein with a role in
           splice site recognition. The family also contains human
           and mouse LUC7 like (LUC7L) proteins and human cisplatin
           resistance-associated overexpressed protein (CROP).
          Length = 252

 Score = 26.6 bits (59), Expect = 7.3
 Identities = 23/71 (32%), Positives = 37/71 (52%)

Query: 78  LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
            +  E  +EE+ K   + + EE  E DE+  K   + E   E+ K  +  ++ +E EE K
Sbjct: 103 KQRLELTQEEQTKIAADSKAEELAELDEEIGKLLAEAEALGEEGKVDEAMKLMKEVEELK 162

Query: 138 NKKKEEEEEEE 148
            KKKE E+ +E
Sbjct: 163 AKKKELEDSDE 173


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 26.6 bits (59), Expect = 7.3
 Identities = 29/126 (23%), Positives = 48/126 (38%), Gaps = 34/126 (26%)

Query: 68  LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE-----------------KK 110
           L +A SA +  +    E EEK +E  E + E +++K + +E                 K 
Sbjct: 317 LKKARSALDACDR---ELEEKAREVSELKMERQRKKPQIDELESIVRLKQAEADMFQLKA 373

Query: 111 REKKEEEEEKQ------KDKKEEEVE--------EEEEEQKNKKKEEEEEEEKEKVEMYR 156
            E + E E  Q       +K EEE           E EE++    EE + +E+   +   
Sbjct: 374 DEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEESHRDSDS 433

Query: 157 NPVPMY 162
             + M 
Sbjct: 434 MKMRME 439


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 26.6 bits (59), Expect = 7.4
 Identities = 18/69 (26%), Positives = 36/69 (52%)

Query: 81  NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
            E E +E  +  +  EE+    KDE E +    + +  +++ +  E+E+ E  EE +  +
Sbjct: 64  LESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLE 123

Query: 141 KEEEEEEEK 149
           KE E+ +E+
Sbjct: 124 KEIEDLKER 132


>gnl|CDD|233935 TIGR02577, cas_TM1794_Cmr2, CRISPR-associated protein Cas10/Cmr2,
           subtype III-B.  This model represent a Crm2 family of
           the CRISPR-associated RAMP module, a set of six genes
           recurring found together in prokaryotic genomes. This
           gene cluster is found only in species with CRISPR
           repeats, usually near the repeats themselves. Because
           most of the six (but not this family) contain RAMP
           domains, and because its appearance in a genome appears
           to depend on other CRISPR-associated Cas genes, the set
           is designated the CRISPR RAMP module. This protein,
           typified by TM1794 from Thermotoga maritima, is
           designated Crm2, for CRISPR RAMP Module protein 2
           [Mobile and extrachromosomal element functions, Other].
          Length = 483

 Score = 26.6 bits (59), Expect = 7.4
 Identities = 18/107 (16%), Positives = 37/107 (34%), Gaps = 6/107 (5%)

Query: 56  KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEE-----KEKDEDEEKK 110
              R            +     L   + +  EK +E    + ++E     K+  +D  + 
Sbjct: 71  LEFRVKGSEEYAELPNL-FLAILPAKDEKILEKLQETIRIKIKDELAELLKKAVKDLIEG 129

Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
                + EE  K K+E   +E E+    +       E++E  +  +N
Sbjct: 130 PAVITDLEEGLKQKEEALKKELEKYADKRLSAPSLLEKEEVYKDVKN 176


>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK).  This
           family consists of several plant specific phytosulfokine
           precursor proteins. Phytosulfokines, are active as
           either a pentapeptide or a C-terminally truncated
           tetrapeptide. These compounds were first isolated
           because of their ability to stimulate cell division in
           somatic embryo cultures of Asparagus officinalis.
          Length = 74

 Score = 25.0 bits (55), Expect = 7.7
 Identities = 9/34 (26%), Positives = 17/34 (50%)

Query: 74  ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE 107
                E+++ E  E  K +E+E   E + + E+E
Sbjct: 20  TDTGHESSKIEAAESSKVDEDEVVSECEGEGEEE 53


>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B.  Escherichia coli
           stringent starvation protein B (SspB), is thought to
           enhance the specificity of degradation of tmRNA-tagged
           proteins by the ClpXP protease. The tmRNA tag, also
           known as ssrA, is an 11-aa peptide added to the C
           terminus of proteins stalled during translation, targets
           proteins for degradation by ClpXP and ClpAP. SspB a
           cytoplasmic protein that specifically binds to residues
           1-4 and 7 of the tag. Binding of SspB enhances
           degradation of tagged proteins by ClpX, and masks
           sequence elements important for ClpA interactions,
           inhibiting degradation by ClpA. However, more recent
           work has cast doubt on the importance of SspB in
           wild-type cells. SspB is encoded in an operon whose
           synthesis is stimulated by carbon, amino acid, and
           phosphate starvation. SspB may play a special role
           during nutrient stress, for example by ensuring rapid
           degradation of the products of stalled translation,
           without causing a global increase in degradation of all
           ClpXP substrates.
          Length = 153

 Score = 26.0 bits (58), Expect = 7.7
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 76  ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
           E  E +E EEEE +  ++  E+E+ + KD  + K R
Sbjct: 111 EPEEADEEEEEEDDDADDSPEDEDPEPKDPPKTKGR 146



 Score = 25.6 bits (57), Expect = 9.7
 Identities = 12/40 (30%), Positives = 23/40 (57%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
           E E+  EEEEEE+++  +  EDE+ + +   + + +   K
Sbjct: 111 EPEEADEEEEEEDDDADDSPEDEDPEPKDPPKTKGRPSLK 150


>gnl|CDD|220821 pfam10602, RPN7, 26S proteasome subunit RPN7.  RPN7 (known as the
           non ATPase regulatory subunit 6 in higher eukaryotes) is
           one of the lid subunits of the 26S proteasome and has
           been shown in Saccharomyces cerevisiae to be required
           for structural integrity. The 26S proteasome is is
           involved in the ATP-dependent degradation of
           ubiquitinated proteins.
          Length = 174

 Score = 26.0 bits (58), Expect = 7.7
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 99  EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           +E   +E  +K  E+ EE + + KD KE   +EE
Sbjct: 1   DEAWVEELRKKNEEELEELDAELKDAKENLGKEE 34


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 26.5 bits (59), Expect = 7.8
 Identities = 11/53 (20%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 107 EEKKREK--KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
             KKR K  + EEE+  K++ +        + +++K    ++E  +  +   N
Sbjct: 12  ALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRAN 64


>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG.  Members of this
           family are the protein YabG, demonstrated for Bacillus
           subtilis to be an endopeptidase able to release
           N-terminal peptides from a number of sporulation
           proteins, including CotT, CotF, and SpoIVA. It appears
           to be expressed under control of sigma-K [Cellular
           processes, Sporulation and germination].
          Length = 283

 Score = 26.5 bits (59), Expect = 7.8
 Identities = 11/51 (21%), Positives = 23/51 (45%)

Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
           DE E KKR+K+ +E  +   +   +  +  +E+K  +     +  +    M
Sbjct: 53  DESERKKRKKRFKERMETSYRLFRQDYKLMKEKKEYRATGGYKNNQLYFGM 103


>gnl|CDD|226123 COG3595, COG3595, Uncharacterized conserved protein [Function
           unknown].
          Length = 318

 Score = 26.3 bits (58), Expect = 7.9
 Identities = 14/76 (18%), Positives = 24/76 (31%), Gaps = 5/76 (6%)

Query: 94  EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE-----EEEQKNKKKEEEEEEE 148
             E  E  EKDE + K+  + +EE        EE    +      E  +     E   ++
Sbjct: 19  TFEAGELIEKDEKDVKEISEDKEEINVDVSGSEETTVPDVEKNRPELLELSANVESSVQK 78

Query: 149 KEKVEMYRNPVPMYRN 164
           +  V +          
Sbjct: 79  ENDVSLDGFEGDTLIV 94


>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809).  This
           family consists of several proteins of unknown function
           Raphanus sativus (Radish) and Brassica napus (Rape).
          Length = 138

 Score = 25.9 bits (56), Expect = 7.9
 Identities = 19/41 (46%), Positives = 26/41 (63%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           E +EE+KE + E E +EE+KE   + E K EKKE E   +K
Sbjct: 98  EGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEKKEVENGPRK 138


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 25.7 bits (57), Expect = 7.9
 Identities = 11/55 (20%), Positives = 32/55 (58%), Gaps = 3/55 (5%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE---EVEEEEEE 135
            E+  +K+E+EE+E++    D+D++   +   + ++   D  ++    +E++++E
Sbjct: 50  AEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 26.6 bits (58), Expect = 8.0
 Identities = 19/87 (21%), Positives = 43/87 (49%)

Query: 59  RANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
           R  A+L+    +  +AS + E  E E+E K+  + E+ +E E E     +  R++K    
Sbjct: 361 RIPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLA 420

Query: 119 EKQKDKKEEEVEEEEEEQKNKKKEEEE 145
             +++   EE ++   ++K   ++ ++
Sbjct: 421 SPEEEALGEEEQKAPPKKKQLNQKNKQ 447


>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
          Length = 535

 Score = 26.4 bits (59), Expect = 8.1
 Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 13/75 (17%)

Query: 87  EKEKEEEEEEEEEEKEK-------------DEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
            K KEEE +++ +  EK             +++E    E   +   K+ +++E E EEEE
Sbjct: 461 RKRKEEEAKKKAKTFEKYIPEIARKLAEILEKEEPDIEEVLAKLIGKKLEEEEVEEEEEE 520

Query: 134 EEQKNKKKEEEEEEE 148
           E    +  +   E++
Sbjct: 521 EAVVVESAKNYTEKD 535


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 26.5 bits (58), Expect = 8.2
 Identities = 15/65 (23%), Positives = 32/65 (49%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
           +++E+E ++  + E++     K     + + + K+K E E E        KKK +E  ++
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKK 295

Query: 149 KEKVE 153
             K+E
Sbjct: 296 ALKME 300


>gnl|CDD|224712 COG1799, COG1799, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 167

 Score = 26.2 bits (58), Expect = 8.2
 Identities = 11/45 (24%), Positives = 18/45 (40%)

Query: 91  EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
            ++EE+  EE  + E  +   E +E    K   +K E    E   
Sbjct: 16  LDDEEDYYEEDPRQEPRDPAVEPEEYAPPKYNFRKREPARMEMRG 60


>gnl|CDD|131942 TIGR02896, spore_III_AF, stage III sporulation protein AF.  This
           family represents the stage III sporulation protein AF
           of the bacterial endospore formation program, which
           exists in some but not all members of the Firmicutes
           (formerly called low-GC Gram-positives). The C-terminal
           region of this protein is poorly conserved, so only the
           N-terminal region, which includes two predicted
           transmembrane domains, is included in the seed alignment
           [Cellular processes, Sporulation and germination].
          Length = 106

 Score = 25.3 bits (56), Expect = 8.3
 Identities = 10/47 (21%), Positives = 22/47 (46%)

Query: 67  ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK 113
           I      ASE    N+ EE++K+ + +      E+   + E++  ++
Sbjct: 59  IDIYVFKASELSVKNQIEEQKKKIQAKTRAYILEEMAVQLEKQVEKE 105


>gnl|CDD|237259 PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed.
          Length = 830

 Score = 26.6 bits (60), Expect = 8.3
 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 3/66 (4%)

Query: 70  QAISASETLENNEWEEEEKEKEEE---EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
             I      E+ + E  E+  + +   E   EE  E+  DEE+ RE+  E  E+  ++KE
Sbjct: 672 AYIPPGSYEEDWDLEGLEEALKTDFGLELPIEEWLEEGLDEEELRERILEAAEEAYEEKE 731

Query: 127 EEVEEE 132
           EE+ EE
Sbjct: 732 EELGEE 737


>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking.  Sec14-like
           Golgi-trafficking domain The GOLD domain is always found
           combined with lipid- or membrane-association domains.
          Length = 136

 Score = 25.8 bits (57), Expect = 8.3
 Identities = 12/42 (28%), Positives = 22/42 (52%)

Query: 70  QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
              S + ++  +E  +EE+E+E EEEE E    +   + + R
Sbjct: 41  PPTSDAVSVHVSESSDEEEEEEAEEEEAETGDVEAGSKSQSR 82


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 26.7 bits (59), Expect = 8.6
 Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 7/86 (8%)

Query: 63  RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           R+ V++  A ++  + +N + +E     ++           DED +       ++EE Q 
Sbjct: 401 RIPVMVNMASASVSSKKNQKPQESSPSLDQTNAASRNSVMMDEDSD-------DDEEFQI 453

Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEE 148
            + E+E E  + E K+   EEE +++
Sbjct: 454 PESEQEPETTKNETKDTAMEEEPQDK 479


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 26.1 bits (58), Expect = 8.6
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 95  EEEEEEKEKDEDEEKKREKKEEEEEKQK 122
           EEE+E+K+    EEKK  K+E+E+ ++ 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 26.4 bits (59), Expect = 8.7
 Identities = 5/27 (18%), Positives = 18/27 (66%)

Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEE 134
            +++ ++  +E    +K+++E+ +E+E
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQE 265


>gnl|CDD|217956 pfam04194, PDCD2_C, Programmed cell death protein 2, C-terminal
           putative domain. 
          Length = 165

 Score = 25.8 bits (57), Expect = 8.8
 Identities = 21/66 (31%), Positives = 34/66 (51%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
           E EE E        EE EE+E D  EE ++E    +E+++ D+ E+   +++  QK KK+
Sbjct: 9   EEEEYETLHAPSLPEEPEEREIDAFEELEKEGGGGKEDEETDESEKAKSDDKTFQKFKKR 68

Query: 142 EEEEEE 147
                E
Sbjct: 69  VARNPE 74


>gnl|CDD|223136 COG0058, GlgP, Glucan phosphorylase [Carbohydrate transport and
           metabolism].
          Length = 750

 Score = 26.5 bits (59), Expect = 8.8
 Identities = 12/51 (23%), Positives = 23/51 (45%)

Query: 82  EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
           EW  +    +E     +++  ++   E KRE K+   E+  D+   EV+  
Sbjct: 436 EWLNDLDILDELLWFADDKAFRELIAEIKRENKKRLAEEIADRTGIEVDPN 486


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 26.1 bits (58), Expect = 8.9
 Identities = 17/59 (28%), Positives = 31/59 (52%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           +KE E  E++ +E        +KK+++K+  +   K  +EE  +  E  +K KK+  EE
Sbjct: 30  DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEE 88


>gnl|CDD|220634 pfam10220, DUF2146, Uncharacterized conserved protein (DUF2146).
           This is a family of proteins conserved from plants to
           humans. In Dictyostelium it is annotated as Mss11p but
           this could not be confirmed. Mss11p is required for the
           activation of pseudo-hyphal and invasive growth by
           Ste12p in yeast.
          Length = 890

 Score = 26.4 bits (58), Expect = 9.1
 Identities = 21/124 (16%), Positives = 40/124 (32%), Gaps = 8/124 (6%)

Query: 30  YNLRIRNVSLTDDAEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKE 89
           + ++  N        F+C V   +Y  PI              +A     +    ++ ++
Sbjct: 510 FTIKQANYDFYQHIAFKCCVKLWRYQFPI-----FQPSSSDIRAADFENNSLSAAKKMEQ 564

Query: 90  KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD---KKEEEVEEEEEEQKNKKKEEEEE 146
            E+E  +EE ++E+ E  E + +          D         E   EE  N  +     
Sbjct: 565 AEDELADEETDQEQPESLEPQLQGSSTSPSDASDLNFSTASSSEASSEESDNYARPTSRS 624

Query: 147 EEKE 150
              E
Sbjct: 625 GTDE 628


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 26.6 bits (58), Expect = 9.1
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 85  EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
            +E E ++  E+ ++ K    D E+  +  EEE+ K    K+ +
Sbjct: 94  SQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGK 137


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 25.5 bits (55), Expect = 9.2
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 77  TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
           TL   E EE+     +EEE++++E E+D+DE+   +  E
Sbjct: 2   TLHFLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40


>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Multidomain protein with permuted HD nuclease domain,
           palm domain and Zn-ribbon; MTH326-like has inactivated
           polymerase catalytic domain; alr1562 and slr7011 -
           predicted only on the basis of size, presence of HD
           domain, and location with RAMPs in one operon; signature
           gene for type III; also known as Crm2 family.
          Length = 475

 Score = 26.3 bits (58), Expect = 9.4
 Identities = 21/78 (26%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 84  EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
           EEE +E +EE EEE  E   ++ +E    +  E    + D  E   +  E+  +++++ E
Sbjct: 90  EEEAEELKEEIEEETWESILNKVKEYLDIENVEPFLLKSDALERISDRIEDGNEDREEAE 149

Query: 144 EEEEEKEKVEMYRNPVPM 161
           E E  ++ V   R+  P+
Sbjct: 150 ELEAARKNV---RDFAPV 164


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 25.6 bits (57), Expect = 9.5
 Identities = 18/69 (26%), Positives = 44/69 (63%)

Query: 86  EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
           E++ K+ + E E+ EKE  + +EK ++      E  ++KKE+E++++ +E + K+++ ++
Sbjct: 24  EKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQ 83

Query: 146 EEEKEKVEM 154
           + +K + E 
Sbjct: 84  DLQKRQQEE 92


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
           represents a subfamily of RNA splicing factors including
           the Pad-1 protein (N. crassa), CAPER (M. musculus) and
           CC1.3 (H.sapiens). These proteins are characterized by
           an N-terminal arginine-rich, low complexity domain
           followed by three (or in the case of 4 H. sapiens
           paralogs, two) RNA recognition domains (rrm: pfam00706).
           These splicing factors are closely related to the U2AF
           splicing factor family (TIGR01642). A homologous gene
           from Plasmodium falciparum was identified in the course
           of the analysis of that genome at TIGR and was included
           in the seed.
          Length = 457

 Score = 26.4 bits (58), Expect = 9.5
 Identities = 8/68 (11%), Positives = 22/68 (32%), Gaps = 3/68 (4%)

Query: 87  EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ---KDKKEEEVEEEEEEQKNKKKEE 143
           E+ +      +   + +D D  + R  +          + + +     ++       KE 
Sbjct: 22  ERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYRPRGDRSYRRDDRRSGRNTKEP 81

Query: 144 EEEEEKEK 151
             E E++ 
Sbjct: 82  LTEAERDD 89


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 26.4 bits (59), Expect = 9.6
 Identities = 14/36 (38%), Positives = 24/36 (66%)

Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
           E++ ++ +EE  +K K+++EEE EEEE     + KE
Sbjct: 513 EDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEAKE 548


>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
          (roundabout) receptors.  Ig3_Robo: domain similar to
          the third immunoglobulin (Ig)-like domain in Robo
          (roundabout) receptors. Robo receptors play a role in
          the development of the central nervous system (CNS),
          and are receptors of Slit protein. Slit is a repellant
          secreted by the neural cells in the midline. Slit acts
          through Robo to prevent most neurons from crossing the
          midline from either side. Three mammalian Robo homologs
          (robo1, -2, and -3), and three mammalian Slit homologs
          (Slit-1,-2, -3), have been identified. Commissural
          axons, which cross the midline, express low levels of
          Robo; longitudinal axons, which avoid the midline,
          express high levels of Robo. robo1, -2, and -3 are
          expressed by commissural neurons in the vertebrate
          spinal cord and Slits 1, -2, -3 are expressed at the
          ventral midline. Robo-3 is a divergent member of the
          Robo family which instead of being a positive regulator
          of slit responsiveness, antagonizes slit responsiveness
          in precrossing axons.  The Slit-Robo interaction is
          mediated by the second leucine-rich repeat (LRR) domain
          of Slit and the two N-terminal Ig domains of Robo, Ig1
          and Ig2. The primary Robo binding site for Slit2 has
          been shown by surface plasmon resonance experiments and
          mutational analysis to be is the Ig1 domain, while the
          Ig2 domain has been proposed to harbor a weak secondary
          binding site.
          Length = 69

 Score = 24.7 bits (54), Expect = 9.7
 Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 10/39 (25%)

Query: 32 LRIRNVSLTDDAEFQCQ----VGPAQYHKPIRANARLTV 66
          L+IRNV+  D+  + C+    VG       I A+A LTV
Sbjct: 37 LKIRNVTAGDEGSYTCEAENMVG------KIEASASLTV 69


>gnl|CDD|224510 COG1594, RPB9, DNA-directed RNA polymerase, subunit M/Transcription
           elongation factor TFIIS [Transcription].
          Length = 113

 Score = 25.5 bits (56), Expect = 9.8
 Identities = 13/52 (25%), Positives = 23/52 (44%)

Query: 89  EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
           E+E   ++      K+  E+KK      E+E Q  K     +E+  +  NK+
Sbjct: 31  EEEASNKKVYRYSVKEAVEKKKEVVLVVEDETQGAKTLPTAKEKCPKCGNKE 82


>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
           family 4 of glycosyl hydrolases [Carbohydrate transport
           and metabolism].
          Length = 442

 Score = 26.1 bits (58), Expect = 9.9
 Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 76  ETLENNE--WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
             L ++   +     + +E++ +E E+     +E  KREK+  E  K   K E + + EE
Sbjct: 251 PVLPSSYHRYYYYPYDMKEKKPDEIEKFGTRANEVMKREKELFELYK---KPELKEKPEE 307

Query: 134 EEQK 137
            E++
Sbjct: 308 LEKR 311


>gnl|CDD|224553 COG1638, DctP, TRAP-type C4-dicarboxylate transport system,
           periplasmic component [Carbohydrate transport and
           metabolism].
          Length = 332

 Score = 26.1 bits (58), Expect = 10.0
 Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 3/76 (3%)

Query: 67  ILPQAISASETLENNEWEEEEKE-KEEEEEEEEEEKEKDEDEEKK--REKKEEEEEKQKD 123
           IL +A   +   +    EE E E  E+ +E   E  E D  E  +   +   +E  K+  
Sbjct: 257 ILLEAAKEAAEEQRKLVEELEDELLEKLKEAGVEVVEPDAAEAFREAAKPVYDEFAKKAG 316

Query: 124 KKEEEVEEEEEEQKNK 139
           +   E+ E  E  + K
Sbjct: 317 EIGAELIEAIEAARKK 332


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.299    0.123    0.324 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,921,396
Number of extensions: 903211
Number of successful extensions: 34604
Number of sequences better than 10.0: 1
Number of HSP's gapped: 16476
Number of HSP's successfully gapped: 5512
Length of query: 167
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 77
Effective length of database: 6,945,742
Effective search space: 534822134
Effective search space used: 534822134
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (22.0 bits)
S2: 55 (24.8 bits)