RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9773
         (252 letters)



>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 51.8 bits (125), Expect = 9e-08
 Identities = 27/99 (27%), Positives = 53/99 (53%)

Query: 143 VMYPYKCILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE 202
           V+     I    P  +  +  F E    E E     +   ++ ++  E  +KK +EE++E
Sbjct: 370 VLPFLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKE 429

Query: 203 EEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           +++K    + ++EE E E+E++EE  ++++ E EEE+EE
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 46.1 bits (110), Expect = 7e-06
 Identities = 23/62 (37%), Positives = 44/62 (70%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            + I  + EKK E+E +E++ +     K+++ +EEE+E++E+++E+E E+ E E EEEEE
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471

Query: 225 EE 226
           ++
Sbjct: 472 KK 473



 Score = 40.7 bits (96), Expect = 5e-04
 Identities = 25/92 (27%), Positives = 46/92 (50%), Gaps = 1/92 (1%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           E+  F   I  +   +    +       EEE++     ++  ++ +K  E   +K E E 
Sbjct: 369 EVLPFLSIIF-KHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEK 427

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           +E++++    +KK EEEEEE+E   + +EEEE
Sbjct: 428 KEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459



 Score = 30.3 bits (69), Expect = 0.93
 Identities = 17/44 (38%), Positives = 32/44 (72%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEK 206
            F      EEE++E+++ EEE+ EEEEE +++K +EEE+++++ 
Sbjct: 434 AFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 29.9 bits (68), Expect = 1.2
 Identities = 15/57 (26%), Positives = 29/57 (50%)

Query: 196 NKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            +EE E     K+  +  K+  E  E++ EE  K+KK +    +++   + EE+E+ 
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 51.1 bits (122), Expect = 2e-07
 Identities = 29/81 (35%), Positives = 45/81 (55%), Gaps = 6/81 (7%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E + E    + E  +E  E +     + E +++EK  +     + G++EEEEEEE  +++
Sbjct: 816 EGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEE 875

Query: 232 KNEEEEEEEEVLIKLEEEEEN 252
           + EEEEEEE      EEEEEN
Sbjct: 876 EEEEEEEEE------EEEEEN 890



 Score = 49.2 bits (117), Expect = 8e-07
 Identities = 32/94 (34%), Positives = 46/94 (48%), Gaps = 18/94 (19%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEE-------EEKKEEDEIEKEEG------ 217
           E E +  ++ E E   E +    +   E  E+E       E K++E  ++   G      
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863

Query: 218 ENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E EEEEEEE  ++++ EEEEEEEE     EE EE
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEE-----EENEE 892



 Score = 49.2 bits (117), Expect = 9e-07
 Identities = 28/81 (34%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEE---EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
           E+    E + ++ E+++E  E+E   E   + K  E+  +     +  + E+EE E EEE
Sbjct: 814 EHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEE 873

Query: 223 EEEEVVKQKKNEEEEEEEEVL 243
           EEEE  ++++ EEEEE EE L
Sbjct: 874 EEEEEEEEEEEEEEEENEEPL 894



 Score = 49.2 bits (117), Expect = 1e-06
 Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 19/100 (19%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN------ 219
                E E + E   ++E   + E        ++E  E+E   E++ E ++ E       
Sbjct: 797 AEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGG 856

Query: 220 --------EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
                   EEEEEEE  ++++ EEEEEEEE     EEE E
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEE-----EEENE 891



 Score = 46.5 bits (110), Expect = 7e-06
 Identities = 28/83 (33%), Positives = 40/83 (48%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           GE   + + E E E   E +E   K   E EE E E + E E  ++EGE E  EE E V+
Sbjct: 683 GEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVE 742

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
            +   E E + EV  + + +E  
Sbjct: 743 DEGEGEAEGKHEVETEGDRKETE 765



 Score = 46.5 bits (110), Expect = 8e-06
 Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 158 SPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
           S       E      E++   E + E  ++E+ V      +  + EEE++EE+E E+EE 
Sbjct: 819 SETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEE 878

Query: 218 ENEEEEEEEVVKQKKNEEEEE 238
           E EEEEEEE       EE EE
Sbjct: 879 EEEEEEEEE-------EENEE 892



 Score = 42.7 bits (100), Expect = 1e-04
 Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 170 GEEEKKEEDEIEEEENEEE-EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            + E++ E EIE +E + + E   ++   E E E E  ++E EIE  E   E E+E E  
Sbjct: 689 RKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGE 748

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
            + K+E E E +    + E E E
Sbjct: 749 AEGKHEVETEGDRKETEHEGETE 771



 Score = 42.7 bits (100), Expect = 1e-04
 Identities = 29/83 (34%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE-EEEEVV 228
           GE E  EE E  E+E E E E K +   E + +E E + E E E +E E+E E +  E  
Sbjct: 730 GEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG 789

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
           + K +E  E + E   + E  E+
Sbjct: 790 EMKGDEGAEGKVEHEGETEAGEK 812



 Score = 41.9 bits (98), Expect = 2e-04
 Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 7/47 (14%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           G+ E++EE+E EEEE EEEEE       EEEEEEEE +E   +E  E
Sbjct: 861 GDSEEEEEEEEEEEEEEEEEE-------EEEEEEEENEEPLSLEWPE 900



 Score = 41.1 bits (96), Expect = 3e-04
 Identities = 26/81 (32%), Positives = 40/81 (49%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E + E E E +   E E  +++   E E E E K++EDE E + GE+ E + +E  + 
Sbjct: 740 EVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEG 799

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           K   E E E     + E + E
Sbjct: 800 KVEHEGETEAGEKDEHEGQSE 820



 Score = 40.7 bits (95), Expect = 6e-04
 Identities = 23/54 (42%), Positives = 34/54 (62%), Gaps = 8/54 (14%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           G + ++EE+E EEEE EEEEE        EEEEEEE++ E+ +  E  E  +++
Sbjct: 860 GGDSEEEEEEEEEEEEEEEEE--------EEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 40.4 bits (94), Expect = 7e-04
 Identities = 27/80 (33%), Positives = 42/80 (52%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E K+ + + E E  E E E + +    E+E E E  EE E  ++EGE E E + EV  + 
Sbjct: 700 EAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759

Query: 232 KNEEEEEEEEVLIKLEEEEE 251
             +E E E E   + +E+E+
Sbjct: 760 DRKETEHEGETEAEGKEDED 779



 Score = 39.2 bits (91), Expect = 0.002
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 11/68 (16%)

Query: 169 AGEEEKKEEDEIE----EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           A ++EK  +        + E EEEEE       EEEEEEEE++EE+E E+EE E     E
Sbjct: 845 AKQDEKGVDGGGGSDGGDSEEEEEEE-------EEEEEEEEEEEEEEEEEEENEEPLSLE 897

Query: 225 EEVVKQKK 232
               +QK+
Sbjct: 898 WPETRQKQ 905



 Score = 38.4 bits (89), Expect = 0.003
 Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 3/85 (3%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE--IEKEEGENE-EEEEEE 226
            E E  EE   E E+  E E   + +++ E   E + ++E E  IE +E +++ E E EE
Sbjct: 655 AEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEE 714

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEE 251
           V  + + E E  E+E  I+  EE E
Sbjct: 715 VEHEGETEAEGTEDEGEIETGEEGE 739



 Score = 38.4 bits (89), Expect = 0.003
 Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 4/90 (4%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE----EGENEE 221
           E  A E E + E E E  E+E E E  ++  + E+E E E + + E+E E    E E+E 
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E E E  + +   E +  E+  +K +E  E
Sbjct: 769 ETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798



 Score = 37.3 bits (86), Expect = 0.007
 Identities = 25/81 (30%), Positives = 34/81 (41%), Gaps = 4/81 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E   E   EE E   E E +  +    E E+E + E     + EGE   E + E    
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGE---- 692

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           ++ E E E +E   K E E E
Sbjct: 693 QEGEGEIEAKEADHKGETEAE 713



 Score = 37.3 bits (86), Expect = 0.007
 Identities = 23/83 (27%), Positives = 40/83 (48%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
            +G E ++E +   + ENE E E+  ++  E+E E E + +E + + E    E E E E 
Sbjct: 662 ESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGET 721

Query: 228 VKQKKNEEEEEEEEVLIKLEEEE 250
             +   +E E E     +  E+E
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDE 744



 Score = 37.3 bits (86), Expect = 0.007
 Identities = 27/85 (31%), Positives = 42/85 (49%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E     +E   + E E EE E E E + +  ++E E E  ++ E+  ++ EGE E + E 
Sbjct: 696 EGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEV 755

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEE 250
           E    +K  E E E E   K +E+E
Sbjct: 756 ETEGDRKETEHEGETEAEGKEDEDE 780



 Score = 36.9 bits (85), Expect = 0.009
 Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 2/87 (2%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE-EGENEEEEE 224
           E+      ++ E   E E    EE   + + + E E + E + E EI  E +GE E E E
Sbjct: 639 EHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGE 698

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            E  K+  ++ E E EEV  + E E E
Sbjct: 699 IE-AKEADHKGETEAEEVEHEGETEAE 724



 Score = 36.5 bits (84), Expect = 0.012
 Identities = 18/72 (25%), Positives = 37/72 (51%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
              GE E + +++ +E E +  E+ + K ++  E + E + E +  EK+E E + E + +
Sbjct: 765 EHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQAD 824

Query: 227 VVKQKKNEEEEE 238
             + K    E+E
Sbjct: 825 DTEVKDETGEQE 836



 Score = 36.1 bits (83), Expect = 0.017
 Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 2/84 (2%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEE--KKEEDEIEKEEGENEEEEEEEV 227
           GE E K E E E +  E E E + +   +E+E+E E    E+ E++ +EG   + E E  
Sbjct: 747 GEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGE 806

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEE 251
            +  + +E E + E      E ++
Sbjct: 807 TEAGEKDEHEGQSETQADDTEVKD 830



 Score = 35.0 bits (80), Expect = 0.038
 Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 1/81 (1%)

Query: 172 EEKKEEDEIEEEENEEEEEVK-QKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E + E E  EE  E E+E + + + K E E E ++KE +   + E E +E+E+E  ++ 
Sbjct: 726 TEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQA 785

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
            ++ E + +E    K+E E E
Sbjct: 786 GEDGEMKGDEGAEGKVEHEGE 806


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 48.8 bits (117), Expect = 1e-06
 Identities = 26/78 (33%), Positives = 50/78 (64%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
            F      E+E++EED +++E+ E++++  +++ ++ +  +EE+ EEDE   +E + EEE
Sbjct: 313 NFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372

Query: 223 EEEEVVKQKKNEEEEEEE 240
           EEE+  K+KK+ E    E
Sbjct: 373 EEEKEKKKKKSAESTRSE 390



 Score = 46.1 bits (110), Expect = 8e-06
 Identities = 29/81 (35%), Positives = 51/81 (62%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+ K+  D++++E   ++++       EE+EEEEE   +DE E+++ ++ EEEEE+V   
Sbjct: 293 EDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLS 352

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
            + E+EE+E+      EEEEE
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEE 373



 Score = 38.8 bits (91), Expect = 0.002
 Identities = 28/86 (32%), Positives = 50/86 (58%), Gaps = 4/86 (4%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E    +++     + EE+E EEE+ V    + E+EE++++  EE+E + +  + EE+EE+
Sbjct: 305 EFEPDDDDNFGLGQGEEDEEEEEDGV----DDEDEEDDDDDLEEEEEDVDLSDEEEDEED 360

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E    + +EEEEEEE+   K +  E 
Sbjct: 361 EDSDDEDDEEEEEEEKEKKKKKSAES 386



 Score = 36.9 bits (86), Expect = 0.008
 Identities = 24/85 (28%), Positives = 51/85 (60%), Gaps = 5/85 (5%)

Query: 172 EEKKEEDEIEEEENE-----EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           +  K E+E+ +EE E     E E +++ + +EE++EEEE  +E   + ++    ++++  
Sbjct: 255 DRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNF 314

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEE 251
            + Q + +EEEEE+ V  + EE+++
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDDD 339



 Score = 36.1 bits (84), Expect = 0.017
 Identities = 18/83 (21%), Positives = 50/83 (60%), Gaps = 1/83 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            E++++EED  E  ++ ++E      +     + EE +EE+E +  + E+EE++++++ +
Sbjct: 286 EEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEE-DGVDDEDEEDDDDDLEE 344

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
           ++++ +  +EEE     + ++E+
Sbjct: 345 EEEDVDLSDEEEDEEDEDSDDED 367



 Score = 35.4 bits (82), Expect = 0.024
 Identities = 27/97 (27%), Positives = 48/97 (49%), Gaps = 15/97 (15%)

Query: 171 EEEKKEEDE---------------IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE 215
           EE  KEE E                EE++ EEE+  +   + ++E E ++       + E
Sbjct: 261 EELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGE 320

Query: 216 EGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E E EEE+  +   ++ ++++ EEEE  + L +EEE+
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEED 357



 Score = 30.0 bits (68), Expect = 1.4
 Identities = 16/62 (25%), Positives = 41/62 (66%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            +   EE+  ++ E EEE+ +  +E + +++++ ++E++E++EE+E EK++ ++ E    
Sbjct: 330 VDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRS 389

Query: 226 EV 227
           E+
Sbjct: 390 EL 391



 Score = 28.8 bits (65), Expect = 3.3
 Identities = 15/43 (34%), Positives = 26/43 (60%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK 207
            +    EE++++ED  +E++ EEEEE K+KK K+  E    + 
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 27.7 bits (62), Expect = 8.3
 Identities = 16/49 (32%), Positives = 30/49 (61%)

Query: 204 EEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           +  K E+E+ KEE E  ++ E E +++ + EEE++EEE   K   ++ +
Sbjct: 255 DRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLD 303


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 47.9 bits (115), Expect = 2e-06
 Identities = 28/82 (34%), Positives = 51/82 (62%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A EE  K  +E E+E  E   E+++ + +  ++EE   ++ + +EK E E E++E+E   
Sbjct: 62  AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQ 121

Query: 229 KQKKNEEEEEEEEVLIKLEEEE 250
           KQ++ E++EEE E LI+ + +E
Sbjct: 122 KQQELEKKEEELEELIEEQLQE 143



 Score = 40.5 bits (96), Expect = 4e-04
 Identities = 27/79 (34%), Positives = 45/79 (56%), Gaps = 1/79 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E ++E   I EE  +E E +K++   E +EE  + + E E E  E  NE ++ E+ + Q
Sbjct: 34  KEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQ 93

Query: 231 KKNEEEEEEEEVLIKLEEE 249
           K+ E  + + E+L K EEE
Sbjct: 94  KE-ENLDRKLELLEKREEE 111



 Score = 39.8 bits (94), Expect = 8e-04
 Identities = 30/80 (37%), Positives = 51/80 (63%), Gaps = 1/80 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E  +K E+E+E++E E E++ ++ + KEEE EE  +++  E+E+  G   EE +E ++++
Sbjct: 103 ELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEK 162

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
            + EE   E  VLIK  EEE
Sbjct: 163 VE-EEARHEAAVLIKEIEEE 181


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 44.6 bits (105), Expect = 2e-05
 Identities = 30/84 (35%), Positives = 46/84 (54%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
              EE++  +E +  E EE+EEV      E   E  E++EE+    ++GE+  + E E +
Sbjct: 174 VDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERI 233

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
            +K+ EEEE EEEV+   E E E 
Sbjct: 234 DKKQGEEEEMEEEVINLFEIEWEE 257



 Score = 28.0 bits (62), Expect = 4.3
 Identities = 18/81 (22%), Positives = 36/81 (44%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +EE  E + + E+       +    +++ +E    + + D  E+E  E  +  EEE  ++
Sbjct: 136 DEEDVEREILAEKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEE 195

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
             ++   E    L + EEEE 
Sbjct: 196 VGSDSYGEGNRELNEEEEEEA 216


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 45.3 bits (107), Expect = 2e-05
 Identities = 28/69 (40%), Positives = 43/69 (62%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           ++ EE+E  EEE EEEEE ++++ + EEEE E+++EE+E+E + G  EE E         
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDG 496

Query: 233 NEEEEEEEE 241
            E EE+ E 
Sbjct: 497 EEPEEDAER 505



 Score = 44.9 bits (106), Expect = 2e-05
 Identities = 31/70 (44%), Positives = 46/70 (65%), Gaps = 4/70 (5%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           +E +EE+ +EEEE EEEEE ++    E+E EEEE ++E+E E+ E +N  EEE E   + 
Sbjct: 437 QESEEEESVEEEEEEEEEEEEE----EQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492

Query: 232 KNEEEEEEEE 241
             + EE EE+
Sbjct: 493 DGDGEEPEED 502



 Score = 43.0 bits (101), Expect = 8e-05
 Identities = 32/66 (48%), Positives = 41/66 (62%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKL 246
           +E E ++   +EEEEEEEE++EE E E+EEGE+EEEEEE        EE E   E     
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDG 496

Query: 247 EEEEEN 252
           EE EE+
Sbjct: 497 EEPEED 502



 Score = 43.0 bits (101), Expect = 9e-05
 Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 8/73 (10%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            EE  +EE+E EEEE EEE+        E EEEE E +EE+E  + +  +EEE E     
Sbjct: 441 EEESVEEEEEEEEEEEEEEQ--------ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492

Query: 230 QKKNEEEEEEEEV 242
               EE EE+ E 
Sbjct: 493 DGDGEEPEEDAER 505



 Score = 38.7 bits (90), Expect = 0.002
 Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 3/82 (3%)

Query: 162 FKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE---EGE 218
               E+   E  ++EE+E EEEE EE+E  +++   EEEEEE E     E E E   EG+
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493

Query: 219 NEEEEEEEVVKQKKNEEEEEEE 240
            + EE EE  +++ +E      
Sbjct: 494 GDGEEPEEDAERRNSEMAGISR 515



 Score = 37.2 bits (86), Expect = 0.006
 Identities = 31/82 (37%), Positives = 45/82 (54%), Gaps = 7/82 (8%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           G   +  +       + E   +  ++++EEE  EEE++EE+E E+EE E+EEEE E+   
Sbjct: 413 GTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGED--- 469

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
               EEEEEE E     EEE E
Sbjct: 470 ----EEEEEEVEADNGSEEEME 487



 Score = 36.8 bits (85), Expect = 0.008
 Identities = 21/71 (29%), Positives = 36/71 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEE++EE+E E+E  EEE E ++++ + E +   E++ E   E +    E EE+ E    
Sbjct: 449 EEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNS 508

Query: 231 KKNEEEEEEEE 241
           +        E 
Sbjct: 509 EMAGISRMSEG 519



 Score = 36.0 bits (83), Expect = 0.017
 Identities = 25/78 (32%), Positives = 37/78 (47%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEE++EE E EEEE E+EEE ++ +     EEE E   E + + EE E + E     +  
Sbjct: 453 EEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAG 512

Query: 231 KKNEEEEEEEEVLIKLEE 248
                E ++        E
Sbjct: 513 ISRMSEGQQPRGSSVQPE 530



 Score = 34.5 bits (79), Expect = 0.053
 Identities = 19/72 (26%), Positives = 32/72 (44%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
             E E +EE+E  E +N  EEE++     + + EE E+  E    +  G +   E ++  
Sbjct: 464 EEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPR 523

Query: 229 KQKKNEEEEEEE 240
                 E  +EE
Sbjct: 524 GSSVQPESPQEE 535



 Score = 33.7 bits (77), Expect = 0.076
 Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 9/87 (10%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEE---------KKEEDEIEKEEGENEEEEEE 225
           K++D  EEE  + +E  +Q  +    +  +             ++  E+E  E EEEEEE
Sbjct: 394 KQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEE 453

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E  ++++  EEEE E+   + E E +N
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADN 480



 Score = 29.5 bits (66), Expect = 2.0
 Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 20/84 (23%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE--------------------NEEEEEEEV 227
           EE + +   K+++ EEEE+++  E E++                       ++E EEEE 
Sbjct: 385 EEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEES 444

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEE 251
           V++++ EEEEEEEE     EEE E
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGE 468


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 45.1 bits (106), Expect = 2e-05
 Identities = 33/83 (39%), Positives = 55/83 (66%), Gaps = 3/83 (3%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            A E +K EE +I+ EE ++ EE K+K  + +++E EEKK+ +E++K E EN+ +  EE  
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE-- 1666

Query: 229  KQKKNEEEEEEEEVLIKLEEEEE 251
              KK EE++++ E   K EE+E+
Sbjct: 1667 -AKKAEEDKKKAEEAKKAEEDEK 1688



 Score = 44.7 bits (105), Expect = 3e-05
 Identities = 29/86 (33%), Positives = 50/86 (58%), Gaps = 4/86 (4%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEE----EEEEKKEEDEIEKEEGENEEEEEEE 226
             EEKK+ +E+++ E E + +  ++  K EE+    EE +K EEDE +  E   +E EE +
Sbjct: 1643 AEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702

Query: 227  VVKQKKNEEEEEEEEVLIKLEEEEEN 252
              ++ K +E EE+++     + EEEN
Sbjct: 1703 KAEELKKKEAEEKKKAEELKKAEEEN 1728



 Score = 42.4 bits (99), Expect = 1e-04
 Identities = 29/87 (33%), Positives = 51/87 (58%), Gaps = 5/87 (5%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEE---EEEEKKEEDEIEKEEGENEEEEEE-- 225
            EE+KK+ +E ++ E +E++  +  K + EE    EE +KKE +E +K E   + EEE   
Sbjct: 1671 EEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            +  + KK  EE++++    K +EEE+ 
Sbjct: 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKK 1757



 Score = 40.9 bits (95), Expect = 5e-04
 Identities = 32/86 (37%), Positives = 55/86 (63%), Gaps = 2/86 (2%)

Query: 169  AGEEEKKEEDEIE--EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            A E +KKE +E +  EE  + EEE K K  + ++E EE+KK+ +E +K+E E ++    +
Sbjct: 1704 AEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK 1763

Query: 227  VVKQKKNEEEEEEEEVLIKLEEEEEN 252
              ++KK EE  +E+E +I+ E +EE+
Sbjct: 1764 KEEEKKAEEIRKEKEAVIEEELDEED 1789



 Score = 40.5 bits (94), Expect = 7e-04
 Identities = 34/97 (35%), Positives = 54/97 (55%), Gaps = 11/97 (11%)

Query: 166  ENIAGEEEKKEEDE---------IEEEENEEEEEVK--QKKNKEEEEEEEEKKEEDEIEK 214
              I  EE KK E+E          E EE ++ EE+K  +++NK +  EE +K EED+ + 
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677

Query: 215  EEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            EE +  EE+E++  +  K E EE ++   +K +E EE
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714



 Score = 40.1 bits (93), Expect = 0.001
 Identities = 31/85 (36%), Positives = 59/85 (69%), Gaps = 2/85 (2%)

Query: 169  AGEEEKKEEDEI--EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            A E+EKK  + +  E EE ++ EE+K+K+ +E+++ EE KK E+E + +  E ++E EE+
Sbjct: 1683 AEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742

Query: 227  VVKQKKNEEEEEEEEVLIKLEEEEE 251
              K ++ +++EEE++ +  L++EEE
Sbjct: 1743 KKKAEEAKKDEEEKKKIAHLKKEEE 1767



 Score = 37.8 bits (87), Expect = 0.004
 Identities = 27/85 (31%), Positives = 45/85 (52%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            E     EE K+ +E +     + EE K+ +    EE  +  +EE +++ EE +  EE + 
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEE 250
            +  + KK EEE+++ E L K E EE
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEE 1645



 Score = 37.8 bits (87), Expect = 0.005
 Identities = 29/85 (34%), Positives = 49/85 (57%), Gaps = 2/85 (2%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            E +   EE+ +    EE +  EE++ K ++ K+ EE+E++  E  + E EE +  EE ++
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709

Query: 226  EVVKQKKNEEE--EEEEEVLIKLEE 248
            +  ++KK  EE  + EEE  IK EE
Sbjct: 1710 KEAEEKKKAEELKKAEEENKIKAEE 1734



 Score = 36.3 bits (83), Expect = 0.015
 Identities = 23/74 (31%), Positives = 43/74 (58%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            A E +KK E+  + +E +++ E  +KK  E ++  E KK+ DE +K E   + +E ++  
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531

Query: 229  KQKKNEEEEEEEEV 242
            + KK +E ++ EE 
Sbjct: 1532 EAKKADEAKKAEEK 1545



 Score = 36.3 bits (83), Expect = 0.017
 Identities = 22/81 (27%), Positives = 46/81 (56%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE KK+ D  +++  E ++  +  K + E   +E +  E++ E  E + EE +++    +
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384

Query: 231  KKNEEEEEEEEVLIKLEEEEE 251
            KK EE+++ +E   K EE+++
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKK 1405



 Score = 35.5 bits (81), Expect = 0.029
 Identities = 25/81 (30%), Positives = 49/81 (60%), Gaps = 3/81 (3%)

Query: 172  EEKKEEDEIE---EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            EEKK+ DE +   EE+ ++ +E+K+    +++ +E +KK E++ + +E + + EE ++  
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447

Query: 229  KQKKNEEEEEEEEVLIKLEEE 249
            + KK  EE ++ E   K  EE
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEE 1468



 Score = 35.1 bits (80), Expect = 0.031
 Identities = 23/81 (28%), Positives = 48/81 (59%), Gaps = 2/81 (2%)

Query: 171  EEEKKEEDEIE--EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            EE+KK+ DE++      ++ +E K+K  ++++ +E +KK E+  + +E + + EE ++  
Sbjct: 1401 EEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460

Query: 229  KQKKNEEEEEEEEVLIKLEEE 249
            + KK  EE ++ +   K  EE
Sbjct: 1461 EAKKKAEEAKKADEAKKKAEE 1481



 Score = 35.1 bits (80), Expect = 0.033
 Identities = 22/77 (28%), Positives = 47/77 (61%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            E    +E KK+ +E +++ +E ++  + KK  +E ++ EE K+ DE +K E   + +E +
Sbjct: 1481 EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540

Query: 226  EVVKQKKNEEEEEEEEV 242
            +  ++KK +E ++ EE+
Sbjct: 1541 KAEEKKKADELKKAEEL 1557



 Score = 34.7 bits (79), Expect = 0.043
 Identities = 21/73 (28%), Positives = 40/73 (54%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            A E +KK E+  + +E +++ E  +KK    +++ EE K+  E  K E E   +E E   
Sbjct: 1304 ADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363

Query: 229  KQKKNEEEEEEEE 241
            ++ +  E+++EE 
Sbjct: 1364 EKAEAAEKKKEEA 1376



 Score = 34.7 bits (79), Expect = 0.043
 Identities = 24/83 (28%), Positives = 47/83 (56%), Gaps = 2/83 (2%)

Query: 171  EEEKKEEDEIEE--EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
               KK+ DE ++  EE ++ +E K+K  + ++ +E +KK E+  + EE + + EE ++  
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473

Query: 229  KQKKNEEEEEEEEVLIKLEEEEE 251
            + KK  EE ++ +   K  EE +
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAK 1496



 Score = 34.3 bits (78), Expect = 0.066
 Identities = 22/83 (26%), Positives = 46/83 (55%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            A E +K EE + + EE ++ +E K+K  + ++ +E +KK E+  +K +   +  E ++  
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512

Query: 229  KQKKNEEEEEEEEVLIKLEEEEE 251
             + K  EE ++ +   K EE ++
Sbjct: 1513 DEAKKAEEAKKADEAKKAEEAKK 1535



 Score = 34.0 bits (77), Expect = 0.091
 Identities = 23/83 (27%), Positives = 48/83 (57%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            A E++K +E + + EE ++ +E K+K  + ++ EE +KK E+  + +E + + EE ++  
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKAD 1486

Query: 229  KQKKNEEEEEEEEVLIKLEEEEE 251
            + KK  EE +++    K   E +
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAK 1509



 Score = 33.6 bits (76), Expect = 0.10
 Identities = 19/83 (22%), Positives = 43/83 (51%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            E    +E KK+ +E +++ +  +++ ++ K   E  + E +   DE E  E + E  E++
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372

Query: 226  EVVKQKKNEEEEEEEEVLIKLEE 248
            +   +KK +  +++ E   K +E
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADE 1395



 Score = 33.2 bits (75), Expect = 0.14
 Identities = 21/82 (25%), Positives = 48/82 (58%), Gaps = 1/82 (1%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK-EEDEIEKEEGENEEEEEEEV 227
            A EE+ +  ++ +EE  ++ +  K+K  ++++ +E +KK EED+ + +E +     +++ 
Sbjct: 1361 AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420

Query: 228  VKQKKNEEEEEEEEVLIKLEEE 249
             + KK  EE+++ +   K  EE
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEE 1442



 Score = 33.2 bits (75), Expect = 0.14
 Identities = 23/71 (32%), Positives = 44/71 (61%), Gaps = 4/71 (5%)

Query: 171  EEEKKEEDEIEEEEN----EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
             E KK+ DE ++ E     +E ++ ++ K  +E ++ EEKK+ DE++K E   + EE+++
Sbjct: 1506 AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565

Query: 227  VVKQKKNEEEE 237
              + KK EE++
Sbjct: 1566 AEEAKKAEEDK 1576



 Score = 33.2 bits (75), Expect = 0.15
 Identities = 24/85 (28%), Positives = 48/85 (56%), Gaps = 4/85 (4%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE + EE     EE ++ +  + KK +E + + EE K+ +E +K+  + +++E EE  K 
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649

Query: 231  KKNEEEEEEEEV----LIKLEEEEE 251
            ++ ++ EEE ++      K  EE++
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDK 1674



 Score = 33.2 bits (75), Expect = 0.16
 Identities = 21/81 (25%), Positives = 43/81 (53%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE+KK ++  + EE ++ +E K+K  + ++ +E +KK E+  +K +   ++ EE +   +
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346

Query: 231  KKNEEEEEEEEVLIKLEEEEE 251
                E E   +     EE+ E
Sbjct: 1347 AAKAEAEAAADEAEAAEEKAE 1367



 Score = 32.8 bits (74), Expect = 0.21
 Identities = 24/82 (29%), Positives = 48/82 (58%), Gaps = 3/82 (3%)

Query: 171  EEEKKEEDEIEE--EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK-EEGENEEEEEEEV 227
            EE KK+ D  ++  EE ++ +E K+K  +++++ +E KK     +K +E + + EE+++ 
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKA 1433

Query: 228  VKQKKNEEEEEEEEVLIKLEEE 249
             + KK  EE ++ +   K  EE
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEE 1455



 Score = 32.4 bits (73), Expect = 0.25
 Identities = 23/84 (27%), Positives = 49/84 (58%), Gaps = 1/84 (1%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            A E +K +E + + EE ++ EE K+K  + ++ +E +KK E+  + +E + + EE ++  
Sbjct: 1440 AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1499

Query: 229  KQ-KKNEEEEEEEEVLIKLEEEEE 251
             + KK  E +++ +   K EE ++
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKK 1523



 Score = 32.0 bits (72), Expect = 0.32
 Identities = 22/83 (26%), Positives = 50/83 (60%), Gaps = 2/83 (2%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE--EEVV 228
            EE KK+ DE ++    +++  + KK +E ++ +E KK E+  + +E +  EE++  +E+ 
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552

Query: 229  KQKKNEEEEEEEEVLIKLEEEEE 251
            K ++ ++ EE+++     + EE+
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEED 1575



 Score = 31.6 bits (71), Expect = 0.45
 Identities = 23/82 (28%), Positives = 44/82 (53%)

Query: 168  IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
            I  EE +K ++  + EE ++ +E K+ + K++ +E ++K EE +   E  +  EE +++ 
Sbjct: 1272 IKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1331

Query: 228  VKQKKNEEEEEEEEVLIKLEEE 249
               KK  EE ++     K E E
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAE 1353



 Score = 31.3 bits (70), Expect = 0.52
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 3/94 (3%)

Query: 160  EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVK---QKKNKEEEEEEEEKKEEDEIEKEE 216
            E  K  E    +E KK E++ + +E ++ EE+K   +KK  EE ++ EE K     + EE
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585

Query: 217  GENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
             +  EE   E V +   EE++ + E   K EE +
Sbjct: 1586 AKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAK 1619



 Score = 31.3 bits (70), Expect = 0.56
 Identities = 19/83 (22%), Positives = 42/83 (50%)

Query: 168  IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
             A  E +   DE E  E + E   K+K+  +++ +  +KK E++ + +E + + EE+++ 
Sbjct: 1347 AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406

Query: 228  VKQKKNEEEEEEEEVLIKLEEEE 250
              + K     +++    K + EE
Sbjct: 1407 ADELKKAAAAKKKADEAKKKAEE 1429



 Score = 31.3 bits (70), Expect = 0.64
 Identities = 32/91 (35%), Positives = 53/91 (58%), Gaps = 11/91 (12%)

Query: 172  EEKKEEDEIEEEEN----EEEEEVKQKKNKEEEE-------EEEEKKEEDEIEKEEGENE 220
            EEKK+ DE+++ E     EE+++ ++ K  EE++       EE +K EE  IE+     E
Sbjct: 1543 EEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYE 1602

Query: 221  EEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            EE++ +  + KK EE + + E L K EEE++
Sbjct: 1603 EEKKMKAEEAKKAEEAKIKAEELKKAEEEKK 1633



 Score = 30.9 bits (69), Expect = 0.70
 Identities = 21/82 (25%), Positives = 44/82 (53%)

Query: 160  EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
            E+ K  E    +E KK E++ + +E +++ E  +K ++ +++ EE KK+ D  +K+  E 
Sbjct: 1282 ELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEA 1341

Query: 220  EEEEEEEVVKQKKNEEEEEEEE 241
            ++  E    + +   +E E  E
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAE 1363



 Score = 30.9 bits (69), Expect = 0.71
 Identities = 27/78 (34%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 175  KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK--EEDEIEKEEGENEEEEEEEVVKQKK 232
            ++ +E ++ E    EEV +   +E++ + EE K  EE +I+ EE +  EEE+++V + KK
Sbjct: 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKK 1640

Query: 233  NEEEEEEEEVLIKLEEEE 250
             E EE+++   +K  EEE
Sbjct: 1641 KEAEEKKKAEELKKAEEE 1658



 Score = 30.1 bits (67), Expect = 1.4
 Identities = 23/63 (36%), Positives = 42/63 (66%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE+KK+ +E +++E E+++    KK +E++ EE  K++E  IE+E  E +E+   EV K+
Sbjct: 1740 EEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKK 1799

Query: 231  KKN 233
             K+
Sbjct: 1800 IKD 1802



 Score = 29.3 bits (65), Expect = 2.1
 Identities = 18/92 (19%), Positives = 50/92 (54%)

Query: 160  EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
            E  + +E        + +  I+ EE  + +E+K+ + K++ +E ++ +E+ + ++ + + 
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKA 1311

Query: 220  EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            EE ++ +  K+K  E +++ +    K EE ++
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343



 Score = 29.0 bits (64), Expect = 3.6
 Identities = 23/81 (28%), Positives = 48/81 (59%), Gaps = 1/81 (1%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK-EEGENEEEEEEEV 227
            A E +KK E+  + EE +++ E  +K ++ +++ EE KK ++  +K EE + + +E ++ 
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505

Query: 228  VKQKKNEEEEEEEEVLIKLEE 248
             + KK  +E ++ E   K +E
Sbjct: 1506 AEAKKKADEAKKAEEAKKADE 1526



 Score = 29.0 bits (64), Expect = 3.6
 Identities = 32/96 (33%), Positives = 51/96 (53%), Gaps = 7/96 (7%)

Query: 160  EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQ----KKNKEEEEEEEEKKEEDEIEKE 215
            E  K  E    +E KK E+  + +E ++ EE K+    KK +E ++ EE+KK E+  + E
Sbjct: 1514 EAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAE 1573

Query: 216  EGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            E +N    + E  K+    EE   EEV+   EEE++
Sbjct: 1574 EDKNMALRKAEEAKKA---EEARIEEVMKLYEEEKK 1606



 Score = 27.8 bits (61), Expect = 6.6
 Identities = 16/66 (24%), Positives = 32/66 (48%)

Query: 177  EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
            +DE  +   ++ +   ++ N+ +E  EE   + +E +K E    EE  +    +KK E+ 
Sbjct: 1068 QDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDA 1127

Query: 237  EEEEEV 242
             + EE 
Sbjct: 1128 RKAEEA 1133



 Score = 27.4 bits (60), Expect = 9.8
 Identities = 18/87 (20%), Positives = 39/87 (44%)

Query: 164  FWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
            + +     +E    DE  EE   + EE K+ +  + EE  + ++ + + E      E  +
Sbjct: 1076 YKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARK 1135

Query: 224  EEEVVKQKKNEEEEEEEEVLIKLEEEE 250
             E+  K ++  + E+ + V I  + E+
Sbjct: 1136 AEDARKAEEARKAEDAKRVEIARKAED 1162


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 44.3 bits (105), Expect = 3e-05
 Identities = 13/65 (20%), Positives = 31/65 (47%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
            EE + +E+E EE+   + ++ + E+ EEE  +  ++   E   +   + E    ++   
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426

Query: 235 EEEEE 239
           + + E
Sbjct: 427 DSDSE 431



 Score = 42.8 bits (101), Expect = 8e-05
 Identities = 15/66 (22%), Positives = 33/66 (50%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           + +E++E+E+EEEE+   +  +EE E+ EE+  +   +     + +   +   K  K   
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESA 425

Query: 236 EEEEEE 241
            + + E
Sbjct: 426 SDSDSE 431



 Score = 37.4 bits (87), Expect = 0.005
 Identities = 17/57 (29%), Positives = 30/57 (52%)

Query: 196 NKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
             +E+E+EEE++  DE E+EEGE+ EEE  +  +   +E   +         ++E  
Sbjct: 369 EVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESA 425



 Score = 35.9 bits (83), Expect = 0.016
 Identities = 11/56 (19%), Positives = 25/56 (44%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           E+E++E+   E EE E E+  ++     E+   E   +     + + + E   + +
Sbjct: 374 EDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSD 429



 Score = 33.6 bits (77), Expect = 0.089
 Identities = 17/56 (30%), Positives = 27/56 (48%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           EEE++  DE EEEE E+ EE   +  ++   E       D   K + E+  + + E
Sbjct: 376 EEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431



 Score = 33.2 bits (76), Expect = 0.11
 Identities = 13/55 (23%), Positives = 28/55 (50%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            EE++ +E E EE E+ EEE  + +++   E   +   + +    +E  ++ + E
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431



 Score = 32.8 bits (75), Expect = 0.14
 Identities = 15/57 (26%), Positives = 32/57 (56%)

Query: 185 NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           + +E +++ K+    +  + E+ +EDE E+EE  ++E EEEE    ++   +  E+ 
Sbjct: 348 STKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404



 Score = 27.8 bits (62), Expect = 6.0
 Identities = 9/47 (19%), Positives = 19/47 (40%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
            E+   E E  EE+  +  E+   E      +  E + ++E   + +
Sbjct: 383 DEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSD 429


>gnl|CDD|218484 pfam05185, PRMT5, PRMT5 arginine-N-methyltransferase.  The human
           homologue of yeast Skb1 (Shk1 kinase-binding protein 1)
           is PRMT5, an arginine-N-methyltransferase. These
           proteins appear to be key mitotic regulators. They play
           a role in Jak signalling in higher eukaryotes.
          Length = 445

 Score = 44.2 bits (105), Expect = 3e-05
 Identities = 24/79 (30%), Positives = 39/79 (49%), Gaps = 12/79 (15%)

Query: 94  EVIQNKIENVELPEKVDIIVSEWMGFY----LLHESMIDSVIFARDKFLKPEGVMYPYKC 149
            +I + +   + PEK DI+VSE +G +    L  E + D       +FLKP+G+  P   
Sbjct: 244 TIISSDMREWKGPEKADILVSELLGSFGDNELSPECL-DGA----QRFLKPDGISIPQSY 298

Query: 150 ILHSAPSYSPEIFKFWENI 168
             + AP  SP   K ++ +
Sbjct: 299 TSYLAPISSP---KLYQKV 314


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 44.0 bits (104), Expect = 4e-05
 Identities = 15/73 (20%), Positives = 45/73 (61%), Gaps = 1/73 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQ-KKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            +++G++ ++   E  ++  +  +++ + K   +  EE E K +E+++++EE E +++E+
Sbjct: 360 TDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQ 419

Query: 225 EEVVKQKKNEEEE 237
            +  K+K+ ++E 
Sbjct: 420 ADEDKEKRQKDER 432



 Score = 33.2 bits (76), Expect = 0.13
 Identities = 15/48 (31%), Positives = 31/48 (64%), Gaps = 1/48 (2%)

Query: 171 EEEKKEEDEIEEEENE-EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
            E K E D  EE E + +EE++KQ++N+++++E+ ++ +E   + E  
Sbjct: 386 GEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 32.1 bits (73), Expect = 0.29
 Identities = 12/37 (32%), Positives = 29/37 (78%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK 207
            E K +E+++++EENE++++ +  ++KE+ +++E KK
Sbjct: 398 AEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 30.1 bits (68), Expect = 1.1
 Identities = 17/80 (21%), Positives = 44/80 (55%), Gaps = 1/80 (1%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           EE K   ++  ++ +E  +   KK ++  ++  E K+E +  +E  E + +EE+   ++ 
Sbjct: 354 EEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEA-EAKAKEEKLKQEEN 412

Query: 232 KNEEEEEEEEVLIKLEEEEE 251
           + +++E+ +E   K +++E 
Sbjct: 413 EKKQKEQADEDKEKRQKDER 432



 Score = 27.5 bits (61), Expect = 7.2
 Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 5/56 (8%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           K +E  +E  K+  +  K+ GE ++E +       +  E + +EE L + E E++ 
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDAS-----EEAEAKAKEEKLKQEENEKKQ 416


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 43.5 bits (102), Expect = 6e-05
 Identities = 32/116 (27%), Positives = 60/116 (51%), Gaps = 7/116 (6%)

Query: 128 DSVIFARDKFLKPEGVMYPYKCILHSAPSYSPEIFKFWENIAGEEEKKEEDEI----EEE 183
           D++   +D  +K +    P+     S    S  I  F+  I G+ E  + D I       
Sbjct: 79  DAIRMGKDTSVKIKQRTCPFHKEGSSFEMGSKNITCFYP-IHGKAEIGDLDMIIIKRRRA 137

Query: 184 ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
            +  EE++  + N   ++++E++ E+D+ E+++ E EEEEEE  +K   +E+EE+E
Sbjct: 138 RHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE--IKGFDDEDEEDE 191



 Score = 36.9 bits (85), Expect = 0.010
 Identities = 19/51 (37%), Positives = 35/51 (68%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
           +E++ E+D+ EE++ EEEEE ++ K  ++E+EE+E  E+   EK E +  +
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207



 Score = 36.6 bits (84), Expect = 0.011
 Identities = 28/88 (31%), Positives = 50/88 (56%), Gaps = 9/88 (10%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +N   +++ ++EDE +++E ++EEE       EEEEEE +  ++++ E E GE+   E+ 
Sbjct: 149 DNFVIDDDDEDEDEDDDDEEDDEEE-------EEEEEEIKGFDDEDEEDEGGEDFTYEKS 201

Query: 226 EVVKQK--KNEEEEEEEEVLIKLEEEEE 251
           EV K    K  E    ++V  K++E  E
Sbjct: 202 EVDKTDCFKFIEAGAGDDVFDKIDEAFE 229


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 43.2 bits (102), Expect = 8e-05
 Identities = 20/69 (28%), Positives = 36/69 (52%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           W      E+    D     E E+    +  + +E+EEEEEE+ E++   KE  ++EE EE
Sbjct: 361 WTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEE 420

Query: 225 EEVVKQKKN 233
           ++V  + ++
Sbjct: 421 DDVESKYED 429



 Score = 33.6 bits (77), Expect = 0.087
 Identities = 19/61 (31%), Positives = 30/61 (49%), Gaps = 8/61 (13%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            +   + +DE EE+E EEEEE        +E+E   K+  D+ E EE + E + E+    
Sbjct: 382 EDANTERDDEEEEDEEEEEEE--------DEDEGPSKEHSDDEEFEEDDVESKYEDSDGN 433

Query: 230 Q 230
            
Sbjct: 434 S 434



 Score = 33.6 bits (77), Expect = 0.11
 Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 7/60 (11%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E+   E + EEEE+EEEEE       EE+E+E   KE  + E+ E ++ E + E+     
Sbjct: 382 EDANTERDDEEEEDEEEEE-------EEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNS 434



 Score = 32.0 bits (73), Expect = 0.29
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 199 EEEEEEEKKEEDEIEKEEGENEEEEEEEV---VKQKKNEEEEEEEEVLIKLEEEEEN 252
              E E+   E + E+EE E EEEEE+E     K+  ++EE EE++V  K E+ + N
Sbjct: 377 SALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGN 433



 Score = 31.6 bits (72), Expect = 0.44
 Identities = 15/49 (30%), Positives = 32/49 (65%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK 214
            N   ++E++E++E EEEE+E+E   K+  + EE EE++ + + ++ + 
Sbjct: 384 ANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432



 Score = 30.9 bits (70), Expect = 0.70
 Identities = 20/49 (40%), Positives = 27/49 (55%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
           A  E   EE+E EEEE EE+E+    K   ++EE EE   E + E  +G
Sbjct: 384 ANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 42.3 bits (100), Expect = 1e-04
 Identities = 23/87 (26%), Positives = 51/87 (58%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +    E +KKE ++  E +++++EE + +  ++E+ ++E +  + E +KE   ++ E+EE
Sbjct: 77  KWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEE 136

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E  +  K  +E+ +EE+  + EEE   
Sbjct: 137 EKDEAAKKAKEDSDEELSEEDEEEAAE 163



 Score = 40.0 bits (94), Expect = 6e-04
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           GE    E D+  E  + E+EE K +  K+ +E+ +E+  E++ E+   E E E E+E
Sbjct: 116 GEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172



 Score = 39.6 bits (93), Expect = 7e-04
 Identities = 24/83 (28%), Positives = 47/83 (56%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           + +  +++  EE+E E EE+E+ ++  +  + E ++E E    EDE EK+E   + +E+ 
Sbjct: 90  QGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDS 149

Query: 226 EVVKQKKNEEEEEEEEVLIKLEE 248
           +    +++EEE  EEE     +E
Sbjct: 150 DEELSEEDEEEAAEEEEAEAEKE 172



 Score = 37.7 bits (88), Expect = 0.003
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 5/74 (6%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           WE    E+   E + I+ E ++E E         E+EEE+++  +   E  + E  EE+E
Sbjct: 104 WEVEEDEDSDDEGEWIDVESDKEIESSD-----SEDEEEKDEAAKKAKEDSDEELSEEDE 158

Query: 225 EEVVKQKKNEEEEE 238
           EE  ++++ E E+E
Sbjct: 159 EEAAEEEEAEAEKE 172



 Score = 33.9 bits (78), Expect = 0.056
 Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 170 GEEEKKEEDE-IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           GEEE  +    +E  E  +EEE K+K+ ++  E +++  EE+E E EE E+ ++E E + 
Sbjct: 61  GEEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWID 120

Query: 229 KQKKNEEEEEEEE 241
            +   E E  + E
Sbjct: 121 VESDKEIESSDSE 133



 Score = 28.1 bits (63), Expect = 4.2
 Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEE---EEKKEEDEIEK 214
           E+    E    EDE E++E  ++ +    +   EE+EE   EE++ E E EK
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173



 Score = 26.9 bits (60), Expect = 9.7
 Identities = 17/46 (36%), Positives = 26/46 (56%), Gaps = 2/46 (4%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
           E    E  KK +++ +EE +EE+EE  +   +EE E E+EK  E  
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEE--EAAEEEEAEAEKEKASELA 178


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 42.1 bits (99), Expect = 1e-04
 Identities = 19/80 (23%), Positives = 41/80 (51%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           + E+ E+    E+  ++E E +    K  ++ E+  K+ +E +K+  E + ++  E   +
Sbjct: 76  QAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAK 135

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
            + E E++ +E   K  EEE
Sbjct: 136 AEAEAEKKAKEEAKKQAEEE 155



 Score = 41.4 bits (97), Expect = 3e-04
 Identities = 20/76 (26%), Positives = 38/76 (50%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           K E+   E E+    E+ +QK+ ++    E+  K+ ++  K+  E +++ EE   KQ   
Sbjct: 72  KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE 131

Query: 234 EEEEEEEEVLIKLEEE 249
            + + E E   K +EE
Sbjct: 132 AKAKAEAEAEKKAKEE 147



 Score = 36.4 bits (84), Expect = 0.010
 Identities = 16/84 (19%), Positives = 40/84 (47%), Gaps = 4/84 (4%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           ++K    + +E + + E++ ++ + +   E+  +K+ E     E+   + E+  +  ++K
Sbjct: 58  QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117

Query: 232 KNEEEE----EEEEVLIKLEEEEE 251
           + + EE    +  E   K E E E
Sbjct: 118 QKQAEEAKAKQAAEAKAKAEAEAE 141



 Score = 36.0 bits (83), Expect = 0.012
 Identities = 24/93 (25%), Positives = 46/93 (49%), Gaps = 9/93 (9%)

Query: 166 ENIAGEEEKKEEDE---IEEEENEEEEEVKQKKNKE----EEEEEEEKKEEDEIEKEEGE 218
           +  A E+  K+ ++     EE+ ++ EE K K+  E     E E E+K +E+  ++ E E
Sbjct: 96  QRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEE 155

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            + +   E   +KK  E +++ E   K + E +
Sbjct: 156 AKAKAAAE--AKKKAAEAKKKAEAEAKAKAEAK 186



 Score = 35.2 bits (81), Expect = 0.024
 Identities = 17/78 (21%), Positives = 38/78 (48%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E ++    E+   + E+  KQ + K+++ EE + K+  E + +     E++ +E  K++ 
Sbjct: 93  ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQA 152

Query: 233 NEEEEEEEEVLIKLEEEE 250
            EE + +     K +  E
Sbjct: 153 EEEAKAKAAAEAKKKAAE 170



 Score = 33.3 bits (76), Expect = 0.11
 Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 2/83 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKN--KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            EE K +   E +   E E  K+ K   K++ EEE + K   E +K+  E +++ E E  
Sbjct: 121 AEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAK 180

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
            + + + + + EE   K E  + 
Sbjct: 181 AKAEAKAKAKAEEAKAKAEAAKA 203



 Score = 32.5 bits (74), Expect = 0.16
 Identities = 12/80 (15%), Positives = 43/80 (53%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A   +++++   ++E+  +++  +Q +  E++   E+ ++++  ++   E   ++ E+  
Sbjct: 52  ANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAA 111

Query: 229 KQKKNEEEEEEEEVLIKLEE 248
           KQ + ++++ EE    +  E
Sbjct: 112 KQAEEKQKQAEEAKAKQAAE 131



 Score = 29.8 bits (67), Expect = 1.5
 Identities = 18/75 (24%), Positives = 27/75 (36%), Gaps = 5/75 (6%)

Query: 169 AGEEEKKEEDEI-----EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           A EE K +          E + + E E K K   + + + EE K + E  K +   E   
Sbjct: 152 AEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAA 211

Query: 224 EEEVVKQKKNEEEEE 238
           + E         E E
Sbjct: 212 KAEAEAAAAAAAEAE 226



 Score = 28.6 bits (64), Expect = 3.3
 Identities = 10/61 (16%), Positives = 34/61 (55%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLE 247
            ++  + + +++   ++E++ + ++E++  E E++   E  +QK+ E+    E+   + E
Sbjct: 49  AQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAE 108

Query: 248 E 248
           +
Sbjct: 109 Q 109



 Score = 28.3 bits (63), Expect = 4.7
 Identities = 10/80 (12%), Positives = 38/80 (47%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
                 +    + +++   K+++ ++++ E++ ++ E +   E+   +E E+    ++  
Sbjct: 45  PGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA 104

Query: 233 NEEEEEEEEVLIKLEEEEEN 252
            + E+  ++   K ++ EE 
Sbjct: 105 KQAEQAAKQAEEKQKQAEEA 124



 Score = 27.5 bits (61), Expect = 6.4
 Identities = 21/73 (28%), Positives = 35/73 (47%), Gaps = 1/73 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE-KEEGENEEEEE 224
           E  A  E + E+   EE + + EEE K K   E +++  E K++ E E K + E + + +
Sbjct: 131 EAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190

Query: 225 EEVVKQKKNEEEE 237
            E  K K    + 
Sbjct: 191 AEEAKAKAEAAKA 203


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 41.7 bits (98), Expect = 2e-04
 Identities = 21/84 (25%), Positives = 43/84 (51%), Gaps = 6/84 (7%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ- 230
           +E+++ +E+++++  E+E +KQ + KE    +E+KK+ +E  K+    +++ EE   K  
Sbjct: 84  KEQQQAEELQQKQAAEQERLKQLE-KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAA 142

Query: 231 ----KKNEEEEEEEEVLIKLEEEE 250
                K E E +      K    E
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAE 166



 Score = 40.2 bits (94), Expect = 7e-04
 Identities = 18/83 (21%), Positives = 41/83 (49%), Gaps = 2/83 (2%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
              +  +E+ + +E++  EE + KQ   +E  ++ E+++   + +K++ E   ++     
Sbjct: 72  KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL-- 129

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
           KQK+ EE   +     K + E E
Sbjct: 130 KQKQAEEAAAKAAAAAKAKAEAE 152



 Score = 38.6 bits (90), Expect = 0.002
 Identities = 17/89 (19%), Positives = 39/89 (43%), Gaps = 4/89 (4%)

Query: 166 ENIAGEEEKKEEDE----IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
           +  A E+E+ ++ E      +E+ ++ EE  ++   ++++ EE   +     K + E E 
Sbjct: 94  QKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEA 153

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
           +      K+   E +++ E    K    E
Sbjct: 154 KRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182



 Score = 36.3 bits (84), Expect = 0.012
 Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 6/92 (6%)

Query: 166 ENIAGEEEKKEEDE-----IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           E +A +E+KK+ +E       +++  EE   K     + + E E K+     +K   E +
Sbjct: 109 ERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK 168

Query: 221 EEEEEEVVKQKKNEEEEE-EEEVLIKLEEEEE 251
           ++ E E  K+   E +++ E E   K   E +
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK 200



 Score = 34.8 bits (80), Expect = 0.038
 Identities = 14/76 (18%), Positives = 41/76 (53%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
           ++ +  ++++   +   +Q+K KE+++ EE ++++   ++   + E+E      ++K+ E
Sbjct: 62  EQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAE 121

Query: 235 EEEEEEEVLIKLEEEE 250
           E  ++  +  K  EE 
Sbjct: 122 EAAKQAALKQKQAEEA 137



 Score = 34.4 bits (79), Expect = 0.040
 Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 3/84 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +  A E ++K+  E E  +  E+E +  ++ K++ EE  ++    + + EE      +  
Sbjct: 86  QQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE---AAAKAA 142

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEE 249
              K K   E +       K   E
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAE 166



 Score = 34.4 bits (79), Expect = 0.052
 Identities = 12/61 (19%), Positives = 37/61 (60%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            +++ K++E+++ E+ ++ +  ++E   + E+E    +++K + EE  ++  +K ++ EE
Sbjct: 77  AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136

Query: 252 N 252
            
Sbjct: 137 A 137



 Score = 30.9 bits (70), Expect = 0.59
 Identities = 14/71 (19%), Positives = 30/71 (42%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE   +     + + E E +      K+   E ++K E +  +K   E +++ E E   +
Sbjct: 135 EEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194

Query: 231 KKNEEEEEEEE 241
              E +++ E 
Sbjct: 195 AAAEAKKKAEA 205



 Score = 30.2 bits (68), Expect = 1.0
 Identities = 16/67 (23%), Positives = 32/67 (47%), Gaps = 3/67 (4%)

Query: 186 EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIK 245
           E+    +Q++   +  EE+ KK+E +  +E  + +  E+E   + K+ E+E    +   K
Sbjct: 62  EQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQE---RLKQLEKERLAAQEQKK 118

Query: 246 LEEEEEN 252
             EE   
Sbjct: 119 QAEEAAK 125



 Score = 27.5 bits (61), Expect = 8.2
 Identities = 12/61 (19%), Positives = 39/61 (63%), Gaps = 4/61 (6%)

Query: 191 VKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
           V ++ N+++++++  K+ E++ +K+E +  EE +++   +++  ++ E+E    +L  +E
Sbjct: 60  VVEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKE----RLAAQE 115

Query: 251 E 251
           +
Sbjct: 116 Q 116



 Score = 27.1 bits (60), Expect = 9.9
 Identities = 14/73 (19%), Positives = 28/73 (38%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A    KK   E +++   E  +    + K++ E E   K   E +K+     +++     
Sbjct: 156 AAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEA 215

Query: 229 KQKKNEEEEEEEE 241
           K+K   E +    
Sbjct: 216 KKKAAAEAKAAAA 228


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 41.5 bits (98), Expect = 2e-04
 Identities = 21/62 (33%), Positives = 39/62 (62%), Gaps = 3/62 (4%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
           +++   EEEE++ +   +E +EE +EKKEE + E+ E +  +   EE   Q+K EE+E +
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEE---QRKLEEKERK 317

Query: 240 EE 241
           ++
Sbjct: 318 KQ 319



 Score = 37.6 bits (88), Expect = 0.004
 Identities = 22/62 (35%), Positives = 39/62 (62%), Gaps = 1/62 (1%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
            EV +K +K  EEEEE+  +  E E++E E +E++EE+  ++++ +  +   E   KLEE
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQE-EAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313

Query: 249 EE 250
           +E
Sbjct: 314 KE 315



 Score = 34.9 bits (81), Expect = 0.025
 Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK-EEGENEEEE 223
           EEE+K     EEE  EE +E K++K KEE E +  K   +E  K EE E +++ 
Sbjct: 267 EEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320



 Score = 30.3 bits (69), Expect = 0.97
 Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 4/52 (7%)

Query: 160 EIFKFWENIAGEE--EKKEEDEIEEEENEEEEEV--KQKKNKEEEEEEEEKK 207
           +I K  E    EE  EKKEE + EE E +  +    +Q+K +E+E +++ +K
Sbjct: 271 KILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 40.9 bits (96), Expect = 2e-04
 Identities = 24/70 (34%), Positives = 42/70 (60%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E KK +D+   E  E +EE K+  N E++E++ + ++EDE  +EE E E+EE  +  +++
Sbjct: 50  EAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKE 109

Query: 232 KNEEEEEEEE 241
             E+ E   E
Sbjct: 110 TEEKTESNVE 119



 Score = 40.9 bits (96), Expect = 2e-04
 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 1/71 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            ++E  E +E++EEE E      ++   + E+E+EE +EE+E E EE  ++E E+E   K
Sbjct: 55  DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE-SSDENEKETEEK 113

Query: 230 QKKNEEEEEEE 240
            + N E+E   
Sbjct: 114 TESNVEKEITN 124



 Score = 40.1 bits (94), Expect = 4e-04
 Identities = 23/75 (30%), Positives = 40/75 (53%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +    ++   +E    EE  EEE+E    ++KE++ + E++ EE E E EE + E  +E 
Sbjct: 47  DEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDEN 106

Query: 226 EVVKQKKNEEEEEEE 240
           E   ++K E   E+E
Sbjct: 107 EKETEEKTESNVEKE 121



 Score = 39.7 bits (93), Expect = 5e-04
 Identities = 22/76 (28%), Positives = 41/76 (53%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E  A + + +E  EIEE + EE+E    +  +++ + E+E +E +E  +EE E   +E E
Sbjct: 48  EQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENE 107

Query: 226 EVVKQKKNEEEEEEEE 241
           +  ++K     E+E  
Sbjct: 108 KETEEKTESNVEKEIT 123



 Score = 39.3 bits (92), Expect = 8e-04
 Identities = 28/84 (33%), Positives = 52/84 (61%), Gaps = 3/84 (3%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEE---EEEKKEEDEIEKEEGENEEEEEEEVV 228
            ++   DE E ++++++E  + ++ KEEE+E    E+K+++ + EKE+ E+EEE EEE  
Sbjct: 41  SDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
           +     E+E EE+    +E+E  N
Sbjct: 101 ESSDENEKETEEKTESNVEKEITN 124



 Score = 39.3 bits (92), Expect = 8e-04
 Identities = 23/78 (29%), Positives = 45/78 (57%)

Query: 164 FWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           F+ +   ++   +E E ++ +++E  E+++ K +E+E    E KE+    ++E E  EEE
Sbjct: 35  FFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEE 94

Query: 224 EEEVVKQKKNEEEEEEEE 241
            EE  ++  +E E+E EE
Sbjct: 95  NEEEDEESSDENEKETEE 112



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 19/48 (39%), Positives = 29/48 (60%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           + + ++EDE  EEENEEE+E    +N++E EE+ E   E EI     +
Sbjct: 81  KGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPSWK 128



 Score = 36.6 bits (85), Expect = 0.006
 Identities = 21/75 (28%), Positives = 44/75 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E +K ++ E  E E  +EEE +   ++++E++ + +KE++E E+E  E +EE  +E  K+
Sbjct: 50  EAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKE 109

Query: 231 KKNEEEEEEEEVLIK 245
            + + E   E+ +  
Sbjct: 110 TEEKTESNVEKEITN 124



 Score = 36.3 bits (84), Expect = 0.008
 Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 4/77 (5%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +  A E+E K+ D+ E  E EE +E +++    E++E++   E+++ E E    EE EEE
Sbjct: 43  QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE----EENEEE 98

Query: 226 EVVKQKKNEEEEEEEEV 242
           +     +NE+E EE+  
Sbjct: 99  DEESSDENEKETEEKTE 115



 Score = 27.8 bits (62), Expect = 4.4
 Identities = 17/67 (25%), Positives = 34/67 (50%), Gaps = 7/67 (10%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE-------EEEVVKQKKNEEEEEEEEVLI 244
              +   +E+E ++  +++  E EE + EE+E       E++   +K++EE EEE E   
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99

Query: 245 KLEEEEE 251
           +   +E 
Sbjct: 100 EESSDEN 106


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 41.6 bits (98), Expect = 2e-04
 Identities = 19/76 (25%), Positives = 33/76 (43%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
             E+ E   E  +E+++    E+  E EE+K E +  KEE +  + E + + +     E 
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60

Query: 237 EEEEEVLIKLEEEEEN 252
           +          EEEE 
Sbjct: 61  KPAASGEGGGGEEEEE 76



 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 1/72 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE-EDEIEKEEGENEEEEEEEVVK 229
           E   +   E+ +   +E+    +++  E +  +EE  + + EI++ E   +E E +    
Sbjct: 6   EALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAAS 65

Query: 230 QKKNEEEEEEEE 241
            +    EEEEEE
Sbjct: 66  GEGGGGEEEEEE 77



 Score = 35.8 bits (83), Expect = 0.018
 Identities = 20/72 (27%), Positives = 34/72 (47%), Gaps = 2/72 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVK--QKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           EEEK E D ++EE ++ + E+   ++   E E +     E    E+EE E + E  E   
Sbjct: 28  EEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAEAAEFRA 87

Query: 229 KQKKNEEEEEEE 240
             +  ++   EE
Sbjct: 88  YLRGGDDALAEE 99



 Score = 33.9 bits (78), Expect = 0.060
 Identities = 19/73 (26%), Positives = 35/73 (47%), Gaps = 3/73 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+EK  E    EEE  E + +K++ +K + E +  ++  DE+E +   + E    E  ++
Sbjct: 20  EDEKLAE---AEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEE 76

Query: 231 KKNEEEEEEEEVL 243
           +   E  E    L
Sbjct: 77  EAKAEAAEFRAYL 89



 Score = 32.3 bits (74), Expect = 0.22
 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 2/73 (2%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVK--QKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
              +  +E+  E +E + E +  +EE+     +    EE  +E + +     E G  EEE
Sbjct: 15  LRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEE 74

Query: 223 EEEEVVKQKKNEE 235
           EEE   +  +   
Sbjct: 75  EEEAKAEAAEFRA 87


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 40.4 bits (95), Expect = 3e-04
 Identities = 20/83 (24%), Positives = 48/83 (57%), Gaps = 3/83 (3%)

Query: 171 EEEKKEEDEIEEE---ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
            E KK++ E+ EE     +E EE ++ K K+++ ++++ K++D+ + ++ +  E+++E+ 
Sbjct: 58  TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117

Query: 228 VKQKKNEEEEEEEEVLIKLEEEE 250
            + K  +  +   E L  L E +
Sbjct: 118 AEDKLEDLTKSYSETLSTLSELK 140



 Score = 39.3 bits (92), Expect = 5e-04
 Identities = 16/71 (22%), Positives = 45/71 (63%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           D    E  ++++E+ ++  K ++E EE++K + + +K + + +++++++  K+    E++
Sbjct: 54  DAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113

Query: 238 EEEEVLIKLEE 248
           +E+E   KLE+
Sbjct: 114 DEKEAEDKLED 124



 Score = 34.7 bits (80), Expect = 0.021
 Identities = 13/65 (20%), Positives = 35/65 (53%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           EI K  +    +++ K + +  +++ +++++ K  K  ++ E+++EK+ ED++E      
Sbjct: 70  EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSY 129

Query: 220 EEEEE 224
            E   
Sbjct: 130 SETLS 134



 Score = 33.1 bits (76), Expect = 0.075
 Identities = 11/63 (17%), Positives = 37/63 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE++K + + ++ + +++++  +K +K++++ E++ ++E E + E+      E    + +
Sbjct: 79  EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSE 138

Query: 231 KKN 233
            K 
Sbjct: 139 LKP 141



 Score = 28.5 bits (64), Expect = 2.4
 Identities = 11/61 (18%), Positives = 30/61 (49%), Gaps = 6/61 (9%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           W+    +++K ++ +      +++++   K  K++E+E E+K E+      E  +   E 
Sbjct: 86  WKKKKSKKKKDKDKD------KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSEL 139

Query: 225 E 225
           +
Sbjct: 140 K 140



 Score = 28.1 bits (63), Expect = 3.7
 Identities = 11/64 (17%), Positives = 36/64 (56%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
           K +E     + KK++ + ++++++++++ K+    E+++E+E + + +++ K   E    
Sbjct: 76  KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135

Query: 223 EEEE 226
             E 
Sbjct: 136 LSEL 139



 Score = 27.3 bits (61), Expect = 5.8
 Identities = 17/59 (28%), Positives = 37/59 (62%), Gaps = 4/59 (6%)

Query: 198 EEEEEEEEKKEE--DEIEKEEGENEEEEE--EEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            E  E ++KK+E  +EIEK + E EE+++   +  K KK ++++++++   K ++ E+ 
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113



 Score = 27.0 bits (60), Expect = 8.4
 Identities = 10/56 (17%), Positives = 34/56 (60%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
           E I   +++ EE +  + + ++ ++ K K   ++++++++K E+ + ++ E + E+
Sbjct: 69  EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 41.1 bits (97), Expect = 4e-04
 Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           +E  E +E EEE + E E   + K   ++E+E   +E+  +  EE E+ ++ E+    + 
Sbjct: 227 KETSETEEREEETDVEIETTSETK-GTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEI 285

Query: 232 KNEEEEEE 239
             E+++EE
Sbjct: 286 LKEKKDEE 293



 Score = 40.8 bits (96), Expect = 5e-04
 Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 6/62 (9%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEE------EEEEKKEEDEIEKEEGENEEEEEE 225
           EE++EE ++E E   E +  KQ++    EE      EE+E  ++ E   +    +E+++E
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292

Query: 226 EV 227
           E+
Sbjct: 293 EL 294



 Score = 38.1 bits (89), Expect = 0.004
 Identities = 16/49 (32%), Positives = 23/49 (46%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           K K+  E EE EEE   E E   E    ++E+E    +      EE+E+
Sbjct: 225 KLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKED 273



 Score = 36.1 bits (84), Expect = 0.014
 Identities = 16/65 (24%), Positives = 31/65 (47%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           E  E EE EE    + +   E +  K+E++   +E+     EE+E+  K +  ++ E  +
Sbjct: 228 ETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILK 287

Query: 241 EVLIK 245
           E   +
Sbjct: 288 EKKDE 292



 Score = 34.2 bits (79), Expect = 0.056
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 4/63 (6%)

Query: 194 KKNKE----EEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEE 249
           KK KE    EE EEE   E +   + +G  +E+E           EE+E+ +    L++ 
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283

Query: 250 EEN 252
           E  
Sbjct: 284 EIL 286



 Score = 31.1 bits (71), Expect = 0.66
 Identities = 17/67 (25%), Positives = 37/67 (55%), Gaps = 4/67 (5%)

Query: 189 EEVKQKKNKEEEEEEE-EKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE---EVLI 244
           +E  + + +EEE + E E   E +  K+E E   EE+  +  ++K + ++ E+     ++
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286

Query: 245 KLEEEEE 251
           K +++EE
Sbjct: 287 KEKKDEE 293



 Score = 29.6 bits (67), Expect = 1.9
 Identities = 13/43 (30%), Positives = 28/43 (65%), Gaps = 2/43 (4%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE 202
           +I + + N   ++++ +  +  EE+ ++EE  K++KNK EE+E
Sbjct: 638 KIKQIFRNWIEKKKEFKILDYTEEKTKKEE--KKEKNKREEKE 678



 Score = 29.2 bits (66), Expect = 2.8
 Identities = 8/49 (16%), Positives = 26/49 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
              + +  + E+E + EE+     + KE+ ++ E+  + + +++++ E 
Sbjct: 245 TTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293



 Score = 27.7 bits (62), Expect = 7.5
 Identities = 11/32 (34%), Positives = 23/32 (71%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
           IE+++  +  +  ++K K+EE++E+ K+EE E
Sbjct: 647 IEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 27.3 bits (61), Expect = 9.0
 Identities = 12/52 (23%), Positives = 24/52 (46%), Gaps = 5/52 (9%)

Query: 163 KFWENIAGEEEKKEEDEIEEEE-----NEEEEEVKQKKNKEEEEEEEEKKEE 209
           +       +  K+E++   EE+      E+E+  K +   + E  +E+K EE
Sbjct: 242 EIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 40.6 bits (96), Expect = 6e-04
 Identities = 18/73 (24%), Positives = 27/73 (36%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            G E K     +EEEE  +EE  ++ + + + E EE    E E E +         EE  
Sbjct: 27  LGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAA 86

Query: 229 KQKKNEEEEEEEE 241
           +     E      
Sbjct: 87  EAAAAAEAAARPA 99



 Score = 28.3 bits (64), Expect = 5.2
 Identities = 15/60 (25%), Positives = 25/60 (41%)

Query: 193 QKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           + K+     EEEE ++E+   + E E + E EE    + + E + E        E  E  
Sbjct: 30  EVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAA 89


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 40.1 bits (94), Expect = 7e-04
 Identities = 26/62 (41%), Positives = 43/62 (69%)

Query: 184 ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVL 243
           E E E+EV  ++ +EE+EE++E++E+   ++EE + EEE+EE+  K KK +E   E E+L
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87

Query: 244 IK 245
            K
Sbjct: 88  NK 89



 Score = 37.8 bits (88), Expect = 0.004
 Identities = 24/62 (38%), Positives = 41/62 (66%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E EK+  DE EEEE EE++E ++K   +EEE +EE+++E++ +K +   E   E E++ +
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89

Query: 231 KK 232
            K
Sbjct: 90  TK 91



 Score = 32.4 bits (74), Expect = 0.24
 Identities = 22/55 (40%), Positives = 35/55 (63%), Gaps = 2/55 (3%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E+E E+E   EE+E EKEE + EEE+  +  ++   +EEEE+EE   K ++ +E 
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEV--DEEEEKEEKKKKTKKVKET 80



 Score = 31.3 bits (71), Expect = 0.55
 Identities = 17/58 (29%), Positives = 39/58 (67%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           K+ ++E  +EEE++E++E ++EE +  ++EEE   +++K E++++ ++V     E E 
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 30.1 bits (68), Expect = 1.1
 Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 3/75 (4%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEE---EEEEEKKEEDEIEKEEGENEEE 222
           E    EEEK  + E E +E EE+EE K+K  K +E   E E   K +    +   +  +E
Sbjct: 45  EEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTKPIWTRNPKDVTKE 104

Query: 223 EEEEVVKQKKNEEEE 237
           E     K   N+ E+
Sbjct: 105 EYAAFYKSLTNDWED 119



 Score = 29.0 bits (65), Expect = 2.7
 Identities = 15/49 (30%), Positives = 30/49 (61%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK 214
           E    E++++EE   ++EE  +EEE K++K K+ ++ +E   E + + K
Sbjct: 41  EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 38.6 bits (90), Expect = 9e-04
 Identities = 14/49 (28%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
            +E+E  EEE+ +E E +     + E +E++K+E  + + E+ E  + E
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEK-EKPKTE 142



 Score = 37.4 bits (87), Expect = 0.002
 Identities = 15/59 (25%), Positives = 31/59 (52%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E    D+ EEE  EE  +  ++++  E + E ++K++ E+ K + E E+ + E    + 
Sbjct: 90  ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148



 Score = 37.4 bits (87), Expect = 0.002
 Identities = 14/52 (26%), Positives = 25/52 (48%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           EEEE EEE   + ++    E + E K+++     +    +E+ + E  K K 
Sbjct: 97  EEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148



 Score = 36.3 bits (84), Expect = 0.005
 Identities = 18/56 (32%), Positives = 30/56 (53%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           IF+   +   EEE+ EE+  +E E E+  E K +  ++++ E  + K E E  K E
Sbjct: 87  IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 35.9 bits (83), Expect = 0.006
 Identities = 16/59 (27%), Positives = 30/59 (50%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
              +DE EE E E  +E +Q+   E + E +EKK+ +  + +  + + + E +  K  K
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150



 Score = 35.9 bits (83), Expect = 0.006
 Identities = 17/59 (28%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGEN-----EEEEEEEVVKQKKNEEEEEEEEVLIK 245
           ++  + ++EEEE E++  DE E+E+        +E+++ EV K K  +E+ + E    K
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147



 Score = 32.8 bits (75), Expect = 0.078
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
           E    E E+++  E + E  E+++    K   E+E+ + E K+ 
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146



 Score = 30.5 bits (69), Expect = 0.42
 Identities = 14/47 (29%), Positives = 27/47 (57%)

Query: 206 KKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
              +DE E+ E E+ +E E+E   + K E +E+++  + K + E+E 
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEK 138



 Score = 28.6 bits (64), Expect = 2.2
 Identities = 12/45 (26%), Positives = 27/45 (60%)

Query: 207 KEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            +E+E  +EE  +E E+E+    + +++E+++ E    K E+E+ 
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKP 139


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 39.8 bits (93), Expect = 0.001
 Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 171 EEEKKEEDEIEEEENE--EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           E+ K    E +EE N   E+    +K+ +++ E  E+  +E E  K+E E E EE +E  
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERE 559

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
           + KK E E+E +E L  L++E E+
Sbjct: 560 RNKKLELEKEAQEALKALKKEVES 583



 Score = 35.6 bits (82), Expect = 0.021
 Identities = 17/87 (19%), Positives = 52/87 (59%), Gaps = 5/87 (5%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E++ +  +++ +E+ + ++E++Q+  + +E E  +K E ++  +E  +  ++E E ++++
Sbjct: 528 EQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRE 587

Query: 231 KKNEEEEEEE-----EVLIKLEEEEEN 252
            K ++  + +     E L+KL+E ++ 
Sbjct: 588 LKEKKIHKAKEIKSIEDLVKLKETKQK 614


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 39.8 bits (93), Expect = 0.001
 Identities = 16/72 (22%), Positives = 38/72 (52%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
                EE+++E + +      + ++ +     E+E+EE++++ E   + E  + E+   +
Sbjct: 7   TQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLK 66

Query: 226 EVVKQKKNEEEE 237
           E+ KQKK E ++
Sbjct: 67  ELKKQKKQEIQK 78



 Score = 37.9 bits (88), Expect = 0.004
 Identities = 21/78 (26%), Positives = 45/78 (57%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEE+      EEE+ EE E V +    + +++E   ++EDE E+++ E E   + E+ K+
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 231 KKNEEEEEEEEVLIKLEE 248
           +K   +E +++   ++++
Sbjct: 61  EKARLKELKKQKKQEIQK 78


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 39.5 bits (93), Expect = 0.001
 Identities = 26/85 (30%), Positives = 42/85 (49%), Gaps = 5/85 (5%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEE-EKKEEDEIEKEEG----ENEEEE 223
           A EE  K   E+E E  E   E+++ + +  + EE  ++K E   +KEE     E E   
Sbjct: 56  AKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSN 115

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEE 248
           +E+ + +K+ E EE   E   +LE 
Sbjct: 116 KEKNLDEKEEELEELIAEQREELER 140



 Score = 36.1 bits (84), Expect = 0.015
 Identities = 28/80 (35%), Positives = 46/80 (57%), Gaps = 1/80 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E   K+E+ +E++E E   + K    KEEE EE   ++ +E+E+  G  +EE +E ++++
Sbjct: 97  ESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEE 156

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
            + EE   E   LIK  EEE
Sbjct: 157 VE-EEARHEAAKLIKEIEEE 175



 Score = 33.4 bits (77), Expect = 0.10
 Identities = 31/99 (31%), Positives = 53/99 (53%), Gaps = 18/99 (18%)

Query: 171 EEEKKEEDEIEEE-ENEEEEEVKQKKNKEEEEEEEEKKEEDEIE-----KEE-------- 216
           EE KKE + +++E   E +EEV + + + E E +E + E   +E     +EE        
Sbjct: 39  EEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMES 98

Query: 217 ----GENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
                EN E++E+E+  ++KN +E+EEE   +  E+ EE
Sbjct: 99  LDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREE 137



 Score = 31.8 bits (73), Expect = 0.35
 Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
              ++    I EE  +E E +K++   E +EE  + + E E E +E  NE +  E  + Q
Sbjct: 28  GSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQ 87

Query: 231 K------KNEEEEEEEEVLIKLEEE 249
           +      K E  +++EE L K E+E
Sbjct: 88  REETLDRKMESLDKKEENLEKKEKE 112


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 39.7 bits (93), Expect = 0.001
 Identities = 19/77 (24%), Positives = 33/77 (42%), Gaps = 3/77 (3%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEED---EIEKEEGENEEEEEEEVVKQKKNEE 235
            +E       EEV++ + + +  EE  K  E    E+E+   + EE   E   + +  E+
Sbjct: 75  GLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLED 134

Query: 236 EEEEEEVLIKLEEEEEN 252
             EE E L  L+ +   
Sbjct: 135 LLEELEPLAYLDFDLSL 151



 Score = 27.3 bits (61), Expect = 9.1
 Identities = 19/92 (20%), Positives = 32/92 (34%), Gaps = 5/92 (5%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKE---EEEEEEEKKEEDEIEKEEGEN 219
              E   G +E ++    E  +      + +  +     +  E       +E+EK   E 
Sbjct: 35  DLEEGEKGLKELEKLKVAEVAQISLSSLLSEVLDYLRSVKGLEGRLFILPEEVEKL--EA 92

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E +  EEV+K  +    E EE      E   E
Sbjct: 93  ELKSLEEVIKPAEKFSSEVEELTRKLEERLSE 124


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 39.6 bits (92), Expect = 0.001
 Identities = 18/74 (24%), Positives = 39/74 (52%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
             + E+ EE++ EEE    ++  K KK K ++   ++   +   + ++ + + E+   +V
Sbjct: 317 DEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLV 376

Query: 229 KQKKNEEEEEEEEV 242
             KK +E ++EE V
Sbjct: 377 TAKKQKEPKKEEPV 390



 Score = 37.2 bits (86), Expect = 0.007
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 2/64 (3%)

Query: 170 GEEEKKEEDEIEEEE--NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           G++E +EED I +      + EE + K + E   + E +++ED  E EE +NEEE     
Sbjct: 277 GDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSK 336

Query: 228 VKQK 231
             +K
Sbjct: 337 KGKK 340



 Score = 33.8 bits (77), Expect = 0.073
 Identities = 20/90 (22%), Positives = 47/90 (52%), Gaps = 6/90 (6%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK-----EEDEIEKEEGENE 220
           +   G+E  K  ++  +EE  ++++ K  KNK++ +++++ K     + DE + ++G++E
Sbjct: 222 DEDDGDESDKGGED-GDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDE 280

Query: 221 EEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
             EE+ +     +  + EE E  +  E   
Sbjct: 281 GREEDYISDSSASGNDPEEREDKLSPEIPA 310



 Score = 31.9 bits (72), Expect = 0.30
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE--KEEGENEEEEEEEVV 228
           ++  + + +  ++E  EE+ +        + EE E K   EI    E  ++E+ EE E  
Sbjct: 267 DDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESE-- 324

Query: 229 KQKKNEEEEEEEEVLIKL 246
            ++KNEEE    +   KL
Sbjct: 325 -EEKNEEEGGLSKKGKKL 341



 Score = 31.5 bits (71), Expect = 0.44
 Identities = 15/80 (18%), Positives = 34/80 (42%), Gaps = 11/80 (13%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEE-----EEKKEEDEIEKEEGENEE 221
            +  +++ K   + + +E + +         +E  EE           D  E+E+  + E
Sbjct: 254 KLDDDKKGKRGGDDDADEYDSD------DGDDEGREEDYISDSSASGNDPEEREDKLSPE 307

Query: 222 EEEEEVVKQKKNEEEEEEEE 241
              +  ++Q ++ EE EEE+
Sbjct: 308 IPAKPEIEQDEDSEESEEEK 327



 Score = 28.4 bits (63), Expect = 4.8
 Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 1/60 (1%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE-KEEGENEEEEEEEVVKQKKN 233
           K       +E+EE E  K    + ++ + ++ + +DE +  E  +  E+ +EE  K+KK 
Sbjct: 187 KNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKK 246


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 39.6 bits (92), Expect = 0.001
 Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 169  AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEK---KEEDEIEKEEG--ENEEEE 223
            A +E K+E D  ++E  ++E+ +++    +E+ ++++     E+DE   E+G  EN +E 
Sbjct: 4015 AADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQEN 4074

Query: 224  EEEVVKQKKNEEEEEEEEV 242
            EE      K++EE E+ EV
Sbjct: 4075 EESTEDGVKSDEELEQGEV 4093



 Score = 39.2 bits (91), Expect = 0.002
 Identities = 19/99 (19%), Positives = 51/99 (51%)

Query: 153  SAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI 212
            S P    E     E++  +E++ +  +  + E+ + E   + K + + E++E  ++ED +
Sbjct: 3978 SQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPL 4037

Query: 213  EKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            E+    +E+ ++++     +++E+  E+     ++E EE
Sbjct: 4038 EENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEE 4076



 Score = 38.4 bits (89), Expect = 0.003
 Identities = 17/67 (25%), Positives = 30/67 (44%)

Query: 167  NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            ++A ++EK  ED  EE   E EE  +     +EE E+ E  E+  I+     + +     
Sbjct: 4054 DLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFAS 4113

Query: 227  VVKQKKN 233
                ++N
Sbjct: 4114 AEADEEN 4120



 Score = 37.7 bits (87), Expect = 0.006
 Identities = 23/79 (29%), Positives = 41/79 (51%), Gaps = 2/79 (2%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            +E  ++ED +EE    +E+   Q+ +  +  E++EK  ED  E+   ENEE  E+ V   
Sbjct: 4028 DEPMQDEDPLEENNTLDEDI--QQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSD 4085

Query: 231  KKNEEEEEEEEVLIKLEEE 249
            ++ E+ E  E+  I    +
Sbjct: 4086 EELEQGEVPEDQAIDNHPK 4104



 Score = 36.9 bits (85), Expect = 0.010
 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 11/79 (13%)

Query: 178  DEIEEEENEEEEEVKQKKNKEEEE---EEEEKKEE--DEIEKEEGENEEEEEEEVVKQKK 232
            DE +EE + E++E  Q ++  EE    +E+ ++++  D  E +E  NE+  EE V     
Sbjct: 4017 DENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENV----- 4071

Query: 233  NEEEEEEEEVLIKLEEEEE 251
             +E EE  E  +K +EE E
Sbjct: 4072 -QENEESTEDGVKSDEELE 4089



 Score = 36.1 bits (83), Expect = 0.018
 Identities = 18/78 (23%), Positives = 40/78 (51%), Gaps = 8/78 (10%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE----EDEIEKEEGENEE----E 222
             E+    +E +    E++ +  + K+++E+E+EEE  +    +DEI+ +  EN      E
Sbjct: 3924 NEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPE 3983

Query: 223  EEEEVVKQKKNEEEEEEE 240
             E+  + +    +E+E +
Sbjct: 3984 NEDLDLPEDLKLDEKEGD 4001



 Score = 36.1 bits (83), Expect = 0.019
 Identities = 19/89 (21%), Positives = 41/89 (46%), Gaps = 13/89 (14%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEE-------------EEVKQKKNKEEEEEEEEKKEEDEI 212
            ++    E +  + E E ++ +EE             E++  + N+E+  E E+K  E   
Sbjct: 3869 KDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSA 3928

Query: 213  EKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
               E +   +E++    + K+ +E+E+EE
Sbjct: 3929 ANNESDLVSKEDDNKALEDKDRQEKEDEE 3957



 Score = 35.4 bits (81), Expect = 0.035
 Identities = 19/91 (20%), Positives = 38/91 (41%), Gaps = 7/91 (7%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG----ENEE 221
            +    EE    +   E+ + ++  ++ +   K  E+  EE  +E+E   E+G    E  E
Sbjct: 4033 DEDPLEENNTLD---EDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELE 4089

Query: 222  EEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            + E    +   N  + + +      E +EEN
Sbjct: 4090 QGEVPEDQAIDNHPKMDAKSTFASAEADEEN 4120



 Score = 32.7 bits (74), Expect = 0.21
 Identities = 21/79 (26%), Positives = 41/79 (51%), Gaps = 9/79 (11%)

Query: 177  EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
             +++ +E NEE+    ++K+ E+        E D + KE+     E+++    Q+K +EE
Sbjct: 3904 NEKLWDEPNEEDLLETEQKSNEQSAANN---ESDLVSKEDDNKALEDKDR---QEKEDEE 3957

Query: 237  EEEEEVLIKLE---EEEEN 252
            E  ++V I  E   + +EN
Sbjct: 3958 EMSDDVGIDDEIQPDIQEN 3976



 Score = 32.3 bits (73), Expect = 0.33
 Identities = 27/98 (27%), Positives = 41/98 (41%), Gaps = 10/98 (10%)

Query: 164  FWENIAGEEEKKEEDEIEEEENEEEEEVK-------QKKNKEEEEEEEEKKEEDEIEKEE 216
            F EN+   EE  E+    +EE E+ E  +        K + +      E  EE+  +   
Sbjct: 4067 FEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIV 4126

Query: 217  GENEEEEEEEVVK---QKKNEEEEEEEEVLIKLEEEEE 251
            GENEE  EE+ V+       E E+ +E+     E   E
Sbjct: 4127 GENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSE 4164



 Score = 31.9 bits (72), Expect = 0.41
 Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 6/74 (8%)

Query: 170  GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            G   +   + + E+++ EE         EE+   +   +E E  + E +     ++ VV 
Sbjct: 3821 GVGAEDITNTLNEDDDLEELA------NEEDTANQSDLDESEARELESDMNGVTKDSVVS 3874

Query: 230  QKKNEEEEEEEEVL 243
            + +N + EEE + L
Sbjct: 3875 ENENSDSEEENQDL 3888



 Score = 31.1 bits (70), Expect = 0.70
 Identities = 13/70 (18%), Positives = 32/70 (45%), Gaps = 3/70 (4%)

Query: 186  EEEEEVKQKKNKEEEEEEEEKKEEDEIEKE---EGENEEEEEEEVVKQKKNEEEEEEEEV 242
             E++++++  N+E+   + +  E +  E E    G  ++    E       EE ++ +E 
Sbjct: 3832 NEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891

Query: 243  LIKLEEEEEN 252
            +  + E+  N
Sbjct: 3892 VNDIPEDLSN 3901



 Score = 30.7 bits (69), Expect = 0.84
 Identities = 29/124 (23%), Positives = 50/124 (40%), Gaps = 37/124 (29%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEE----------VKQKKNKEEEEEEE----------- 204
            E +A EE+   + +++E E  E E           V + +N + EEE +           
Sbjct: 3838 EELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPE 3897

Query: 205  ------------EKKEEDEIEKEEGENEE----EEEEEVVKQKKNEEEEEEEEVLIKLEE 248
                        E  EED +E E+  NE+     E + V K+  N+  E+++    + EE
Sbjct: 3898 DLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEE 3957

Query: 249  EEEN 252
            E  +
Sbjct: 3958 EMSD 3961



 Score = 30.4 bits (68), Expect = 1.1
 Identities = 19/87 (21%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            +N A E++ ++E E EEE +++     + +   +E   +   E ++++  E    +E+E 
Sbjct: 3941 DNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            +V   K ++ E+ + E   + +EE + 
Sbjct: 4001 DV--SKDSDLEDMDMEAADENKEEADA 4025



 Score = 30.0 bits (67), Expect = 1.6
 Identities = 18/70 (25%), Positives = 40/70 (57%), Gaps = 4/70 (5%)

Query: 185  NEEEEEVKQKKNKEEEEEE--EEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
            +E+E +V +  + E+ + E  +E KEE + EK+E   +E+  EE      +E+ ++++  
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEE--NNTLDEDIQQDDFS 4053

Query: 243  LIKLEEEEEN 252
             +  ++E+ N
Sbjct: 4054 DLAEDDEKMN 4063



 Score = 29.6 bits (66), Expect = 2.4
 Identities = 27/120 (22%), Positives = 50/120 (41%), Gaps = 16/120 (13%)

Query: 149  CILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEE-------- 200
            C  H   +     F+    +          E  EEE+E+++E  +++   ++        
Sbjct: 2897 CKQHGRDTELAGAFEDGTGMKYGTGNINVSEQIEEEDEKDDEYGEQEKVSDKNDYLKDFI 2956

Query: 201  --EEEEEKKEEDEIEKEEGENE------EEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
              +++     E E E+E+ E E       + EEE    +KNE+E E E+    L E+E+ 
Sbjct: 2957 DYDDDGSISTESEPEEEKRECESGFDGWNDYEEEEDDDEKNEKECELEQNAEILIEDEKI 3016


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 39.2 bits (91), Expect = 0.001
 Identities = 20/83 (24%), Positives = 45/83 (54%), Gaps = 2/83 (2%)

Query: 171 EEEKKEEDEIEEEENEE--EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           + +KKEE   EE + ++  E+E  ++  KE  + +E++K+ +E EK+    ++++EE+  
Sbjct: 80  QRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQAR 139

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
           K    ++++ E        E  +
Sbjct: 140 KAAAEQKKKAEAAKAKAAAEAAK 162



 Score = 37.6 bits (87), Expect = 0.004
 Identities = 21/81 (25%), Positives = 46/81 (56%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
              KK E + +++E +  EE+K K+  E+E  ++ +KE  + ++++ + EE E++  ++Q
Sbjct: 72  SSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQ 131

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           K+ EE+  +     K + E  
Sbjct: 132 KQQEEQARKAAAEQKKKAEAA 152



 Score = 36.9 bits (85), Expect = 0.008
 Identities = 19/92 (20%), Positives = 43/92 (46%), Gaps = 2/92 (2%)

Query: 159 PEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
               +     A E+E+ ++  +E+E  + +E+ KQ +  E++ + E+K++E++  K   E
Sbjct: 87  QVAEELKPKQAAEQERLKQ--LEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAE 144

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
            +++ E    K      + +      K  EE 
Sbjct: 145 QKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEA 176



 Score = 33.4 bits (76), Expect = 0.085
 Identities = 20/86 (23%), Positives = 40/86 (46%), Gaps = 5/86 (5%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE-----EEEKKEEDEIEKEEGENEEEEEE 225
           E+E+ +  E +++  E E++ + ++ ++EE+      E++KK E    K   E  + +  
Sbjct: 107 EKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAA 166

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
              K+K  E  +  EE   K E    
Sbjct: 167 AEAKKKAEEAAKAAEEAKAKAEAAAA 192



 Score = 33.4 bits (76), Expect = 0.092
 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 5/87 (5%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEE-----EEEEKKEEDEIEKEEGENEEEE 223
           A E EK+ + E +++E +  +   ++K K E        E  K +     K++ E   + 
Sbjct: 120 AEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKA 179

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEEEE 250
            EE   + +    +++ E   K   E+
Sbjct: 180 AEEAKAKAEAAAAKKKAEAEAKAAAEK 206



 Score = 31.8 bits (72), Expect = 0.32
 Identities = 20/91 (21%), Positives = 43/91 (47%), Gaps = 5/91 (5%)

Query: 166 ENIAGEEEKKEEDEIE-----EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           E +  +E++K+ +E E     E++ +EE+  K    ++++ E  + K   E  K +   E
Sbjct: 109 ERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAE 168

Query: 221 EEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            +++ E   +   E + + E    K + E E
Sbjct: 169 AKKKAEEAAKAAEEAKAKAEAAAAKKKAEAE 199



 Score = 31.1 bits (70), Expect = 0.59
 Identities = 17/72 (23%), Positives = 32/72 (44%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A E  K +     +++ EE  +  ++   + E    +KK E E +    + + E E +  
Sbjct: 157 AAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAK 216

Query: 229 KQKKNEEEEEEE 240
            +KK E   EE+
Sbjct: 217 AEKKAEAAAEEK 228



 Score = 30.7 bits (69), Expect = 0.64
 Identities = 18/80 (22%), Positives = 41/80 (51%), Gaps = 3/80 (3%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           ++       +    + E++ K+K+ +  EE + ++  E E  K+    E+E  +   +QK
Sbjct: 62  QQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQL---EKERLKAQEQQK 118

Query: 232 KNEEEEEEEEVLIKLEEEEE 251
           + EE E++ ++  K +EE+ 
Sbjct: 119 QAEEAEKQAQLEQKQQEEQA 138



 Score = 30.7 bits (69), Expect = 0.70
 Identities = 19/85 (22%), Positives = 37/85 (43%), Gaps = 6/85 (7%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEE--EEEEEEKKEEDEIEKEEGENEEEEEEE 226
           A E++KK E      + +   E  + K   E  ++ EE  K  +E + +      +++ E
Sbjct: 142 AAEQKKKAE----AAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAE 197

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEE 251
              +   E+ + E E   K E++ E
Sbjct: 198 AEAKAAAEKAKAEAEAKAKAEKKAE 222



 Score = 29.5 bits (66), Expect = 1.9
 Identities = 15/78 (19%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE----EEEKKEEDEIEKEEGENEEEE 223
           +  +  + +  +   ++ E++ + K+++  EE +     E+E+ ++ E E+ + + ++++
Sbjct: 60  VVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119

Query: 224 EEEVVKQKKNEEEEEEEE 241
            EE  KQ + E++++EE+
Sbjct: 120 AEEAEKQAQLEQKQQEEQ 137



 Score = 28.4 bits (63), Expect = 3.9
 Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 6/79 (7%)

Query: 169 AGEEEKKEEDEI---EEEENEEEEEVKQKKNKEEE---EEEEEKKEEDEIEKEEGENEEE 222
           A  E KK+ +E     EE   + E    KK  E E     E+ K E +   K E + E  
Sbjct: 165 AAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAA 224

Query: 223 EEEEVVKQKKNEEEEEEEE 241
            EE+   +KK    + + +
Sbjct: 225 AEEKAAAEKKKAAAKAKAD 243



 Score = 27.6 bits (61), Expect = 6.2
 Identities = 17/75 (22%), Positives = 31/75 (41%), Gaps = 2/75 (2%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEE--EKKEEDEIEKEEGENEEEEEEE 226
           A E + K E    +++ E E +   +K K E E +   EKK E   E++    +++   +
Sbjct: 180 AEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAK 239

Query: 227 VVKQKKNEEEEEEEE 241
               K     +  E 
Sbjct: 240 AKADKAAAAAKAAER 254


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 39.4 bits (92), Expect = 0.001
 Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 6/74 (8%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKE------EEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           E+  +D IEE+ +EE+ +  Q  N E      EE E  ++ EEDE E  +GE EE E + 
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDA 279

Query: 227 VVKQKKNEEEEEEE 240
               + +E +E +E
Sbjct: 280 AEASEDSESDESDE 293



 Score = 31.7 bits (72), Expect = 0.40
 Identities = 15/54 (27%), Positives = 26/54 (48%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            E  +EE E  +E  E+E E    + +E E +  E  E+ E ++ + + E   E
Sbjct: 247 SEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPGE 300



 Score = 28.7 bits (64), Expect = 4.0
 Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 1/54 (1%)

Query: 189 EEVKQKKNKEEEEEEE-EKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           EE      +E+ +EE+ +  + D  E  E   EE E  +  ++ + E  + E E
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGE 273


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 39.0 bits (91), Expect = 0.002
 Identities = 26/72 (36%), Positives = 41/72 (56%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+E K   E  + E E EE+ K+K+ + ++  E+E ++E+     E   EEEE EE    
Sbjct: 271 EDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPP 330

Query: 231 KKNEEEEEEEEV 242
              ++EEE+EEV
Sbjct: 331 PLPKKEEEKEEV 342



 Score = 38.7 bits (90), Expect = 0.002
 Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKK--NKEEEEEEEEKKEEDEIEK-EEGENEEEEEEEV 227
            E++  E    +E+ +E+E     +  + EEE EE+EK++   ++K  E E+E+EE E V
Sbjct: 255 LEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIV 314

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEE 251
            +    EEE EE E     ++EEE
Sbjct: 315 PESPVEEEESEEPEPPPLPKKEEE 338



 Score = 37.9 bits (88), Expect = 0.003
 Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 13/91 (14%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKK-------------NKEEEEEEEEKKEEDEIEKEEG 217
            +EKKE+ E  E   +EE E +  K               +E+E+E+E K   E    E 
Sbjct: 226 TKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEE 285

Query: 218 ENEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
           E EE+E+E+  + KK  E+E+E+E +  + E
Sbjct: 286 ETEEKEKEKRKRLKKMMEDEDEDEEMEIVPE 316



 Score = 31.0 bits (70), Expect = 0.67
 Identities = 22/86 (25%), Positives = 39/86 (45%), Gaps = 3/86 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E    E+EK++   +++   +E+E+ + +   E   EEEE +E    E      +EEE+E
Sbjct: 284 EEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEP---EPPPLPKKEEEKE 340

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           EV              V+ K   ++E
Sbjct: 341 EVTVSPDGGRRRGRRRVMKKKTFKDE 366



 Score = 27.1 bits (60), Expect = 8.9
 Identities = 16/52 (30%), Positives = 31/52 (59%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVL 243
           K+ K K+E++E  E   ++E E+E G+ +   E+E  +    +E+E+E+E  
Sbjct: 224 KKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPK 275


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 39.0 bits (91), Expect = 0.002
 Identities = 16/86 (18%), Positives = 31/86 (36%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           +I +F + +   E+ K   + +E          +K +K  E      +E+    ++  E+
Sbjct: 33  DILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAES 92

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIK 245
                 +        EEEEE E    
Sbjct: 93  TRSSHPKSKAPSTESEEEEEPEETPD 118



 Score = 29.4 bits (66), Expect = 1.8
 Identities = 17/88 (19%), Positives = 30/88 (34%), Gaps = 15/88 (17%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE---------------EEEKKEEDEIE 213
           + +  + +ED +E  +  E+ E  +   K +E                 E      +E  
Sbjct: 24  SKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKP 83

Query: 214 KEEGENEEEEEEEVVKQKKNEEEEEEEE 241
            +  ++ E       K K    E EEEE
Sbjct: 84  AKPRKSAESTRSSHPKSKAPSTESEEEE 111


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 38.8 bits (91), Expect = 0.002
 Identities = 10/69 (14%), Positives = 42/69 (60%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           +   + D+ ++++++++ +     + +++E+++E  ++D+++ E+ E +E +E E +   
Sbjct: 125 DVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDD 184

Query: 232 KNEEEEEEE 240
            +   +E++
Sbjct: 185 DDFVWDEDD 193



 Score = 37.7 bits (88), Expect = 0.005
 Identities = 13/56 (23%), Positives = 39/56 (69%), Gaps = 1/56 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           +++  ++D+I++++++E+++     + + ++E+EEKKE  E+EK   +++   +E+
Sbjct: 138 DDDDLDDDDIDDDDDDEDDDEDDDDD-DVDDEDEEKKEAKELEKLSDDDDFVWDED 192



 Score = 37.7 bits (88), Expect = 0.005
 Identities = 13/81 (16%), Positives = 47/81 (58%), Gaps = 3/81 (3%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +  KK E +   +++++   VK      + +++++  ++D+++ ++ ++++++E++    
Sbjct: 101 DSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDD 160

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
             ++ ++E+EE   K E +E 
Sbjct: 161 DDDDVDDEDEE---KKEAKEL 178



 Score = 35.7 bits (83), Expect = 0.018
 Identities = 8/70 (11%), Positives = 46/70 (65%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           ++    ++ ++++++++++     + ++++++E+  E+D+ +  + E+EE++E + +++ 
Sbjct: 122 KDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKL 181

Query: 232 KNEEEEEEEE 241
            ++++   +E
Sbjct: 182 SDDDDFVWDE 191



 Score = 35.7 bits (83), Expect = 0.021
 Identities = 12/84 (14%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           +   + ++I    +  ++D+ +++++ +++++    + ++E+++E+  ++D  +++E + 
Sbjct: 116 DDLNYVKDIDVLNQADDDDDDDDDDDLDDDDID--DDDDDEDDDEDDDDDDVDDEDEEKK 173

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVL 243
           E +E E++        +E++ E L
Sbjct: 174 EAKELEKLSDDDDFVWDEDDSEAL 197



 Score = 35.4 bits (82), Expect = 0.023
 Identities = 13/81 (16%), Positives = 51/81 (62%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E+K   D+ ++    ++ +V  + + ++++++++  ++D+I+ ++ + +++E+++    
Sbjct: 106 AEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDV 165

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
              +EE++E + L KL ++++
Sbjct: 166 DDEDEEKKEAKELEKLSDDDD 186



 Score = 34.6 bits (80), Expect = 0.044
 Identities = 10/85 (11%), Positives = 46/85 (54%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +N   +++     +  +  N+ +++     + + ++++ +  ++DE + E+ ++++ ++E
Sbjct: 109 KNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDE 168

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEE 250
           +  K++  E E+  ++     +E++
Sbjct: 169 DEEKKEAKELEKLSDDDDFVWDEDD 193



 Score = 33.4 bits (77), Expect = 0.091
 Identities = 12/62 (19%), Positives = 39/62 (62%), Gaps = 1/62 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           + +  + D+ +++E+++E++     + E+EE++E  KE +++  ++    +E++ E ++Q
Sbjct: 141 DLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEA-KELEKLSDDDDFVWDEDDSEALRQ 199

Query: 231 KK 232
            +
Sbjct: 200 AR 201



 Score = 30.7 bits (70), Expect = 0.65
 Identities = 8/59 (13%), Positives = 34/59 (57%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           ++E  ++D++++E+ E++E  + +K  ++++   ++ + + + +   + +     + VK
Sbjct: 156 DDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTATADPVK 214



 Score = 30.3 bits (69), Expect = 1.1
 Identities = 8/79 (10%), Positives = 42/79 (53%), Gaps = 7/79 (8%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
              ++ ++  + ++++++       +++ ++++  E+D+ + ++ + ++E+EE+      
Sbjct: 119 NYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEK------ 172

Query: 233 NEEEEEEEEVLIKLEEEEE 251
            +E +E E++    +   +
Sbjct: 173 -KEAKELEKLSDDDDFVWD 190


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 38.0 bits (89), Expect = 0.002
 Identities = 19/69 (27%), Positives = 31/69 (44%), Gaps = 6/69 (8%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
            K    + EEEE  E    +  +N        EKK  DE+++E  E E++   E  ++K+
Sbjct: 15  PKPPSPKEEEEEALELAIQEAHEN------ALEKKLLDELDEELDEEEDDRFLEEYRRKR 68

Query: 233 NEEEEEEEE 241
             E +   E
Sbjct: 69  LAEMKALAE 77



 Score = 31.9 bits (73), Expect = 0.18
 Identities = 22/76 (28%), Positives = 31/76 (40%), Gaps = 7/76 (9%)

Query: 152 HSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEE-----EEVKQKKNKEEEEEEEEK 206
                         E      EKK  DE++EE +EEE     EE ++K+  E +   E+ 
Sbjct: 20  PKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKALAEKS 79

Query: 207 K--EEDEIEKEEGENE 220
           K  E  EI K +   E
Sbjct: 80  KFGEVYEISKPDYVRE 95



 Score = 28.4 bits (64), Expect = 2.9
 Identities = 14/55 (25%), Positives = 25/55 (45%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           E  ++ +KK     +    K+EE+E  +   +   E   E     + +EE +EEE
Sbjct: 2   EWNDILRKKGILPPKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEE 56



 Score = 26.8 bits (60), Expect = 9.0
 Identities = 12/48 (25%), Positives = 24/48 (50%), Gaps = 2/48 (4%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLI 244
           K    +EEE  EE      +  +E   E++++ +   E +EEE++  +
Sbjct: 16  KPPSPKEEE--EEALELAIQEAHENALEKKLLDELDEELDEEEDDRFL 61


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 39.0 bits (91), Expect = 0.002
 Identities = 18/71 (25%), Positives = 33/71 (46%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
             E+ K  E   + EE E  +   +++ + E+ E  E+           E+E E+E+E+ 
Sbjct: 379 EAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELE 438

Query: 229 KQKKNEEEEEE 239
           + +  EEEE  
Sbjct: 439 EAQPEEEEEAR 449



 Score = 38.6 bits (90), Expect = 0.002
 Identities = 19/71 (26%), Positives = 35/71 (49%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E  +  E+  +  EEE   ++ + + E E+ E  +E+  + +     E+E E+E   +
Sbjct: 379 EAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELE 438

Query: 231 KKNEEEEEEEE 241
           +   EEEEE  
Sbjct: 439 EAQPEEEEEAR 449



 Score = 38.6 bits (90), Expect = 0.002
 Identities = 19/72 (26%), Positives = 32/72 (44%), Gaps = 2/72 (2%)

Query: 169 AGEEEKKEEDEIEEEENEE--EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
              E+ K  + I + E EE  + E +++   E+ E  EE     E+     +  E+E+E 
Sbjct: 378 QEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDEL 437

Query: 227 VVKQKKNEEEEE 238
              Q + EEE  
Sbjct: 438 EEAQPEEEEEAR 449



 Score = 38.2 bits (89), Expect = 0.003
 Identities = 17/71 (23%), Positives = 33/71 (46%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           +E E+ +  E+  ++ +EE  + E ++  +  + E  E +    E     +   E+E+E 
Sbjct: 378 QEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDEL 437

Query: 241 EVLIKLEEEEE 251
           E     EEEE 
Sbjct: 438 EEAQPEEEEEA 448



 Score = 37.1 bits (86), Expect = 0.006
 Identities = 22/74 (29%), Positives = 41/74 (55%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           D  E ++  E+ +V +  ++ EEEE  +++ E+ +E E+ E  EE+          E+E 
Sbjct: 372 DSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEF 431

Query: 238 EEEEVLIKLEEEEE 251
           E+E+ L + + EEE
Sbjct: 432 EDEDELEEAQPEEE 445



 Score = 36.3 bits (84), Expect = 0.010
 Identities = 20/77 (25%), Positives = 38/77 (49%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
             E + E E+ +  E + Q++ +E  + E E++ E E  +   E+    E   + + + E
Sbjct: 373 SAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFE 432

Query: 235 EEEEEEEVLIKLEEEEE 251
           +E+E EE   + EEE  
Sbjct: 433 DEDELEEAQPEEEEEAR 449



 Score = 33.6 bits (77), Expect = 0.080
 Identities = 21/70 (30%), Positives = 32/70 (45%)

Query: 157 YSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
              E  K  E I+  EE++      EE  E E+  + +++    E     ++E E E E 
Sbjct: 378 QEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDEL 437

Query: 217 GENEEEEEEE 226
            E + EEEEE
Sbjct: 438 EEAQPEEEEE 447



 Score = 33.6 bits (77), Expect = 0.093
 Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 4/65 (6%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           I +  E  A + E +E  E E+ E  EE+     + +E     E++ E+++  +E    E
Sbjct: 389 ISQREEEEALQREAEERLEAEQAERAEEDA----RLRELYPLPEDEFEDEDELEEAQPEE 444

Query: 221 EEEEE 225
           EEE  
Sbjct: 445 EEEAR 449



 Score = 30.9 bits (70), Expect = 0.56
 Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 4/62 (6%)

Query: 195 KNKEEEEEEEEKKEEDEI----EKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
           K+  E ++E E  +  E+    E+EE    E EE    +Q +  EE+     L  L E+E
Sbjct: 371 KDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDE 430

Query: 251 EN 252
             
Sbjct: 431 FE 432


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 39.2 bits (91), Expect = 0.002
 Identities = 32/82 (39%), Positives = 49/82 (59%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEEK    + EEEE + E  +K+K+  EEEE+ E+ K E+E E++    EEE      + 
Sbjct: 752 EEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEEL 811

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           K+  E  EEE++LI+ EE+ + 
Sbjct: 812 KEEAELLEEEQLLIEQEEKIKE 833



 Score = 36.9 bits (85), Expect = 0.009
 Identities = 25/81 (30%), Positives = 39/81 (48%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
              ++E  E  E   EE+  ++Q++  +EEE EE   E  E +K E   EEE E    + 
Sbjct: 805 RALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEI 864

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
            K E  +E      +LEE++ 
Sbjct: 865 TKEELLQELLLKEEELEEQKL 885



 Score = 33.8 bits (77), Expect = 0.079
 Identities = 28/81 (34%), Positives = 45/81 (55%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            +  KE  E E+E+  +EEE+K    +EEE + E  K E     +E + +E E+E    +
Sbjct: 269 AQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLE 328

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           K+ ++E+EE E L K  +E E
Sbjct: 329 KELKKEKEEIEELEKELKELE 349



 Score = 33.8 bits (77), Expect = 0.091
 Identities = 26/82 (31%), Positives = 41/82 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE+KEE    +EE     E + K+  E  EEE+   E++E  KEE   E   E +  ++
Sbjct: 789 VEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQK 848

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
            +   EEE E +  ++ +EE  
Sbjct: 849 LEKLAEEELERLEEEITKEELL 870



 Score = 33.4 bits (76), Expect = 0.11
 Identities = 24/92 (26%), Positives = 52/92 (56%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           E     + +  +E+++ E   +E E EEE   +  K  +EEE+E++ +EE+     + E 
Sbjct: 238 ERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEE 297

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E + E   ++++K ++EE+ +E   +L++ E+
Sbjct: 298 ELKSELLKLERRKVDDEEKLKESEKELKKLEK 329



 Score = 33.4 bits (76), Expect = 0.13
 Identities = 28/82 (34%), Positives = 49/82 (59%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E+ K E+E EE+   +EEE++  + + +EE E  ++E+  IE+EE   EEE EE  ++ 
Sbjct: 784 TEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALEL 843

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           K+ ++ E+  E  ++  EEE  
Sbjct: 844 KEEQKLEKLAEEELERLEEEIT 865



 Score = 33.4 bits (76), Expect = 0.14
 Identities = 27/87 (31%), Positives = 50/87 (57%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E  +    K++E   EEE+ E+ +  ++K+ K + +EEE +  E+E+++E    EEE+  
Sbjct: 765 EEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLL 824

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
              ++K  EEE EE  + +K E++ E 
Sbjct: 825 IEQEEKIKEEELEELALELKEEQKLEK 851



 Score = 33.0 bits (75), Expect = 0.15
 Identities = 26/93 (27%), Positives = 49/93 (52%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           +  K  E       K+EE+   E    E  +V  ++  +E E+E +K E++  +++E   
Sbjct: 280 KEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIE 339

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E E+E + ++ K+  EEEEEE++    E+ E+ 
Sbjct: 340 ELEKELKELEIKREAEEEEEEQLEKLQEKLEQL 372



 Score = 33.0 bits (75), Expect = 0.15
 Identities = 28/82 (34%), Positives = 40/82 (48%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E +K    E +E E EE    +     + E E EE   E+  EKE+ E+ +EEEEE  K+
Sbjct: 906 ESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKR 965

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
               +EE     L+ + E EE 
Sbjct: 966 LLLAKEELGNVNLMAIAEFEEK 987



 Score = 32.6 bits (74), Expect = 0.21
 Identities = 23/93 (24%), Positives = 47/93 (50%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           E  K       E   +   E +EEE E++ + ++ K   +EEEE + +      ++  + 
Sbjct: 255 ESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDE 314

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E+ +E E   +K  +E ++E+E + +LE+E + 
Sbjct: 315 EKLKESEKELKKLEKELKKEKEEIEELEKELKE 347



 Score = 32.6 bits (74), Expect = 0.22
 Identities = 24/82 (29%), Positives = 43/82 (52%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E KK E E+++E+ E EE  K+ K  E + E EE++EE   + +E   + EEE    K+
Sbjct: 322 KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKK 381

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
            ++E      ++  +  E +  
Sbjct: 382 LESERLSSAAKLKEEELELKNE 403



 Score = 32.2 bits (73), Expect = 0.25
 Identities = 28/92 (30%), Positives = 55/92 (59%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           +  +  ++   + E+++ D+ E+ +  E+E  K +K  ++E+EE E+ E++  E E    
Sbjct: 294 KEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKRE 353

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            EEEEEE +++ + + E+ EEE+L K + E E
Sbjct: 354 AEEEEEEQLEKLQEKLEQLEEELLAKKKLESE 385



 Score = 32.2 bits (73), Expect = 0.29
 Identities = 31/86 (36%), Positives = 51/86 (59%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E  + EE++KEE +  EEE++++  +++K+N+ EE   EE     + E E  E   EE +
Sbjct: 888 ELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEAD 947

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E  K++ N+EEEEE    + L +EE 
Sbjct: 948 EKEKEEDNKEEEEERNKRLLLAKEEL 973



 Score = 32.2 bits (73), Expect = 0.29
 Identities = 23/81 (28%), Positives = 45/81 (55%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
             ++  D ++E   +E+EE++  K + E+EEE   +   E ++EE E + +EEE  +  K
Sbjct: 235 LNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAK 294

Query: 232 KNEEEEEEEEVLIKLEEEEEN 252
           + EE + E   L + + ++E 
Sbjct: 295 EEEELKSELLKLERRKVDDEE 315



 Score = 31.9 bits (72), Expect = 0.36
 Identities = 28/85 (32%), Positives = 48/85 (56%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
                  ++EE   EEE  E   E+K+++  E+  EEE ++ E+EI KEE   E   +EE
Sbjct: 819 EEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEE 878

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEE 251
            ++++K ++E E +E   K E++E 
Sbjct: 879 ELEEQKLKDELESKEEKEKEEKKEL 903



 Score = 31.5 bits (71), Expect = 0.46
 Identities = 20/87 (22%), Positives = 43/87 (49%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             +  + E +EE+ +  +  +  EE      +   +E+EE +   +  ++E E   +  +
Sbjct: 214 YQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLK 273

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E  +++K ++ +EEE  L+  EEEE  
Sbjct: 274 ENKEEEKEKKLQEEELKLLAKEEEELK 300



 Score = 31.5 bits (71), Expect = 0.47
 Identities = 23/82 (28%), Positives = 48/82 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +EE+KE+   EEE     +E ++ K++  + E  +  +E+++++ E E ++ E+E   ++
Sbjct: 276 KEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEK 335

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           ++ EE E+E + L    E EE 
Sbjct: 336 EEIEELEKELKELEIKREAEEE 357



 Score = 31.5 bits (71), Expect = 0.51
 Identities = 26/82 (31%), Positives = 48/82 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E +KKE+   EE +  + E+ +   +K +E +++  +E   +E++  E EEEEE+  +K+
Sbjct: 701 EIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKK 760

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           ++ EEE+ E  +  K   EEE 
Sbjct: 761 EEEEEEKSELSLKEKELAEEEE 782



 Score = 31.1 bits (70), Expect = 0.58
 Identities = 25/93 (26%), Positives = 56/93 (60%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           ++ +    +  +EE++ + E+ + E  + ++ ++ K  E+E ++ EK+ + E E+ E   
Sbjct: 283 KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELE 342

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           +E +E E+ ++ + EEEE+ E++  KLE+ EE 
Sbjct: 343 KELKELEIKREAEEEEEEQLEKLQEKLEQLEEE 375



 Score = 31.1 bits (70), Expect = 0.65
 Identities = 28/81 (34%), Positives = 46/81 (56%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEE+ E+ + + E+ EEE   K+K   E      + KEE+   K E E E +   E+ +Q
Sbjct: 358 EEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQ 417

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +++  +EE++E L  +EE EE
Sbjct: 418 EEDLLKEEKKEELKIVEELEE 438



 Score = 31.1 bits (70), Expect = 0.66
 Identities = 30/91 (32%), Positives = 48/91 (52%), Gaps = 5/91 (5%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEV-----KQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           E    E   KE++  EEEE  E+ +V     ++ K +EEE    E++ ++E E  E E  
Sbjct: 764 EEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQL 823

Query: 221 EEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
             E+EE +K+++ EE   E +   KLE+  E
Sbjct: 824 LIEQEEKIKEEELEELALELKEEQKLEKLAE 854



 Score = 31.1 bits (70), Expect = 0.69
 Identities = 25/85 (29%), Positives = 41/85 (48%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            +    EE   EE  I  +   E EE+  ++  E+E+EE+ K+EE+E  K     +EE   
Sbjct: 916  KENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGN 975

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEE 250
              +      EE+EE     +L++E 
Sbjct: 976  VNLMAIAEFEEKEERYNKDELKKER 1000



 Score = 30.7 bits (69), Expect = 0.79
 Identities = 29/86 (33%), Positives = 45/86 (52%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E     E++++  E E EE   E + +QK  K  EEE E  +EE   E+   E   +EEE
Sbjct: 820 EEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEE 879

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
              ++ K+E E +EE+   + +E EE
Sbjct: 880 LEEQKLKDELESKEEKEKEEKKELEE 905



 Score = 30.7 bits (69), Expect = 0.82
 Identities = 17/82 (20%), Positives = 37/82 (45%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+ KK  +  + +E  E EE         +  EE      E+ ++E E  E  ++E+ K+
Sbjct: 205 EQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKE 264

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           ++   +  +E    + E++ + 
Sbjct: 265 EEILAQVLKENKEEEKEKKLQE 286



 Score = 30.7 bits (69), Expect = 0.89
 Identities = 27/87 (31%), Positives = 51/87 (58%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E I  +  K+ ++E +E++ +EEE     K +EE + E  K E  +++ EE   E E+E 
Sbjct: 265 EEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEL 324

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           + ++++  +E+EE EE+  +L+E E  
Sbjct: 325 KKLEKELKKEKEEIEELEKELKELEIK 351



 Score = 30.7 bits (69), Expect = 0.92
 Identities = 17/77 (22%), Positives = 35/77 (45%), Gaps = 4/77 (5%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
            E   ++E  ++  E  +   +   + EE K +E +++++     ++  E    ++K E 
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQ----AKKALEYYQLKEKLEL 222

Query: 236 EEEEEEVLIKLEEEEEN 252
           EEE    L  L+  EE 
Sbjct: 223 EEENLLYLDYLKLNEER 239



 Score = 30.3 bits (68), Expect = 0.99
 Identities = 26/88 (29%), Positives = 40/88 (45%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
                 EE      E+  +E EE E  KQ+  KEEE   +  KE  E EKE+   EEE +
Sbjct: 231 DYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELK 290

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
               ++++ + E  + E     +EE+  
Sbjct: 291 LLAKEEEELKSELLKLERRKVDDEEKLK 318



 Score = 30.3 bits (68), Expect = 1.1
 Identities = 24/77 (31%), Positives = 46/77 (59%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
            +D+I EE    E+++K+K+ +EE+   ++++EE+E  +   + +E  EEE   +K   E
Sbjct: 731 AQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVE 790

Query: 236 EEEEEEVLIKLEEEEEN 252
           EE+EE++  + EE    
Sbjct: 791 EEKEEKLKAQEEELRAL 807



 Score = 30.3 bits (68), Expect = 1.1
 Identities = 16/85 (18%), Positives = 35/85 (41%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
                E + +E    E+ +   E    ++K + EEE           E+     +E   +
Sbjct: 190 IIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRD 249

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEE 250
           E  + + +++E E+EE ++    +E
Sbjct: 250 EQEEIESSKQELEKEEEILAQVLKE 274



 Score = 29.9 bits (67), Expect = 1.3
 Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 3/89 (3%)

Query: 163  KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
            +           K E E EE   EE +E +++++ +EEEEE  K+     E+    N   
Sbjct: 921  EERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMA 980

Query: 223  EEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
              E     +K E   ++E    +LEEE++
Sbjct: 981  IAEFE---EKEERYNKDELKKERLEEEKK 1006



 Score = 29.9 bits (67), Expect = 1.3
 Identities = 19/84 (22%), Positives = 46/84 (54%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           +  +  K  E+ I+  +    +E ++ ++ ++E E+EE+     +++ + E +E++ +E 
Sbjct: 228 LYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEE 287

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEE 251
             +   +EEEE +  L+KLE  + 
Sbjct: 288 ELKLLAKEEEELKSELLKLERRKV 311



 Score = 29.9 bits (67), Expect = 1.4
 Identities = 23/87 (26%), Positives = 44/87 (50%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             +  EEEK+ +  +E  E EE+   ++KK + +  EE E+  E +  K   E EE E++
Sbjct: 398 LELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQ 457

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            +   K   E ++ E++L + +  +  
Sbjct: 458 ALKLLKDKLELKKSEDLLKETKLVKLL 484



 Score = 29.9 bits (67), Expect = 1.4
 Identities = 24/81 (29%), Positives = 39/81 (48%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
             E + EE  +EE + +E+EE  +++ +E  +     KEE          E EE+EE   +
Sbjct: 934  YESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNK 993

Query: 231  KKNEEEEEEEEVLIKLEEEEE 251
             + ++E  EEE    L E  E
Sbjct: 994  DELKKERLEEEKKELLREIIE 1014



 Score = 29.9 bits (67), Expect = 1.5
 Identities = 30/90 (33%), Positives = 50/90 (55%), Gaps = 3/90 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKK---NKEEEEEEEEKKEEDEIEKEEGENEEE 222
           E    EEE ++++ +EE+ENE EE + ++     K E E EE   EE + +++E +N+EE
Sbjct: 899 EKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEE 958

Query: 223 EEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           EEE   +    +EE     ++   E EE+ 
Sbjct: 959 EEERNKRLLLAKEELGNVNLMAIAEFEEKE 988



 Score = 29.6 bits (66), Expect = 2.0
 Identities = 18/79 (22%), Positives = 40/79 (50%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
               ++ +++ +EE+N+EEEE + K+    +EE          E EE E    ++E   ++
Sbjct: 941  LLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKER 1000

Query: 231  KKNEEEEEEEEVLIKLEEE 249
             + E++E   E++ +  + 
Sbjct: 1001 LEEEKKELLREIIEETCQR 1019



 Score = 28.0 bits (62), Expect = 6.1
 Identities = 33/87 (37%), Positives = 54/87 (62%), Gaps = 6/87 (6%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK------EEGENEEEEE 224
           E+E +E+ E E  +NE     ++ K KE+  +EE KK + E E+      +E +++  EE
Sbjct: 679 EQELQEKAESELAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEE 738

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            ++++QK  E+EEEEE+  +K EEEEE
Sbjct: 739 LKLLEQKIKEKEEEEEKSRLKKEEEEE 765



 Score = 28.0 bits (62), Expect = 6.1
 Identities = 28/79 (35%), Positives = 47/79 (59%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E KKE++EIEE E E +E   +++ +EEEEE+ EK +E   + EE    +++ E     
Sbjct: 329 KELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLS 388

Query: 231 KKNEEEEEEEEVLIKLEEE 249
              + +EEE E+  + E+E
Sbjct: 389 SAAKLKEEELELKNEEEKE 407



 Score = 28.0 bits (62), Expect = 6.4
 Identities = 20/87 (22%), Positives = 48/87 (55%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            E +  + E+++E ++  E +E+EE++ +++ KEE +  EE +E  E ++ +   E+EE 
Sbjct: 395 EEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEEL 454

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E+   +   ++ E ++   +  E +  
Sbjct: 455 EKQALKLLKDKLELKKSEDLLKETKLV 481



 Score = 27.2 bits (60), Expect = 9.2
 Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 3/85 (3%)

Query: 171 EEEKKEEDEIEEEE---NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           E+EK+E+ E+EEE    N  EE+  + + +  EE     K E E E+   E  +E+E+E 
Sbjct: 894 EKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEE 953

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEEN 252
             +++ EE  +   +  +       
Sbjct: 954 DNKEEEEERNKRLLLAKEELGNVNL 978


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 38.2 bits (89), Expect = 0.002
 Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 1/73 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG-ENEEEEEEEVVK 229
            ++ K E+  E+   E E + + K  +E+ E EE +KEE   + EE  E + EE ++   
Sbjct: 5   CKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENN 64

Query: 230 QKKNEEEEEEEEV 242
           + K E ++ E E+
Sbjct: 65  KLKEENKKLENEL 77



 Score = 34.3 bits (79), Expect = 0.031
 Identities = 29/77 (37%), Positives = 46/77 (59%), Gaps = 7/77 (9%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
           ++E +  + EN EE+  K+ +NKEE++ +EE  E +EIEKEE   + EE  EV       
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEV------- 54

Query: 235 EEEEEEEVLIKLEEEEE 251
           + EE ++   KL+EE +
Sbjct: 55  KIEELKDENNKLKEENK 71



 Score = 33.2 bits (76), Expect = 0.077
 Identities = 23/64 (35%), Positives = 38/64 (59%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
            E + K  K E  EE+  KE +  E+++G+ E+ E EE+ K++  E+ EE  EV I+  +
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60

Query: 249 EEEN 252
           +E N
Sbjct: 61  DENN 64



 Score = 31.3 bits (71), Expect = 0.34
 Identities = 17/73 (23%), Positives = 38/73 (52%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E++ ++ + E  E +  +E + K+  + +EE+ E +E ++ E  E   E  E +    +
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60

Query: 231 KKNEEEEEEEEVL 243
            +N + +EE + L
Sbjct: 61  DENNKLKEENKKL 73



 Score = 27.4 bits (61), Expect = 6.2
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
           E+   EE+ + E+  +EE  E+ EE    + K EE ++E  K ++E +K E E E 
Sbjct: 26  EDKGKEEDLEFEEIEKEEIIEDSEE--SNEVKIEELKDENNKLKEENKKLENELEA 79


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 31/82 (37%), Positives = 43/82 (52%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE K E +E+ EE  E +EE+ + K + EE E E     + +E+ E E EE EE     +
Sbjct: 270 EELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELK 329

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           +K E  +EE E    L EE E 
Sbjct: 330 EKIEALKEELEERETLLEELEQ 351



 Score = 38.5 bits (90), Expect = 0.003
 Identities = 25/87 (28%), Positives = 41/87 (47%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E +    E+ +E     +E  EE E   ++ ++   E EE KEE E +      E EE  
Sbjct: 319 EELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELF 378

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E ++++  E E E  E+  +LEE +  
Sbjct: 379 EALREELAELEAELAEIRNELEELKRE 405



 Score = 35.5 bits (82), Expect = 0.023
 Identities = 26/87 (29%), Positives = 38/87 (43%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E + GE     E   E E   EE E + ++ KE+ E  +E+ EE E   EE E    E E
Sbjct: 298 EELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELE 357

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E  ++ + +     EE+    E   E 
Sbjct: 358 EAKEELEEKLSALLEELEELFEALREE 384



 Score = 35.1 bits (81), Expect = 0.031
 Identities = 29/83 (34%), Positives = 47/83 (56%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            E+E +E  E  EE   E+EE++ +  + EEE+EE ++E  E+E E  E +EE E+   +
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRER 913

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
            ++ E + E  EV +   EEE  
Sbjct: 914 LEELEAKLERLEVELPELEEELE 936



 Score = 35.1 bits (81), Expect = 0.036
 Identities = 28/86 (32%), Positives = 53/86 (61%), Gaps = 1/86 (1%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE- 225
               EE +++ +E++ E    EEE++Q +++ EE EEE ++ E+E+E+ +   EE EEE 
Sbjct: 708 RRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEEL 767

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E +++   + +EE EE+  K +  +E
Sbjct: 768 ESLEEALAKLKEEIEELEEKRQALQE 793



 Score = 34.7 bits (80), Expect = 0.041
 Identities = 29/81 (35%), Positives = 47/81 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE K+E +E+E E+ E E+E+K+ + ++EE EEE ++ E E+ + + E E+  E     +
Sbjct: 859 EELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELE 918

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
            K E  E E   L +  EEE 
Sbjct: 919 AKLERLEVELPELEEELEEEY 939



 Score = 34.7 bits (80), Expect = 0.042
 Identities = 25/83 (30%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE-EEEEEVVK 229
           EE ++  + + EE  E E E+ + +N+ EE + E +  E+ +E+     E+ +EE + ++
Sbjct: 372 EELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELE 431

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
            +  E + E EE+  +LEE EE 
Sbjct: 432 AELEELQTELEELNEELEELEEQ 454



 Score = 34.7 bits (80), Expect = 0.052
 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE-EVVK 229
           EE + E +E+EE   E +E+++  K + EE E   ++ E  + + E   EE EE+   + 
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
           ++  E  E   E L +LE E   
Sbjct: 372 EELEELFEALREELAELEAELAE 394



 Score = 34.3 bits (79), Expect = 0.054
 Identities = 32/83 (38%), Positives = 55/83 (66%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK-EEGENEEEEEEEVVK 229
           EE +++ DE+EEE  E E+E+++ K + EE E E+++ EDE+++ EE + E EEE   ++
Sbjct: 838 EELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELE 897

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
            +  E +EE E++  +LEE E  
Sbjct: 898 SELAELKEEIEKLRERLEELEAK 920



 Score = 33.9 bits (78), Expect = 0.089
 Identities = 28/82 (34%), Positives = 49/82 (59%), Gaps = 1/82 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE-EEEEEVVK 229
           +  ++E +E+EEE  E E  +   + + E  E+  ++ E EIE+ E E EE EE+ + ++
Sbjct: 789 QALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
           ++  E E+E EE+  +LEE E 
Sbjct: 849 EELEELEKELEELKEELEELEA 870



 Score = 33.1 bits (76), Expect = 0.15
 Identities = 32/83 (38%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+  +E     EE   + EE+K++    EEE E+ +   +E+E+E  E EEE EE   + 
Sbjct: 701 EDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERL 760

Query: 231 KKNEEE-EEEEEVLIKLEEEEEN 252
           ++ EEE E  EE L KL+EE E 
Sbjct: 761 EELEEELESLEEALAKLKEEIEE 783



 Score = 33.1 bits (76), Expect = 0.15
 Identities = 23/75 (30%), Positives = 35/75 (46%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           I   +E+ EE E   EE E+     ++  +E EE+     EE E   E    E  E E  
Sbjct: 332 IEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAE 391

Query: 228 VKQKKNEEEEEEEEV 242
           + + +NE EE + E+
Sbjct: 392 LAEIRNELEELKREI 406



 Score = 32.8 bits (75), Expect = 0.18
 Identities = 28/86 (32%), Positives = 41/86 (47%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           + +A  EE KEE E +     EE E   +  +EE  E E +  E   E EE + E E  E
Sbjct: 351 QLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLE 410

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E +++     E+ +EE+     E EE
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEE 436



 Score = 32.4 bits (74), Expect = 0.24
 Identities = 25/86 (29%), Positives = 41/86 (47%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             +   EE+ EE + + E  +EE E ++   +E E+   E +E  E  +E+     EE E
Sbjct: 316 NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE 375

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E+ +  + E  E E E+     E EE
Sbjct: 376 ELFEALREELAELEAELAEIRNELEE 401



 Score = 32.4 bits (74), Expect = 0.25
 Identities = 27/86 (31%), Positives = 36/86 (41%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E +   E   EE E    E EE +E  ++K     EE EE  E    E  E E E  E  
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIR 396

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
             +++ K E E  EE +    E  E+
Sbjct: 397 NELEELKREIESLEERLERLSERLED 422



 Score = 32.4 bits (74), Expect = 0.26
 Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 3/87 (3%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
                  E + E +E++ E    EE +++   + E+ +EE K+ E E+E+ + E EE  E
Sbjct: 387 ELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNE 446

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E     +  E+ EE  + L +LE E  
Sbjct: 447 ELE---ELEEQLEELRDRLKELERELA 470



 Score = 32.4 bits (74), Expect = 0.27
 Identities = 31/82 (37%), Positives = 50/82 (60%), Gaps = 2/82 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE+++   E  EE  EE EE +++ +  E E E  ++  + +E+E  E EEE EE  +++
Sbjct: 785 EEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEE--LEE 842

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           K +E EEE EE+  +LEE +E 
Sbjct: 843 KLDELEEELEELEKELEELKEE 864



 Score = 32.0 bits (73), Expect = 0.31
 Identities = 21/82 (25%), Positives = 39/82 (47%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E ++ ++E+EE+ +   EE+++      EE  E + E  EI  E  E + E E    + 
Sbjct: 354 AELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERL 413

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           ++  E  E+ +  +K  E E  
Sbjct: 414 ERLSERLEDLKEELKELEAELE 435



 Score = 32.0 bits (73), Expect = 0.31
 Identities = 27/86 (31%), Positives = 51/86 (59%), Gaps = 3/86 (3%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
               +  ++E + +E+     E+E+++ + + EE EE+  + E+E+E+ E E EE +EE 
Sbjct: 806 ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL 865

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEEN 252
              ++   E+EE E+ L +LEEE+E 
Sbjct: 866 ---EELEAEKEELEDELKELEEEKEE 888



 Score = 32.0 bits (73), Expect = 0.32
 Identities = 25/71 (35%), Positives = 43/71 (60%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE ++E  E+E E  E +EE+++ + + EE E + ++ E E+ + E E EEE E+ +  +
Sbjct: 887 EELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETE 946

Query: 231 KKNEEEEEEEE 241
            + E E  EEE
Sbjct: 947 LEREIERLEEE 957



 Score = 31.6 bits (72), Expect = 0.38
 Identities = 34/100 (34%), Positives = 46/100 (46%), Gaps = 2/100 (2%)

Query: 155 PSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK 214
                E+ +  E +   +E+ EE E E     E  E  + + +E EE  EE KE+ E  K
Sbjct: 277 EELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALK 336

Query: 215 EEGENEEEEEEEVVKQKK--NEEEEEEEEVLIKLEEEEEN 252
           EE E  E   EE+ +      E +EE EE L  L EE E 
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE 376



 Score = 31.6 bits (72), Expect = 0.49
 Identities = 31/84 (36%), Positives = 46/84 (54%), Gaps = 2/84 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK--EEGENEEEEEEEVV 228
           EE + E++E+E+E  E EEE ++ + +  E E E  + ++EIEK  E  E  E + E + 
Sbjct: 866 EELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLE 925

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
            +    EEE EEE    LE E E 
Sbjct: 926 VELPELEEELEEEYEDTLETELER 949



 Score = 31.2 bits (71), Expect = 0.52
 Identities = 33/79 (41%), Positives = 46/79 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +  ++ E EIEE E E EE  ++    EEE EE EK+ E+  E+ E    E+EE E   +
Sbjct: 821 QRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELK 880

Query: 231 KKNEEEEEEEEVLIKLEEE 249
           +  EE+EE EE L +LE E
Sbjct: 881 ELEEEKEELEEELRELESE 899



 Score = 31.2 bits (71), Expect = 0.56
 Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E  ++   +++EE  E EE+ +  + + EE EEE ++ E  ++  E E E  E+     +
Sbjct: 768 ESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLE 827

Query: 231 KKNEE-EEEEEEVLIKLEEEEEN 252
           ++ EE EEE EE+  KL+E EE 
Sbjct: 828 QEIEELEEEIEELEEKLDELEEE 850



 Score = 31.2 bits (71), Expect = 0.61
 Identities = 27/82 (32%), Positives = 40/82 (48%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE ++ ++ +EE E E E   +     +EE EE E+K +   E+ E   EE EE E    
Sbjct: 751 EELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLD 810

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
               E E  E+   +LE+E E 
Sbjct: 811 ALERELESLEQRRERLEQEIEE 832



 Score = 30.8 bits (70), Expect = 0.74
 Identities = 26/75 (34%), Positives = 44/75 (58%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             +A  EE+ E+ +   EE EEE E  +++ +E +E  EE +EE E  +E     +EE E
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIE 782

Query: 226 EVVKQKKNEEEEEEE 240
           E+ ++++  +EE EE
Sbjct: 783 ELEEKRQALQEELEE 797



 Score = 30.5 bits (69), Expect = 0.93
 Identities = 26/76 (34%), Positives = 39/76 (51%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           E+E+EE E E EE  ++ +  EEE E  E+      E+ E   E+ +  +   ++  EE 
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802

Query: 237 EEEEEVLIKLEEEEEN 252
           EE E  L  LE E E+
Sbjct: 803 EEAERRLDALERELES 818



 Score = 30.5 bits (69), Expect = 0.96
 Identities = 27/90 (30%), Positives = 51/90 (56%), Gaps = 8/90 (8%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE---EEEKKEEDEIEKEEGENEEEEEE-- 225
            E + + +++EEE    + E++  ++  EE     EE +++ +E+++E    EEE E+  
Sbjct: 677 AELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQ 736

Query: 226 ---EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
              E ++++  E EEE EE+  +LEE EE 
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEE 766



 Score = 30.5 bits (69), Expect = 1.0
 Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE-EEEEVVK 229
           EEE +  +E   +  EE EE+++K+   +EE EE ++E +E E+     E E E  E  +
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRR 823

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
           ++  +E EE EE + +LEE+ + 
Sbjct: 824 ERLEQEIEELEEEIEELEEKLDE 846



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 29/80 (36%), Positives = 52/80 (65%), Gaps = 1/80 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E +KE +E+EEE +  EEE+++ + + EE E+E ++ + E+E+   E  EE +EE+++ 
Sbjct: 235 KELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELR-EELEELQEELLEL 293

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
           K+  EE E E  L++   EE
Sbjct: 294 KEEIEELEGEISLLRERLEE 313



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 6/88 (6%)

Query: 171 EEEKKEEDEIEEEENE------EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            E ++  +E+EE + E      E EE++ + +   E  EE + E +E+E+   E +E+ E
Sbjct: 274 SELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIE 333

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
               + ++ E   EE E L+   EE + 
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKE 361



 Score = 29.7 bits (67), Expect = 1.6
 Identities = 27/81 (33%), Positives = 44/81 (54%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E+++ EDE++E E E+EE  ++ +  E E  E +++ E   E+ E    + E  EV   
Sbjct: 870 AEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELP 929

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +  EE EEE E  ++ E E E
Sbjct: 930 ELEEELEEEYEDTLETELERE 950



 Score = 29.3 bits (66), Expect = 2.1
 Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 5/86 (5%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE-----EEE 225
           EE +++   + EE  E  E ++++  + E E  E + E +E+++E    EE      E  
Sbjct: 361 EELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERL 420

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E +K++  E E E EE+  +LEE  E
Sbjct: 421 EDLKEELKELEAELEELQTELEELNE 446



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 29/82 (35%), Positives = 49/82 (59%), Gaps = 3/82 (3%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E  ++  + +E+E  E EEE+++ + K +E EEE ++ E E+E+ + E EE E E   K+
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE---KE 873

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           +  +E +E EE   +LEEE   
Sbjct: 874 ELEDELKELEEEKEELEEELRE 895



 Score = 28.1 bits (63), Expect = 5.6
 Identities = 19/70 (27%), Positives = 36/70 (51%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E + E  E++EE  +  E +++ + K E  E E  + E+E+E+E  +  E E E  +++
Sbjct: 894 RELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIER 953

Query: 231 KKNEEEEEEE 240
            + E E    
Sbjct: 954 LEEEIEALGP 963



 Score = 27.8 bits (62), Expect = 6.8
 Identities = 23/81 (28%), Positives = 41/81 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE ++ E E+ E + E E+  ++ +  E + E  E +  +  E+ E E E+  E E+ ++
Sbjct: 891 EELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELERE 950

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
            +  EEE E    + L   EE
Sbjct: 951 IERLEEEIEALGPVNLRAIEE 971


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 37.8 bits (88), Expect = 0.002
 Identities = 16/55 (29%), Positives = 42/55 (76%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
             +EE++I+E+ +  E+++K+ + ++ +EE+E+ +EE+E E+EE E+ ++++++ 
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196



 Score = 35.9 bits (83), Expect = 0.009
 Identities = 21/68 (30%), Positives = 42/68 (61%), Gaps = 6/68 (8%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           K++     EEE + +E++   + K +E E E+  EEDE ++EE E EEEE+E+       
Sbjct: 136 KRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDF------ 189

Query: 234 EEEEEEEE 241
           ++++++++
Sbjct: 190 DDDDDDDD 197



 Score = 35.1 bits (81), Expect = 0.019
 Identities = 19/56 (33%), Positives = 36/56 (64%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            EEEE+ +EK    E + +E E E+ +EE+   +++ EEEEEE+E     ++++++
Sbjct: 143 TEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198



 Score = 33.2 bits (76), Expect = 0.093
 Identities = 18/53 (33%), Positives = 31/53 (58%)

Query: 164 FWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           F E     +EK    E + +E E E+  ++ +  EEEEEEEE+++ED  + ++
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194



 Score = 32.0 bits (73), Expect = 0.22
 Identities = 18/57 (31%), Positives = 33/57 (57%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
           +F   E    E+    E +++E E E+ +E  +K  +EEEEEEEE ++ D+ + ++ 
Sbjct: 141 LFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 31.3 bits (71), Expect = 0.33
 Identities = 17/54 (31%), Positives = 34/54 (62%), Gaps = 4/54 (7%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
            +      +K+  E+E E+ +EE+E    K++EEEEEEEE+ E+ + + ++ ++
Sbjct: 149 IDEKLSMLEKKLKELEAEDVDEEDE----KDEEEEEEEEEEDEDFDDDDDDDDD 198



 Score = 31.3 bits (71), Expect = 0.35
 Identities = 15/52 (28%), Positives = 31/52 (59%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
           E+I  +    E+   E E  + +EE ++ + +EEEEEEE++  +D+ + ++ 
Sbjct: 147 EDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198



 Score = 27.0 bits (60), Expect = 8.8
 Identities = 15/49 (30%), Positives = 33/49 (67%), Gaps = 2/49 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           ++ K+ E E  +EE+E++EE  +++ +EEE+E+ +  ++D+ +    EN
Sbjct: 158 KKLKELEAEDVDEEDEKDEE--EEEEEEEEDEDFDDDDDDDDDDYNAEN 204


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 22/93 (23%), Positives = 41/93 (44%), Gaps = 11/93 (11%)

Query: 159 PEIFKFWENIAGEEEKKEEDEIEEEENEEEEEV----KQKKNKEEEEEEEEKKEEDEIEK 214
           PE  +       +  K+E      E  E EEE+    + K    E + + +K  +  I K
Sbjct: 833 PEELELINEE-QKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAK 891

Query: 215 E------EGENEEEEEEEVVKQKKNEEEEEEEE 241
           +         +E +++E+  K+KK + E+ +EE
Sbjct: 892 DVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEE 924



 Score = 35.0 bits (81), Expect = 0.033
 Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 1/91 (1%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKE-EEEEEEEKKEEDEIEKEEGENEE 221
           +  E I    E +EE         EE  ++ ++ +   EE++  K+E      E  E EE
Sbjct: 803 RPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEE 862

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E +     + +  E + + + L KL   ++ 
Sbjct: 863 EIQNINENKNEFVEFKNDPKKLNKLIIAKDV 893



 Score = 28.5 bits (64), Expect = 4.2
 Identities = 21/87 (24%), Positives = 36/87 (41%), Gaps = 3/87 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E I    E   +     + N+  +E+  +  + EEE  +  + + E   E     EE E 
Sbjct: 782 EKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQRE---EMLMRPEELEL 838

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
              +QK  ++E + E   I+  EEE  
Sbjct: 839 INEEQKNLKQEIKLELSEIQEAEEEIQ 865


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 38.3 bits (89), Expect = 0.003
 Identities = 20/76 (26%), Positives = 44/76 (57%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E    E+ K+E+ + +E+  EE ++ K K+  +E+   +EK++E E + EE  + EEE++
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165

Query: 226 EVVKQKKNEEEEEEEE 241
               + K+  ++  ++
Sbjct: 166 RERVRAKSRPKKPPKK 181



 Score = 37.2 bits (86), Expect = 0.006
 Identities = 27/86 (31%), Positives = 50/86 (58%), Gaps = 3/86 (3%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEK---KEEDEIEKEEGENEEEEEE 225
              +E K E   EEE+ +E+ + ++KK KE+ +EE +    KEE + ++   E E+E+E+
Sbjct: 93  KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEK 152

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           +V + +  EEE++ E V  K   ++ 
Sbjct: 153 KVEEPRDREEEKKRERVRAKSRPKKP 178



 Score = 33.7 bits (77), Expect = 0.077
 Identities = 19/72 (26%), Positives = 39/72 (54%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           G  +         +E + E   +++K KE+ +EE++KK+E   E+ +    +EE +E   
Sbjct: 83  GGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142

Query: 230 QKKNEEEEEEEE 241
            K+ E+E+E++ 
Sbjct: 143 PKEKEKEKEKKV 154



 Score = 33.7 bits (77), Expect = 0.090
 Identities = 21/82 (25%), Positives = 44/82 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           ++ K +E+  E+   +E+E+ K+KK +E  + EEEKK E    K   +   +++    K+
Sbjct: 129 KDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKK 188

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           +  EEE++ +     ++ + E 
Sbjct: 189 EPPEEEKQRQAAREAVKGKPEE 210



 Score = 32.9 bits (75), Expect = 0.13
 Identities = 17/75 (22%), Positives = 45/75 (60%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
              G +    + +  +E   E  + ++K+ ++ +EE+++KKE+ + E ++ + +EE +E+
Sbjct: 81  EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140

Query: 227 VVKQKKNEEEEEEEE 241
              ++K +E+E++ E
Sbjct: 141 RPPKEKEKEKEKKVE 155



 Score = 32.6 bits (74), Expect = 0.21
 Identities = 17/81 (20%), Positives = 43/81 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            +EK++E E + EE  + EE K+++    +   ++  ++    K++   EEE++ +  ++
Sbjct: 143 PKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAARE 202

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
               + EE +    + +EE++
Sbjct: 203 AVKGKPEEPDVNEEREKEEDD 223



 Score = 32.2 bits (73), Expect = 0.25
 Identities = 17/70 (24%), Positives = 40/70 (57%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE+K+ + +  +   ++   K+  NK++E  EEEK+ +   E  +G+ EE +  E  ++
Sbjct: 160 REEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREK 219

Query: 231 KKNEEEEEEE 240
           ++++ ++ E 
Sbjct: 220 EEDDGKDRET 229



 Score = 30.2 bits (68), Expect = 0.94
 Identities = 24/84 (28%), Positives = 43/84 (51%), Gaps = 4/84 (4%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE----EEEEEEV 227
           E K   DE  +   +   +    K K  +E + E  +E+E EKE+ + E    +E+ +E 
Sbjct: 68  ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEE 251
            K +K +EE +E+    + E+E+E
Sbjct: 128 PKDRKPKEEAKEKRPPKEKEKEKE 151



 Score = 30.2 bits (68), Expect = 0.96
 Identities = 21/81 (25%), Positives = 43/81 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+EK++E ++EE  + EEE+ +++   +   ++  KK+    +KE  E E++ +      
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAV 204

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           K   EE +  E   K E++ +
Sbjct: 205 KGKPEEPDVNEEREKEEDDGK 225



 Score = 27.9 bits (62), Expect = 6.1
 Identities = 20/83 (24%), Positives = 39/83 (46%), Gaps = 5/83 (6%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEE-----VKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           E    +E+K EE    EEE + E        K+   K+   +++E  EE++  +   E  
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAV 204

Query: 221 EEEEEEVVKQKKNEEEEEEEEVL 243
           + + EE    ++ E+EE++ +  
Sbjct: 205 KGKPEEPDVNEEREKEEDDGKDR 227


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 38.1 bits (89), Expect = 0.003
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 15/90 (16%)

Query: 171 EEEKKEE-------DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK------EEG 217
           E E+KEE        +I E+  E+ EEV +    E + EEEE+K+ + +EK      +  
Sbjct: 97  EREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQMEILEKALKSYLKIV 156

Query: 218 ENEEEEEEEVVK--QKKNEEEEEEEEVLIK 245
           + E +  + + K  QK++EE  ++E  +I+
Sbjct: 157 KEENKSLQRLAKALQKESEERTQDETKMIE 186


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 38.0 bits (88), Expect = 0.003
 Identities = 18/90 (20%), Positives = 30/90 (33%), Gaps = 1/90 (1%)

Query: 153 SAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI 212
           +A S      +  E+    EE  E D IE E+  +        ++ + E      EE   
Sbjct: 345 AAESVVQVADRDGESTPAVEETSEAD-IEREQPGDLAGQAPAAHQVDAEAASAAPEEPAA 403

Query: 213 EKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
              E  +E E E         +  + +E  
Sbjct: 404 LASEAHDETEPEVPEKAAPIPDPAKPDELA 433



 Score = 31.5 bits (71), Expect = 0.47
 Identities = 14/75 (18%), Positives = 27/75 (36%), Gaps = 1/75 (1%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE-EEEKKEEDEIEKEEGENEEEEEEE 226
           +A E   +  D   E     EE  +    +E+  +   +     +++ E      EE   
Sbjct: 344 VAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAA 403

Query: 227 VVKQKKNEEEEEEEE 241
           +  +  +E E E  E
Sbjct: 404 LASEAHDETEPEVPE 418


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 37.0 bits (86), Expect = 0.003
 Identities = 31/77 (40%), Positives = 44/77 (57%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           EE+E EE +  EE+   ++K +EE EEE EKK+E+E  KE  E   +E+EE  K K +  
Sbjct: 24  EEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFV 83

Query: 236 EEEEEEVLIKLEEEEEN 252
            EEE    +  +EE   
Sbjct: 84  VEEEGTDKLSADEESNE 100



 Score = 34.3 bits (79), Expect = 0.030
 Identities = 28/67 (41%), Positives = 44/67 (65%)

Query: 186 EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIK 245
            EE++ ++++ + EEEE EE+K+ +E  + E + EEE EEE  K+K+ EE +E EE   K
Sbjct: 11  LEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARK 70

Query: 246 LEEEEEN 252
            +EE E 
Sbjct: 71  EQEEYEK 77



 Score = 34.3 bits (79), Expect = 0.030
 Identities = 23/78 (29%), Positives = 49/78 (62%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E++++ E + E   +EEE  +++ ++++EEEE K+ E++  KE+ E E+ +   VV++
Sbjct: 27  EREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEE 86

Query: 231 KKNEEEEEEEEVLIKLEE 248
           +  ++   +EE    LE+
Sbjct: 87  EGTDKLSADEESNELLED 104



 Score = 33.5 bits (77), Expect = 0.050
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A  EEK+   +  E E EE EE   +K  EE+ E E K+EE+  E+ E + EEEE +E  
Sbjct: 9   AKLEEKQARRQQREAEEEEREE---RKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
           +Q + E+EE E+     + EEE  
Sbjct: 66  EQARKEQEEYEKLKSSFVVEEEGT 89



 Score = 32.7 bits (75), Expect = 0.11
 Identities = 28/73 (38%), Positives = 45/73 (61%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEE 238
           +I  ++  + EE + ++ + E EEEE ++ +   EK EGE +EEEE E  ++KK EEEE 
Sbjct: 2   KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEER 61

Query: 239 EEEVLIKLEEEEE 251
           +E      +E+EE
Sbjct: 62  KEREEQARKEQEE 74



 Score = 30.1 bits (68), Expect = 0.84
 Identities = 21/64 (32%), Positives = 37/64 (57%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
           K  E   GE +++EE E E E+ +EEEE K+++ +  +E+EE +K +     EE   ++ 
Sbjct: 33  KLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKL 92

Query: 223 EEEE 226
             +E
Sbjct: 93  SADE 96



 Score = 29.3 bits (66), Expect = 1.4
 Identities = 25/61 (40%), Positives = 37/61 (60%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E     EEK+E +  EEEE EEE E K+++ + +E EE+ +KE++E EK +     EEE 
Sbjct: 29  EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEG 88

Query: 226 E 226
            
Sbjct: 89  T 89



 Score = 27.4 bits (61), Expect = 6.5
 Identities = 21/59 (35%), Positives = 36/59 (61%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            K + + EE++ ++++ E E+EE E  ++ EE+   ++K EEE EEE    K EEE + 
Sbjct: 5   AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63


>gnl|CDD|221804 pfam12847, Methyltransf_18, Methyltransferase domain.  Protein in
           this family function as methyltransferases.
          Length = 104

 Score = 35.7 bits (83), Expect = 0.004
 Identities = 24/108 (22%), Positives = 41/108 (37%), Gaps = 21/108 (19%)

Query: 49  AGKTVLDVGTGTGKSIL--------LQGHGVDHHSFC---TKVHPLDHYAPQYLILEVIQ 97
            G  VLD+G GTG   +         +  GVD         + +      P+   +  +Q
Sbjct: 1   PGARVLDIGCGTGSLAIELARLFPGARVTGVDLSPEMLELARENAKLALGPR---ITFVQ 57

Query: 98  NKI-ENVELPEKVDIIVSEWMGFYLLHESMIDSVIFARDKFLKPEGVM 144
               + ++L E  D +     G  LL   ++D++       LKP G +
Sbjct: 58  GDAPDALDLLEGFDAVFIGGGGGDLLE--LLDALA----SLLKPGGRL 99


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 37.9 bits (89), Expect = 0.004
 Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 3/85 (3%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
              EE  E +  +EE  EE  E + K+  E E   +E+K  D++   EG + E   +   
Sbjct: 386 VPVEELLEIEGFDEETVEELRE-RAKEALETEALAQEEKLADDLLSLEGLDRELAFKLAE 444

Query: 229 KQKKNEEEEEEEEV--LIKLEEEEE 251
           K  K  E+  E+ V  LI +E +EE
Sbjct: 445 KGIKTLEDLAEQAVDELIDIEGDEE 469



 Score = 34.5 bits (80), Expect = 0.041
 Identities = 21/92 (22%), Positives = 41/92 (44%), Gaps = 6/92 (6%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK-EEDEIEKEEGE 218
           E     E IA    ++    +EE      EE+ + +  +EE  EE ++  ++ +E E   
Sbjct: 360 EALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALA 419

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
            EE+  ++++       E  + E+  KL E+ 
Sbjct: 420 QEEKLADDLL-----SLEGLDRELAFKLAEKG 446



 Score = 32.1 bits (74), Expect = 0.23
 Identities = 20/89 (22%), Positives = 36/89 (40%), Gaps = 7/89 (7%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQ-KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE---- 225
            E     D   E  + +EE  +   +      EE      +E+ + EG +EE  EE    
Sbjct: 349 AEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRER 408

Query: 226 --EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
             E ++ +   +EE+  + L+ LE  +  
Sbjct: 409 AKEALETEALAQEEKLADDLLSLEGLDRE 437



 Score = 29.1 bits (66), Expect = 2.4
 Identities = 13/56 (23%), Positives = 25/56 (44%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
                +EEK  +D +  E  + E   K  +   +  E+  ++  DE+   EG+ E+
Sbjct: 415 TEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLEDLAEQAVDELIDIEGDEEK 470


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 38.2 bits (88), Expect = 0.004
 Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 8/104 (7%)

Query: 155 PSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK 214
            S   E+ K W ++A  + + ++D   +   E EEE+ Q   K   E+ E+++   + E 
Sbjct: 560 ESEFEELKKKWSSLAQLKSRFQKDATLDS-IEGEEELIQDDEKGNFEDLEDEENSSDNEM 618

Query: 215 EE-------GENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           EE        ENEE  +E   + ++ E   ++EE+    E EE 
Sbjct: 619 EESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEER 662



 Score = 33.2 bits (75), Expect = 0.15
 Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 167 NIAGEEEKKEEDEIEEEE-------NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           N    E+++   + E EE        E EE   +   + E EE   KKEE     E  E 
Sbjct: 603 NFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEER 662

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
            + E+++V    + + + EE+  + + E E 
Sbjct: 663 GDPEKKDVDWYTEEKRKIEEQLKINRSEFET 693



 Score = 29.3 bits (65), Expect = 2.2
 Identities = 18/78 (23%), Positives = 35/78 (44%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           ++  E E+  ++   E E EE    K++     E EE    E+ +++    E  + EE+ 
Sbjct: 625 SVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQL 684

Query: 227 VVKQKKNEEEEEEEEVLI 244
            + + + E    E  V+I
Sbjct: 685 KINRSEFETMVPESRVVI 702



 Score = 27.8 bits (61), Expect = 7.7
 Identities = 15/66 (22%), Positives = 35/66 (53%), Gaps = 7/66 (10%)

Query: 168  IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
            +    E +E+ EI++        + +++ K++EE+E  +  +   E+E G+ E+E E+ +
Sbjct: 1005 LPVPPECREKHEIKDR-------IVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRI 1057

Query: 228  VKQKKN 233
             K   +
Sbjct: 1058 RKTIHD 1063


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 36.9 bits (86), Expect = 0.005
 Identities = 23/81 (28%), Positives = 51/81 (62%), Gaps = 2/81 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           ++E +++D  +  + EE+EE +++K K   + + +K  + +IE++E    E+EE+ + + 
Sbjct: 25  DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84

Query: 231 KKNEEEEE--EEEVLIKLEEE 249
           +++  E+E  E+  L KL+EE
Sbjct: 85  EEDTPEDELAEKLRLRKLQEE 105



 Score = 31.9 bits (73), Expect = 0.26
 Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 7/82 (8%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV-------VKQ 230
           D+ E E      +   K   ++E+E+++ K+  + E++E + EE+ +          +K 
Sbjct: 5   DDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKA 64

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           K  E+E+ + E   K   E E 
Sbjct: 65  KIEEKEKAKREKEEKGLRELEE 86



 Score = 28.1 bits (63), Expect = 4.7
 Identities = 16/73 (21%), Positives = 36/73 (49%), Gaps = 10/73 (13%)

Query: 184 ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE------GENEEEEEEEVVKQKKNEEEE 237
           ++E+E++      K+  +EEE++++E+E  K           + + EE+   +++ EE+ 
Sbjct: 25  DDEDEDD----DVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80

Query: 238 EEEEVLIKLEEEE 250
             E      E+E 
Sbjct: 81  LRELEEDTPEDEL 93


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 37.3 bits (86), Expect = 0.006
 Identities = 21/78 (26%), Positives = 35/78 (44%), Gaps = 2/78 (2%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
           K +D +  + N  EE     +++ E   E +   +DE +K   E +   E  V +Q  +E
Sbjct: 79  KTKDNVVADVNNGEE--FLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSE 136

Query: 235 EEEEEEEVLIKLEEEEEN 252
           +E         +EE EEN
Sbjct: 137 KERIASICTKIIEEPEEN 154



 Score = 30.0 bits (67), Expect = 1.4
 Identities = 15/68 (22%), Positives = 32/68 (47%), Gaps = 5/68 (7%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           GEE   E +     E + +      K+++++  EE+K   +   K++ ++E+E    +  
Sbjct: 91  GEEFLSESESEASLEIDSDI-----KDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICT 145

Query: 230 QKKNEEEE 237
           +   E EE
Sbjct: 146 KIIEEPEE 153


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 37.3 bits (87), Expect = 0.006
 Identities = 25/88 (28%), Positives = 47/88 (53%), Gaps = 4/88 (4%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E +   E +  + EIE+E  E E++ K+KK  +E  + +E +  +  ++ E E +E++  
Sbjct: 401 EELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDEDDIA 460

Query: 226 EVVKQ----KKNEEEEEEEEVLIKLEEE 249
           EVV +       +  E+E+E L+ LE  
Sbjct: 461 EVVARWTGIPVAKLLEDEKEKLLNLERR 488



 Score = 28.8 bits (65), Expect = 2.9
 Identities = 17/66 (25%), Positives = 40/66 (60%), Gaps = 2/66 (3%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE--EGENEEEEEEEVVKQKKNEEE 236
            +E ++ EE +E++++  + E E+E  ++E+DE EK+  +   + +E      +K+ E E
Sbjct: 394 RLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAE 453

Query: 237 EEEEEV 242
            +E+++
Sbjct: 454 VDEDDI 459


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 36.6 bits (84), Expect = 0.006
 Identities = 20/70 (28%), Positives = 45/70 (64%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           +E+ E+   E EE EE  E ++++ K +E E++E++E +  E+   E  ++ E E+++Q 
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQD 184

Query: 232 KNEEEEEEEE 241
           ++E E ++++
Sbjct: 185 ESETESDDDK 194



 Score = 32.0 bits (72), Expect = 0.20
 Identities = 19/63 (30%), Positives = 39/63 (61%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLE 247
           +E++++ + + EE EE  + E  E + +E E++EE+E E  +Q   E+ ++ E  +I+ +
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQD 184

Query: 248 EEE 250
           E E
Sbjct: 185 ESE 187



 Score = 29.6 bits (66), Expect = 1.2
 Identities = 17/58 (29%), Positives = 34/58 (58%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           E +  E  +++ DE E++E +E E  +Q   ++ ++ E E  E+DE E E  +++ E+
Sbjct: 140 ERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197



 Score = 28.1 bits (62), Expect = 3.8
 Identities = 19/66 (28%), Positives = 41/66 (62%), Gaps = 1/66 (1%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKL 246
           +++++K+KK+    +E+ EK  ++  E EE   E E  EE + +++++EE+E E     +
Sbjct: 111 KKDDIKKKKSLIIRQEQIEKARQEREELEE-RMEWERREEKIDEREDQEEQEREREEQTI 169

Query: 247 EEEEEN 252
           EE+ ++
Sbjct: 170 EEQSDD 175


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 36.3 bits (84), Expect = 0.007
 Identities = 26/69 (37%), Positives = 44/69 (63%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
            +E EEE+E   +  K +EEE++ + EE E EKE GE E E+  E  ++++  +EEE+EE
Sbjct: 36  ADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEE 95

Query: 242 VLIKLEEEE 250
               ++++E
Sbjct: 96  EAEDVKQQE 104



 Score = 34.0 bits (78), Expect = 0.032
 Identities = 24/68 (35%), Positives = 45/68 (66%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           K +E E E+E ++E  + K+++ K E EE E++KE  E E+E+   ++E+EE   +++K 
Sbjct: 35  KADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKE 94

Query: 234 EEEEEEEE 241
           EE E+ ++
Sbjct: 95  EEAEDVKQ 102



 Score = 33.3 bits (76), Expect = 0.075
 Identities = 22/68 (32%), Positives = 40/68 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEEK+  DE  + + EE++   ++  KE+E  E E+++  E +++E   +EEE+EE  + 
Sbjct: 40  EEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAED 99

Query: 231 KKNEEEEE 238
            K +E   
Sbjct: 100 VKQQEVFS 107



 Score = 31.0 bits (70), Expect = 0.36
 Identities = 19/71 (26%), Positives = 43/71 (60%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           + +++E+    EE  +E+E  + ++    E++E+E+  ++E ++EE E+ +++E    K+
Sbjct: 51  KSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKR 110

Query: 231 KKNEEEEEEEE 241
           KK  +E   EE
Sbjct: 111 KKPFKEMNLEE 121



 Score = 31.0 bits (70), Expect = 0.37
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 4/68 (5%)

Query: 185 NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLI 244
            +E+EE K+  ++  + +EEE+K    IE EE E E+E  E   +    ++E+EE     
Sbjct: 36  ADEKEEEKENSDEHVKSKEEEQK----IEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEE 91

Query: 245 KLEEEEEN 252
           + EEE E+
Sbjct: 92  EKEEEAED 99



 Score = 30.6 bits (69), Expect = 0.50
 Identities = 25/76 (32%), Positives = 41/76 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEE+K E E  E+E E  E  ++   +++E+EE  ++EE E E E+ + +E    +  K 
Sbjct: 54  EEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKP 113

Query: 231 KKNEEEEEEEEVLIKL 246
            K    EE+ + L  L
Sbjct: 114 FKEMNLEEKIDFLAHL 129



 Score = 29.4 bits (66), Expect = 1.4
 Identities = 19/67 (28%), Positives = 38/67 (56%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +  + E  K +E+E + E  E E+E +  + + E+  E+++KEE   E+E+ E  E+ ++
Sbjct: 43  KENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQ 102

Query: 226 EVVKQKK 232
           + V   K
Sbjct: 103 QEVFSFK 109


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 37.0 bits (86), Expect = 0.008
 Identities = 15/77 (19%), Positives = 41/77 (53%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
             E  + E  + +++  ++  + K KK+K++ ++++    E   E +E EN++  +++  
Sbjct: 2   MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61

Query: 229 KQKKNEEEEEEEEVLIK 245
           K+KK +++   E   + 
Sbjct: 62  KKKKKKKKNLGEAYDLA 78



 Score = 33.5 bits (77), Expect = 0.087
 Identities = 15/83 (18%), Positives = 39/83 (46%), Gaps = 1/83 (1%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
             ++ KK+ D+ +    E   E ++ +NK+  +++++KK++ + +K  GE  +   +  V
Sbjct: 25  KNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKK-KKKKNLGEAYDLAYDLPV 83

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
                  ++      +    E+E
Sbjct: 84  VWSSAAFQDNSHIRKLGNWPEQE 106



 Score = 28.9 bits (65), Expect = 2.7
 Identities = 12/70 (17%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEE---EEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
              +  E E  +++++ KQK  K++ +   ++ +  +    E      E E ++   K+K
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60

Query: 232 KNEEEEEEEE 241
           K +++++++ 
Sbjct: 61  KKKKKKKKKN 70


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 37.0 bits (85), Expect = 0.008
 Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 3/82 (3%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG---ENEEEEEEEVV 228
           E+++EED    +E      +     KE + EE    EE  I+  EG    + EEEE ++ 
Sbjct: 394 EDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLR 453

Query: 229 KQKKNEEEEEEEEVLIKLEEEE 250
           + +  E+E+ E     +L+  E
Sbjct: 454 EFRDMEKEDREFPDEAELQPSE 475



 Score = 28.5 bits (63), Expect = 3.8
 Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            +   +++ KE D  EE   +EE  +   +  EE   EEE+++  E    E E+ E  +E
Sbjct: 410 MSAIDDDDPKENDN-EEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDE 468



 Score = 28.1 bits (62), Expect = 5.4
 Identities = 20/115 (17%), Positives = 39/115 (33%), Gaps = 16/115 (13%)

Query: 153 SAPSYSPEIFKFWENIAGEEEKKEE-----DEIEEEENEEEEEVKQKKNK---------- 197
           +   Y+PE      +   + E         D+       E+   K    K          
Sbjct: 334 TLDEYAPEGEDLRSDYDEDFEYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWA 393

Query: 198 -EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            +EEEE+ +  +E+       +++ +E +        E   ++ E   +L  EEE
Sbjct: 394 EDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEE 448



 Score = 27.4 bits (60), Expect = 8.9
 Identities = 19/52 (36%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 166 ENIAGEEEKKEED-EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           E +AG+EE   +D E  EE + EEEE + ++ ++ E+E+ E  +E E++  E
Sbjct: 424 EEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDEAELQPSE 475


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 36.5 bits (84), Expect = 0.009
 Identities = 11/76 (14%), Positives = 22/76 (28%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            +      +   +E E   +E E +       E        +   +   E      ++  
Sbjct: 164 SSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAP 223

Query: 226 EVVKQKKNEEEEEEEE 241
               Q+  E E+E  E
Sbjct: 224 SPNTQQAVEHEDEPTE 239



 Score = 36.5 bits (84), Expect = 0.010
 Identities = 13/72 (18%), Positives = 22/72 (30%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           G  E    +      N++     Q  +++  EE E    E E +       E        
Sbjct: 145 GPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPP 204

Query: 230 QKKNEEEEEEEE 241
           Q   +E  E + 
Sbjct: 205 QSPPDEPGEPQS 216



 Score = 34.6 bits (79), Expect = 0.036
 Identities = 5/71 (7%), Positives = 9/71 (12%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
                       EE            +  E                  E    E      
Sbjct: 100 VGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSP 159

Query: 230 QKKNEEEEEEE 240
            ++     +  
Sbjct: 160 NQQPSSFLQPS 170



 Score = 33.8 bits (77), Expect = 0.076
 Identities = 8/81 (9%), Positives = 18/81 (22%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
               ++    +  E            +             +    E       ++     
Sbjct: 108 SGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFL 167

Query: 230 QKKNEEEEEEEEVLIKLEEEE 250
           Q  +E+  EE E      E +
Sbjct: 168 QPSHEDSPEEPEPPTSEPEPD 188



 Score = 31.9 bits (72), Expect = 0.28
 Identities = 11/87 (12%), Positives = 29/87 (33%)

Query: 154 APSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
           +P+  P  F    +    EE +      E ++    + +   +    +   ++  E +  
Sbjct: 158 SPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSP 217

Query: 214 KEEGENEEEEEEEVVKQKKNEEEEEEE 240
             +       ++ V  + +  E E E 
Sbjct: 218 TPQQAPSPNTQQAVEHEDEPTEPEREG 244



 Score = 31.9 bits (72), Expect = 0.29
 Identities = 9/70 (12%), Positives = 21/70 (30%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E    E E +     + E        +   +E  + +    ++    N ++  E   +  
Sbjct: 179 EPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPT 238

Query: 232 KNEEEEEEEE 241
           + E E     
Sbjct: 239 EPEREGPPFP 248



 Score = 31.9 bits (72), Expect = 0.29
 Identities = 8/66 (12%), Positives = 16/66 (24%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           +     E + E  EE +  + +E  +        + +           EE          
Sbjct: 64  QGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENT 123

Query: 236 EEEEEE 241
                E
Sbjct: 124 SGSSPE 129



 Score = 31.5 bits (71), Expect = 0.36
 Identities = 8/71 (11%), Positives = 16/71 (22%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E       E     +            E    E      ++      +   E+  E  + 
Sbjct: 121 ENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEP 180

Query: 231 KKNEEEEEEEE 241
             +E E +   
Sbjct: 181 PTSEPEPDSPG 191



 Score = 31.5 bits (71), Expect = 0.38
 Identities = 6/82 (7%), Positives = 18/82 (21%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
                +         +   EE+    + E       +            +     E    
Sbjct: 93  SGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPP 152

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
           +  N    ++    ++   E+ 
Sbjct: 153 ESHNPSPNQQPSSFLQPSHEDS 174



 Score = 31.5 bits (71), Expect = 0.38
 Identities = 9/73 (12%), Positives = 22/73 (30%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A  E+   + E + E  EE    ++++  +        +           + EE    + 
Sbjct: 60  AVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLS 119

Query: 229 KQKKNEEEEEEEE 241
            +  +    E   
Sbjct: 120 PENTSGSSPESPA 132



 Score = 31.5 bits (71), Expect = 0.44
 Identities = 8/76 (10%), Positives = 20/76 (26%), Gaps = 1/76 (1%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            EN +G   +                  +    E       ++    ++    E+  EE 
Sbjct: 120 PENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQ-PSHEDSPEEP 178

Query: 225 EEVVKQKKNEEEEEEE 240
           E    + + +     +
Sbjct: 179 EPPTSEPEPDSPGPPQ 194



 Score = 31.1 bits (70), Expect = 0.50
 Identities = 9/72 (12%), Positives = 17/72 (23%), Gaps = 2/72 (2%)

Query: 171 EEEKKEEDEIEEEENEE--EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
                         +E    E      N++     +   E+   E E   +E E +    
Sbjct: 133 SHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192

Query: 229 KQKKNEEEEEEE 240
            Q +        
Sbjct: 193 PQSETPTSSPPP 204



 Score = 30.7 bits (69), Expect = 0.78
 Identities = 7/76 (9%), Positives = 13/76 (17%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
                     E       E+                 E    E       +  +   +  
Sbjct: 111 AEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPS 170

Query: 226 EVVKQKKNEEEEEEEE 241
                ++ E    E E
Sbjct: 171 HEDSPEEPEPPTSEPE 186



 Score = 29.9 bits (67), Expect = 1.1
 Identities = 8/71 (11%), Positives = 11/71 (15%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E   E  E      +EE           E             +E       E       
Sbjct: 69  TESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSP 128

Query: 231 KKNEEEEEEEE 241
           +          
Sbjct: 129 ESPASHSPPPS 139



 Score = 29.6 bits (66), Expect = 1.6
 Identities = 8/71 (11%), Positives = 15/71 (21%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
              + E D    EE+   E+ ++ +        E             E            
Sbjct: 65  GHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTS 124

Query: 231 KKNEEEEEEEE 241
             + E      
Sbjct: 125 GSSPESPASHS 135



 Score = 29.6 bits (66), Expect = 1.6
 Identities = 8/76 (10%), Positives = 22/76 (28%), Gaps = 4/76 (5%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEED----EIEKEEGENEEEEEE 225
            E+  +E +    E   +     Q +        +   +E         ++  +   ++ 
Sbjct: 171 HEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQA 230

Query: 226 EVVKQKKNEEEEEEEE 241
              + +  E E E   
Sbjct: 231 VEHEDEPTEPEREGPP 246



 Score = 29.6 bits (66), Expect = 1.9
 Identities = 10/88 (11%), Positives = 26/88 (29%), Gaps = 5/88 (5%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE-----EEEKKEEDEIEKEEGENEEEE 223
                 ++     +  +E+  E  +    E E +     + E        +   +   E 
Sbjct: 155 HNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEP 214

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           +    +Q  +   ++  E   +  E E 
Sbjct: 215 QSPTPQQAPSPNTQQAVEHEDEPTEPER 242



 Score = 28.0 bits (62), Expect = 4.6
 Identities = 6/72 (8%), Positives = 12/72 (16%), Gaps = 1/72 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEE-EEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
             + +  E+    E+ E  +        E             E        E        
Sbjct: 70  ESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPE 129

Query: 229 KQKKNEEEEEEE 240
               +       
Sbjct: 130 SPASHSPPPSPP 141



 Score = 28.0 bits (62), Expect = 5.1
 Identities = 8/71 (11%), Positives = 11/71 (15%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +EE+ +                      EE       E       E              
Sbjct: 84  KEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSH 143

Query: 231 KKNEEEEEEEE 241
               E    E 
Sbjct: 144 PGPHEPAPPES 154



 Score = 27.2 bits (60), Expect = 9.4
 Identities = 10/62 (16%), Positives = 18/62 (29%), Gaps = 3/62 (4%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE---EGENEEEEEEEVVKQKKNEEEEE 238
            +     E+  ++   + E  EE +  E E   +    G   E           + EE  
Sbjct: 56  PQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELA 115

Query: 239 EE 240
             
Sbjct: 116 SG 117


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 36.8 bits (85), Expect = 0.010
 Identities = 23/84 (27%), Positives = 42/84 (50%), Gaps = 2/84 (2%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            GEEE  +ED++EE +++E+E    + ++ E E + E  EEDE E +   NE     +  
Sbjct: 646 EGEEE--DEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKA 703

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
            ++   +     + L   + E+E 
Sbjct: 704 VRRALPKVLNLPDALDGGDSEDEE 727



 Score = 32.5 bits (74), Expect = 0.22
 Identities = 19/78 (24%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEE-EEKKEEDEIEKEEGENEEEEEE-EVVK 229
           E K    ++ E E E+E+++++  + E+E E  E+ + E E + E+GE +E+E++ E  +
Sbjct: 636 ENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANE 695

Query: 230 QKKNEEEEEEEEVLIKLE 247
                ++     +   L 
Sbjct: 696 GVVPIDKAVRRALPKVLN 713



 Score = 32.5 bits (74), Expect = 0.22
 Identities = 18/58 (31%), Positives = 34/58 (58%), Gaps = 1/58 (1%)

Query: 185 NEEEEEVKQKKNKEEEEEEE-EKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           +E +   +Q    EEE+E++ E+ ++DE E E  E+ E E E   +  + +E+E++ E
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692



 Score = 31.0 bits (70), Expect = 0.59
 Identities = 22/76 (28%), Positives = 42/76 (55%), Gaps = 7/76 (9%)

Query: 180 IEEEENEEEEEV----KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           IE+  NE  + +    K  +NK   ++  E +EEDE + EE +++E+E E +   + +E 
Sbjct: 617 IEDVTNEGLQLILDVLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAI---EDSES 673

Query: 236 EEEEEEVLIKLEEEEE 251
           E E +    + +E+E+
Sbjct: 674 ESESDGEDGEEDEQED 689


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 36.2 bits (84), Expect = 0.010
 Identities = 25/75 (33%), Positives = 34/75 (45%), Gaps = 13/75 (17%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEE---EEEKKEEDEIEKEEG-----ENEEEEEEE 226
             E ++E   +E+         KEE ++   E  K EE +    EG     E  EEEEEE
Sbjct: 109 SGESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEE 168

Query: 227 VVKQKKNEEEEEEEE 241
           +      EEEEE +E
Sbjct: 169 IA-----EEEEEVKE 178



 Score = 32.3 bits (74), Expect = 0.15
 Identities = 18/63 (28%), Positives = 29/63 (46%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           EN+  E+   E     +EE+++      K  +++    E    + E  +EE E   EEEE
Sbjct: 115 ENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEE 174

Query: 226 EVV 228
           EV 
Sbjct: 175 EVK 177


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 36.5 bits (84), Expect = 0.010
 Identities = 19/73 (26%), Positives = 39/73 (53%), Gaps = 2/73 (2%)

Query: 171 EEEKKEEDEIEEEENEEEE--EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           EE   + +  +EE+ ++++  E +Q++  E E E +E     E E  + E+E  EEE V 
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQ 260

Query: 229 KQKKNEEEEEEEE 241
             + +  +E +++
Sbjct: 261 SDQDDLPDESDDD 273



 Score = 34.1 bits (78), Expect = 0.066
 Identities = 22/80 (27%), Positives = 37/80 (46%), Gaps = 4/80 (5%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI----EKEEGENEE 221
           E +   E  +E  +  E E+EE+ +  Q    E+EE+ E + E  E     E E  + E 
Sbjct: 192 EMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRES 251

Query: 222 EEEEEVVKQKKNEEEEEEEE 241
           E  EE + Q   ++  +E +
Sbjct: 252 ESGEEEMVQSDQDDLPDESD 271



 Score = 31.5 bits (71), Expect = 0.40
 Identities = 16/60 (26%), Positives = 29/60 (48%)

Query: 190 EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEE 249
           E+ ++   + E E+EE  ++D+  + E E + E E E  +    +E E  +      EEE
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEE 257



 Score = 31.1 bits (70), Expect = 0.54
 Identities = 14/59 (23%), Positives = 29/59 (49%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           E+E++ E E E +E    +E +    + E  EEE  + + +   +E +++ E   E  +
Sbjct: 224 EQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGAR 282


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 35.4 bits (82), Expect = 0.012
 Identities = 21/66 (31%), Positives = 42/66 (63%)

Query: 185 NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLI 244
           +++E++ ++   +E EE E+ ++EE E E+ E ENE EEE++ + + + + EE +++ L 
Sbjct: 2   SDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLR 61

Query: 245 KLEEEE 250
              E E
Sbjct: 62  AQAEFE 67



 Score = 32.3 bits (74), Expect = 0.15
 Identities = 20/54 (37%), Positives = 35/54 (64%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           +E+K EE + EE E  E+ E ++ + +E EEE E ++E+ EI + E + EE ++
Sbjct: 4   KEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKD 57



 Score = 27.7 bits (62), Expect = 4.6
 Identities = 22/76 (28%), Positives = 38/76 (50%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +    EE   EE E  E+  EEE E ++ + + E EEE+++  E E + EE +++    +
Sbjct: 4   KEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQ 63

Query: 226 EVVKQKKNEEEEEEEE 241
              +  +   E E EE
Sbjct: 64  AEFENLRKRTEREREE 79



 Score = 27.3 bits (61), Expect = 6.7
 Identities = 24/58 (41%), Positives = 34/58 (58%), Gaps = 7/58 (12%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           K+ K EE + EE +E ++ E+EE E EE EEE       NE EEE++E+     + EE
Sbjct: 4   KEQKTEEPDAEETEEAEKSEEEEAEEEEPEEE-------NELEEEQQEIAELEAQLEE 54


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 36.1 bits (83), Expect = 0.012
 Identities = 17/78 (21%), Positives = 39/78 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E K      +    +E+++V + + +E E+ + E K+ DE   +  +++  + ++  K 
Sbjct: 255 QEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKA 314

Query: 231 KKNEEEEEEEEVLIKLEE 248
            + E E++E E   K E 
Sbjct: 315 SEKEAEDKELEAQKKREP 332



 Score = 31.5 bits (71), Expect = 0.45
 Identities = 19/74 (25%), Positives = 36/74 (48%), Gaps = 4/74 (5%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A     KE+ ++ E +  E E+ + +  K +EE  + K  +    K+E +  E+E E   
Sbjct: 264 ADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAE--- 320

Query: 229 KQKKNEEEEEEEEV 242
             K+ E +++ E V
Sbjct: 321 -DKELEAQKKREPV 333



 Score = 30.4 bits (68), Expect = 0.94
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE----- 223
             ++++ E  + ++E     +       KE+++  E +K E E  + E +  +EE     
Sbjct: 242 NADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAK 301

Query: 224 ---EEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
                ++ ++ K  E+E E++ L   ++ E 
Sbjct: 302 DHKAFDLKQESKASEKEAEDKELEAQKKREP 332



 Score = 30.4 bits (68), Expect = 1.0
 Identities = 12/71 (16%), Positives = 38/71 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E +K+E ++ + E  + +EE  + K+ +  + ++E K  ++  +++    +++ E V + 
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAED 336

Query: 231 KKNEEEEEEEE 241
            +  + + E +
Sbjct: 337 LQKTKPQVEAQ 347



 Score = 30.0 bits (67), Expect = 1.3
 Identities = 19/84 (22%), Positives = 42/84 (50%), Gaps = 2/84 (2%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE--DEIEKEEGENEEEEEEEV 227
            +    +ED+   E  + E E  Q + K+ +EE  + K+    ++++E   +E+E E++ 
Sbjct: 264 ADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKE 323

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEE 251
           ++ +K  E   E+    K + E +
Sbjct: 324 LEAQKKREPVAEDLQKTKPQVEAQ 347



 Score = 29.6 bits (66), Expect = 1.5
 Identities = 14/81 (17%), Positives = 42/81 (51%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E + +ED    ++ +EE + KQ    + +++ +  ++  + +++E   +++E + + K 
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
                 +E+++V    + E E
Sbjct: 264 ADTSSPKEDKQVAENQKREIE 284



 Score = 28.0 bits (62), Expect = 5.5
 Identities = 14/88 (15%), Positives = 40/88 (45%), Gaps = 5/88 (5%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A ++    +D  +++ +E  ++ ++ KN  +  +    KE+ ++ + +    E+ + E+ 
Sbjct: 232 AQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIK 291

Query: 229 K-----QKKNEEEEEEEEVLIKLEEEEE 251
           K      K  + +  + +   K  E+E 
Sbjct: 292 KNDEEALKAKDHKAFDLKQESKASEKEA 319


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 36.2 bits (84), Expect = 0.012
 Identities = 21/64 (32%), Positives = 36/64 (56%), Gaps = 3/64 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE-EVVK 229
           EE +++  E  E+  E +EEVK++  K+ ++  E   +   + +E G NEE  E+ E + 
Sbjct: 469 EEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPE--VDTYLLLEELGINEETYEKLEALL 526

Query: 230 QKKN 233
            KKN
Sbjct: 527 AKKN 530



 Score = 34.3 bits (79), Expect = 0.053
 Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 2/74 (2%)

Query: 180 IEEEENEEEEEVKQKKNKEEEE--EEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
            EE   +  EE KQ   KE      EE +++  E  ++  EN+EE ++EV K+ K   E 
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503

Query: 238 EEEEVLIKLEEEEE 251
           +   +L +L   EE
Sbjct: 504 DTYLLLEELGINEE 517



 Score = 30.1 bits (68), Expect = 1.2
 Identities = 14/59 (23%), Positives = 30/59 (50%)

Query: 193 QKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           +K  +EE   +  ++ +  I KE   +  EE  E V ++  +  E +EEV  ++ ++ +
Sbjct: 440 KKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVK 498



 Score = 28.9 bits (65), Expect = 3.3
 Identities = 21/85 (24%), Positives = 37/85 (43%), Gaps = 8/85 (9%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQK--KNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           E I    E+ ++  ++E      EE+++K  +  E+  E +E+       K+E   + ++
Sbjct: 446 EAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEV------KKEVPKKVKK 499

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEE 248
             EV      EE    EE   KLE 
Sbjct: 500 IPEVDTYLLLEELGINEETYEKLEA 524


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 36.2 bits (84), Expect = 0.013
 Identities = 19/82 (23%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE--DEIEKEEGENEEEEEEEVVKQ 230
           E++    +E +E   E++++ +  +E+EE  ++ +E+   E+E++   +E++ + E+  Q
Sbjct: 206 EEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQ 265

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
               + E  +E+  K+EEE   
Sbjct: 266 AIELQREFNKEIKEKVEEERNG 287



 Score = 35.0 bits (81), Expect = 0.029
 Identities = 22/81 (27%), Positives = 41/81 (50%), Gaps = 6/81 (7%)

Query: 171 EEEKKEEDEIEEEENEEEEEV--KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           EEE++ E  ++E+  E   ++  +     E +E   EK+   E E+E+ E  ++ EE++ 
Sbjct: 185 EEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKL- 243

Query: 229 KQKKNEEEEEEEEVLIKLEEE 249
              + E E + E    KL+ E
Sbjct: 244 ---RQELERQAEAHEQKLKNE 261



 Score = 33.1 bits (76), Expect = 0.15
 Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 2/70 (2%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           EEE E E  +K+K+  EE   + E++    +E +E   E++   E  ++K+   ++ EE+
Sbjct: 185 EEEEELERALKEKR--EELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEK 242

Query: 242 VLIKLEEEEE 251
           +  +LE + E
Sbjct: 243 LRQELERQAE 252



 Score = 28.5 bits (64), Expect = 3.9
 Identities = 26/77 (33%), Positives = 45/77 (58%), Gaps = 5/77 (6%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEE---EEKKEE--DEIEKEEGENEEEEEEEVVKQKK 232
              +EE ++  +++ + K +EEEE E   +EK+EE   ++E+E     E +E  + KQ +
Sbjct: 166 ASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLR 225

Query: 233 NEEEEEEEEVLIKLEEE 249
            E E E+EE+  K EE+
Sbjct: 226 LEFEREKEELRKKYEEK 242


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 36.3 bits (84), Expect = 0.014
 Identities = 13/67 (19%), Positives = 29/67 (43%), Gaps = 4/67 (5%)

Query: 174 KKEEDEIEEEENEEEEEVK--QKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
              ++ I  EEN  + +    Q  +K E   +  K  E++  +E     +  EE  + ++
Sbjct: 875 GDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEE--LMKR 932

Query: 232 KNEEEEE 238
             E +++
Sbjct: 933 AKEYQDK 939



 Score = 35.9 bits (83), Expect = 0.018
 Identities = 12/72 (16%), Positives = 32/72 (44%), Gaps = 7/72 (9%)

Query: 179 EIEEEENEEEEEVKQKKN--KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           E E    + +  ++ ++N  K +  E +   + + + +    NEE+       Q++    
Sbjct: 869 EYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSA-----QERMPGA 923

Query: 237 EEEEEVLIKLEE 248
           +  EE++ + +E
Sbjct: 924 DSPEELMKRAKE 935



 Score = 31.6 bits (72), Expect = 0.42
 Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 9/73 (12%)

Query: 189 EEVKQK-KNKEEEEEEEEKKEEDEIEKEEGE--------NEEEEEEEVVKQKKNEEEEEE 239
            +VK +  N   E E      ++ I  EE              + E + +  K  EE+  
Sbjct: 857 NDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSA 916

Query: 240 EEVLIKLEEEEEN 252
           +E +   +  EE 
Sbjct: 917 QERMPGADSPEEL 929



 Score = 31.6 bits (72), Expect = 0.43
 Identities = 12/88 (13%), Positives = 38/88 (43%), Gaps = 1/88 (1%)

Query: 166 ENIAGEEEKKEEDEIEEE-ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            + A  E++ ++  I  + +++ +  V + +    + +   + EE+ ++ +  E +   +
Sbjct: 841 GHQAIIEQRTQDSGIRNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHK 900

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            E + Q     EE+  +  +   +  E 
Sbjct: 901 TEALSQNNKYNEEKSAQERMPGADSPEE 928



 Score = 31.3 bits (71), Expect = 0.56
 Identities = 8/43 (18%), Positives = 17/43 (39%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
           +      K E      + NEE+   ++    +  EE  ++ +E
Sbjct: 893 SELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKE 935



 Score = 30.9 bits (70), Expect = 0.84
 Identities = 15/63 (23%), Positives = 26/63 (41%), Gaps = 12/63 (19%)

Query: 167 NIAGEEE--KKEEDEIEEEENEEEEEVKQKKNKEEEE----------EEEEKKEEDEIEK 214
           +I GEE   K +  E++     E      K N+E+            EE  K+ ++  +K
Sbjct: 880 SIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDK 939

Query: 215 EEG 217
            +G
Sbjct: 940 HKG 942


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 35.6 bits (82), Expect = 0.014
 Identities = 17/71 (23%), Positives = 38/71 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE   E +  + E+NE+E++ K+ ++ ++ EEEE    +   E  +  +EE E  E+   
Sbjct: 210 EELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAA 269

Query: 231 KKNEEEEEEEE 241
           + + ++  + +
Sbjct: 270 EASADDTPDSD 280



 Score = 34.0 bits (78), Expect = 0.058
 Identities = 12/56 (21%), Positives = 25/56 (44%)

Query: 186 EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           +  EE+  +    + E+ E++ +  E E ++   EEE        + ++   EE E
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEME 262



 Score = 34.0 bits (78), Expect = 0.058
 Identities = 12/76 (15%), Positives = 34/76 (44%), Gaps = 5/76 (6%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
             +E+ +E    + E  + ++  +E+E++++ EE+E    +  +E+ +        +  E
Sbjct: 208 MAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASS-----EEME 262

Query: 236 EEEEEEVLIKLEEEEE 251
             E E      ++  +
Sbjct: 263 SGEMEAAEASADDTPD 278



 Score = 32.9 bits (75), Expect = 0.13
 Identities = 16/73 (21%), Positives = 34/73 (46%), Gaps = 6/73 (8%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
              A  E+ ++ED+ +E+E++++ E      +EE    +   E+ +   EE E+ E E  
Sbjct: 216 PESADSEDNEDEDDPKEDEDDDQGE------EEESGSSDSLSEDSDASSEEMESGEMEAA 269

Query: 226 EVVKQKKNEEEEE 238
           E       + ++ 
Sbjct: 270 EASADDTPDSDDA 282



 Score = 28.2 bits (63), Expect = 3.7
 Identities = 11/76 (14%), Positives = 32/76 (42%), Gaps = 7/76 (9%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE-------DEIEKEEGENEEEEE 224
           +  +E  +  E  + E+ E +    ++E++++ E++E        ++ +    E E  E 
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEM 266

Query: 225 EEVVKQKKNEEEEEEE 240
           E       +  + ++ 
Sbjct: 267 EAAEASADDTPDSDDA 282


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 36.1 bits (83), Expect = 0.015
 Identities = 16/79 (20%), Positives = 40/79 (50%), Gaps = 1/79 (1%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE-EEEEVVKQK 231
              +  + +++E ++++   + K K  +++EE   +  +   +E  ++E  +  + VK+ 
Sbjct: 12  NNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKS 71

Query: 232 KNEEEEEEEEVLIKLEEEE 250
             EE ++  EVL   EE +
Sbjct: 72  TKEESKQLLEVLKTKEEHQ 90



 Score = 35.7 bits (82), Expect = 0.024
 Identities = 24/85 (28%), Positives = 44/85 (51%), Gaps = 11/85 (12%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKN----------KEEEEEEEEKKEEDEIEKEEGENE 220
            +EK +E + ++   E +E+V  KK            +E+  +E  K  DE++K   E E
Sbjct: 17  MQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKE-E 75

Query: 221 EEEEEEVVKQKKNEEEEEEEEVLIK 245
            ++  EV+K K+  ++E + E+L K
Sbjct: 76  SKQLLEVLKTKEEHQKEIQYEILQK 100



 Score = 33.4 bits (76), Expect = 0.12
 Identities = 22/90 (24%), Positives = 46/90 (51%), Gaps = 2/90 (2%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKE-EEEEEEEKKEEDEIEKEEGE 218
              +  E +  ++E+   D ++    E+  +   K   E ++  +EE K+  E+ K + E
Sbjct: 29  NRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEESKQLLEVLKTKEE 88

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
           +++E + E++ QK     E +E +L KLE+
Sbjct: 89  HQKEIQYEIL-QKTIPSFEPKESILKKLED 117


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 36.2 bits (84), Expect = 0.015
 Identities = 38/102 (37%), Positives = 50/102 (49%), Gaps = 15/102 (14%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKN--------------KEEEEEEEEKKEEDE 211
           EN A  EE + E E EEE +EEE E   KKN               E ++ ++E K E +
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448

Query: 212 IEKEEGENEEEEEEEVVKQKKNEEEEEE-EEVLIKLEEEEEN 252
            +KE  E EE E+EE  K +K   +  +  E   K EEEEE 
Sbjct: 449 EKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEEL 490



 Score = 35.0 bits (81), Expect = 0.032
 Identities = 24/74 (32%), Positives = 43/74 (58%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
              + +   E  E E E +  +K+NK E +E++E  EE+E+E EE    E+   +++K+ 
Sbjct: 419 NVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRS 478

Query: 232 KNEEEEEEEEVLIK 245
           +  ++EEEEE L +
Sbjct: 479 EKAQKEEEEEELDE 492



 Score = 35.0 bits (81), Expect = 0.037
 Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 8/93 (8%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE----DEIEKEEGENEEEE 223
           +  ++  K ++  ++EEN     +K  +  E  ++EE   E      E+E EE  +EEE 
Sbjct: 353 MLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEEN 412

Query: 224 EEEVVKQKKN----EEEEEEEEVLIKLEEEEEN 252
           EE   K         E  E+E    KL++E +N
Sbjct: 413 EEPSKKNVGRRKFGPENGEKEAESKKLKKENKN 445



 Score = 33.9 bits (78), Expect = 0.089
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI---EKEEGENEEEEEEE 226
            E  KKEE++ E EE   E E +++ ++EE EE  +K         E  E E E ++ ++
Sbjct: 382 AEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKK 441

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEEN 252
             K +  E++E +EE  ++ EEE + 
Sbjct: 442 ENKNEFKEKKESDEEEELEDEEEAKV 467



 Score = 32.7 bits (75), Expect = 0.17
 Identities = 24/83 (28%), Positives = 45/83 (54%), Gaps = 1/83 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE-EGENEEEEEEEVV 228
            EE++ E+ + EEE+ +++E+    +N     ++  K +E E ++E  G    +  +   
Sbjct: 324 SEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAE 383

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
            +KK E + E EE+  +LE EEE
Sbjct: 384 ARKKEENDAEIEELRRELEGEEE 406



 Score = 30.4 bits (69), Expect = 0.90
 Identities = 24/62 (38%), Positives = 33/62 (53%), Gaps = 5/62 (8%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           +EKKE DE EE E+EEE +V++  NK  +  E+ +KEE     EE E +EE         
Sbjct: 448 KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEE-----EEEELDEENPWLKTTSS 502

Query: 232 KN 233
             
Sbjct: 503 VG 504



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 4/75 (5%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
               EK+ E +  ++EN+ E     K+ KE +EEEE + EE+   ++      +  E+  
Sbjct: 427 PENGEKEAESKKLKKENKNEF----KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQ 482

Query: 229 KQKKNEEEEEEEEVL 243
           K+++ EE +EE   L
Sbjct: 483 KEEEEEELDEENPWL 497



 Score = 29.6 bits (67), Expect = 1.7
 Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 7/94 (7%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE-------GE 218
           E  +G    K     E  + EE +   ++  +E E EEE  +EE+E   ++       G 
Sbjct: 368 EENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
              E+E E  K KK  + E +E+     EEE E+
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELED 461



 Score = 28.1 bits (63), Expect = 6.1
 Identities = 18/85 (21%), Positives = 39/85 (45%), Gaps = 2/85 (2%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN--EEEE 223
            N   +  +K + E EEEE +EE    +  +   +  +++  ++    K +       + 
Sbjct: 471 ANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKA 530

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEE 248
             +V K+KK E+  + ++ LI  E+
Sbjct: 531 AVKVKKKKKKEKSIDLDDDLIDEED 555



 Score = 27.7 bits (62), Expect = 7.3
 Identities = 19/69 (27%), Positives = 31/69 (44%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            ++ KKE     +E+ E +EE + +  +E + E+   K     EK + E EEEE +E   
Sbjct: 436 SKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENP 495

Query: 230 QKKNEEEEE 238
             K      
Sbjct: 496 WLKTTSSVG 504


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 35.8 bits (83), Expect = 0.016
 Identities = 19/57 (33%), Positives = 35/57 (61%), Gaps = 8/57 (14%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           + G +E+  E+ +E+ E EEEE        ++  E E++ EEDE E+EE +++E ++
Sbjct: 271 VVGGDEEDLEELLEKAEEEEEE--------DDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 32.7 bits (75), Expect = 0.13
 Identities = 20/75 (26%), Positives = 39/75 (52%), Gaps = 6/75 (8%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           + I+E   ++  + K K   E    +EE   E+ +EK E E EE++  E       +E+E
Sbjct: 250 EAIKEVIKKKGGDFKVKGEPEVVGGDEED-LEELLEKAEEEEEEDDYSES-----EDEDE 303

Query: 238 EEEEVLIKLEEEEEN 252
           E+E+   + +++E +
Sbjct: 304 EDEDEEEEEDDDEGD 318



 Score = 28.1 bits (63), Expect = 4.8
 Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 7/49 (14%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
              E +   EE++EED+  E E+E+EE+       E+EEEEE+  E D+
Sbjct: 278 DLEELLEKAEEEEEEDDYSESEDEDEED-------EDEEEEEDDDEGDK 319


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 36.0 bits (84), Expect = 0.016
 Identities = 18/81 (22%), Positives = 48/81 (59%), Gaps = 2/81 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E++ +E + + +E  + +EE+++KK K +EEE++  +E ++  ++  +  ++E +E++K+
Sbjct: 533 EQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKE 592

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
            +  ++       +K  E  E
Sbjct: 593 LRQLQKGGYAS--VKAHELIE 611



 Score = 34.0 bits (79), Expect = 0.073
 Identities = 24/83 (28%), Positives = 46/83 (55%), Gaps = 2/83 (2%)

Query: 171 EEEKKEEDEIEEEENE--EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           EE KK   E +E+ NE     E  +++ +++ EE E   +E E  KEE E ++E+ +E  
Sbjct: 505 EEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEE 564

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
            +   E E+E ++ + + ++E +
Sbjct: 565 DKLLEEAEKEAQQAIKEAKKEAD 587



 Score = 32.9 bits (76), Expect = 0.17
 Identities = 25/83 (30%), Positives = 45/83 (54%), Gaps = 7/83 (8%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           IA  EE + E E + EE E   +  +K  +E EE++E+ +EE++   EE E E       
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKE------- 574

Query: 228 VKQKKNEEEEEEEEVLIKLEEEE 250
            +Q   E ++E +E++ +L + +
Sbjct: 575 AQQAIKEAKKEADEIIKELRQLQ 597



 Score = 32.5 bits (75), Expect = 0.22
 Identities = 22/83 (26%), Positives = 47/83 (56%), Gaps = 2/83 (2%)

Query: 172 EEKKE--EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           EEKKE  ++E ++   E E+E +Q   + ++E +E  KE  +++K    + +  E    +
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEAR 613

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
           ++ N+  E++E+   K +E++E 
Sbjct: 614 KRLNKANEKKEKKKKKQKEKQEE 636



 Score = 31.7 bits (73), Expect = 0.35
 Identities = 28/102 (27%), Positives = 53/102 (51%), Gaps = 16/102 (15%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEE--------KKEEDEIEKEEG 217
           +    E   KE ++++EE  E++E+++++++K  EE E+E        KKE DEI KE  
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELR 594

Query: 218 ENE--------EEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           + +          E  E  K+     E++E++   + E++EE
Sbjct: 595 QLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 35.1 bits (81), Expect = 0.017
 Identities = 17/70 (24%), Positives = 42/70 (60%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           G   +   +    E+    +  K+ + +EEE++E++KK+E + EK+E ++++E+  E   
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182

Query: 230 QKKNEEEEEE 239
            KK ++++++
Sbjct: 183 SKKKKKKKKK 192



 Score = 28.1 bits (63), Expect = 3.7
 Identities = 18/44 (40%), Positives = 26/44 (59%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
            E   E E  E++   K E E E EEEE+  K+KK E ++E++E
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKE 169



 Score = 27.0 bits (60), Expect = 7.6
 Identities = 13/64 (20%), Positives = 29/64 (45%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
                      E    +K    + E+E + EE+E ++++ + E ++E++  K KK +  E
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179

Query: 238 EEEE 241
            +  
Sbjct: 180 PKGS 183


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 35.8 bits (82), Expect = 0.017
 Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 8/85 (9%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           EN   EEE+KEE   E EE EE E V + + K +  + EE ++E        E E E EE
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKE--------EKEPEPEE 158

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEE 250
           E   ++ + EE   E +  KL+  E
Sbjct: 159 EEKPKRGSLEENNGEFMTHKLKHTE 183



 Score = 35.8 bits (82), Expect = 0.019
 Identities = 24/68 (35%), Positives = 39/68 (57%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           E+   E E V++++ +E  EE EE +E + + K E +N+  + EE  K++K  E EEEE+
Sbjct: 102 EDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEK 161

Query: 242 VLIKLEEE 249
                 EE
Sbjct: 162 PKRGSLEE 169



 Score = 30.0 bits (67), Expect = 1.2
 Identities = 20/73 (27%), Positives = 35/73 (47%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEE 238
           E +    ++      ++ +E+   E E  EE+E E+   E EE EE E V + + + +  
Sbjct: 83  EFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWR 142

Query: 239 EEEVLIKLEEEEE 251
           + E   K E+E E
Sbjct: 143 DAEECQKEEKEPE 155



 Score = 29.6 bits (66), Expect = 1.7
 Identities = 22/71 (30%), Positives = 35/71 (49%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           + E  +E E++KQK+ +   E EE KK+ +E  K   E E+  ++E   +K  EEEE+  
Sbjct: 195 QVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRR 254

Query: 241 EVLIKLEEEEE 251
                     E
Sbjct: 255 LKEEIERRRAE 265



 Score = 28.1 bits (62), Expect = 4.5
 Identities = 18/81 (22%), Positives = 46/81 (56%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E EK ++ + E     EE + K+++ ++  EEEE++++++E +++  E EE+   +   +
Sbjct: 201 EFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIE 260

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           ++  E  E+ + + +    E+
Sbjct: 261 RRRAEAAEKRQKVPEDGLSED 281



 Score = 28.1 bits (62), Expect = 5.0
 Identities = 21/79 (26%), Positives = 37/79 (46%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E+++E +    +    E  ++ +     E E  ++EE E  +EE E  EE E     ++K
Sbjct: 79  ERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQK 138

Query: 233 NEEEEEEEEVLIKLEEEEE 251
           N+  + EE    + E E E
Sbjct: 139 NDWRDAEECQKEEKEPEPE 157


>gnl|CDD|100107 cd02440, AdoMet_MTases, S-adenosylmethionine-dependent
           methyltransferases (SAM or AdoMet-MTase), class I;
           AdoMet-MTases are enzymes that use
           S-adenosyl-L-methionine (SAM or AdoMet) as a substrate
           for methyltransfer, creating the product
           S-adenosyl-L-homocysteine (AdoHcy). There are at least
           five structurally distinct families of AdoMet-MTases,
           class I being the largest and most diverse. Within this
           class enzymes can be classified by different substrate
           specificities (small molecules, lipids, nucleic acids,
           etc.) and different target atoms for methylation
           (nitrogen, oxygen, carbon, sulfur, etc.).
          Length = 107

 Score = 33.9 bits (78), Expect = 0.018
 Identities = 23/108 (21%), Positives = 35/108 (32%), Gaps = 21/108 (19%)

Query: 52  TVLDVGTGTGKSILLQGHGVDHHSFCTKVHPLDHYAPQYLILEVIQNKIENVEL------ 105
            VLD+G GTG   L    G         +      A +           +NVE+      
Sbjct: 1   RVLDLGCGTGALALALASGPGARVTGVDI---SPVALELARKAAAALLADNVEVLKGDAE 57

Query: 106 ------PEKVDIIVSEWMGFYL--LHESMIDSVIFARDKFLKPEGVMY 145
                  E  D+I+S+    +L       ++         LKP GV+ 
Sbjct: 58  ELPPEADESFDVIISDPPLHHLVEDLARFLEEARRL----LKPGGVLV 101


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 35.0 bits (81), Expect = 0.018
 Identities = 20/51 (39%), Positives = 32/51 (62%), Gaps = 1/51 (1%)

Query: 192 KQKKNKEE-EEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           KQ +  E+ E EEEEK++E+E E+     +   E+  +KQK  E ++EE+E
Sbjct: 114 KQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDE 164



 Score = 30.8 bits (70), Expect = 0.45
 Identities = 15/44 (34%), Positives = 24/44 (54%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
            E EEEE+  +EE E      ++  E+     K K+ ++EE+EE
Sbjct: 122 LEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165



 Score = 28.1 bits (63), Expect = 3.8
 Identities = 11/50 (22%), Positives = 27/50 (54%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E+   EEEE + ++ +E      + + E    K++ +  ++EE+E ++ +
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169



 Score = 27.3 bits (61), Expect = 6.8
 Identities = 16/47 (34%), Positives = 27/47 (57%), Gaps = 2/47 (4%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           E+ + E+  EE+ +EEE E   +  K   E+   K++  E++KEE E
Sbjct: 120 EQLEREE--EEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDE 164



 Score = 26.9 bits (60), Expect = 8.9
 Identities = 13/44 (29%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK 214
           EEEK++E+E E      +   +Q + K ++ +E +K+E++E+  
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLK-QKAKEMQKEEDEEMRH 168


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 36.1 bits (83), Expect = 0.018
 Identities = 17/67 (25%), Positives = 33/67 (49%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E  + E+E+ E E   ++E  +    EE +E  E   EDE E +  + E+ E+ + ++ 
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELES 988

Query: 231 KKNEEEE 237
           K   +  
Sbjct: 989 KAAYDSR 995



 Score = 33.8 bits (77), Expect = 0.093
 Identities = 16/66 (24%), Positives = 34/66 (51%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           +E +  EEE  E E     ++ E +E+EE  +  +++ ++E EN+  +EE+     + E 
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELES 988

Query: 236 EEEEEE 241
           +   + 
Sbjct: 989 KAAYDS 994



 Score = 33.4 bits (76), Expect = 0.11
 Identities = 17/60 (28%), Positives = 35/60 (58%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           ++E +E EEEV + +   ++E +E  ++E+  E  E  +E+E E +   ++  E+ +E E
Sbjct: 928 DDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELE 987



 Score = 31.1 bits (70), Expect = 0.54
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E E   +DE +E + +EE +   +   E+E E +   EED  + +E E++   +    K+
Sbjct: 940  EYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDSRPGKR 999

Query: 231  KK 232
            +K
Sbjct: 1000 RK 1001



 Score = 29.6 bits (66), Expect = 2.1
 Identities = 12/56 (21%), Positives = 33/56 (58%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
           E  A  +++ +E + +EE +E  E++ + +++ +  +EE+ ++ DE+E +   +  
Sbjct: 940 EYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDSR 995



 Score = 28.8 bits (64), Expect = 3.1
 Identities = 14/61 (22%), Positives = 33/61 (54%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
            ++E++E EE   +     ++E +E  E++E ++   +  E+E E     +++ E+ +E 
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986

Query: 241 E 241
           E
Sbjct: 987 E 987


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 35.7 bits (83), Expect = 0.021
 Identities = 14/82 (17%), Positives = 40/82 (48%), Gaps = 7/82 (8%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
               ++   + E +++++EE +E       E+EE  + + +ED+ + ++ ++    E  +
Sbjct: 113 EPRYDDAYRDLEEDDDDDEESDE-------EDEESSKSEDDEDDDDDDDDDDIATRERSL 165

Query: 228 VKQKKNEEEEEEEEVLIKLEEE 249
            ++++  E EE+   L     +
Sbjct: 166 ERRRRRREWEEKRAELEFYYYQ 187



 Score = 32.3 bits (74), Expect = 0.26
 Identities = 15/84 (17%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +N+ G ++    D+ + EE  ++E       ++ EE++++ +E DE ++E  ++E++E++
Sbjct: 90  DNVFGSDQVVIFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDD 149

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEE 249
           +       +++++++    +   E
Sbjct: 150 D-------DDDDDDDIATRERSLE 166



 Score = 31.9 bits (73), Expect = 0.32
 Identities = 10/58 (17%), Positives = 31/58 (53%)

Query: 195 KNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
               EEE ++E + +D     E +++++EE +   ++ ++ E++E++     +++   
Sbjct: 103 DGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIAT 160



 Score = 28.0 bits (63), Expect = 5.0
 Identities = 16/85 (18%), Positives = 41/85 (48%), Gaps = 16/85 (18%)

Query: 159 PEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           P     + ++  +++  EE + E+EE+ +          E++E++++  ++D+I   E  
Sbjct: 114 PRYDDAYRDLEEDDDDDEESDEEDEESSKS---------EDDEDDDDDDDDDDIATRERS 164

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVL 243
            E        ++++ E EE+  E+ 
Sbjct: 165 LER-------RRRRREWEEKRAELE 182


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 35.4 bits (82), Expect = 0.021
 Identities = 23/105 (21%), Positives = 44/105 (41%), Gaps = 26/105 (24%)

Query: 162 FKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKN------------------------- 196
           ++  E    EEE++  +  E EEN + E ++ ++                          
Sbjct: 285 YQLEEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNP 344

Query: 197 -KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
            +++EEEE+E +EE+E E+E  E E EE   ++     +     +
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAPLPND 389



 Score = 34.3 bits (79), Expect = 0.050
 Identities = 23/85 (27%), Positives = 38/85 (44%), Gaps = 22/85 (25%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE----------------------IEKEE 216
           E EEEE EEEE  ++ + +E  + E  +  E                          +++
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQK 347

Query: 217 GENEEEEEEEVVKQKKNEEEEEEEE 241
            E EE+E+EE  ++++  EE E EE
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 33.1 bits (76), Expect = 0.12
 Identities = 27/83 (32%), Positives = 35/83 (42%), Gaps = 27/83 (32%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV---------------------------K 229
           + EEEEEEE++EE     E  EN + E  EV                            +
Sbjct: 286 QLEEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPE 345

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
           QK  EEE+E+EE   + EE EE 
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEP 368



 Score = 29.3 bits (66), Expect = 2.4
 Identities = 20/40 (50%), Positives = 29/40 (72%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           E E+ E E EEEEE   + ++E+E EEEE++E +E+ KEE
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEE 203



 Score = 27.7 bits (62), Expect = 5.8
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query: 209 EDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEE 249
           E E+ + E E EEEE  E  ++ + EEEEEEE   +  EEE
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEE 204



 Score = 27.3 bits (61), Expect = 9.0
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           +E EEEEEE++  +  E++EGE EEEEE E V
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEV 199



 Score = 27.3 bits (61), Expect = 9.5
 Identities = 19/35 (54%), Positives = 23/35 (65%)

Query: 196 NKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            + EEEEEEE+  E + E E  E EEEE EEV K+
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKE 202


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 35.6 bits (82), Expect = 0.022
 Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 9/69 (13%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE---------EGENEEEEEEEVVKQKK 232
           E++   EEE+++KK KEE+ +E+E K+    +KE           +     ++   K +K
Sbjct: 9   EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRK 68

Query: 233 NEEEEEEEE 241
            + E+E  E
Sbjct: 69  RDVEDENPE 77



 Score = 27.9 bits (62), Expect = 6.9
 Identities = 16/44 (36%), Positives = 25/44 (56%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E E ++K   EEE E ++KKEE   EKE  + +  ++E   K +
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQ 48


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 34.5 bits (79), Expect = 0.022
 Identities = 18/56 (32%), Positives = 29/56 (51%)

Query: 186 EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           +EEE+   +  + EE+EEE +  ED +++E+ E EE EEE     K      +   
Sbjct: 19  DEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSSSKSSS 74



 Score = 32.2 bits (73), Expect = 0.14
 Identities = 16/45 (35%), Positives = 24/45 (53%)

Query: 201 EEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIK 245
           ++EEE  +    E EE E E E+ E+ + ++  E EE EEE    
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAAS 62



 Score = 30.3 bits (68), Expect = 0.65
 Identities = 18/40 (45%), Positives = 23/40 (57%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEED 210
           EEE  +    E EE+EEE E  +    EE+EE EE +EE 
Sbjct: 20  EEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59



 Score = 30.3 bits (68), Expect = 0.70
 Identities = 15/57 (26%), Positives = 29/57 (50%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           + + EE E +EEE E+ E+   ++++E EE EEE     +      ++  +E   + 
Sbjct: 25  DSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSSSKSSSQETISIP 81



 Score = 29.1 bits (65), Expect = 1.5
 Identities = 16/42 (38%), Positives = 23/42 (54%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           ++EE       EE EE+EE+ E+ E   +E + E EE EE  
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59



 Score = 28.7 bits (64), Expect = 2.1
 Identities = 17/52 (32%), Positives = 31/52 (59%), Gaps = 3/52 (5%)

Query: 191 VKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
           +K +    ++EEE+   + +E+E++E   EE E+ E    +++EE EE EE 
Sbjct: 10  LKVRPPPTKDEEEDWDSQAEEVEEDE---EEMEDWEDSLDEEDEEAEEVEEE 58



 Score = 27.6 bits (61), Expect = 4.9
 Identities = 15/53 (28%), Positives = 29/53 (54%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           K    +++      ++E +   ++ EE EE+E++ ED  +  + E+EE EE E
Sbjct: 4   KGNAKKLKVRPPPTKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVE 56


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 34.7 bits (80), Expect = 0.025
 Identities = 15/56 (26%), Positives = 30/56 (53%), Gaps = 5/56 (8%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEE-----EEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
            +   +    + D+ + E+         E++K+++ +E+E EEEEK  E+E  +EE
Sbjct: 124 SSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREE 179



 Score = 33.5 bits (77), Expect = 0.059
 Identities = 20/98 (20%), Positives = 43/98 (43%), Gaps = 12/98 (12%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE- 225
            I   ++    D   E + +++EE + K+ + EE+      ++ +   +   ++++ ++ 
Sbjct: 79  AIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDD 138

Query: 226 -----------EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
                      E+ K KK   EE+E E   K  EEE+ 
Sbjct: 139 DSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKA 176



 Score = 32.0 bits (73), Expect = 0.25
 Identities = 21/82 (25%), Positives = 39/82 (47%), Gaps = 13/82 (15%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE----------IEKEEGENEE 221
           E K++  E +   ++ ++      +   +++ ++   EDE          I+KE  E +E
Sbjct: 104 EIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKE 163

Query: 222 EEEEEVVKQKKNEEEEEEEEVL 243
            EEEE   +   EE+  EEE+L
Sbjct: 164 REEEE---KAAEEEKAREEEIL 182



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 14/59 (23%), Positives = 32/59 (54%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
            + + + +  ++D  +++  +E   + ++  K ++E  EEK+ E+E +  E E   EEE
Sbjct: 122 SDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180



 Score = 27.8 bits (62), Expect = 5.1
 Identities = 16/82 (19%), Positives = 36/82 (43%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           G E++  + ++  E  E E   K KK  +   E+ +K    +   E  E+++EE+E   K
Sbjct: 49  GTEDELRKRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRK 108

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
           + + +    + +      + + 
Sbjct: 109 RIEEDARNSDADDSDSSSDSDS 130


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 35.2 bits (82), Expect = 0.026
 Identities = 16/54 (29%), Positives = 26/54 (48%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
              EE+      +  E ++ E+EEEEE+E       +E E  E+VL K +   +
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAK 229



 Score = 31.7 bits (73), Expect = 0.39
 Identities = 18/85 (21%), Positives = 40/85 (47%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
             GE   +E  +   + N EE+        EE +++E+++EE++   +    +E E  E 
Sbjct: 160 ENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEEN 252
           V +K     ++ +++    E++ E 
Sbjct: 220 VLEKFKALAKQYKKLRKAQEKKVEG 244



 Score = 30.9 bits (71), Expect = 0.70
 Identities = 17/83 (20%), Positives = 40/83 (48%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            ++  E E+ ++DE EEEE +E ++       E  E+  EK +    + ++    +E++ 
Sbjct: 183 AHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKV 242

Query: 226 EVVKQKKNEEEEEEEEVLIKLEE 248
           E    +  +  +  E++  +L+ 
Sbjct: 243 EGRLAQHKKYAKLREKLKEELKS 265



 Score = 30.9 bits (71), Expect = 0.75
 Identities = 6/38 (15%), Positives = 13/38 (34%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           EE  + ++    +     + + + EEE          E
Sbjct: 61  EEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98



 Score = 27.8 bits (63), Expect = 5.6
 Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 8/92 (8%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           E+            +++   +  E  E +++       EEEEE+E        E E  E 
Sbjct: 168 ELID---GFVDPNAEEDPAHVGSELEELDDD-----EDEEEEEDENDDSLAADESELPEK 219

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
             E+ + + KQ K   + +E++V  +L + ++
Sbjct: 220 VLEKFKALAKQYKKLRKAQEKKVEGRLAQHKK 251


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 34.3 bits (79), Expect = 0.027
 Identities = 16/74 (21%), Positives = 36/74 (48%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
            E E ++ E +EE   + + ++  +EEE +  ++++ E+ + E E+       ++   E 
Sbjct: 3   SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA 62

Query: 236 EEEEEEVLIKLEEE 249
           E E +    K+  E
Sbjct: 63  EAEFDREFEKMMAE 76



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 18/55 (32%), Positives = 35/55 (63%), Gaps = 3/55 (5%)

Query: 199 EEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE---EEEEVLIKLEEEE 250
            E E ++ +E++E+ +E+ ++E  +EEEV      ++EE   EEE++ +  +EEE
Sbjct: 3   SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEE 57



 Score = 27.7 bits (62), Expect = 4.5
 Identities = 14/60 (23%), Positives = 34/60 (56%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E+    EE ++++  +EEE +  ++ + +++  EEE+    ++E+E++ E     + E E
Sbjct: 12  EDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEAEFDREFE 71


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 35.1 bits (81), Expect = 0.027
 Identities = 28/111 (25%), Positives = 46/111 (41%), Gaps = 21/111 (18%)

Query: 162 FKFWENIAGEEEKKEEDEIEEEENE-----EEEEVKQKKNKEEEEEEEEKKE----EDEI 212
           ++  +    +EE+ E++  EE   +     E      K+  E  E  EE KE      ++
Sbjct: 108 YEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADV 167

Query: 213 ------------EKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
                       EK+E E EEEE+E ++K      E EE+      E+ E+
Sbjct: 168 DVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSED 218



 Score = 33.2 bits (76), Expect = 0.11
 Identities = 14/49 (28%), Positives = 28/49 (57%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
              + +++EEEEEE+++E  I+      E EE+      + +E++EE+ 
Sbjct: 175 ALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDN 223



 Score = 29.3 bits (66), Expect = 1.6
 Identities = 17/75 (22%), Positives = 33/75 (44%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           E+ E EEEE E+E  +K      E EE+  + ++++ E +E +N+     +       + 
Sbjct: 180 EKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKP 239

Query: 236 EEEEEEVLIKLEEEE 250
               ++   K  E  
Sbjct: 240 TSILKKSAAKRSEAP 254



 Score = 28.6 bits (64), Expect = 2.8
 Identities = 19/53 (35%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 201 EEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE--EEEEEVLIKLEEEEE 251
             E +K +E++ E+ E E EEE   + +K+ +N     + E EVL +LEE +E
Sbjct: 107 NYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKE 159


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 35.2 bits (81), Expect = 0.028
 Identities = 17/66 (25%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 171 EEEKKEEDEIEEEENE-EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           +E    E +I EE  + +E+ + +        +E+++ E+DE  KEE E +++ +++  K
Sbjct: 307 DEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRK 366

Query: 230 QKKNEE 235
           +K N  
Sbjct: 367 RKVNPV 372



 Score = 31.3 bits (71), Expect = 0.40
 Identities = 16/41 (39%), Positives = 20/41 (48%)

Query: 196 NKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
            +EEE  ++E     EIE EE      EEE V  Q  N+E 
Sbjct: 240 VQEEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEP 280



 Score = 30.9 bits (70), Expect = 0.54
 Identities = 17/87 (19%), Positives = 32/87 (36%), Gaps = 16/87 (18%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE---------EE 225
           +EE+ I++E +   E        E EE      EE+ ++ +   +E              
Sbjct: 241 QEEESIDDELDVLREI-------EAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTT 293

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
             VK +    +  +E  L + +  EE 
Sbjct: 294 RRVKMRPVRAKPSDEPSLPESDIHEEI 320



 Score = 28.6 bits (64), Expect = 3.5
 Identities = 19/97 (19%), Positives = 34/97 (35%), Gaps = 24/97 (24%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKK----------------NKEEEEEEEEKKEEDEI 212
           AG    +EE    +  N+E   V +KK                   +E    E    +EI
Sbjct: 261 AGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEI 320

Query: 213 EKEE--------GENEEEEEEEVVKQKKNEEEEEEEE 241
            K +        G     +E++  +  +  +EE E++
Sbjct: 321 PKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKK 357


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 35.0 bits (80), Expect = 0.028
 Identities = 15/63 (23%), Positives = 39/63 (61%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
           E   +K  KE EEE  E++++DE + ++ ++E+E++++    + +E+E++++  L     
Sbjct: 35  ENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSS 94

Query: 249 EEE 251
            ++
Sbjct: 95  ADD 97



 Score = 31.5 bits (71), Expect = 0.37
 Identities = 13/50 (26%), Positives = 32/50 (64%)

Query: 191 VKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
            K+   ++  +E EE+  E+E + EE ++++++E+E      ++E++E+E
Sbjct: 33  KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDE 82



 Score = 31.1 bits (70), Expect = 0.48
 Identities = 13/43 (30%), Positives = 31/43 (72%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           K+ E+E  EEE+++EE+     +++E++++++  E+DE E ++
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDD 85



 Score = 30.7 bits (69), Expect = 0.76
 Identities = 10/40 (25%), Positives = 31/40 (77%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
           +E +EE   EE+++EE+++    ++++++++++++ +EDE
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDE 82



 Score = 30.3 bits (68), Expect = 0.87
 Identities = 12/52 (23%), Positives = 37/52 (71%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           K+E+ I +   E EEE  ++++ +EE+++++  E+++ + ++ + ++E+E++
Sbjct: 33  KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDD 84



 Score = 30.0 bits (67), Expect = 1.1
 Identities = 12/50 (24%), Positives = 33/50 (66%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           +K   E EEE  EEE++ ++  + +++E+E++  ++D+ + E+ ++++  
Sbjct: 39  RKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST 88



 Score = 30.0 bits (67), Expect = 1.1
 Identities = 10/62 (16%), Positives = 38/62 (61%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           ++ EE+ +EEE+++EE++       E+++++++ +++++ + ++    ++   +   +  
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETD 102

Query: 233 NE 234
           NE
Sbjct: 103 NE 104



 Score = 29.6 bits (66), Expect = 1.7
 Identities = 12/42 (28%), Positives = 32/42 (76%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEED 210
           A EE  +EED+ EE+++++++E +   + +++E++E++ ++D
Sbjct: 45  AEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDD 86



 Score = 29.6 bits (66), Expect = 1.7
 Identities = 9/55 (16%), Positives = 34/55 (61%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
           E  +++   + EEE  EE+ +++E + ++ + +E+++++   +   +E++++  +
Sbjct: 35  ENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTL 89



 Score = 28.8 bits (64), Expect = 3.1
 Identities = 10/58 (17%), Positives = 38/58 (65%)

Query: 183 EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           +E EEE   ++  ++E++++++++ E+D+ + ++ ++E+E++++      +  ++  E
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNE 100



 Score = 28.4 bits (63), Expect = 4.2
 Identities = 13/54 (24%), Positives = 34/54 (62%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           E   ++  +  EEE  EEE+  ++ + ++++E+E+  ++D+ E +E E++++  
Sbjct: 35  ENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST 88



 Score = 27.7 bits (61), Expect = 6.8
 Identities = 8/56 (14%), Positives = 34/56 (60%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           G+E ++E  E E+++ E++++   +   +++++++E  E+++ +     ++   ++
Sbjct: 42  GKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADD 97



 Score = 27.7 bits (61), Expect = 7.2
 Identities = 9/48 (18%), Positives = 30/48 (62%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           KK     +  +E +EE   E+++ E +++++++  +   +++++E++E
Sbjct: 33  KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDE 80


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 35.4 bits (81), Expect = 0.030
 Identities = 18/77 (23%), Positives = 34/77 (44%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           W N+  E+   EE E ++   +      +K+  +    E+ + E    E+ E E E E+ 
Sbjct: 333 WRNLNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDY 392

Query: 225 EEVVKQKKNEEEEEEEE 241
           E+     K   +++E E
Sbjct: 393 EDENDHSKRICDDDELE 409



 Score = 33.0 bits (75), Expect = 0.15
 Identities = 20/72 (27%), Positives = 33/72 (45%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           E+  NEE E      +      E++  +    EK E EN   EE E  ++ ++ E+E + 
Sbjct: 339 EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDH 398

Query: 241 EVLIKLEEEEEN 252
              I  ++E EN
Sbjct: 399 SKRICDDDELEN 410



 Score = 31.5 bits (71), Expect = 0.45
 Identities = 15/61 (24%), Positives = 29/61 (47%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
              +K+ D+I   E  E E    ++++ EEE E+ + E D  ++   ++E E       +
Sbjct: 358 VPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADE 417

Query: 231 K 231
           K
Sbjct: 418 K 418


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 34.4 bits (79), Expect = 0.032
 Identities = 14/65 (21%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 184 ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE-EEEEEVVKQKKNEEEEEEEEV 242
           E+E+ +  K K N E   +   + +  E  K   + +  E  +++  Q+K EE+++ ++ 
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKK 76

Query: 243 LIKLE 247
            + L+
Sbjct: 77  KVPLQ 81



 Score = 27.8 bits (62), Expect = 5.3
 Identities = 10/60 (16%), Positives = 27/60 (45%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
            E ++ +  +++   E+  K N +++  E+ K        E  +    +E++  K+K  +
Sbjct: 16  LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKK 75



 Score = 27.1 bits (60), Expect = 8.6
 Identities = 12/53 (22%), Positives = 23/53 (43%)

Query: 199 EEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E E+ +  K++   E     N +++  E  K     +  E  + L   E++EE
Sbjct: 17  ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69



 Score = 27.1 bits (60), Expect = 9.1
 Identities = 10/55 (18%), Positives = 28/55 (50%), Gaps = 1/55 (1%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           +K  E      + +  E VK+  ++E++EE+++ K++  +  +    +   ++E 
Sbjct: 40  DKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK-VPLQVNPAQLFVDDEY 93


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 34.4 bits (79), Expect = 0.032
 Identities = 18/70 (25%), Positives = 25/70 (35%), Gaps = 7/70 (10%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A +  KK   E E++ NE   E   +K   E      ++     E+E  E   EE     
Sbjct: 124 AKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEA---- 179

Query: 229 KQKKNEEEEE 238
                EE  E
Sbjct: 180 ---PAEESAE 186



 Score = 33.2 bits (76), Expect = 0.063
 Identities = 24/79 (30%), Positives = 35/79 (44%), Gaps = 4/79 (5%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E + E + E  + E+E +V  KK+K  + ++  KK   E EK+  E   E     V +KK
Sbjct: 96  EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEA----VAEKK 151

Query: 233 NEEEEEEEEVLIKLEEEEE 251
             E            EEEE
Sbjct: 152 AAEAAAVAAEEAAAAEEEE 170



 Score = 31.7 bits (72), Expect = 0.19
 Identities = 16/68 (23%), Positives = 26/68 (38%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           K +  + ++   +   E ++K N+   E   EKK  +       E    EEEE  +    
Sbjct: 118 KDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177

Query: 234 EEEEEEEE 241
           E   EE  
Sbjct: 178 EAPAEESA 185



 Score = 28.2 bits (63), Expect = 3.5
 Identities = 18/68 (26%), Positives = 33/68 (48%), Gaps = 2/68 (2%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           EEK+ +   ++++  + ++  +K   E E++  E + E   EK+  E      EE    +
Sbjct: 108 EEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAE 167

Query: 232 KNEEEEEE 239
             EEE EE
Sbjct: 168 --EEEAEE 173


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 35.0 bits (81), Expect = 0.034
 Identities = 22/82 (26%), Positives = 41/82 (50%), Gaps = 1/82 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE-EEEEEVVK 229
           EE ++E +++ +E  E   ++   +      E E ++ E+ I +   E  E E E E ++
Sbjct: 708 EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
           ++  E EEE  E   ++EE E 
Sbjct: 768 ERLEEAEEELAEAEAEIEELEA 789



 Score = 34.3 bits (79), Expect = 0.060
 Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             +   E K +E   E  E EE+ E  ++   E  E E E+ E +  E E    E EE+ 
Sbjct: 323 AQLEELESKLDELAEELAELEEKLEELKE-ELESLEAELEELEAELEELESRLEELEEQL 381

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E ++ K  + E +   +  ++E  E 
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEA 407



 Score = 32.3 bits (74), Expect = 0.24
 Identities = 24/87 (27%), Positives = 40/87 (45%), Gaps = 4/87 (4%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
              +A E  + E+ +    E     E + ++ + + EE E K +E   E      E EE+
Sbjct: 290 LYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEEL----AELEEK 345

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            E +K++    E E EE+  +LEE E 
Sbjct: 346 LEELKEELESLEAELEELEAELEELES 372



 Score = 31.6 bits (72), Expect = 0.42
 Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 6/93 (6%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             +   EEK EE  +E  E EEE E  QK+      E    +++ +I +E   N E + E
Sbjct: 260 AELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLE 319

Query: 226 EV------VKQKKNEEEEEEEEVLIKLEEEEEN 252
           E+      ++ K +E  EE  E+  KLEE +E 
Sbjct: 320 ELEAQLEELESKLDELAEELAELEEKLEELKEE 352



 Score = 31.6 bits (72), Expect = 0.47
 Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 12/94 (12%)

Query: 171 EEEKKEEDEIEEEENEEEEEV------------KQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           E  + +  ++E +      E+            ++++ ++E EE  +K EE E+++ + E
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAE 441

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            EE EEE    Q++ E  EE  E L +  EE E 
Sbjct: 442 LEELEEELEELQEELERLEEALEELREELEEAEQ 475



 Score = 31.6 bits (72), Expect = 0.51
 Identities = 26/81 (32%), Positives = 38/81 (46%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           EI +  +      E+    E + EE E + E  + K  E  EE  E +E+ E  KEE E+
Sbjct: 296 EISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELES 355

Query: 220 EEEEEEEVVKQKKNEEEEEEE 240
            E E EE+  + +  E   EE
Sbjct: 356 LEAELEELEAELEELESRLEE 376



 Score = 31.2 bits (71), Expect = 0.61
 Identities = 22/80 (27%), Positives = 41/80 (51%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEE ++  +  EE + +   +++   + E E E+ ++   ++ KE  E E E EE   + 
Sbjct: 711 EEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERL 770

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
           ++ EEE  E E  I+  E +
Sbjct: 771 EEAEEELAEAEAEIEELEAQ 790



 Score = 30.8 bits (70), Expect = 0.70
 Identities = 22/82 (26%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE-EEEEEVVK 229
           EE +KE   +  E +  E++ +  + +    E + ++ E ++E+ E + +E  EE   ++
Sbjct: 284 EELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELE 343

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
           +K  E +EE E +  +LEE E 
Sbjct: 344 EKLEELKEELESLEAELEELEA 365



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE ++E +E++EE  E EEE+++     E +E EEK EE  +E  E E E EE ++ +  
Sbjct: 235 EELREELEELQEELKEAEEELEEL--TAELQELEEKLEELRLEVSELEEEIEELQKELYA 292

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
             NE    E++  I  E   
Sbjct: 293 LANEISRLEQQKQILRERLA 312



 Score = 29.6 bits (67), Expect = 2.1
 Identities = 18/84 (21%), Positives = 27/84 (32%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
             E  +  E  I   E   E+  +Q +   E+ E    + E+  E  E    E E     
Sbjct: 822 LRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNE 881

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
           +    E        L +L EE   
Sbjct: 882 RASLEEALALLRSELEELSEELRE 905



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 22/86 (25%), Positives = 35/86 (40%), Gaps = 4/86 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE----E 226
           E    E +E+EE   E E E++   N+    EE       E+E+   E  E E +     
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEEN 252
              ++  E+  + E  L  LE   +N
Sbjct: 915 RELEELREKLAQLELRLEGLEVRIDN 940



 Score = 28.9 bits (65), Expect = 3.0
 Identities = 24/71 (33%), Positives = 40/71 (56%), Gaps = 6/71 (8%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           + ++ E ++EE E++ +E        EE  E EEK EE + E E  E E EE E  +++ 
Sbjct: 317 QLEELEAQLEELESKLDEL------AEELAELEEKLEELKEELESLEAELEELEAELEEL 370

Query: 232 KNEEEEEEEEV 242
           ++  EE EE++
Sbjct: 371 ESRLEELEEQL 381



 Score = 28.1 bits (63), Expect = 4.9
 Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 5/85 (5%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE-----EE 226
             +K+   +E E  + EE + Q   +  E E E ++ E+ +E+ E E  E E      E 
Sbjct: 730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEA 789

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEE 251
            ++Q K E +   E +     E   
Sbjct: 790 QIEQLKEELKALREALDELRAELTL 814



 Score = 28.1 bits (63), Expect = 5.4
 Identities = 20/80 (25%), Positives = 31/80 (38%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E +E +E  EE  EE  E + +  + E + E+ K+E   + +   E   E      + 
Sbjct: 760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA 819

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
               E  E  E  I   E  
Sbjct: 820 ANLRERLESLERRIAATERR 839



 Score = 28.1 bits (63), Expect = 6.0
 Identities = 27/103 (26%), Positives = 46/103 (44%), Gaps = 16/103 (15%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE------------ 213
                EE + E +E+E    E EE+++  ++K  + E +     +EIE            
Sbjct: 356 LEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDR 415

Query: 214 ----KEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
               ++E E   ++ EE   ++   E EE EE L +L+EE E 
Sbjct: 416 RERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELER 458


>gnl|CDD|225137 COG2227, UbiG,
           2-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol
           methylase [Coenzyme metabolism].
          Length = 243

 Score = 34.6 bits (80), Expect = 0.035
 Identities = 33/149 (22%), Positives = 57/149 (38%), Gaps = 16/149 (10%)

Query: 8   DHDEEEYFES----YTDLEVHKLMLDDTVR-NEAYRAAICDNPNIFAGKTVLDVGTGTG- 61
           D+ E + FE+    + D E     L         Y   +        G  VLDVG G G 
Sbjct: 13  DYKELDKFEALASRWWDPEGEFKPLHKINPLRLDYIREVARLRFDLPGLRVLDVGCGGGI 72

Query: 62  --KSILLQG---HGVDHHSFCTKVHPLDHYAPQYLILEVIQNKIENV-ELPEKVDIIVSE 115
             + +   G    G+D      +V  L H     + ++  Q  +E++     + D++   
Sbjct: 73  LSEPLARLGASVTGIDASEKPIEVAKL-HALESGVNIDYRQATVEDLASAGGQFDVVTCM 131

Query: 116 WMGFYLLHESMIDSVIFARDKFLKPEGVM 144
            +   L H    +S + A  K +KP G++
Sbjct: 132 EV---LEHVPDPESFLRACAKLVKPGGIL 157


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 34.6 bits (80), Expect = 0.035
 Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 10/54 (18%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEED--EIEKEEGENEEEEEEE 226
           K ++E E E  EEEEE        E EE EE+ EED  + +  +   EE +E+E
Sbjct: 140 KPKNEFELELPEEEEE--------EPEEMEEELEEDAADRDARKRAAEEAKEQE 185



 Score = 31.2 bits (71), Expect = 0.53
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK 207
           EE++EE E  EEE EE+   +  + +  EE +E+++
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEE 186



 Score = 29.6 bits (67), Expect = 1.7
 Identities = 13/38 (34%), Positives = 25/38 (65%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE 208
           EEE+ EE E E EE+  + + +++  +E +E+EE ++ 
Sbjct: 153 EEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRR 190



 Score = 28.5 bits (64), Expect = 3.2
 Identities = 24/77 (31%), Positives = 35/77 (45%), Gaps = 11/77 (14%)

Query: 176 EEDEIEEEENEEEEEVKQKKN----------KEEEEEEEEKKEEDEIEKEEGENE-EEEE 224
              E+ E   EE+    + K           K + E E E  EE+E E EE E E EE+ 
Sbjct: 109 AAMEVGETPREEKLREDEAKLSLKSGLASLPKPKNEFELELPEEEEEEPEEMEEELEEDA 168

Query: 225 EEVVKQKKNEEEEEEEE 241
            +   +K+  EE +E+E
Sbjct: 169 ADRDARKRAAEEAKEQE 185


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 34.8 bits (80), Expect = 0.036
 Identities = 20/86 (23%), Positives = 39/86 (45%), Gaps = 4/86 (4%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            ++ K ++   + ++ E +E   +   +E E E+E K EE   +  + E EE+ + E   
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYS 205

Query: 230 QKK----NEEEEEEEEVLIKLEEEEE 251
           Q      +  +EEE E    +   E+
Sbjct: 206 QYDGMLVDSSDEEEGEEAPSINYNED 231



 Score = 34.4 bits (79), Expect = 0.049
 Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 4/105 (3%)

Query: 152 HSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKK----NKEEEEEEEEKK 207
            S+     E  +  +    EE  +++ + EEEE+ + E+  Q      +  +EEE EE  
Sbjct: 165 ESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAP 224

Query: 208 EEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
             +  E       +E + E+ + +   + EE      K +E++ +
Sbjct: 225 SINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTS 269



 Score = 31.7 bits (72), Expect = 0.39
 Identities = 22/82 (26%), Positives = 35/82 (42%), Gaps = 11/82 (13%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE-----------G 217
             + +KKE  E  ++++EEE E + +   EE  E++   EE+E    E            
Sbjct: 155 TKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDS 214

Query: 218 ENEEEEEEEVVKQKKNEEEEEE 239
            +EEE EE        +  E E
Sbjct: 215 SDEEEGEEAPSINYNEDTSESE 236



 Score = 31.0 bits (70), Expect = 0.65
 Identities = 17/83 (20%), Positives = 39/83 (46%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
              ++ K+++  E  + ++EEE + +   + EE  E+  +++E E  + E+  + +  +V
Sbjct: 153 KKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLV 212

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
                EE EE   +    +  E 
Sbjct: 213 DSSDEEEGEEAPSINYNEDTSES 235



 Score = 28.6 bits (64), Expect = 3.7
 Identities = 17/74 (22%), Positives = 29/74 (39%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E+   E+  + +  + +  +EEE E     N  E+  E E  E D    E     + EE 
Sbjct: 197 EDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEES 256

Query: 226 EVVKQKKNEEEEEE 239
               +K  E++   
Sbjct: 257 SPPSKKPKEKKTSS 270



 Score = 27.9 bits (62), Expect = 5.1
 Identities = 16/61 (26%), Positives = 36/61 (59%)

Query: 190 EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEE 249
           E K KK K +++ ++ KK+E +   ++ + EE E E+  K +++ E++ ++E     + E
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202

Query: 250 E 250
           +
Sbjct: 203 D 203


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 34.8 bits (80), Expect = 0.039
 Identities = 24/82 (29%), Positives = 45/82 (54%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A EE +K  +EIE+E   E +E+KQ +++  E      ++++ +  +E   E +E+    
Sbjct: 77  AKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTD 136

Query: 229 KQKKNEEEEEEEEVLIKLEEEE 250
           K K  +E EE+ E L + ++ E
Sbjct: 137 KSKHIDEREEQVEKLEEQKKAE 158


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 34.7 bits (80), Expect = 0.041
 Identities = 32/94 (34%), Positives = 43/94 (45%), Gaps = 12/94 (12%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE------------DEIEKEEGE 218
           EEE+  E E  EEE   E  V +K+ K+ EE  E  +EE             EIEK E E
Sbjct: 399 EEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESE 458

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            E    E   K +K+ E    +  + +LE+E E 
Sbjct: 459 LERFRREVRDKVRKDREIRARDRRIERLEKELEE 492



 Score = 32.8 bits (75), Expect = 0.17
 Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 2/87 (2%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E ++  +E++   E E  E EE  E+   + + ++ EE  ++ E+E  + + E EE + E
Sbjct: 392 EALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKRE 451

Query: 226 EVVKQKKNEE--EEEEEEVLIKLEEEE 250
               + + E    E  ++V    E   
Sbjct: 452 IEKLESELERFRREVRDKVRKDREIRA 478



 Score = 28.5 bits (64), Expect = 3.5
 Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 5/80 (6%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEE-----EEKKEEDEIEKEEGENEEEEEEEVVKQK 231
            + + + + EE    K+   +EE  E        KK E+ +E+ E EN E + E    ++
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKR 450

Query: 232 KNEEEEEEEEVLIKLEEEEE 251
           + E+ E E E   +   ++ 
Sbjct: 451 EIEKLESELERFRREVRDKV 470



 Score = 28.1 bits (63), Expect = 4.9
 Identities = 18/84 (21%), Positives = 36/84 (42%)

Query: 157 YSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
              EI  + + I   EE  E  E E  E + E E  +++ ++ E E E  + E   +  +
Sbjct: 413 ERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRK 472

Query: 217 GENEEEEEEEVVKQKKNEEEEEEE 240
                  +  + + +K  EE+++ 
Sbjct: 473 DREIRARDRRIERLEKELEEKKKR 496



 Score = 27.4 bits (61), Expect = 8.7
 Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLE 247
            E + + K +E   E+E  +EE+  E    E   ++ EE V++ + E  E + E L +L+
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRE-LEELK 449

Query: 248 EEEEN 252
            E E 
Sbjct: 450 REIEK 454


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 34.8 bits (80), Expect = 0.042
 Identities = 18/70 (25%), Positives = 26/70 (37%), Gaps = 17/70 (24%)

Query: 185 NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG-----------------ENEEEEEEEV 227
           N  EE +    +   E+E+E + E DE  K+ G                     EEEE  
Sbjct: 539 NSTEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAA 598

Query: 228 VKQKKNEEEE 237
           +K K  +  E
Sbjct: 599 LKMKMTDTSE 608



 Score = 32.1 bits (73), Expect = 0.30
 Identities = 17/65 (26%), Positives = 27/65 (41%), Gaps = 16/65 (24%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEV----------------KQKKNKEEEEEEEEKKEE 209
            ++  ++  ++EDE E E +EE ++                 K  K  EEEE   + K  
Sbjct: 545 IDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAALKMKMT 604

Query: 210 DEIEK 214
           D  EK
Sbjct: 605 DTSEK 609



 Score = 31.8 bits (72), Expect = 0.37
 Identities = 14/71 (19%), Positives = 30/71 (42%), Gaps = 13/71 (18%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV-----------KQKKNEE--EEEEE 240
           + + EE  + +     ++ ++ E E +EE ++               Q K  +  EEEE 
Sbjct: 538 RNSTEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEA 597

Query: 241 EVLIKLEEEEE 251
            + +K+ +  E
Sbjct: 598 ALKMKMTDTSE 608


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 34.4 bits (79), Expect = 0.044
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E+  E E  +    +++  K    K ++ ++EE  ++D    EE E +E+EEEE  ++++
Sbjct: 219 EEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEE 278

Query: 233 NEEEEEE 239
            EE+E+E
Sbjct: 279 EEEDEDE 285



 Score = 34.0 bits (78), Expect = 0.051
 Identities = 23/77 (29%), Positives = 45/77 (58%), Gaps = 5/77 (6%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +N + EE  ++E        ++  +    KNK+ ++EE++ ++ D +E++E + +EEEEE
Sbjct: 214 KNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEE 273

Query: 226 EVVKQKKNEEEEEEEEV 242
                 + EEEEE+E+ 
Sbjct: 274 T-----EEEEEEEDEDE 285


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 34.3 bits (79), Expect = 0.045
 Identities = 20/97 (20%), Positives = 44/97 (45%), Gaps = 11/97 (11%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEE-----------VKQKKNKEEEEEEEEKKEEDEIEK 214
           E    + E++ +DE   E  E E E            K ++NKE+  +E E++ ++E + 
Sbjct: 243 EESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQL 302

Query: 215 EEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           ++   +    +E+ K+   +E+    +   + E  E+
Sbjct: 303 KKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEK 339



 Score = 33.1 bits (76), Expect = 0.13
 Identities = 16/81 (19%), Positives = 33/81 (40%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE  ++ E E ++    E  + +     +    ++K + +  KE+   E E E +  KQ
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
            K +  +      I  E  ++
Sbjct: 302 LKKKLAQLARLKEIAKEVAQK 322



 Score = 32.8 bits (75), Expect = 0.15
 Identities = 28/127 (22%), Positives = 52/127 (40%), Gaps = 13/127 (10%)

Query: 138 LKPEGVMYPYKCILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNK 197
           L   G   P   + H+  SY+P  F+  + +  EE +KE    ++ +  E  E K+ +  
Sbjct: 169 LAANGKPVPAVEVPHAGASYNPS-FEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKM 227

Query: 198 EEE------------EEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIK 245
             E            EE ++  EE+  ++   E  E E E + K  + + + + +    K
Sbjct: 228 APEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEK 287

Query: 246 LEEEEEN 252
             +E E 
Sbjct: 288 RRKELER 294


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 34.7 bits (80), Expect = 0.045
 Identities = 27/91 (29%), Positives = 43/91 (47%)

Query: 162 FKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
               E I   + K EE E +  E  E+ E   +  +EE +E ++ +E  E ++EE   +E
Sbjct: 174 ELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQE 233

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            E  E    +  EE+E  EE+  +L E E  
Sbjct: 234 IEALEERLAELEEEKERLEELKARLLEIESL 264



 Score = 33.6 bits (77), Expect = 0.092
 Identities = 31/83 (37%), Positives = 48/83 (57%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            E EK+  +  E E  E EEE+ ++K + E  EE E+ E++  E EE   E  E EE +K
Sbjct: 469 EEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
           ++  E+ E+ E +L +LEE +E 
Sbjct: 529 EELEEKLEKLENLLEELEELKEK 551



 Score = 33.6 bits (77), Expect = 0.11
 Identities = 29/81 (35%), Positives = 41/81 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E  ++ E EIEE E E E      +  EE  E+ +  EE   + EE   + E E E + +
Sbjct: 291 ERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAE 350

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +KNE  +  EE L +LEE  E
Sbjct: 351 EKNELAKLLEERLKELEERLE 371



 Score = 32.0 bits (73), Expect = 0.28
 Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
                EE+ +    +EE   E EE +++ + + E+  E  K+ E+ I++ + E  E    
Sbjct: 345 LEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAA 404

Query: 226 EV-VKQKKNEEEEEEEEVLIKLEEEEEN 252
              ++++  E E+E EE+  +LEE EE 
Sbjct: 405 LEEIQEELEELEKELEELERELEELEEE 432



 Score = 31.7 bits (72), Expect = 0.39
 Identities = 27/83 (32%), Positives = 47/83 (56%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           + G+E  +E ++   E  E E E  +++   E+EE E ++E +E+EKE  E EEE  E +
Sbjct: 462 VCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELL 521

Query: 228 VKQKKNEEEEEEEEVLIKLEEEE 250
             ++  +EE EE+   ++   EE
Sbjct: 522 ELEEALKEELEEKLEKLENLLEE 544



 Score = 31.7 bits (72), Expect = 0.42
 Identities = 27/92 (29%), Positives = 46/92 (50%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           E  +  E I   E++  E E E  E  E EE  +++ +E+ E+ E   EE E  KE+ + 
Sbjct: 495 EEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL 554

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           ++ +EE    + + +E +E  E L  L   +E
Sbjct: 555 QQLKEELRQLEDRLQELKELLEELRLLRTRKE 586



 Score = 31.3 bits (71), Expect = 0.49
 Identities = 28/83 (33%), Positives = 43/83 (51%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            E E+ EE+   E+E  E  E  ++  KE  E EEE  E  E+E+   E  EE+ E++  
Sbjct: 481 LELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLEN 540

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
             +  EE +E+  L +L+EE   
Sbjct: 541 LLEELEELKEKLQLQQLKEELRQ 563



 Score = 31.3 bits (71), Expect = 0.54
 Identities = 27/81 (33%), Positives = 45/81 (55%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E ++ +   E E   +  +++ + K EE E E ++E   IE EE   E+ EE E +++
Sbjct: 636 ESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEE 695

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +  +  EE EE+L KL E E+
Sbjct: 696 ELEQLREELEELLKKLGEIEQ 716



 Score = 31.3 bits (71), Expect = 0.59
 Identities = 22/86 (25%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE-DEIEKEEGENEEEEEE 225
               E E+  ++E+EE+  + E  +++ +  +E+ + ++ KEE  ++E    E +E  EE
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
             + + + EE EE  E L +L+++ +
Sbjct: 578 LRLLRTRKEELEELRERLKELKKKLK 603



 Score = 30.9 bits (70), Expect = 0.79
 Identities = 26/82 (31%), Positives = 43/82 (52%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +EE++ E EIE  E    E  ++K+  EE +    + E  E+E  +   EE  E E + +
Sbjct: 225 QEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLE 284

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           +  E+ E  EE+  ++EE EE 
Sbjct: 285 ELEEKIERLEELEREIEELEEE 306



 Score = 30.5 bits (69), Expect = 0.91
 Identities = 26/86 (30%), Positives = 41/86 (47%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           + +  E  + E+   E +E  EE  + + + +E EE  E  KE  +  KE  E   + EE
Sbjct: 555 QQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEE 614

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
            +   + +E E E EE   +LE E E
Sbjct: 615 LLQSLELSEAENELEEAEEELESELE 640



 Score = 29.7 bits (67), Expect = 1.7
 Identities = 32/82 (39%), Positives = 45/82 (54%), Gaps = 1/82 (1%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE-DEIEKEEGENEEEEEEEVVKQ 230
           EE+ +E E   EE E+E E   ++ K+ EE  +E KEE  E+     E +EE EE   + 
Sbjct: 360 EERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKEL 419

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           ++ E E EE E  IK  EE+ N
Sbjct: 420 EELERELEELEEEIKKLEEQIN 441



 Score = 29.7 bits (67), Expect = 1.7
 Identities = 26/82 (31%), Positives = 41/82 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE ++E + +EE   E EEE ++ +  +    E E  E + ++  E E  E E      +
Sbjct: 228 EELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELE 287

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           +K E  EE E  + +LEEE E 
Sbjct: 288 EKIERLEELEREIEELEEELEG 309



 Score = 29.7 bits (67), Expect = 1.7
 Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEE-EEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           EE ++E + +     E EE +++ K+ EE  E+ EEK E+ E E EE   E+ E  ++++
Sbjct: 301 EELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLE 360

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
           ++  E EE  EE+  +LE+  E 
Sbjct: 361 ERLKELEERLEELEKELEKALER 383



 Score = 29.7 bits (67), Expect = 1.8
 Identities = 22/80 (27%), Positives = 40/80 (50%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
            E +   E  EE+ E  EE++++  + EEE E  +   +E+E+   + +  EE     ++
Sbjct: 277 RELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEE 336

Query: 232 KNEEEEEEEEVLIKLEEEEE 251
           K E+ E E E L + + E  
Sbjct: 337 KLEKLESELEELAEEKNELA 356



 Score = 29.3 bits (66), Expect = 1.9
 Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQ-KKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           E +   +EK +  +++EE  + E+ +++ K+  EE      +KEE E  +E  +  +++ 
Sbjct: 543 EELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKL 602

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           +E+ ++    EE  +   L + E E E 
Sbjct: 603 KELEERLSQLEELLQSLELSEAENELEE 630



 Score = 29.3 bits (66), Expect = 2.1
 Identities = 19/87 (21%), Positives = 37/87 (42%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E +  E E+K E      E  EE + K +  + +EE  + +    E+++   E       
Sbjct: 525 EALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTR 584

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           +   ++  E  +E ++ L +LEE    
Sbjct: 585 KEELEELRERLKELKKKLKELEERLSQ 611



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 24/79 (30%), Positives = 39/79 (49%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E +  E E  +   EE  E+++   + EE+ E  ++ E EIE+ E E E         +
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE 318

Query: 231 KKNEEEEEEEEVLIKLEEE 249
           +  E+ +  EE L KLEE+
Sbjct: 319 ELLEKLKSLEERLEKLEEK 337



 Score = 29.0 bits (65), Expect = 2.7
 Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 3/87 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +  A   E+ EE E E  E EEE     +  +  +EE EEK E+ E   EE    EE +E
Sbjct: 494 KEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEE---LEELKE 550

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           ++  Q+  EE  + E+ L +L+E  E 
Sbjct: 551 KLQLQQLKEELRQLEDRLQELKELLEE 577



 Score = 29.0 bits (65), Expect = 3.1
 Identities = 25/82 (30%), Positives = 42/82 (51%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E  ++ +  + E E+ E E +K ++ +  E E   ++ E++IE+ E    E EE E   +
Sbjct: 249 ERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELE 308

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
                 EE EE+L KL+  EE 
Sbjct: 309 GLRALLEELEELLEKLKSLEER 330



 Score = 28.6 bits (64), Expect = 3.8
 Identities = 29/81 (35%), Positives = 41/81 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE K    EIE  E E  +  +++  + E   EE +++ + +E+ E E EE EEE    +
Sbjct: 252 EELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLR 311

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
              EE EE  E L  LEE  E
Sbjct: 312 ALLEELEELLEKLKSLEERLE 332



 Score = 28.6 bits (64), Expect = 3.9
 Identities = 27/86 (31%), Positives = 41/86 (47%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E++  E  K  E+E+ E E   EE  ++ +  EE E E E+ EE+         E EE  
Sbjct: 262 ESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELL 321

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E +K  +   E+ EE++     E EE
Sbjct: 322 EKLKSLEERLEKLEEKLEKLESELEE 347



 Score = 28.6 bits (64), Expect = 3.9
 Identities = 30/83 (36%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE-IEKEEGENEEEEEEEVVK 229
           EEE KE  ++EE + E+EEE  +++ +  EE   E +EE E +E+ +    E E  E+  
Sbjct: 209 EEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEA 268

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
            K  EEE  E E L++  EE+  
Sbjct: 269 LKIREEELRELERLLEELEEKIE 291



 Score = 28.2 bits (63), Expect = 5.1
 Identities = 20/75 (26%), Positives = 37/75 (49%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
               EE  +E  E   E +   EE++++  + E+E EE ++E +E+E+E  + EE+  + 
Sbjct: 384 LKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQL 443

Query: 227 VVKQKKNEEEEEEEE 241
             K+    E     E
Sbjct: 444 ESKELMIAELAGAGE 458



 Score = 27.8 bits (62), Expect = 7.0
 Identities = 27/85 (31%), Positives = 44/85 (51%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            +    +  E  E E E  E EEE++ +  K   + E E+  +  +E+ E + EE E E 
Sbjct: 611 QLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEI 670

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEE 251
             + ++ E EE+ EE L +LE+ EE
Sbjct: 671 RRELQRIENEEQLEEKLEELEQLEE 695



 Score = 27.4 bits (61), Expect = 9.4
 Identities = 24/87 (27%), Positives = 41/87 (47%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
              +  E E+  E     EE  E+ E K +K + E EE  E+K E     EE   E EE 
Sbjct: 310 LRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEER 369

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            E ++++  +  E  +++   ++E +E
Sbjct: 370 LEELEKELEKALERLKQLEEAIQELKE 396


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 34.6 bits (80), Expect = 0.045
 Identities = 14/61 (22%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E+    ++ EE   EE   K+    +  EE +E+++++E + ++ E  ++  EE  K + 
Sbjct: 42  ERGSFRDLTEESLREEIA-KEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARD 100

Query: 233 N 233
            
Sbjct: 101 E 101



 Score = 33.1 bits (76), Expect = 0.13
 Identities = 13/52 (25%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 185 NEEE--EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
            EE   EE+ ++  K +  EE +++E+DE +  +   E ++  E   + ++E
Sbjct: 50  TEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDE 101



 Score = 32.7 bits (75), Expect = 0.16
 Identities = 15/47 (31%), Positives = 24/47 (51%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI 212
           E IA E  K +  E  +EE ++EE+      + ++  EE  K  DE+
Sbjct: 56  EEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDEV 102



 Score = 30.4 bits (69), Expect = 0.86
 Identities = 16/44 (36%), Positives = 23/44 (52%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           EE   EE  KE  +I+  E  +EEE++EE       E ++  EE
Sbjct: 51  EESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94



 Score = 28.5 bits (64), Expect = 3.3
 Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 205 EKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
           E+   +EI KE  + +  EE +  +  + ++ ++ EE    +EE
Sbjct: 51  EESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 34.3 bits (79), Expect = 0.046
 Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 2/92 (2%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           I  F+     + + +    I        +E+    +  EEE +       + +KE+  NE
Sbjct: 8   ILSFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFS--EEENKVATTSTKKDKKEDKNNE 65

Query: 221 EEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            +++ E  K+KK E++E + E   KL  +   
Sbjct: 66  SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPK 97



 Score = 30.1 bits (68), Expect = 1.2
 Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 3/93 (3%)

Query: 158 SPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
              +  F E          + + +E++N E ++ K +K K++++E++E K E E +    
Sbjct: 36  KEILSTFSEEENKVATTSTKKDKKEDKNNESKK-KSEKKKKKKKEKKEPKSEGETKLGFK 94

Query: 218 ENEEEEEEEVV--KQKKNEEEEEEEEVLIKLEE 248
             ++ ++ +    K K NE+ +     + +L E
Sbjct: 95  TPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAE 127



 Score = 29.7 bits (67), Expect = 1.5
 Identities = 17/92 (18%), Positives = 42/92 (45%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
            I   + N+    ++      EEE        K+ K +++  E ++K E+ + +K+E + 
Sbjct: 23  PISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKE 82

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            + E E  +  K  ++ ++ ++   K +  E+
Sbjct: 83  PKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 34.5 bits (80), Expect = 0.047
 Identities = 25/84 (29%), Positives = 39/84 (46%), Gaps = 10/84 (11%)

Query: 173 EKKEEDEIEEE---------ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE-EGENEEE 222
            K+  DE+EEE         E EEE    +   + +EE EE +KE + + +E +   ++ 
Sbjct: 186 LKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKY 245

Query: 223 EEEEVVKQKKNEEEEEEEEVLIKL 246
            EE +   +  E E E  E L K 
Sbjct: 246 LEELLALYEYLEIELERAEALSKF 269



 Score = 33.0 bits (76), Expect = 0.17
 Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 2/70 (2%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI--EKEEGENEEE 222
           +E +  EEE    + I E + E EE  K++++  EE +E  KK  +E+    E  E E E
Sbjct: 202 FERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELE 261

Query: 223 EEEEVVKQKK 232
             E + K  K
Sbjct: 262 RAEALSKFLK 271



 Score = 31.4 bits (72), Expect = 0.50
 Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 6/82 (7%)

Query: 172 EEKKEEDEIEEEENEEEE--EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           E+ KEE   E          ++ +  +K      +       + +E+ +   +  EE++K
Sbjct: 34  EDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNP----LREEKKKVSVKSLEELIK 89

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
             + E E+ E+E+    EE  E
Sbjct: 90  DVEEELEKIEKEIKELEEEISE 111



 Score = 28.7 bits (65), Expect = 3.3
 Identities = 22/80 (27%), Positives = 34/80 (42%), Gaps = 4/80 (5%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEE----EEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           E K E+   E + E  E +   K          +E   + E+E++K   E  E EEE   
Sbjct: 154 EDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTP 213

Query: 229 KQKKNEEEEEEEEVLIKLEE 248
            +   E +EE EE+  + E 
Sbjct: 214 SELIREIKEELEEIEKERES 233



 Score = 28.0 bits (63), Expect = 6.3
 Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 4/48 (8%)

Query: 170 GEEEKKEEDEIEEEENEE----EEEVKQKKNKEEEEEEEEKKEEDEIE 213
            +   K  +E+ ++  EE    E+E+K+ + +  E E E K+ E EIE
Sbjct: 77  KKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIE 124


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 34.7 bits (80), Expect = 0.048
 Identities = 35/133 (26%), Positives = 55/133 (41%), Gaps = 22/133 (16%)

Query: 104 ELPEKVDIIVSEWMGFYLLHESMIDSVIFARDKFLKPEGVMYPYKCILHSAPSYSPEIFK 163
           EL  K  ++V+  + F     S +D +I   +  +K EG    Y+ + ++ P +     K
Sbjct: 787 ELRGKTRVLVTNQLHFL----SQVDRIILVHEGMIKEEGT---YEELSNNGPLFQ----K 835

Query: 164 FWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN---- 219
             EN AG+ E     E  EE  EEE++    K          KK+    +K +       
Sbjct: 836 LMEN-AGKME-----EYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLI 889

Query: 220 -EEEEEEEVVKQK 231
            +EE E  VV  K
Sbjct: 890 KQEERETGVVSWK 902


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 34.6 bits (79), Expect = 0.049
 Identities = 26/75 (34%), Positives = 35/75 (46%), Gaps = 6/75 (8%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           W     ++E+     I   E + E       +   EEEEE + EEDE      E  E+E 
Sbjct: 106 WSLFHDDDEENLPSSIAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDE------EYYEKEP 159

Query: 225 EEVVKQKKNEEEEEE 239
            +VV +K  EEEEEE
Sbjct: 160 GKVVDEKSEEEEEEE 174



 Score = 34.2 bits (78), Expect = 0.059
 Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 176 EEDEIEEEENEEEEEVKQKK----NKEEEEEEEEKKEEDEIEKE 215
           EE+E+E EE+EE  E +  K      EEEEEEE K  +D I+ E
Sbjct: 142 EEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLE 185



 Score = 30.4 bits (68), Expect = 1.0
 Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 7/47 (14%)

Query: 183 EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            E EEE E+       EE+EE  +KE  ++  E+ E EEEEE + +K
Sbjct: 140 LEEEEEVEM-------EEDEEYYEKEPGKVVDEKSEEEEEEELKTMK 179



 Score = 29.2 bits (65), Expect = 2.4
 Identities = 17/41 (41%), Positives = 26/41 (63%), Gaps = 4/41 (9%)

Query: 172 EEKKEEDEIEEEENEEEEE----VKQKKNKEEEEEEEEKKE 208
            E++EE E+EE+E   E+E    V +K  +EEEEE +  K+
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKD 180



 Score = 28.5 bits (63), Expect = 4.4
 Identities = 18/61 (29%), Positives = 32/61 (52%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           +   EE+E  E E +EE   K+     +E+ EEE++EE +  K+  + E + ++  VK  
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDN 196

Query: 232 K 232
            
Sbjct: 197 G 197


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 34.7 bits (80), Expect = 0.049
 Identities = 13/64 (20%), Positives = 28/64 (43%), Gaps = 1/64 (1%)

Query: 184 ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK-NEEEEEEEEV 242
             EE+       N   +  +EE+K  D   + E  +    E +  K +K  + +E+ +++
Sbjct: 219 NKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKL 278

Query: 243 LIKL 246
            I +
Sbjct: 279 GISI 282



 Score = 29.3 bits (66), Expect = 2.2
 Identities = 13/67 (19%), Positives = 29/67 (43%), Gaps = 5/67 (7%)

Query: 191 VKQKKNKEEEEEEEEKKE-----EDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIK 245
           +K K+    +EE+              ++E+  N+   E E +     E ++E+ E +  
Sbjct: 211 IKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKD 270

Query: 246 LEEEEEN 252
           L+E+ + 
Sbjct: 271 LKEKAKK 277



 Score = 28.1 bits (63), Expect = 5.0
 Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 5/63 (7%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEE-----KKEEDEIEKEEGENEEEEEEEVV 228
           K +E  I +EE      V    NK  +EE++      + E+  +   E + E+ E+ + +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271

Query: 229 KQK 231
           K+K
Sbjct: 272 KEK 274



 Score = 28.1 bits (63), Expect = 5.9
 Identities = 14/60 (23%), Positives = 23/60 (38%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           NI  EE+    +         +EE K      E E+      E + EK E   + +E+ +
Sbjct: 217 NINKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAK 276


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 34.1 bits (79), Expect = 0.051
 Identities = 18/43 (41%), Positives = 26/43 (60%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
            ++D ++EE  E      Q    EEEEEEEE++EE+E  +EE 
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEA 321



 Score = 33.7 bits (78), Expect = 0.072
 Identities = 13/46 (28%), Positives = 24/46 (52%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
           +A ++   EE +       +    ++++ +EEEEEEEE  EE+   
Sbjct: 278 LADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAA 323



 Score = 32.5 bits (75), Expect = 0.17
 Identities = 14/41 (34%), Positives = 27/41 (65%)

Query: 183 EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           +++  +EE+K+  + + +    E++EE+E E+EE E  EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 31.8 bits (73), Expect = 0.27
 Identities = 15/46 (32%), Positives = 23/46 (50%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
           +A     K+  + E +E    +       +EEEEEEEE++EE   E
Sbjct: 274 LAAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 30.6 bits (70), Expect = 0.79
 Identities = 14/42 (33%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
           +++  +EE +       +    EEE++EE+E E+EE  +EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEE-EEEEEPSEEE 320



 Score = 28.7 bits (65), Expect = 2.6
 Identities = 17/48 (35%), Positives = 19/48 (39%), Gaps = 7/48 (14%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
                 +EE +E            E EEEEEEE       EEEE  EE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEE-------EEEEPSEE 319


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score = 34.2 bits (78), Expect = 0.051
 Identities = 23/107 (21%), Positives = 42/107 (39%), Gaps = 4/107 (3%)

Query: 137 FLKPEGVMYPYKCILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKN 196
            +  EG +Y Y+      P +            GE  + E +EI  +E +    V + ++
Sbjct: 326 PIPLEGDVYGYQPPGWHEPLFEVGPETAE----GELLQFEAEEIYNDEEQYTRPVYENED 381

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVL 243
            EE  ++       E      E   +E+ +V K+ K + E+    VL
Sbjct: 382 FEERVDDVSNGGLAEFAPAAVEVAPKEDAKVTKRGKAKREKLLYSVL 428


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 34.0 bits (78), Expect = 0.054
 Identities = 12/80 (15%), Positives = 29/80 (36%), Gaps = 1/80 (1%)

Query: 163 KFWENIAGEEEKKEEDEIEEEE-NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
            F       E   EE + E E   E+ +   + +   E    + K++    +K +    +
Sbjct: 41  VFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPK 100

Query: 222 EEEEEVVKQKKNEEEEEEEE 241
            + +   K K   + + ++ 
Sbjct: 101 PKPKPKPKPKVKPQPKPKKP 120



 Score = 29.0 bits (65), Expect = 2.0
 Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 4/69 (5%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           K  E   EE + E E   +Q K       E E   E    K + + + E++ +  K K  
Sbjct: 47  KVLEAPTEEPQPEPEPPEEQPK----PPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPK 102

Query: 234 EEEEEEEEV 242
            + + + +V
Sbjct: 103 PKPKPKPKV 111



 Score = 27.8 bits (62), Expect = 4.7
 Identities = 10/72 (13%), Positives = 25/72 (34%), Gaps = 5/72 (6%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E  E     E+     E     +    + +E+ K E+   + +     + + +  VK 
Sbjct: 59  EPEPPE-----EQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKP 113

Query: 231 KKNEEEEEEEEV 242
           +   ++   +  
Sbjct: 114 QPKPKKPPSKTA 125


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 34.2 bits (79), Expect = 0.054
 Identities = 17/74 (22%), Positives = 34/74 (45%), Gaps = 2/74 (2%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
             +E + E  EE+E +++   K EE  +E ++E +E+EK    +E E         K   
Sbjct: 491 IIEEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEK--LLDEVELLTGANSGGKTSL 548

Query: 236 EEEEEEVLIKLEEE 249
            E + ++++     
Sbjct: 549 LELKAQIVVLAHMG 562



 Score = 28.1 bits (63), Expect = 6.1
 Identities = 14/82 (17%), Positives = 30/82 (36%), Gaps = 1/82 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNK-EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           EEE +E +++ +E           K    E + +        +     E + E  +EV  
Sbjct: 521 EEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEAVDEVKF 580

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
            KK +   +       L++++ 
Sbjct: 581 LKKKKGILDAGAFESTLKDKKN 602


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 33.4 bits (77), Expect = 0.056
 Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 4/69 (5%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE----EEVL 243
            EE  ++  +E EE E E+  EDE+E+E  E E E E    +Q K  E E +    EE  
Sbjct: 1   MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERY 60

Query: 244 IKLEEEEEN 252
           ++L+ + EN
Sbjct: 61  LRLQADFEN 69



 Score = 31.9 bits (73), Expect = 0.21
 Identities = 20/56 (35%), Positives = 34/56 (60%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           E+ ++ E+E+EE E EE  E + ++   EEE E E  +E++ +  E E + +E EE
Sbjct: 3   EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEE 58



 Score = 30.4 bits (69), Expect = 0.61
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 2/54 (3%)

Query: 170 GEEEKKEEDEIEEEENEEE--EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
            E+ ++E +E E EE  E+  EE   ++  E E  +EE+ +  E+E +  E EE
Sbjct: 5   NEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEE 58


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 34.2 bits (79), Expect = 0.059
 Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 6/80 (7%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
                 EEE EEEEEV ++  +EEEEEEEE+  E+E+     E     EEE+    +  E
Sbjct: 300 PSPPEPEEEEEEEEEVPEE--EEEEEEEEERTFEEEVRATVAEAIRLLEEELTVSARRHE 357

Query: 236 ----EEEEEEVLIKLEEEEE 251
                 +  E+L +LE+   
Sbjct: 358 FFNFAVDFYELLQRLEDLGR 377



 Score = 27.7 bits (62), Expect = 6.6
 Identities = 17/37 (45%), Positives = 19/37 (51%), Gaps = 5/37 (13%)

Query: 154 APSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEE 190
            P  SP      E    EEE++E  E EEEE EEEE 
Sbjct: 297 RPPPSPP-----EPEEEEEEEEEVPEEEEEEEEEEER 328


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 34.1 bits (78), Expect = 0.061
 Identities = 25/88 (28%), Positives = 39/88 (44%), Gaps = 3/88 (3%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEE--EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE-E 223
           +I  E    E  + E+E +  ++     + +N E    E    E DE++KEE    +E E
Sbjct: 28  DILLERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELE 87

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           E E      + E  E +E   +LE EE 
Sbjct: 88  ELEKEDDDLDGELVELQEEKEQLENEEL 115



 Score = 31.0 bits (70), Expect = 0.59
 Identities = 16/64 (25%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEE---EEKKEEDEIEKEEGENEEEEEEEV 227
           E +  E    E  ++E +E  K+++   +E EE   E+   + E+ + + E E+ E EE+
Sbjct: 56  ESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEEL 115

Query: 228 VKQK 231
              +
Sbjct: 116 QYLR 119



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 20/80 (25%), Positives = 42/80 (52%), Gaps = 1/80 (1%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E +  E    E  + E +E+K+++ +  +E EE +KE+D+++ E  E  +EE+E++  ++
Sbjct: 56  ESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVE-LQEEKEQLENEE 114

Query: 232 KNEEEEEEEEVLIKLEEEEE 251
                E        L+ E+ 
Sbjct: 115 LQYLREYNLFDRNNLQLEDN 134



 Score = 27.1 bits (60), Expect = 9.9
 Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           + +  EEE+   DE+EE E E++          E +EE+E+ E +E++     N 
Sbjct: 73  DELKKEEERL-LDELEELEKEDD---DLDGELVELQEEKEQLENEELQYLREYNL 123


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 34.4 bits (78), Expect = 0.064
 Identities = 29/86 (33%), Positives = 48/86 (55%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            EN+    E+  E+ +EE   E  EE  ++  +E  EE  E+  E+ +E+   EN EE +E
Sbjct: 965  ENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDE 1024

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEEE 251
            E V++ +   EE +EE + ++EE  E
Sbjct: 1025 ENVEEVEENVEEYDEENVEEIEENAE 1050



 Score = 34.0 bits (77), Expect = 0.083
 Identities = 29/80 (36%), Positives = 42/80 (52%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE  +E  E  +EEN EE E   ++N EE  EE  ++  +EIE+   EN EE  EE  ++
Sbjct: 1054 EENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEE 1113

Query: 231  KKNEEEEEEEEVLIKLEEEE 250
               E  EE ++   +   EE
Sbjct: 1114 NAEENAEEYDDENPEEHNEE 1133



 Score = 33.6 bits (76), Expect = 0.094
 Identities = 29/86 (33%), Positives = 48/86 (55%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            EN+    E+  E+ +EE   E  EE  ++  +E  EE  E+ +E+ +E+ E   EE +EE
Sbjct: 981  ENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEE 1040

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEEE 251
             V + ++N EE  EE +   +EE +E
Sbjct: 1041 NVEEIEENAEENVEENIEENIEEYDE 1066



 Score = 33.2 bits (75), Expect = 0.12
 Identities = 30/71 (42%), Positives = 40/71 (56%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE  +E DE   EE EE  E   ++N EE  EE +++  +EIE+   EN EE  EE V++
Sbjct: 1031 EENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEE 1090

Query: 231  KKNEEEEEEEE 241
               E EE  EE
Sbjct: 1091 NVEEIEENVEE 1101



 Score = 32.8 bits (74), Expect = 0.17
 Identities = 32/87 (36%), Positives = 45/87 (51%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            EN+    E+  E+ IEE   E  EE  ++  +E +EE  E+ EE+  E +E   EE EE 
Sbjct: 989  ENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEEN 1048

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
                 ++N EE  EE     +EE EEN
Sbjct: 1049 AEENVEENIEENIEEYDEENVEEIEEN 1075



 Score = 32.1 bits (72), Expect = 0.28
 Identities = 29/86 (33%), Positives = 44/86 (51%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            EN+    E+  E+ +EE   E  EE  ++  +E  EE  E+  E+ +E+ + EN EE EE
Sbjct: 973  ENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEE 1032

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEEE 251
             V +  +   EE EE     +EE  E
Sbjct: 1033 NVEEYDEENVEEIEENAEENVEENIE 1058



 Score = 32.1 bits (72), Expect = 0.32
 Identities = 33/82 (40%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 171  EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE  +E  E  EE  EE +E   ++ +E  EE  E+  E+ IE+ + EN EE EE +   
Sbjct: 1020 EEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENI--- 1076

Query: 231  KKNEEEEEEEEVLIKLEEEEEN 252
            ++N EE  EE V   +EE EEN
Sbjct: 1077 EENIEENVEENVEENVEEIEEN 1098



 Score = 31.7 bits (71), Expect = 0.42
 Identities = 29/84 (34%), Positives = 44/84 (52%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            E+     E+  E+ +EE   E  EE  ++  +E  EE  E+  E+ IE+   EN EE  E
Sbjct: 957  EDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIE 1016

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEE 249
            E V++   E  EE EE + + +EE
Sbjct: 1017 ENVEEYDEENVEEVEENVEEYDEE 1040



 Score = 30.1 bits (67), Expect = 1.2
 Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 2/104 (1%)

Query: 150  ILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
            I H     + E+    EN+    ++  E  IEE+  E  EE  ++  +E  EE  E+  E
Sbjct: 917  IAHEINLINKELKNQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENVEENVEENVE 976

Query: 210  DEIEKEEGENEEEEEEEVVKQ--KKNEEEEEEEEVLIKLEEEEE 251
            + +E+   EN EE  EE V++  ++N EE  EE V   +EE  E
Sbjct: 977  ENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVE 1020



 Score = 30.1 bits (67), Expect = 1.3
 Identities = 27/84 (32%), Positives = 43/84 (51%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            E+     E+  E+ +EE   E  EE  ++  +E  EE  E+  E+ +E+   EN EE  E
Sbjct: 949  EDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVE 1008

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEE 249
            E V++   E  EE +E  ++  EE
Sbjct: 1009 ENVEENIEENVEEYDEENVEEVEE 1032



 Score = 30.1 bits (67), Expect = 1.4
 Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 3/89 (3%)

Query: 166  ENIAGEEEKKEEDEIEE---EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
            EN+    E+  E+ +EE   E  EE  E   ++N EE EE  E+ +E+ +E+ E   EE 
Sbjct: 993  ENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEEN 1052

Query: 223  EEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
             EE + +  +  +EE  EE+   +EE  E
Sbjct: 1053 VEENIEENIEEYDEENVEEIEENIEENIE 1081



 Score = 30.1 bits (67), Expect = 1.4
 Identities = 28/87 (32%), Positives = 42/87 (48%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            EN+    E+  E+ +EE   E  EE  ++  +E  EE +E+  E+  E  E  +EE  EE
Sbjct: 985  ENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEE 1044

Query: 226  EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
                 ++N EE  EE +    EE  E 
Sbjct: 1045 IEENAEENVEENIEENIEEYDEENVEE 1071



 Score = 29.4 bits (65), Expect = 2.2
 Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 2/82 (2%)

Query: 172  EEKKEEDEIEEEEN-EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            EE +E  E   EEN EE  E   ++N EE EE  E+  E+ +E+   EN EE EE V + 
Sbjct: 1043 EEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEEN 1102

Query: 231  -KKNEEEEEEEEVLIKLEEEEE 251
             ++N EE  EE      EE ++
Sbjct: 1103 VEENAEENAEENAEENAEEYDD 1124


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 33.9 bits (78), Expect = 0.064
 Identities = 18/64 (28%), Positives = 24/64 (37%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E   G  +  E  E   +E   E EV+     E   E  E+  E   E+ E   EEE  +
Sbjct: 189 EGWLGPLDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNK 248

Query: 226 EVVK 229
            V  
Sbjct: 249 PVKP 252


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 33.0 bits (76), Expect = 0.067
 Identities = 18/46 (39%), Positives = 34/46 (73%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           E+E++EE+E+E EE +EEE++ +   KE  + + EK+ E+E +++E
Sbjct: 100 EKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145



 Score = 32.3 bits (74), Expect = 0.099
 Identities = 15/45 (33%), Positives = 27/45 (60%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
            +++E+EEE++EE E+E+ + E + +E  E    K   E+  E E
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENE 140



 Score = 31.9 bits (73), Expect = 0.17
 Identities = 14/64 (21%), Positives = 33/64 (51%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           + K+ + +       ++++ EEEE E E E   ++ + +E  E+E  +    ++ E E +
Sbjct: 83  LLKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERK 142

Query: 221 EEEE 224
           ++E 
Sbjct: 143 QKEI 146



 Score = 30.7 bits (70), Expect = 0.37
 Identities = 16/47 (34%), Positives = 29/47 (61%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
            EEE+ E +E++EEE  +E   K+    + E+  E ++++ EI KE+
Sbjct: 104 EEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQ 150



 Score = 29.6 bits (67), Expect = 0.87
 Identities = 16/57 (28%), Positives = 37/57 (64%), Gaps = 4/57 (7%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVL 243
             ++ ++++ +EEE E EE  EE++I++      E+E  ++ ++K+ E E +++E+L
Sbjct: 95  GLDKKEKEEEEEEEVEVEELDEEEQIDEL----LEKELAKLKREKRRENERKQKEIL 147



 Score = 29.2 bits (66), Expect = 1.2
 Identities = 16/50 (32%), Positives = 32/50 (64%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           K +K    +++E+E++EE+E+E EE + EE+ +E + K+    + E+  E
Sbjct: 89  KVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRE 138



 Score = 27.3 bits (61), Expect = 5.2
 Identities = 15/56 (26%), Positives = 33/56 (58%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           K    +++EK+EE+E E E  E +EEE+ + + +K+  + + E+    + +++E  
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEIL 147


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 33.9 bits (77), Expect = 0.068
 Identities = 22/81 (27%), Positives = 43/81 (53%), Gaps = 2/81 (2%)

Query: 162 FKFWENIAGEEEKK--EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
              W   A E EK+  E++   EEE E + E++   +  E+E+ ++K  E  +  ++   
Sbjct: 183 EHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSME 242

Query: 220 EEEEEEEVVKQKKNEEEEEEE 240
             E EEE   + +++E+E+E+
Sbjct: 243 TSESEEEESSESESDEDEDED 263


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 33.7 bits (77), Expect = 0.068
 Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 5/65 (7%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEE 238
           E+ E E  ++         E++E   E +  ++ E+EE E  EE+EEE     + EEE E
Sbjct: 46  ELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEE-----EGEEESE 100

Query: 239 EEEVL 243
           E E L
Sbjct: 101 EFEPL 105



 Score = 33.3 bits (76), Expect = 0.086
 Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 2/66 (3%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           +D IE  E E  ++     N +   E++E + E  I ++  E E EE EE  +++  EE 
Sbjct: 42  DDAIELAEPETSDD--PYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEES 99

Query: 237 EEEEEV 242
           EE E +
Sbjct: 100 EEFEPL 105



 Score = 30.6 bits (69), Expect = 0.69
 Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 5/81 (6%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E  ++     +   E++E + +    E+ EEEE +E +E E+EEGE E EE E +   
Sbjct: 49  EPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPL--- 105

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
              +  EE  E   +LEE EE
Sbjct: 106 --GDTPEELTEASEQLEEHEE 124



 Score = 29.8 bits (67), Expect = 1.2
 Identities = 32/92 (34%), Positives = 45/92 (48%), Gaps = 3/92 (3%)

Query: 164 FWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           F E+  G  E +  ++ EEEE EE EE ++++ +EE EE E   +  E   E  E  EE 
Sbjct: 63  FGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQLEEH 122

Query: 224 EE---EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           EE    +V+Q        E    I+ E EEE 
Sbjct: 123 EEGFQAMVEQAVERGLSAETITRIQAEYEEEG 154


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 33.9 bits (78), Expect = 0.073
 Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            +  E  K +  E  + ++ E + +K    K+ EE +  KK E           + +E +
Sbjct: 56  AVDSELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSL--IERKTQEVK 113

Query: 227 VVKQKKNEEEEEEEEVLIKLEE 248
              ++  E   +E+  + +LEE
Sbjct: 114 DSGEEIAEMMRDEKVPIRELEE 135



 Score = 27.3 bits (61), Expect = 9.0
 Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 3/73 (4%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
            KK E         + +EVK    +  E   +EK    E+E+   E     ++E+   + 
Sbjct: 94  AKKIERSTPSLIERKTQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVALYKQEI---QS 150

Query: 233 NEEEEEEEEVLIK 245
                   E+L  
Sbjct: 151 PTRLNLINELLRA 163


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 33.3 bits (76), Expect = 0.076
 Identities = 22/64 (34%), Positives = 37/64 (57%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
             E  EE   + +E K++  + E+E EE + E+D +EK   E E+++E E +K++  E  
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200

Query: 237 EEEE 240
           EE E
Sbjct: 201 EELE 204



 Score = 27.9 bits (62), Expect = 4.5
 Identities = 17/61 (27%), Positives = 38/61 (62%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           +++  + + + +E+K+  +E+EKE  E E+E++      ++ E+++E EE+  +LEE  E
Sbjct: 142 QERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLE 201

Query: 252 N 252
            
Sbjct: 202 E 202



 Score = 27.9 bits (62), Expect = 4.5
 Identities = 23/64 (35%), Positives = 39/64 (60%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKL 246
            +E +++ K K +E++E  ++ E E+E+ E E +  E+    K+KK E EE +EE+   L
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200

Query: 247 EEEE 250
           EE E
Sbjct: 201 EELE 204



 Score = 27.9 bits (62), Expect = 4.6
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            + K++++ +EE E E EE   +K   E+  EE+EKK+E E  KEE E   EE E
Sbjct: 150 FKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204



 Score = 27.1 bits (60), Expect = 8.4
 Identities = 26/77 (33%), Positives = 46/77 (59%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           + +K+    +     E  EE+K K+ +++E  EE +KE +E+E E+   E+  EE+  K+
Sbjct: 128 KSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKK 187

Query: 231 KKNEEEEEEEEVLIKLE 247
           +  E +EE EE+L +LE
Sbjct: 188 ELEELKEELEELLEELE 204


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 33.2 bits (75), Expect = 0.079
 Identities = 24/82 (29%), Positives = 47/82 (57%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +N A E ++++ DE+ E E  ++E  +  + K E+  EE + + +E  ++E ENEE++ +
Sbjct: 7   KNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPK 66

Query: 226 EVVKQKKNEEEEEEEEVLIKLE 247
           E +     E +  +E+ L  LE
Sbjct: 67  EEIDYPIQENKSFDEKNLDDLE 88



 Score = 28.2 bits (62), Expect = 3.8
 Identities = 18/60 (30%), Positives = 33/60 (55%)

Query: 190 EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEE 249
           + K+ K ++ +E  E +K + E+E+   +  E+  EE+  Q +   E+EEE    K +EE
Sbjct: 9   KAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPKEE 68


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 33.9 bits (78), Expect = 0.079
 Identities = 27/94 (28%), Positives = 50/94 (53%), Gaps = 3/94 (3%)

Query: 159 PEIFKFWENIAGEEEKKEE--DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
            E+ +  E +  +EE+ +   ++++  EN+ EE  K    +E E EE EK+ ++E+ +  
Sbjct: 77  EELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVA 136

Query: 217 GENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
           G   E+  + ++K     E EEE+   +K  EEE
Sbjct: 137 GLTPEQARKLLLKLLDA-ELEEEKAQRVKKIEEE 169



 Score = 32.8 bits (75), Expect = 0.15
 Identities = 19/93 (20%), Positives = 42/93 (45%), Gaps = 9/93 (9%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG---------EN 219
              ++++E     EE   EEE + QK+ + +   E+    E+++E+ E          E 
Sbjct: 64  ERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEE 123

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            E++ +  + +      E+  ++L+KL + E  
Sbjct: 124 LEKQLDNELYRVAGLTPEQARKLLLKLLDAELE 156



 Score = 28.1 bits (63), Expect = 5.7
 Identities = 24/87 (27%), Positives = 44/87 (50%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E     EEK E   +E +E    E  +Q++    E EE +++EE  ++KEE  +   E+ 
Sbjct: 41  EAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKL 100

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           + ++ +  E E+      ++LEE E+ 
Sbjct: 101 DNLENQLEEREKALSARELELEELEKQ 127



 Score = 27.8 bits (62), Expect = 7.2
 Identities = 29/103 (28%), Positives = 49/103 (47%), Gaps = 10/103 (9%)

Query: 150 ILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
           IL  A   + E  +     A E   +E ++  +E   E EE+++      EEE   +KEE
Sbjct: 38  ILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQR------EEERLVQKEE 91

Query: 210 D---EIEK-EEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
                 EK +  EN+ EE E+ +  ++ E EE E+++  +L  
Sbjct: 92  QLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYR 134


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 32.7 bits (75), Expect = 0.084
 Identities = 21/86 (24%), Positives = 44/86 (51%), Gaps = 4/86 (4%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVK----QKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
                   +E E+E++E   E K    +KK KE+   ++ K +++E EKE+ + E   + 
Sbjct: 38  SSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKA 97

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
               +K+  E E+++     ++EE++
Sbjct: 98  LAEAEKERAELEKKKAEAKLMKEEKK 123


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 33.0 bits (76), Expect = 0.084
 Identities = 17/71 (23%), Positives = 41/71 (57%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           +  N+ +  +++ K + EE ++   + + +IEK +   EE EE   + ++  + E+E ++
Sbjct: 62  QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKK 121

Query: 242 VLIKLEEEEEN 252
           +  +LE+ E+N
Sbjct: 122 LKAELEKYEKN 132



 Score = 28.4 bits (64), Expect = 2.9
 Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 10/81 (12%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE----- 225
           E+ KKE +E+++   E + ++++ K   EE EE  +  E E+++ E E ++ + E     
Sbjct: 72  EKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLE-ELKQLEKELKKLKAELEKYE 130

Query: 226 ----EVVKQKKNEEEEEEEEV 242
               E +++ K E +  +E  
Sbjct: 131 KNDPERIEKLKEETKVAKEAA 151


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 33.0 bits (76), Expect = 0.085
 Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 5/68 (7%)

Query: 167 NIAGEEEKKE-EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            +  E+ K E E EI++ E E+EE  K+    E + E  EK+EE+E +      E+   +
Sbjct: 115 ALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQ----IEEKRHAD 170

Query: 226 EVVKQKKN 233
           E+   KK 
Sbjct: 171 EIAFLKKQ 178



 Score = 29.1 bits (66), Expect = 1.8
 Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 7/59 (11%)

Query: 179 EIEEEENEEEEEVKQ--KKNKEEEEEEEE-KKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
           + E+ ++E E+E+K+  ++ +E E+   E + + + IEK     EEEE +   K+  +E
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK----REEEERQIEEKRHADE 171



 Score = 27.2 bits (61), Expect = 6.7
 Identities = 13/42 (30%), Positives = 27/42 (64%)

Query: 211 EIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            ++ E+G++E E+E + ++++K E E+   E+  KLE  E+ 
Sbjct: 115 ALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKR 156


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 33.2 bits (76), Expect = 0.089
 Identities = 10/50 (20%), Positives = 29/50 (58%)

Query: 183 EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           +  +E+E        E+ +++EE + + + E EE +  +++ ++ +K+K+
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122



 Score = 27.8 bits (62), Expect = 5.4
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE 215
           K EDE E   +  E E  +KK + E ++++E +E   ++K+
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113



 Score = 27.4 bits (61), Expect = 6.1
 Identities = 14/57 (24%), Positives = 29/57 (50%)

Query: 191 VKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLE 247
            K +  KE      E ++  + E+ E + ++E EE    QKK ++  +E+++   L+
Sbjct: 72  QKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQ 128


>gnl|CDD|224112 COG1191, FliA, DNA-directed RNA polymerase specialized sigma
           subunit [Transcription].
          Length = 247

 Score = 33.1 bits (76), Expect = 0.089
 Identities = 21/86 (24%), Positives = 41/86 (47%), Gaps = 10/86 (11%)

Query: 177 EDEIEEEENEEEEEVKQKKNK----------EEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           ++EI EE   ++EE  +              E+  ++++   +D+IE  +   E+EE  E
Sbjct: 129 DEEIAEELGIDKEEYIEALLAINGSQLLSLDEDVLKDDDDDVDDQIENPDDGVEKEELLE 188

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEEN 252
           ++K+      E E+ VL+   +EE  
Sbjct: 189 ILKEAIEPLPEREKLVLVLRYKEELT 214


>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
          Length = 313

 Score = 33.2 bits (76), Expect = 0.089
 Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 172 EEKKEEDEI--EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           + ++EE  I  + +   + E+V+ +  +  +EE  E+ E    +    E + EEE E+  
Sbjct: 104 KFEQEEAPIPVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVPMAEVQPEEETEIEV 163

Query: 230 QKKNEEEEEEEEVLIKL 246
            +  E + E E  +I L
Sbjct: 164 DEPEEPKPEPELDVIVL 180



 Score = 31.3 bits (71), Expect = 0.39
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 1/75 (1%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
             E+EEEE++ E+E +     + + + + +K E ++E+   E   EE E V  +    E 
Sbjct: 95  GGELEEEEDKFEQE-EAPIPVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVPMAEV 153

Query: 237 EEEEEVLIKLEEEEE 251
           + EEE  I+++E EE
Sbjct: 154 QPEEETEIEVDEPEE 168


>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
          Length = 542

 Score = 33.7 bits (78), Expect = 0.091
 Identities = 27/88 (30%), Positives = 45/88 (51%), Gaps = 13/88 (14%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            +  EEE  EE+   EE   EE+E         E   E +  +DE  +EE   EE + E+
Sbjct: 464 PLLPEEELAEEEAAAEEAALEEDE---------EAALEVELSDDEELEEEKAEEELKYED 514

Query: 227 VVKQKKNEEEEEEEEV--LIK--LEEEE 250
           ++K+ +   +E+ EEV  +++  L E+E
Sbjct: 515 LLKRLRELAKEDPEEVAQVLRTWLSEDE 542


>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 535

 Score = 33.5 bits (77), Expect = 0.096
 Identities = 16/90 (17%), Positives = 36/90 (40%), Gaps = 9/90 (10%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEE-----EEVKQKKNKEEEEEEEEKKEEDEIEK 214
           E+F     I  ++  K  +   +   E +     +++  +   E  EEE +K +E+  E 
Sbjct: 442 EVFGKGAKIKIQKALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQET 501

Query: 215 EEGENEEEEEEEV----VKQKKNEEEEEEE 240
           +  E  E +E  +    + ++  E    + 
Sbjct: 502 QLKEATELQEFMINNEDLIEEAKELFGIKS 531



 Score = 33.1 bits (76), Expect = 0.14
 Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           + G+  K +  +  +      + +     KE +   E+KK + E   E  E E ++++E 
Sbjct: 443 VFGKGAKIKIQKALKSAENPLQSL-----KEFKPSNEKKKIDTESTAEMLEEEAKKDDEE 497

Query: 228 VKQKKNEEEEEEEEVLIKLEE 248
           V++ + +E  E +E +I  E+
Sbjct: 498 VQETQLKEATELQEFMINNED 518



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 11/50 (22%), Positives = 21/50 (42%), Gaps = 3/50 (6%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE + ++EE  + +  +  E +E     ED IE+     E    + +   
Sbjct: 489 EEAKKDDEEVQETQLKEATELQEFMINNEDLIEE---AKELFGIKSIELL 535


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 33.5 bits (77), Expect = 0.10
 Identities = 21/51 (41%), Positives = 34/51 (66%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           + IA EE+ K+ +E+++   E+E  V ++   EEEEEEEE++EE+E    E
Sbjct: 373 DKIATEEDVKDIEELKKFLKEKEHPVVERWAAEEEEEEEEEEEEEEEPVAE 423


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 33.5 bits (77), Expect = 0.10
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 4/77 (5%)

Query: 166  ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             ++A  +E  E  E   E  +E  ++++      E  E    E + + +       E   
Sbjct: 934  SDVAVAQEVAEHAEPVVEPQDETADIEE----AAETAEVVVAEPEVVAQPAAPVVAEVAA 989

Query: 226  EVVKQKKNEEEEEEEEV 242
            EV      E E    +V
Sbjct: 990  EVETVTAVEPEVAPAQV 1006



 Score = 31.5 bits (72), Expect = 0.44
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 11/91 (12%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           + +      E  EN EE    +++ +++  E  E ++ +  EK   ++E+++     +Q+
Sbjct: 622 DTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQR 681

Query: 232 KNEEE-----------EEEEEVLIKLEEEEE 251
           +  +E             EE+ + + E+EE 
Sbjct: 682 RRNDEKRQAQQEAKALNVEEQSVQETEQEER 712



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 20/93 (21%), Positives = 42/93 (45%), Gaps = 13/93 (13%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE----EEEKKEEDEIEKEEGENEEEEEEE 226
           E ++ E  E    ++E+++  ++++ +   +E    ++E K  +  E+   E E+EE  +
Sbjct: 655 ESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEERVQ 714

Query: 227 VV---------KQKKNEEEEEEEEVLIKLEEEE 250
            V          QK   E+   EE +  + EE 
Sbjct: 715 QVQPRRKQRQLNQKVRIEQSVAEEAVAPVVEET 747



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 14/90 (15%), Positives = 34/90 (37%), Gaps = 8/90 (8%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKK-------NKEEEEEEEEKKEEDEIEKEEGENEE 221
           +G EE K +++    + E + E +Q +        ++  E  + +      E  E   E 
Sbjct: 581 SGGEETKPQEQ-PAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREEN 639

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
                  +Q+  E  E ++  + +    ++
Sbjct: 640 RRNRRQAQQQTAETRESQQAEVTEKARTQD 669



 Score = 27.7 bits (62), Expect = 8.3
 Identities = 13/72 (18%), Positives = 35/72 (48%), Gaps = 5/72 (6%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +++  E  E ++ E  E+   + ++ +    E + ++ +++ + ++       EE+ V  
Sbjct: 647 QQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSV-- 704

Query: 231 KKNEEEEEEEEV 242
              +E E+EE V
Sbjct: 705 ---QETEQEERV 713


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 33.6 bits (77), Expect = 0.10
 Identities = 19/77 (24%), Positives = 42/77 (54%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E + E + ++E+ +E E  +++  ++ +    + K+  +E+     ENEE EEE  +K+
Sbjct: 266 KEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKK 325

Query: 231 KKNEEEEEEEEVLIKLE 247
           +  E   + E  + KL+
Sbjct: 326 RTVELLPDAENNVAKLQ 342



 Score = 27.8 bits (62), Expect = 6.6
 Identities = 24/99 (24%), Positives = 43/99 (43%), Gaps = 4/99 (4%)

Query: 158 SPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
             E  +     A E    E D  E+   E E E++  + + +E E   ++   EI+    
Sbjct: 239 RTEKVEQAATAAAEVPPAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALAS 298

Query: 218 ENEEEEEEEVVKQKKNEEEEEE----EEVLIKLEEEEEN 252
           + ++  EE    +++NEE EEE    +  +  L + E N
Sbjct: 299 KIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENN 337


>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein.  This
           is the P3 protein section of the Potyviridae
           polyproteins. The function is not known except that the
           protein is essential to viral survival.
          Length = 445

 Score = 33.3 bits (77), Expect = 0.11
 Identities = 26/64 (40%), Positives = 34/64 (53%), Gaps = 8/64 (12%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEK-------EEGENEEEEEEEVVKQKKNEEEEEEEEVLI 244
           + +K K E  EE+E+KEE EI K       E     EEE  E ++ +KN E  E  E LI
Sbjct: 271 EHRKLKREAAEEKEEKEEKEIRKLYLYSKLEGKLPTEEEFLEYIE-EKNPELLETAEELI 329

Query: 245 KLEE 248
           + EE
Sbjct: 330 EEEE 333



 Score = 31.0 bits (71), Expect = 0.67
 Identities = 26/69 (37%), Positives = 35/69 (50%), Gaps = 4/69 (5%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
            A E+E+KEE EI +     + E K    +EE  E  E+K  + +E  E   E  EEEEV
Sbjct: 279 AAEEKEEKEEKEIRKLYLYSKLEGKLPT-EEEFLEYIEEKNPELLETAE---ELIEEEEV 334

Query: 228 VKQKKNEEE 236
             Q K  +E
Sbjct: 335 EHQAKTADE 343


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 33.4 bits (77), Expect = 0.11
 Identities = 11/40 (27%), Positives = 20/40 (50%), Gaps = 1/40 (2%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
            EKK E E E E+   +++ +    K +  + E KK + +
Sbjct: 421 AEKKAEKE-EAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459



 Score = 33.0 bits (76), Expect = 0.13
 Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 2/62 (3%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEE--EEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           E ++EE E       E +++++K+ K E   E+EE +K   + + E    + +  +   K
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454

Query: 230 QK 231
           + 
Sbjct: 455 KV 456



 Score = 31.8 bits (73), Expect = 0.31
 Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 1/76 (1%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           E E EE EN      ++KK ++++ + E+K E++E EK   + + E   +  K    E +
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454

Query: 237 EEEEEVL-IKLEEEEE 251
           + + + L  KL   E+
Sbjct: 455 KVDPDPLGEKLARTED 470



 Score = 29.9 bits (68), Expect = 1.5
 Identities = 16/67 (23%), Positives = 30/67 (44%), Gaps = 5/67 (7%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
              + E+E  E  N    E K+ + K+ + E++ +KEE      E    +++ E   K+ 
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEE-----AEKAAAKKKAEAAAKKA 446

Query: 232 KNEEEEE 238
           K  + E 
Sbjct: 447 KGPDGET 453



 Score = 27.2 bits (61), Expect = 9.5
 Identities = 15/73 (20%), Positives = 31/73 (42%), Gaps = 2/73 (2%)

Query: 151 LHSAP--SYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE 208
           LH  P  +   E      N++  E KK   +  + E + E+E  +K   +++ E   KK 
Sbjct: 387 LHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446

Query: 209 EDEIEKEEGENEE 221
           +    + +  + +
Sbjct: 447 KGPDGETKKVDPD 459


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 31.9 bits (73), Expect = 0.12
 Identities = 23/69 (33%), Positives = 41/69 (59%), Gaps = 2/69 (2%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNK--EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           EEK+EE E E+EE E +EE +++  K   ++EE E++ EE E  K +      ++E+  +
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDE 74

Query: 230 QKKNEEEEE 238
           + K +E  +
Sbjct: 75  ELKEQERWD 83



 Score = 31.1 bits (71), Expect = 0.27
 Identities = 23/69 (33%), Positives = 40/69 (57%), Gaps = 3/69 (4%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           EE+ EE+E  K++K ++EE+E+E  K   + E+ E   +  EE E  K K      ++E+
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEERE---KRLEELEKAKNKPLARYADDED 71

Query: 242 VLIKLEEEE 250
              +L+E+E
Sbjct: 72  YDEELKEQE 80



 Score = 26.9 bits (60), Expect = 5.8
 Identities = 15/56 (26%), Positives = 33/56 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           +EEK+ ++E E+E  +   + ++++ + EE E+ + K       +E  +EE +E+E
Sbjct: 25  KEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKEQE 80


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 32.8 bits (75), Expect = 0.12
 Identities = 22/81 (27%), Positives = 41/81 (50%), Gaps = 3/81 (3%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
             K+ + E+E++  + EE +K ++     +E   +  E+EI +   E  EEE E    ++
Sbjct: 116 ARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAE---LER 172

Query: 232 KNEEEEEEEEVLIKLEEEEEN 252
           +N   + E E   + +EE EN
Sbjct: 173 ENIRAKIEAEARGRAKEEREN 193



 Score = 29.0 bits (65), Expect = 2.5
 Identities = 22/89 (24%), Positives = 46/89 (51%), Gaps = 5/89 (5%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE---E 222
                E  + E +E  +   E+ ++ +Q+   + ++E   K+ + E+E++  +NEE    
Sbjct: 80  AQAKLERARVEAEERRKTLQEQTQQEQQR--AQYQDELARKRYQKELEQQRRQNEELLKM 137

Query: 223 EEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           +EE V++Q+      EEE + ++ E  EE
Sbjct: 138 QEESVLRQEAMRRATEEEILEMRRETIEE 166



 Score = 27.4 bits (61), Expect = 6.4
 Identities = 21/85 (24%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEE--KKEEDEIEKEEGENEEEEEEEVVK 229
           E+ ++E +  + ++E   +  QK+ +++  + EE  K +E+ + ++E      EEE +  
Sbjct: 100 EQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEM 159

Query: 230 QKKNEEEE---EEEEVLIKLEEEEE 251
           +++  EEE   E E +  K+E E  
Sbjct: 160 RRETIEEEAELERENIRAKIEAEAR 184


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 31.9 bits (73), Expect = 0.12
 Identities = 22/82 (26%), Positives = 48/82 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E + EE + + EE + ++E K  K + + +++++KK++ +  K+  + EE+E  +  ++
Sbjct: 61  KETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEE 120

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
             +EEEE EE+   +  E  E 
Sbjct: 121 SSDEEEEGEEDKQEEPVEIMEK 142



 Score = 27.7 bits (62), Expect = 3.9
 Identities = 22/91 (24%), Positives = 49/91 (53%), Gaps = 9/91 (9%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEE---------EKKEEDEIEKEEGENEE 221
           E E+ E  + + ++  E+EE +QK+ +++ ++EE         +KK++ + +K++ +   
Sbjct: 48  EYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGN 107

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           ++EE+   +   E  +EEEE     +EE   
Sbjct: 108 KKEEKEGSKSSEESSDEEEEGEEDKQEEPVE 138


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.5 bits (76), Expect = 0.12
 Identities = 14/42 (33%), Positives = 31/42 (73%), Gaps = 1/42 (2%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE-EDEIEK 214
           KK E+ +E+ + E E++ ++++ +E+E+E+E ++E E E E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620



 Score = 32.7 bits (74), Expect = 0.17
 Identities = 15/40 (37%), Positives = 27/40 (67%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           K EE  E+ K+E ++  +EE E E+E+E+E  ++++ E E
Sbjct: 580 KREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619



 Score = 31.6 bits (71), Expect = 0.45
 Identities = 17/49 (34%), Positives = 25/49 (51%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           EE V++ K + E++  EE++ E E EKE     E E E   K   +  E
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHE 630



 Score = 31.2 bits (70), Expect = 0.62
 Identities = 14/35 (40%), Positives = 23/35 (65%)

Query: 206 KKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           KK E+ +EK + E E++  EE  ++K+ E+E E E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERERE 613



 Score = 30.4 bits (68), Expect = 1.0
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEE 204
           A E+ K+E ++   EE E E+E ++++ +E E E E
Sbjct: 584 AVEKAKREAEQKAREEREREKEKEKEREREREREAE 619



 Score = 30.0 bits (67), Expect = 1.3
 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEE 238
           K  K+ EE  E+ K E E +K   E E E+E+E  ++++ E E E
Sbjct: 576 KLAKKREEAVEKAKREAE-QKAREEREREKEKEKEREREREREAE 619



 Score = 29.3 bits (65), Expect = 2.3
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
           EE  E+ + E E+   EE  ++K+ ++E E E E++ E
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619



 Score = 28.1 bits (62), Expect = 5.3
 Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 3/51 (5%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
           E  +K + E E++  EE E   ++K KE+E E E ++E +   K    + E
Sbjct: 583 EAVEKAKREAEQKAREERE---REKEKEKEREREREREAERAAKASSSSHE 630



 Score = 28.1 bits (62), Expect = 5.5
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           EE   K K+  E++  EE ++E+++ ++ E E E E E
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619



 Score = 27.7 bits (61), Expect = 8.2
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 204 EEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           EE  E+ + E E+   EE E E+  ++++  E E E E
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619


>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
           topoisomerase IV), A subunit [DNA replication,
           recombination, and repair].
          Length = 804

 Score = 33.4 bits (77), Expect = 0.13
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 176 EEDEIEEEENEEEEEVKQKK---NKEEEEEEEEKKEEDEIEKEEG-ENEEEEEEEVVKQK 231
           EE++IE+E  E E+E+   +     EE   +  KKE  EI+K+ G E   E  EE   + 
Sbjct: 435 EEEKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERRTEIVEEEEDEI 494

Query: 232 KNEEEEEEEEVLIKLEEE 249
           ++E+   EE+V++ L  +
Sbjct: 495 EDEDLIAEEDVVVTLSHK 512


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 32.9 bits (74), Expect = 0.13
 Identities = 21/75 (28%), Positives = 43/75 (57%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           N +G E ++EE + E+E+ + E+E  +  N + + E+E++K E E +K E E ++     
Sbjct: 145 NKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIA 204

Query: 227 VVKQKKNEEEEEEEE 241
                + E+E+++ E
Sbjct: 205 NKNAIELEQEKQKTE 219



 Score = 31.8 bits (71), Expect = 0.32
 Identities = 22/86 (25%), Positives = 50/86 (58%), Gaps = 5/86 (5%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE-----IEKEEGENEEEEEE 225
            + KKE +   +  N+   E++Q++ K E+E+++ +KE  E     I+ E+ + + E+E+
Sbjct: 131 AQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEK 190

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
           +  +Q+K +      +  I+LE+E++
Sbjct: 191 QKTEQEKQKTSNIANKNAIELEQEKQ 216


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 33.3 bits (77), Expect = 0.13
 Identities = 15/82 (18%), Positives = 40/82 (48%), Gaps = 2/82 (2%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
              + D   ++  EE E+ + +K +E+E+  +E  E  +  + E + + +  +      +
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQ 216

Query: 233 NEEEEEEEEV--LIKLEEEEEN 252
             ++ ++EE+  + ++ EEE  
Sbjct: 217 IGKKIDKEEITPMKEINEEERR 238



 Score = 33.3 bits (77), Expect = 0.14
 Identities = 17/72 (23%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE---NEEEEEEEV 227
           + +++ E    ++E E+E+  K+     ++ E E+KK+    + +EG     ++ ++EE+
Sbjct: 167 DSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEI 226

Query: 228 VKQKKNEEEEEE 239
              K+  EEE  
Sbjct: 227 TPMKEINEEERR 238



 Score = 28.3 bits (64), Expect = 5.5
 Identities = 12/70 (17%), Positives = 32/70 (45%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
             + E D ++++   +  +  +K      + + E  +  E  ++    +EEE+E++ K+ 
Sbjct: 131 NNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEA 190

Query: 232 KNEEEEEEEE 241
               ++ E E
Sbjct: 191 LEAMKKLEAE 200


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 33.3 bits (76), Expect = 0.13
 Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE---E 222
             +A  E +  ++  +   +E EE+      +EE++ ++ K E  +I  EE E  E    
Sbjct: 690 NKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEALEANLL 749

Query: 223 EEEEVVKQKKNEEEEEEEEV 242
            E+  +K +K +++    EV
Sbjct: 750 AEQNSLKAQKQQQKRIAAEV 769



 Score = 32.6 bits (74), Expect = 0.20
 Identities = 16/79 (20%), Positives = 33/79 (41%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE  + +    E E+ +E E       EE+      +EE + +  + E ++   EE+   
Sbjct: 685 EENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEAL 744

Query: 231 KKNEEEEEEEEVLIKLEEE 249
           + N   E+      K +++
Sbjct: 745 EANLLAEQNSLKAQKQQQK 763



 Score = 32.2 bits (73), Expect = 0.25
 Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 5/85 (5%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK- 229
            E +        +E+EE  E K    + E  +E E    DE E++      EEE++    
Sbjct: 670 LELQVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDF 729

Query: 230 ----QKKNEEEEEEEEVLIKLEEEE 250
               Q  + EE E  E  +  E+  
Sbjct: 730 KNEWQDISLEELEALEANLLAEQNS 754



 Score = 29.1 bits (65), Expect = 2.7
 Identities = 19/80 (23%), Positives = 40/80 (50%), Gaps = 6/80 (7%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           EE + +    E ++       Q  +K +  +E E+  E+++   EGE+ +E E+      
Sbjct: 652 EESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKEIED------ 705

Query: 232 KNEEEEEEEEVLIKLEEEEE 251
              +E EE+ ++  +EEE++
Sbjct: 706 LLFDESEEDNIVGMIEEEKD 725


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 33.2 bits (76), Expect = 0.15
 Identities = 19/97 (19%), Positives = 46/97 (47%), Gaps = 2/97 (2%)

Query: 148 KCILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK 207
           KC L  +PS   E    W+ ++  +++     +++  +E +EE  +++ +E ++  + KK
Sbjct: 151 KCFLEMSPSTPVEKLSGWDTLSDSDQEAVLPLVKKSPSEAKEEKAEERKQESKKGAKRKK 210

Query: 208 EEDEIEKEEGENEEEEEEE--VVKQKKNEEEEEEEEV 242
           +    +K +    + +        QKK+ + E + E 
Sbjct: 211 DASGDDKSKKAKTDRDVSTSTAASQKKSSDLESKLEA 247


>gnl|CDD|237007 PRK11873, arsM, arsenite S-adenosylmethyltransferase; Reviewed.
          Length = 272

 Score = 32.6 bits (75), Expect = 0.15
 Identities = 26/91 (28%), Positives = 37/91 (40%), Gaps = 22/91 (24%)

Query: 42  CDNPNIFA----GKTVLDVGTGTGKSILLQGHGVDHHSFCTKVHPLDHYAPQYLIL---- 93
           C NP   A    G+TVLD+G+G G    L    V       KV  +D   P+ L      
Sbjct: 66  CGNPTALAELKPGETVLDLGSGGGFDCFLAARRVGPTG---KVIGVD-MTPEMLAKARAN 121

Query: 94  ---------EVIQNKIENVELPEK-VDIIVS 114
                    E    +IE + + +  VD+I+S
Sbjct: 122 ARKAGYTNVEFRLGEIEALPVADNSVDVIIS 152


>gnl|CDD|222171 pfam13489, Methyltransf_23, Methyltransferase domain.  This family
           appears to be a methyltransferase domain.
          Length = 154

 Score = 32.1 bits (73), Expect = 0.15
 Identities = 28/124 (22%), Positives = 43/124 (34%), Gaps = 20/124 (16%)

Query: 45  PNIFAGKTVLDVGTGTGK-SILLQGHGVDHHSFCTKVHPLDHYAPQYLILEVIQNKIENV 103
           P +  G  VLD+G GTG    LL+  G D          +  ++        +       
Sbjct: 18  PRLKPGGRVLDIGCGTGILLRLLRERGFDVTGVDPSPAAVLIFSLFDAPDPAVLAG---- 73

Query: 104 ELPEKVDIIVSEWMGFYLLHESMIDSVIFARD--KFLKPEGVMYPYKCILHSAPSYSPEI 161
               K D+I +    F +L E + D     +   + LKP GV+     I          +
Sbjct: 74  ----KYDLITA----FEVL-EHLPDPPALLQQLRELLKPGGVLL----ISTPLADDDARL 120

Query: 162 FKFW 165
           F  W
Sbjct: 121 FANW 124


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 33.1 bits (75), Expect = 0.15
 Identities = 20/73 (27%), Positives = 35/73 (47%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E+  +E E  +E  + E++     +      E+E    D+   +E ++  E ++  V Q
Sbjct: 453 DEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQ 512

Query: 231 KKNEEEEEEEEVL 243
              EEEEEE  VL
Sbjct: 513 AIEEEEEEERAVL 525


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 31.5 bits (72), Expect = 0.15
 Identities = 18/63 (28%), Positives = 35/63 (55%), Gaps = 7/63 (11%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           DEI+EE    EE+  +KK K  + +++   ++   + +  E + +EE++       E+EE
Sbjct: 74  DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDD-------EDEE 126

Query: 238 EEE 240
           +EE
Sbjct: 127 DEE 129


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 32.7 bits (75), Expect = 0.16
 Identities = 20/87 (22%), Positives = 41/87 (47%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           EN+  E  + E+    +   E   +   +    EEE E +KK  ++I ++E   EE E+ 
Sbjct: 280 ENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQN 339

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            + ++K+   + EE +  ++  E+   
Sbjct: 340 LIEERKELNSKLEEIQKKLEDLEKRLE 366



 Score = 30.8 bits (70), Expect = 0.65
 Identities = 23/80 (28%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
              +GE   KE  E     NEEE E+K+K  ++  E+E+  +E ++   EE +    + E
Sbjct: 295 LKNSGEPSLKEIHE--ARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLE 352

Query: 226 EVVKQKKNEEEEEEEEVLIK 245
           E+ K+ ++ E+  E+    K
Sbjct: 353 EIQKKLEDLEKRLEKLKSNK 372



 Score = 28.4 bits (64), Expect = 4.1
 Identities = 14/50 (28%), Positives = 27/50 (54%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           E+ +++E  +EE E    EE K+  +K EE +++ +  E  +EK +    
Sbjct: 324 EKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKSNKS 373


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 32.4 bits (74), Expect = 0.16
 Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 5/53 (9%)

Query: 192 KQKKNKEE-EEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVL 243
           +Q +   + ++EEEE++ E   E       E+ E+  +KQK  EE++ E E  
Sbjct: 113 RQLRFLAQKQKEEEERRVERRREL----GLEDPEQLRLKQKAKEEQKAESEET 161



 Score = 32.0 bits (73), Expect = 0.25
 Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 171 EEEKKEEDEIEEE-ENEEEEEVKQKKNKEEEEEEEE 205
           EEE++ E   E   E+ E+  +KQK  +E++ E EE
Sbjct: 125 EEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160



 Score = 27.7 bits (62), Expect = 5.4
 Identities = 11/44 (25%), Positives = 23/44 (52%), Gaps = 3/44 (6%)

Query: 183 EENEEEEEVKQKKNKEEEEEEEEKKEEDEIE-KEEGENEEEEEE 225
            + ++EEE  +++ +   E   E  E+  ++ K + E + E EE
Sbjct: 119 AQKQKEEE--ERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 32.6 bits (75), Expect = 0.17
 Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 171 EEEK--KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           E E+  K+ +E   E+ + +E ++ K   EE     EK  ++E +K    ++++ EE + 
Sbjct: 505 EIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIE 564

Query: 229 KQKKNEEEEEEEEVLIKLEE 248
             K+  E E++EE+  K EE
Sbjct: 565 WLKEELEGEDKEEIEAKTEE 584



 Score = 31.4 bits (72), Expect = 0.42
 Identities = 21/76 (27%), Positives = 32/76 (42%), Gaps = 4/76 (5%)

Query: 180 IEEEENEEEEEVKQKKNKEE-EEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEE 238
            E++              E   ++ EE   ED+  KE  E + E EE V   +K+ +EE 
Sbjct: 489 KEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEG 548

Query: 239 E---EEVLIKLEEEEE 251
           +   E    K+EE  E
Sbjct: 549 DKLPEADKKKVEEAIE 564



 Score = 30.3 bits (69), Expect = 1.1
 Identities = 17/78 (21%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 174 KKEEDEIEEEE--NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           K+++  I      +++E E   K  +E   E++++KE  E  K E E      E+ +K++
Sbjct: 489 KEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEA-KNEAEEYVYSLEKSLKEE 547

Query: 232 KNEEEEEEEEVLIKLEEE 249
            ++  E +++ + +  E 
Sbjct: 548 GDKLPEADKKKVEEAIEW 565



 Score = 29.5 bits (67), Expect = 1.6
 Identities = 10/55 (18%), Positives = 27/55 (49%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
             +++   K+++  I    G +++E E  V   ++   E+++ +  I+ + E E 
Sbjct: 481 SAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEE 535


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 31.8 bits (73), Expect = 0.19
 Identities = 12/50 (24%), Positives = 23/50 (46%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             D +E E      E++ K  + ++E+   + E  E+E +  E E E + 
Sbjct: 57  AADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDH 106


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 32.7 bits (75), Expect = 0.19
 Identities = 20/83 (24%), Positives = 45/83 (54%), Gaps = 1/83 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE-EVVK 229
           EE  KE  E  +E  +  E++++ K +  E + E  + ++E+++   E  +       ++
Sbjct: 374 EEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIE 433

Query: 230 QKKNEEEEEEEEVLIKLEEEEEN 252
            K NE EEE+E+  ++++++E  
Sbjct: 434 AKINELEEEKEDKALEIKKQEWK 456



 Score = 32.3 bits (74), Expect = 0.25
 Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 4/93 (4%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           E     E    E E+K      E+E   E+E+++ + +  + +E+ K  E EI  E    
Sbjct: 805 EEVSRIEARLREIEQKLNRLTLEKE-YLEKEIQELQEQRIDLKEQIKSIEKEI--ENLNG 861

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           ++EE EE +++ +    + E   L  L++E + 
Sbjct: 862 KKEELEEELEELEAALRDLESR-LGDLKKERDE 893



 Score = 32.0 bits (73), Expect = 0.33
 Identities = 20/69 (28%), Positives = 36/69 (52%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
             E +++ +E+E +  ++ + + + K K E  EEE  + ED   ++E   EEE   E V+
Sbjct: 898 LRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQ 957

Query: 230 QKKNEEEEE 238
            +    EEE
Sbjct: 958 AELQRVEEE 966



 Score = 31.2 bits (71), Expect = 0.52
 Identities = 21/88 (23%), Positives = 48/88 (54%), Gaps = 3/88 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK-EEDEIEKEEGENEEEEE 224
           + I   +E++ + + + +  E+E E    K +E EEE EE +    ++E   G+ ++E +
Sbjct: 833 KEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERD 892

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E  ++ +  E E + EE+  ++E++ + 
Sbjct: 893 E--LEAQLRELERKIEELEAQIEKKRKR 918



 Score = 30.8 bits (70), Expect = 0.71
 Identities = 27/88 (30%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEE-KKEEDEIEKEEGENEEEEE 224
           E I   E++ E    ++EE EEE E  +   ++ E    + KKE DE+E +  E E + E
Sbjct: 847 EQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIE 906

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E  ++ +  ++ +   E+  KLE  EE 
Sbjct: 907 E--LEAQIEKKRKRLSELKAKLEALEEE 932



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 25/78 (32%), Positives = 45/78 (57%), Gaps = 2/78 (2%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKN-KEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           EN+ G++E+ EE E+EE E    +   +  + K+E +E E +  E E + EE E + E++
Sbjct: 857 ENLNGKKEELEE-ELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKK 915

Query: 225 EEVVKQKKNEEEEEEEEV 242
            + + + K + E  EEE+
Sbjct: 916 RKRLSELKAKLEALEEEL 933



 Score = 29.7 bits (67), Expect = 2.0
 Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 6/80 (7%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E +KE +++E+EE + +E +++ +      E+E +  + E+++ E   EE EE      
Sbjct: 719 GEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEE------ 772

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
             ++ EE   ++  +L    
Sbjct: 773 DLHKLEEALNDLEARLSHSR 792



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE-DEIEKEEGENEEEEEEEVVKQ 230
           E +  E E + EE E + E K+K+  E + + E  +EE  EIE  +GE+EE  EEE+  +
Sbjct: 895 EAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLE 954

Query: 231 KKNEEEEEEEEVLIKLEE 248
               E +  EE +  LE 
Sbjct: 955 DVQAELQRVEEEIRALEP 972



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 16/83 (19%), Positives = 37/83 (44%), Gaps = 2/83 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI--EKEEGENEEEEEEEVV 228
           E+  K E+ + + E         +   E  + EEE    +    E E+  N    E+E +
Sbjct: 772 EDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYL 831

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
           +++  E +E+  ++  +++  E+
Sbjct: 832 EKEIQELQEQRIDLKEQIKSIEK 854



 Score = 28.9 bits (65), Expect = 2.8
 Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 1/71 (1%)

Query: 172 EEKKEEDEIEEEENEEEE-EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E++  E EI+E + +  + + + K  ++E E    KKEE E E EE E    + E  +  
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886

Query: 231 KKNEEEEEEEE 241
            K E +E E +
Sbjct: 887 LKKERDELEAQ 897



 Score = 28.5 bits (64), Expect = 4.6
 Identities = 20/83 (24%), Positives = 33/83 (39%), Gaps = 4/83 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE----KEEGENEEEEEEE 226
           E  K+E   ++ E    E  + +   +  +   +  + E EIE    +EE   E  EE E
Sbjct: 684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELE 743

Query: 227 VVKQKKNEEEEEEEEVLIKLEEE 249
                  +E E  +  L +LE  
Sbjct: 744 EDLSSLEQEIENVKSELKELEAR 766



 Score = 28.5 bits (64), Expect = 4.7
 Identities = 27/103 (26%), Positives = 45/103 (43%), Gaps = 11/103 (10%)

Query: 160 EIFKFWENIAGEEEKKEE---------DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEED 210
           EI K  E +  EEEK +E           +E+E    + E+K+ + + EE EE+  K E+
Sbjct: 720 EIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEE 779

Query: 211 EIEKEEGENEEEEEEEVVKQKKNEEEE--EEEEVLIKLEEEEE 251
            +   E         E+  +    EEE    E  L ++E++  
Sbjct: 780 ALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLN 822



 Score = 28.1 bits (63), Expect = 5.6
 Identities = 21/86 (24%), Positives = 44/86 (51%), Gaps = 4/86 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE----NEEEEEEE 226
              +   DE+ +E ++   ++ + + + E+ E+EE+K ++ +E+ E +     +E E  +
Sbjct: 698 RRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVK 757

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEEN 252
              ++     EE EE L KLEE   +
Sbjct: 758 SELKELEARIEELEEDLHKLEEALND 783


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 31.9 bits (73), Expect = 0.19
 Identities = 14/33 (42%), Positives = 15/33 (45%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
           +D    EE   EE       KEEE  E EKKE 
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKES 192



 Score = 28.5 bits (64), Expect = 2.4
 Identities = 14/39 (35%), Positives = 19/39 (48%), Gaps = 2/39 (5%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEE--EEKKEE 209
           E K+   +EE   EE E     K +E  E E  E KK++
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 28.5 bits (64), Expect = 3.0
 Identities = 11/37 (29%), Positives = 22/37 (59%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE 208
           ++ K  +E   EE E     K++++ E E++E +KK+
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 26.9 bits (60), Expect = 8.6
 Identities = 15/43 (34%), Positives = 20/43 (46%), Gaps = 4/43 (9%)

Query: 190 EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E K  K  EE   EE +      E+E  E E++E     K+KK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKES----KKKK 196


>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3).  This family
           consists of several optic atrophy 3 (OPA3) proteins.
           OPA3 deficiency causes type III 3-methylglutaconic
           aciduria (MGA) in humans. This disease manifests with
           early bilateral optic atrophy, spasticity,
           extrapyramidal dysfunction, ataxia, and cognitive
           deficits, but normal longevity.
          Length = 134

 Score = 31.5 bits (72), Expect = 0.19
 Identities = 14/38 (36%), Positives = 24/38 (63%)

Query: 190 EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           E +++  KE ++EEE ++E +E+E   GE E E E + 
Sbjct: 95  EYQRQSRKEAKKEEELQQELEELEARVGELELEIERQK 132



 Score = 27.2 bits (61), Expect = 5.0
 Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 6/48 (12%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE 208
           ++++      E +K      EEE  +E EE++ +  + E E E +K  
Sbjct: 93  VYEYQRQSRKEAKK------EEELQQELEELEARVGELELEIERQKAR 134


>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 32.5 bits (74), Expect = 0.19
 Identities = 13/51 (25%), Positives = 27/51 (52%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
           E E++  ++K + +   E+GE   E+ +++ + K   EE E  E + K + 
Sbjct: 367 EIEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKIDKAQA 417



 Score = 29.1 bits (65), Expect = 2.4
 Identities = 18/73 (24%), Positives = 32/73 (43%), Gaps = 11/73 (15%)

Query: 165 WENIAG-------EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
           W N A           K+ + EIE++  +E+ ++     K E   E+ K    +I + + 
Sbjct: 346 WSNTASGTLTDSYVTPKEAQAEIEQKLAQEKAQLDAAVEKGELTPEQAK----QIARAKL 401

Query: 218 ENEEEEEEEVVKQ 230
           E EE E  E + +
Sbjct: 402 EPEERERLEKIDK 414


>gnl|CDD|225962 COG3428, COG3428, Predicted membrane protein [Function unknown].
          Length = 494

 Score = 32.4 bits (74), Expect = 0.20
 Identities = 16/66 (24%), Positives = 21/66 (31%), Gaps = 4/66 (6%)

Query: 169 AGEEEKKE----EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           AG   + E    E EI  E      E + KK    E  E +  EE+  E           
Sbjct: 119 AGGGGEAEAELVEREIAFELAALVREARVKKLDALELAEADTPEEEVAEVLARSQSSVLR 178

Query: 225 EEVVKQ 230
             + K 
Sbjct: 179 YPMNKG 184


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 32.2 bits (73), Expect = 0.21
 Identities = 16/72 (22%), Positives = 36/72 (50%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            + ++K E+E++E+     E++   K +E+   +E + +ED+    E  ++ +E  E  K
Sbjct: 95  RKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGK 154

Query: 230 QKKNEEEEEEEE 241
             +  E+    E
Sbjct: 155 VHEETEDSYHVE 166



 Score = 30.7 bits (69), Expect = 0.54
 Identities = 20/78 (25%), Positives = 39/78 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            E  K  +E E+  + EE   +Q K   +E+  E++ E+ +   E+ E  + E ++V ++
Sbjct: 150 LEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEE 209

Query: 231 KKNEEEEEEEEVLIKLEE 248
             +EE+   E+     EE
Sbjct: 210 DYDEEDNPVEDSKAIKEE 227



 Score = 30.3 bits (68), Expect = 0.81
 Identities = 16/78 (20%), Positives = 43/78 (55%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           + +   E  + + ++  +++  E+ENE+ +E  +K  + + E ++  +E+ + E    E+
Sbjct: 161 DSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVED 220

Query: 220 EEEEEEEVVKQKKNEEEE 237
            +  +EE+ K+   E++E
Sbjct: 221 SKAIKEELAKEPVEEQQE 238



 Score = 29.1 bits (65), Expect = 2.0
 Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 3/79 (3%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI---EKEEGENEEEEEEEVVKQKK 232
           EE E   EE  +    K K+  EEE +E+ +   ++I   ++EE    +E + +  K   
Sbjct: 80  EEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLL 139

Query: 233 NEEEEEEEEVLIKLEEEEE 251
            E+ ++ +E L   +  EE
Sbjct: 140 AEDSDDRQETLEAGKVHEE 158



 Score = 28.7 bits (64), Expect = 2.9
 Identities = 18/83 (21%), Positives = 41/83 (49%), Gaps = 3/83 (3%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
             E+    E+   E+  ++ +E   K +++E E+ +E  E+ E  K E ++  EE+ +  
Sbjct: 158 ETEDSYHVEETASEQYKQDMKE---KASEQENEDSKEPVEKAERTKAETDDVTEEDYDEE 214

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
                + +  +EE+  +  EE++
Sbjct: 215 DNPVEDSKAIKEELAKEPVEEQQ 237



 Score = 26.8 bits (59), Expect = 9.4
 Identities = 16/77 (20%), Positives = 33/77 (42%), Gaps = 6/77 (7%)

Query: 171 EEEKKEEDEIEEEENE------EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           E+ +   ++I   + E      E +  + K    E+ ++ ++  E     EE E+    E
Sbjct: 107 EQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVE 166

Query: 225 EEVVKQKKNEEEEEEEE 241
           E   +Q K + +E+  E
Sbjct: 167 ETASEQYKQDMKEKASE 183



 Score = 26.8 bits (59), Expect = 9.5
 Identities = 18/85 (21%), Positives = 43/85 (50%), Gaps = 3/85 (3%)

Query: 171 EEEKKEEDEIEEEE---NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           EEE KE+ +   E+   +++EE+   K+ + +E++    ++ D+ ++     +  EE E 
Sbjct: 102 EEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETED 161

Query: 228 VKQKKNEEEEEEEEVLIKLEEEEEN 252
               +    E+ ++ + +   E+EN
Sbjct: 162 SYHVEETASEQYKQDMKEKASEQEN 186


>gnl|CDD|218954 pfam06246, Isy1, Isy1-like splicing family.  Isy1 protein is
           important in the optimisation of splicing.
          Length = 253

 Score = 31.9 bits (73), Expect = 0.21
 Identities = 18/70 (25%), Positives = 36/70 (51%), Gaps = 10/70 (14%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN----------EEEEE 224
           + E E E+E  E+  E  ++   E+ + EE+++EED     + +           +EE E
Sbjct: 179 EYEAEREKELIEKLVEEWKEIQDEDLQAEEDEREEDIYIPLDDDAEFVAHVPVPTQEEVE 238

Query: 225 EEVVKQKKNE 234
           E +++++K E
Sbjct: 239 EWLLEKRKRE 248



 Score = 29.2 bits (66), Expect = 1.8
 Identities = 23/85 (27%), Positives = 39/85 (45%), Gaps = 13/85 (15%)

Query: 179 EIEEEENEEEEEVKQKKNKE-------------EEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E+ E+E E+ +E  +K   E             +EE+    + E E EKE  E   EE +
Sbjct: 138 ELFEKEPEDRKEKPRKSRAELYKRIDADYYGYRDEEDGVLLEYEAEREKELIEKLVEEWK 197

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEE 250
           E+  +    EE+E EE +    +++
Sbjct: 198 EIQDEDLQAEEDEREEDIYIPLDDD 222


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 32.3 bits (74), Expect = 0.22
 Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 6/48 (12%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
              +  EEE              E+  +  E E+E+ + E+   EE+V
Sbjct: 260 APPQPPEEEP------PPPPPPPEDDDDPPEDEEEQDDAEDRALEEIV 301



 Score = 30.4 bits (69), Expect = 1.1
 Identities = 8/41 (19%), Positives = 16/41 (39%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI 212
              +  +E         E+       EEE+++ E +  +EI
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 32.0 bits (73), Expect = 0.22
 Identities = 18/68 (26%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVK-QKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           +  E E+ E         EEE++ + + + +EEEEE +   ++ E ++ E ++EEE E+E
Sbjct: 18  LEEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKE 77

Query: 227 VVKQKKNE 234
           + ++++ +
Sbjct: 78  LQREERLK 85



 Score = 28.9 bits (65), Expect = 2.2
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
            FW  +  EEE  E+DE  E E EEEEE       + E++E E  +E+E EKE    E  
Sbjct: 27  FFWTYLLFEEE--EDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERL 84

Query: 223 EEEEVVKQKK 232
           ++++ VK K 
Sbjct: 85  KKKKRVKTKA 94


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 32.6 bits (74), Expect = 0.22
 Identities = 23/92 (25%), Positives = 46/92 (50%), Gaps = 3/92 (3%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE---EGEN 219
           K    ++ +++ K+  +  E+  + E + +    K+  EE++ KKEE+E       +   
Sbjct: 463 KLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAA 522

Query: 220 EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
             EE  E +KQ K + E E +++   L+ +EE
Sbjct: 523 SREECAESLKQAKQDLEMEIKKLEHDLKLKEE 554



 Score = 29.5 bits (66), Expect = 1.9
 Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 1/69 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE-EEEEEEVVK 229
           EEE       +   + EE     K+ K++ E E +K E D   KEE     E+E +E+ K
Sbjct: 509 EEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRK 568

Query: 230 QKKNEEEEE 238
            +++E+E E
Sbjct: 569 YQESEKETE 577


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 31.7 bits (73), Expect = 0.22
 Identities = 28/82 (34%), Positives = 52/82 (63%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A EE  K   E E E  E   E+++++ +  ++EE   ++++ +EK+E   EE+E+E   
Sbjct: 58  AKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEKELAA 117

Query: 229 KQKKNEEEEEEEEVLIKLEEEE 250
           +Q++ EE+EEE E LI+ +++E
Sbjct: 118 RQQQLEEKEEELEELIEEQQQE 139



 Score = 27.1 bits (61), Expect = 6.6
 Identities = 26/77 (33%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           +K+E+ +EE+E E     +Q + KEEE EE  ++++ E+E+  G   EE +E ++++ + 
Sbjct: 102 EKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQELERISGLTAEEAKEILLEEVE- 160

Query: 234 EEEEEEEEVLIKLEEEE 250
            E + E   +IK  EEE
Sbjct: 161 AELKHEAAKMIKEIEEE 177


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 32.5 bits (74), Expect = 0.23
 Identities = 14/76 (18%), Positives = 28/76 (36%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
             + E   E+ +   E  +  +  +  E E +        E  E E E +E ++      
Sbjct: 241 HGRTEVTEEDLKLAVELVLLPRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDG 300

Query: 233 NEEEEEEEEVLIKLEE 248
            E ++  EE++    E
Sbjct: 301 EETDQIPEELMFDAVE 316


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 32.1 bits (74), Expect = 0.23
 Identities = 16/54 (29%), Positives = 30/54 (55%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           +E    E EE E  E+V+   +    E+ EE +EED+ + ++  +E+++E   V
Sbjct: 333 EENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVEV 386



 Score = 31.0 bits (71), Expect = 0.58
 Identities = 15/65 (23%), Positives = 35/65 (53%), Gaps = 8/65 (12%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
             IEE   EE EE +  +  E   ++  +++ +E+E+E+ ++ +++         +E+++
Sbjct: 330 TFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDW--------DEDDD 381

Query: 238 EEEEV 242
           E  EV
Sbjct: 382 EGVEV 386


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 31.9 bits (73), Expect = 0.24
 Identities = 15/52 (28%), Positives = 31/52 (59%)

Query: 201 EEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E E  K E  E E+E    +++EEE++++ ++   +E  ++++ K+E E E 
Sbjct: 207 EAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREK 258



 Score = 29.9 bits (68), Expect = 1.0
 Identities = 22/71 (30%), Positives = 35/71 (49%), Gaps = 1/71 (1%)

Query: 168 IAGEEEKKEEDEIE-EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           I  E  K E  E E E   E+++E +Q    +E   +E  K+  E  + E E    E+E 
Sbjct: 206 IEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQER 265

Query: 227 VVKQKKNEEEE 237
           +++ K  E+EE
Sbjct: 266 MLEHKLQEQEE 276



 Score = 29.2 bits (66), Expect = 1.9
 Identities = 22/76 (28%), Positives = 42/76 (55%), Gaps = 3/76 (3%)

Query: 177 EDEIEEE-ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           E  IE E    E  E +Q+  +E+++EEE+  E  E   +  E+ ++  E++  +++   
Sbjct: 203 EKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQ--EHVKQLIEKMEAEREKLL 260

Query: 236 EEEEEEVLIKLEEEEE 251
            E+E  +  KL+E+EE
Sbjct: 261 AEQERMLEHKLQEQEE 276



 Score = 28.4 bits (64), Expect = 3.3
 Identities = 23/75 (30%), Positives = 43/75 (57%), Gaps = 5/75 (6%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E++K+EE  +E +E   +E VKQ   K E E E+   E++ + +     + +E+EE++K+
Sbjct: 225 EKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEH----KLQEQEELLKE 280

Query: 231 K-KNEEEEEEEEVLI 244
             K E E  ++E+  
Sbjct: 281 GFKTEAESLQKEIQD 295



 Score = 27.6 bits (62), Expect = 6.1
 Identities = 18/76 (23%), Positives = 36/76 (47%), Gaps = 6/76 (7%)

Query: 183 EENEEEEEVKQKKNKEEE---EEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE-- 237
              E+  E ++ K +  E   E   EK++E+E   E  E   +E  + + +K   E E  
Sbjct: 200 TAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKL 259

Query: 238 -EEEEVLIKLEEEEEN 252
             E+E +++ + +E+ 
Sbjct: 260 LAEQERMLEHKLQEQE 275


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 32.3 bits (73), Expect = 0.25
 Identities = 17/55 (30%), Positives = 28/55 (50%)

Query: 186 EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
            + EE K +K   +  + ++  E+ E +  E E  + EE E V   +NE EEE +
Sbjct: 126 SDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELD 180



 Score = 31.2 bits (70), Expect = 0.60
 Identities = 16/62 (25%), Positives = 35/62 (56%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEE 238
           ++EEE+ E++   ++K  K +E+ E++  E +  + EE E     E E  ++   E+++ 
Sbjct: 127 DVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDG 186

Query: 239 EE 240
           E+
Sbjct: 187 ED 188



 Score = 30.8 bits (69), Expect = 0.85
 Identities = 21/81 (25%), Positives = 36/81 (44%), Gaps = 7/81 (8%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
               +K      + E E+ E+  +K+ K        KK ++++E +  E+E  + EE   
Sbjct: 115 RRTRRKAAAASSDVEEEKTEKKVRKRRKV-------KKMDEDVEDQGSESEVSDVEESEF 167

Query: 230 QKKNEEEEEEEEVLIKLEEEE 250
               E E EEE  L K + E+
Sbjct: 168 VTSLENESEEELDLEKDDGED 188



 Score = 27.7 bits (61), Expect = 6.4
 Identities = 16/68 (23%), Positives = 27/68 (39%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           +   AG  +K E     +EE +E   V +KK     +   +K   +  E  E  +E    
Sbjct: 42  FLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVN 101

Query: 225 EEVVKQKK 232
           E+    K+
Sbjct: 102 EDAALDKE 109


>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
          Length = 193

 Score = 31.4 bits (72), Expect = 0.26
 Identities = 18/69 (26%), Positives = 37/69 (53%), Gaps = 3/69 (4%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV---KQKKNEEEEEEEEVL 243
             EE K  + +  EE E E ++++E ++ E E+E +E +  +   + +  E + EE + +
Sbjct: 2   SNEEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEAQLAEAQAEERDSV 61

Query: 244 IKLEEEEEN 252
           ++   E EN
Sbjct: 62  LRARAEVEN 70


>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD.  YqfD is part of
           the sigma-E regulon in the sporulation program of
           endospore-forming Gram-positive bacteria. Mutation
           results in a sporulation defect in Bacillus subtilis.
           Members are found in all currently known
           endospore-forming bacteria, including the genera
           Bacillus, Symbiobacterium, Carboxydothermus,
           Clostridium, and Thermoanaerobacter [Cellular processes,
           Sporulation and germination].
          Length = 382

 Score = 31.9 bits (73), Expect = 0.26
 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEE 238
           EI +E   E +E  +K  KEE  E+ ++K E E++KE   N      ++V  K   E  E
Sbjct: 310 EIVKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKKELDPN-----AKIVSDKILSERVE 364

Query: 239 EEEVLIKL 246
             +V + +
Sbjct: 365 GGKVKVTV 372


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 31.7 bits (72), Expect = 0.27
 Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 10/54 (18%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
            +D  E+EE E++         +E+EE+E   EEDE E  E   EEE+EE  VK
Sbjct: 31  AQDLTEDEEAEDDVV-------DEDEEDEAVVEEDENELTE---EEEDEEGEVK 74



 Score = 30.9 bits (70), Expect = 0.52
 Identities = 16/40 (40%), Positives = 29/40 (72%), Gaps = 1/40 (2%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE 208
             E+E+ E+D ++E+E E+E  V++ +N+  EEEE+E+ E
Sbjct: 34  LTEDEEAEDDVVDEDE-EDEAVVEEDENELTEEEEDEEGE 72



 Score = 30.9 bits (70), Expect = 0.56
 Identities = 14/43 (32%), Positives = 26/43 (60%)

Query: 162 FKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEE 204
           F   +++  +EE +++   E+EE+E   E  + +  EEEE+EE
Sbjct: 28  FASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEE 70



 Score = 30.5 bits (69), Expect = 0.72
 Identities = 15/45 (33%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
            ++  E+E+ E+D ++++E +    EE+E   +   EEE+EE EV
Sbjct: 31  AQDLTEDEEAEDDVVDEDEEDEAVVEEDE--NELTEEEEDEEGEV 73



 Score = 27.8 bits (62), Expect = 5.9
 Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 6/50 (12%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           A  ++  E++E E++  +E+EE       E   EE+E +  +E E EEGE
Sbjct: 29  ASAQDLTEDEEAEDDVVDEDEE------DEAVVEEDENELTEEEEDEEGE 72



 Score = 27.0 bits (60), Expect = 9.2
 Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 4/44 (9%)

Query: 208 EEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            +D  E EE E++  +E+E    +     EE+E  L + EE+EE
Sbjct: 31  AQDLTEDEEAEDDVVDEDE----EDEAVVEEDENELTEEEEDEE 70


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 32.1 bits (73), Expect = 0.27
 Identities = 26/124 (20%), Positives = 46/124 (37%), Gaps = 2/124 (1%)

Query: 120 YLLHESMIDSVIFARDKFLKPEGVMYPYKCILHSAPSYSPEIFKFWENIAGEEEKKEEDE 179
            +L    +  ++   +   +        +           +I K    I   E K     
Sbjct: 347 AILKLLYVKDIVPISELLKEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAI 406

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
           ++  E +EEE   +   K E+ EE+    E +  K+     E EE +  K+K+ E  E  
Sbjct: 407 LKNAEIKEEEG--KITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFV 464

Query: 240 EEVL 243
           E+VL
Sbjct: 465 EKVL 468



 Score = 30.9 bits (70), Expect = 0.55
 Identities = 22/64 (34%), Positives = 40/64 (62%), Gaps = 1/64 (1%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGE-NEEEEEEEVVKQKKNEEEEEEEEVLIKLE 247
           +E  +++ KE+ E++EE+KEE +I+K   +  + E +      K  E +EEE ++ IK+E
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424

Query: 248 EEEE 251
           + EE
Sbjct: 425 KSEE 428


>gnl|CDD|225136 COG2226, UbiE, Methylase involved in ubiquinone/menaquinone
          biosynthesis [Coenzyme metabolism].
          Length = 238

 Score = 31.5 bits (72), Expect = 0.27
 Identities = 13/31 (41%), Positives = 15/31 (48%)

Query: 36 AYRAAICDNPNIFAGKTVLDVGTGTGKSILL 66
           +R A+     I  G  VLDV  GTG   LL
Sbjct: 38 LWRRALISLLGIKPGDKVLDVACGTGDMALL 68


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 31.9 bits (73), Expect = 0.28
 Identities = 17/60 (28%), Positives = 32/60 (53%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
           +E  E EE  E++Q++ + E+    E  E   +E+ E    EE+E    + K+ ++ E+E
Sbjct: 146 LEVLEEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKE 205


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 31.8 bits (72), Expect = 0.28
 Identities = 19/69 (27%), Positives = 32/69 (46%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
               + E EE ++E E EV       E + E ++K   E  + +G+     E E+  + +
Sbjct: 208 SYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESE 267

Query: 233 NEEEEEEEE 241
           +E EEE  E
Sbjct: 268 DEYEEEAGE 276



 Score = 30.3 bits (68), Expect = 0.90
 Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 3/73 (4%)

Query: 172 EEKKEEDEIEEE---ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           E ++ +DE+E E     ++ E   + K K   E  E K ++    + E E+E E+E E  
Sbjct: 214 ENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEE 273

Query: 229 KQKKNEEEEEEEE 241
             ++ E E++ EE
Sbjct: 274 AGEEQENEDKGEE 286



 Score = 28.3 bits (63), Expect = 3.7
 Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 2/85 (2%)

Query: 157 YSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
              E+  ++ +   EE   E +       ++ E   + K K   E  E K ++    + E
Sbjct: 202 LDDEVTSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETE 261

Query: 217 GENEE--EEEEEVVKQKKNEEEEEE 239
            E+E   E EEE  ++++NE++ EE
Sbjct: 262 IEDESEDEYEEEAGEEQENEDKGEE 286



 Score = 27.6 bits (61), Expect = 5.8
 Identities = 19/70 (27%), Positives = 36/70 (51%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E     ++ E EEN++E E +     ++ E + E KE+   E  E + ++    E   +
Sbjct: 204 DEVTSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIE 263

Query: 231 KKNEEEEEEE 240
            ++E+E EEE
Sbjct: 264 DESEDEYEEE 273


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 32.1 bits (73), Expect = 0.30
 Identities = 17/49 (34%), Positives = 28/49 (57%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
           W+ +   +EKKEE + E+ E  + EEV ++  +EE+    E  E+  IE
Sbjct: 273 WDWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIE 321



 Score = 29.0 bits (65), Expect = 3.0
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
           +E+K+KK +++EE+ EE K E+  E+ E E +  
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310



 Score = 28.7 bits (64), Expect = 4.1
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 191 VKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           +K+ K K+EE ++EEK EE + E+ + E EEEE+  
Sbjct: 276 LKELKEKKEE-KDEEKSEEVKTEEVDEEFEEEEKGF 310



 Score = 27.5 bits (61), Expect = 7.8
 Identities = 15/42 (35%), Positives = 25/42 (59%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           K+ KEE++EE+ ++ + E   EE E EE+     + +K N E
Sbjct: 280 KEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIE 321



 Score = 27.5 bits (61), Expect = 8.1
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 214 KEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           KE+ E ++EE+ E VK ++ +EE EEEE
Sbjct: 280 KEKKEEKDEEKSEEVKTEEVDEEFEEEE 307


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 31.7 bits (72), Expect = 0.30
 Identities = 25/88 (28%), Positives = 46/88 (52%), Gaps = 5/88 (5%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE-EEEE 224
           E +  + E+ E+  ++E    EEE  K+     E+  +EE+     +  EEG  E E++E
Sbjct: 364 ERLEKDVEEGEKTIVKEARQIEEELEKE----VEKLGKEEESLFKRVALEEGLKELEQDE 419

Query: 225 EEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E  +K+   EE+E  E++ ++  E E+ 
Sbjct: 420 ENFLKELSKEEKELLEKLKMEASEVEKL 447



 Score = 27.9 bits (62), Expect = 6.7
 Identities = 25/75 (33%), Positives = 39/75 (52%), Gaps = 5/75 (6%)

Query: 148 KCILHSAPSYSPEIFKFWENIAGEEE---KKE--EDEIEEEENEEEEEVKQKKNKEEEEE 202
           K I+  A     E+ K  E +  EEE   K+   E+ ++E E +EE  +K+   +E+E  
Sbjct: 375 KTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEEKELL 434

Query: 203 EEEKKEEDEIEKEEG 217
           E+ K E  E+EK  G
Sbjct: 435 EKLKMEASEVEKLFG 449


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 32.1 bits (73), Expect = 0.31
 Identities = 17/66 (25%), Positives = 36/66 (54%), Gaps = 6/66 (9%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           +E+   EE+++E +++ +EE E E      +  EE+   EE+K   E ++ + + E E+ 
Sbjct: 246 FEDFLLEEKRRELEKLAKEEAERE------RQAEEQRRREEEKAAMEADRAQAKAEVEKR 299

Query: 225 EEVVKQ 230
            E ++ 
Sbjct: 300 REKLQN 305



 Score = 29.1 bits (65), Expect = 2.7
 Identities = 16/84 (19%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE-EDEIEKEEGENEEEEEEEVV 228
            +    +ED      +     V++  N +EE + + +   E  +E+ E E+E E E+   
Sbjct: 28  QQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQNQS 87

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
              K+    +EE +   ++  +++
Sbjct: 88  DVLKSSSIVKEESISTDMDGIDDD 111



 Score = 27.5 bits (61), Expect = 8.8
 Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 4/39 (10%)

Query: 203 EEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           EE+++E +++ KEE E E + EE    Q++ EEE+   E
Sbjct: 252 EEKRRELEKLAKEEAERERQAEE----QRRREEEKAAME 286


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 31.6 bits (71), Expect = 0.33
 Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 6/88 (6%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEE------ENEEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
           E     EN + +   +      EE            +VK  K K+E+   + ++   E +
Sbjct: 155 EHPTITENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHK 214

Query: 214 KEEGENEEEEEEEVVKQKKNEEEEEEEE 241
            EE + +EE+EEE V+++  +EE +  +
Sbjct: 215 TEEKQPQEEQEEEEVEEEAKQEEGQGTD 242



 Score = 31.6 bits (71), Expect = 0.35
 Identities = 14/46 (30%), Positives = 28/46 (60%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE 215
            +E+   + +   +E++ EE+  Q++ +EEE EEE K+EE +   +
Sbjct: 198 KQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243



 Score = 30.4 bits (68), Expect = 0.75
 Identities = 22/81 (27%), Positives = 37/81 (45%), Gaps = 4/81 (4%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQ--KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           E +  +   E   N  EE ++         + +  + K+E  + K +   +E + EE  K
Sbjct: 161 ENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEE--K 218

Query: 230 QKKNEEEEEEEEVLIKLEEEE 250
           Q + E+EEEE E   K EE +
Sbjct: 219 QPQEEQEEEEVEEEAKQEEGQ 239


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 31.7 bits (72), Expect = 0.34
 Identities = 20/53 (37%), Positives = 30/53 (56%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           + IA EE+    DE+ E   E+   V ++  +EEEEEEEE++EE    +   E
Sbjct: 372 DKIATEEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPME 424



 Score = 27.1 bits (60), Expect = 9.6
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 6/60 (10%)

Query: 189 EEVKQKKNKEEEEEEEE------KKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
           E++  K   EE+    +      K++   + +   E EEEEEEE  ++    E   EE V
Sbjct: 368 EDLYDKIATEEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPV 427


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 31.3 bits (71), Expect = 0.35
 Identities = 16/53 (30%), Positives = 29/53 (54%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
            +  +KK+KE+  EEE+K E D+ + EE EN+ +  EE    +++       +
Sbjct: 65  TKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSD 117



 Score = 27.4 bits (61), Expect = 5.2
 Identities = 12/52 (23%), Positives = 25/52 (48%), Gaps = 6/52 (11%)

Query: 191 VKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
            K+ K K++E+  EE+K+ +       ++++ EE E       E  + +E  
Sbjct: 65  TKKSKKKDKEKLTEEEKKPES------DDDKTEENENDPDNNEESGDSQESA 110


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 31.4 bits (72), Expect = 0.35
 Identities = 10/47 (21%), Positives = 25/47 (53%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
                       E E+ ++  +++++++E++ E+DE E + G + EE
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79



 Score = 27.9 bits (63), Expect = 4.8
 Identities = 12/58 (20%), Positives = 28/58 (48%), Gaps = 4/58 (6%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           I+G  +             E E  +    ++ E+++++ ++EDE ++EE +   + EE
Sbjct: 26  ISGFIDPNAAAAAATAAAIESELDE----EDLEDDDDDDEDEDEDDEEEADLGPDPEE 79



 Score = 27.5 bits (62), Expect = 6.4
 Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 7/51 (13%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEE 248
             E E +E+  ED+ + +E E+E++EEE        +   + EE   +  E
Sbjct: 43  AIESELDEEDLEDDDDDDEDEDEDDEEEA-------DLGPDPEEARERFAE 86



 Score = 26.8 bits (60), Expect = 9.6
 Identities = 12/54 (22%), Positives = 25/54 (46%), Gaps = 7/54 (12%)

Query: 196 NKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEE 249
           N           E +  E++  ++++++E+E       +E++EEE  L    EE
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDE-------DEDDEEEADLGPDPEE 79


>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
           motility and secretion].
          Length = 214

 Score = 31.0 bits (70), Expect = 0.38
 Identities = 22/75 (29%), Positives = 40/75 (53%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           E++IE+ E      +++ K K EE   + KK +DE+ +E  E  E+ +E  + +K +  E
Sbjct: 6   EEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIE 65

Query: 237 EEEEEVLIKLEEEEE 251
           EE  E    + E+ +
Sbjct: 66  EEISEKEKVMSEKLK 80



 Score = 28.3 bits (63), Expect = 3.7
 Identities = 23/77 (29%), Positives = 42/77 (54%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           +EK EE  I+ +++++E   +  +  E+ +E    ++ D IE+E  E E+   E++ +  
Sbjct: 24  KEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKEKVMSEKLKEPA 83

Query: 232 KNEEEEEEEEVLIKLEE 248
           +     EEEE   KLEE
Sbjct: 84  QMSSTSEEEEKKAKLEE 100


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 30.4 bits (69), Expect = 0.43
 Identities = 21/91 (23%), Positives = 38/91 (41%), Gaps = 5/91 (5%)

Query: 167 NIAGEEEKKEEDEIEEEEN-EEEEEVKQKKNKEEEEE--EEEKKEEDEIEKEEGE--NEE 221
            +     +K++ EI  +   EE E  K+    EE+++   ++ KE+ E E+ + +  +E 
Sbjct: 25  QVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSEN 84

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
             E   VK  K +    E E      E    
Sbjct: 85  ATENNTVKATKKQLFSSEYEQTSSSSESTSE 115



 Score = 28.1 bits (63), Expect = 3.1
 Identities = 16/74 (21%), Positives = 30/74 (40%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A  + K +      E+ + E      + + E ++E    EE +    +   E+ E+EE+ 
Sbjct: 18  AENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELK 77

Query: 229 KQKKNEEEEEEEEV 242
            Q  +E   E   V
Sbjct: 78  NQLFSENATENNTV 91


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.5 bits (69), Expect = 0.43
 Identities = 16/65 (24%), Positives = 36/65 (55%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
            +E E+E+ E  +E ++K+  + ++E++EKKE+   EK   +   +  +   K+   +  
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 237 EEEEE 241
            ++EE
Sbjct: 152 TKKEE 156



 Score = 30.5 bits (69), Expect = 0.48
 Identities = 20/74 (27%), Positives = 43/74 (58%), Gaps = 4/74 (5%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           N+  EE +KE+    +E ++E EE +  K K+E++E++EKK  +++ K++     +   +
Sbjct: 89  NLFPEESEKEQ----KEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144

Query: 227 VVKQKKNEEEEEEE 240
              +K   ++EE +
Sbjct: 145 KATKKTTTKKEEGK 158



 Score = 27.0 bits (60), Expect = 6.1
 Identities = 13/39 (33%), Positives = 28/39 (71%)

Query: 208 EEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKL 246
            E+  ++++  ++E EE+E +K KK ++E++E++V  KL
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKL 130



 Score = 27.0 bits (60), Expect = 6.9
 Identities = 18/67 (26%), Positives = 36/67 (53%)

Query: 184 ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVL 243
             E E+E K+   + EE+E  + K+E + +KE+   E+  +++  K  KN  ++  ++  
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 244 IKLEEEE 250
            K EE +
Sbjct: 152 TKKEEGK 158


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 31.5 bits (72), Expect = 0.44
 Identities = 22/63 (34%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 178 DEIEEEENEE-EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           +EIEEE  EE EEE+K+ + ++E  E E+K  E + +K   + ++EE       +   EE
Sbjct: 64  EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123

Query: 237 EEE 239
            E 
Sbjct: 124 PEP 126



 Score = 31.1 bits (71), Expect = 0.57
 Identities = 17/63 (26%), Positives = 34/63 (53%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
           +EE E E  EE +++  + E E+E  + E+  +EK+  +  ++ ++E    ++  E E E
Sbjct: 63  LEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAE 122

Query: 240 EEV 242
           E  
Sbjct: 123 EPE 125



 Score = 27.2 bits (61), Expect = 9.1
 Identities = 24/57 (42%), Positives = 34/57 (59%), Gaps = 4/57 (7%)

Query: 198 EEEEEEEEKKEED--EIEKEEGENEEEEEEEVVKQ--KKNEEEEEEEEVLIKLEEEE 250
           EEE  EE ++E    E EKE  E E++  E+  K+  KK ++EE   E L++LE EE
Sbjct: 67  EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 31.3 bits (71), Expect = 0.44
 Identities = 12/59 (20%), Positives = 32/59 (54%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
              +E  ++  E   ++   EEE E+++ E  +   ++ + ++++EE+ V++ K  + E
Sbjct: 64  SASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVRELKARDRE 122



 Score = 29.4 bits (66), Expect = 1.5
 Identities = 12/59 (20%), Positives = 33/59 (55%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
              +E + +  E   ++   EEE E+++ +   ++ +++ E +++EE++V + K  + E
Sbjct: 64  SASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVRELKARDRE 122



 Score = 27.8 bits (62), Expect = 5.0
 Identities = 9/49 (18%), Positives = 28/49 (57%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIK 245
            +E +++  +  ++E+  EE   +++ E      +K +E++++EE  ++
Sbjct: 66  SQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVR 114


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 30.9 bits (70), Expect = 0.51
 Identities = 19/77 (24%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEE--EEEEEEKKEEDEIEKEEG 217
           E  +  E +  E+E++E+  +  ++ EEE+++ ++KNK+   +E E       E+  +  
Sbjct: 143 EQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQALLDELETSTLPAAELIAQHK 202

Query: 218 ENEEEEEEEVVKQKKNE 234
           +N  + E +V K K  +
Sbjct: 203 KNSVKLEMQVEKPKPEK 219



 Score = 30.5 bits (69), Expect = 0.75
 Identities = 17/81 (20%), Positives = 41/81 (50%), Gaps = 11/81 (13%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENE-----------EEEEVKQKKNKEEEEEEEEKKEE 209
           ++    NI  E  KK+ +  ++E  +           E+EE+++    E+EEEE+ +   
Sbjct: 106 VYNLTNNIDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLL 165

Query: 210 DEIEKEEGENEEEEEEEVVKQ 230
            + E+E+  N+ + ++ ++ +
Sbjct: 166 QKEEEEQQMNKRKNKQALLDE 186



 Score = 30.2 bits (68), Expect = 1.1
 Identities = 22/72 (30%), Positives = 44/72 (61%), Gaps = 2/72 (2%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
           I+ E  +++ E  QK+NK+  ++ +EK   ++ E EE    E+EEEE  +++   ++EEE
Sbjct: 113 IDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEE--QRRLLLQKEEE 170

Query: 240 EEVLIKLEEEEE 251
           E+ + K + ++ 
Sbjct: 171 EQQMNKRKNKQA 182


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 31.1 bits (70), Expect = 0.51
 Identities = 26/95 (27%), Positives = 41/95 (43%), Gaps = 13/95 (13%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEE----KKEEDEIEKEEGENEEEEEEE 226
           E   KE D+I  E +++ +  K+K    EEE+E+     K++     +E  E      E 
Sbjct: 146 ESMDKEVDDILGEGSDDSDSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSER 205

Query: 227 VV---------KQKKNEEEEEEEEVLIKLEEEEEN 252
                      K+K NEE+EE+ E     E   E+
Sbjct: 206 SNIGGRGPRGHKRKLNEEDEEDAESESSFESSNED 240


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 30.0 bits (68), Expect = 0.55
 Identities = 10/50 (20%), Positives = 14/50 (28%), Gaps = 1/50 (2%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
              + E +     E    K K++   E    E      E       EEE 
Sbjct: 107 NAALAEADGGPTAEATTPK-KKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 31.4 bits (72), Expect = 0.56
 Identities = 26/74 (35%), Positives = 46/74 (62%), Gaps = 2/74 (2%)

Query: 173 EKKEEDEIE--EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EKK ++E++  +    ++ +   ++ K EEEE++E+   D+  K +G+ EEEEEE+  K 
Sbjct: 493 EKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKEEKD 552

Query: 231 KKNEEEEEEEEVLI 244
           K+ EE+E E   +I
Sbjct: 553 KETEEDEPEGPKII 566



 Score = 27.9 bits (63), Expect = 5.8
 Identities = 19/65 (29%), Positives = 37/65 (56%), Gaps = 4/65 (6%)

Query: 192 KQKKNKEE----EEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLE 247
           K+KK KEE    +    +K +E + E++  E E++E+  +    K++ +EEEEE   + +
Sbjct: 492 KEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKEEK 551

Query: 248 EEEEN 252
           ++E  
Sbjct: 552 DKETE 556


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 30.0 bits (68), Expect = 0.57
 Identities = 21/80 (26%), Positives = 40/80 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE K+  +E ++E  ++ +E K+     EEE ++E+  E E  ++  ++E E   E    
Sbjct: 52  EERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVT 111

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
              EE  +E++    +  EE
Sbjct: 112 DYEEEYIDEDKYKTTVTVEE 131


>gnl|CDD|233668 TIGR01983, UbiG, ubiquinone biosynthesis O-methyltransferase.  This
           model represents an O-methyltransferase believed to act
           at two points in the ubiquinone biosynthetic pathway in
           bacteria (UbiG) and fungi (COQ3). A separate methylase
           (MenG/UbiE) catalyzes the single C-methylation step. The
           most commonly used names for genes in this family do not
           indicate whether this gene is an O-methyl, or C-methyl
           transferase [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Menaquinone and ubiquinone].
          Length = 224

 Score = 30.7 bits (70), Expect = 0.58
 Identities = 29/122 (23%), Positives = 49/122 (40%), Gaps = 11/122 (9%)

Query: 32  VRNEAYRAAICDNPNIFAGKTVLDVGTGTG---KSILLQG---HGVDHHSFCTKVHPLDH 85
           +R +  R  I  N     G  VLDVG G G   + +   G    G+D      +V  L  
Sbjct: 28  LRLDYIRDTIRKNKKPLFGLRVLDVGCGGGLLSEPLARLGANVTGIDASEENIEVAKLHA 87

Query: 86  YAPQYLILEVIQNKIENV--ELPEKVDIIVSEWMGFYLLHESMIDSVIFARDKFLKPEGV 143
                L +E     +E++  +  +  D++    +   L H     + I A  + LKP G+
Sbjct: 88  KKDPLLKIEYRCTSVEDLAEKGAKSFDVVTCMEV---LEHVPDPQAFIRACAQLLKPGGI 144

Query: 144 MY 145
           ++
Sbjct: 145 LF 146


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 30.3 bits (68), Expect = 0.61
 Identities = 16/55 (29%), Positives = 30/55 (54%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           ++E  +EEEK   +    E+ E   E+E+    Q + +  +E EE+  KL+++E 
Sbjct: 5   EKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQKLQQKEV 59



 Score = 30.3 bits (68), Expect = 0.62
 Identities = 21/66 (31%), Positives = 39/66 (59%), Gaps = 2/66 (3%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           +E+E+E N+EEE  K   N    E+ E   E+++ ++ + + +  +E E +KQK  ++E 
Sbjct: 2   EEVEKEINKEEE--KDVNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQKLQQKEV 59

Query: 238 EEEEVL 243
           E +E L
Sbjct: 60  EAQEYL 65


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 30.8 bits (69), Expect = 0.62
 Identities = 9/46 (19%), Positives = 25/46 (54%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
           ++E+   EEK+ ++++ ++     E+ E++ ++    EE   +E  
Sbjct: 97  RKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPT 142


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 31.0 bits (70), Expect = 0.62
 Identities = 16/83 (19%), Positives = 33/83 (39%), Gaps = 2/83 (2%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A    + E+ EI  E+  +EE+ + +  +E +  +  +  E  +   E + E E E +  
Sbjct: 283 AETRREAEQAEILAEQAIQEEKAQAE--QEVQHAKALEAREMRVGLIERQKETELEPQER 340

Query: 229 KQKKNEEEEEEEEVLIKLEEEEE 251
               N  + + +E         E
Sbjct: 341 SYFINAAQRQAQEEAKAAANIAE 363



 Score = 29.8 bits (67), Expect = 1.5
 Identities = 16/82 (19%), Positives = 30/82 (36%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           G  E+++E E+E +E        Q++ +EE +      E    + E       E EE  +
Sbjct: 325 GLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAER 384

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
            ++       E    +  E   
Sbjct: 385 AEQAALVAAAEAAEQEQVEIAV 406



 Score = 29.0 bits (65), Expect = 2.7
 Identities = 16/82 (19%), Positives = 35/82 (42%), Gaps = 1/82 (1%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEK-KEEDEIEKEEGENEEEEEEEVVKQ 230
              + E E+   + E   E +Q +   E+  +EEK + E E++  +     E    ++++
Sbjct: 270 ILAETEAEVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIER 329

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
           +K  E E +E        + + 
Sbjct: 330 QKETELEPQERSYFINAAQRQA 351



 Score = 28.3 bits (63), Expect = 4.4
 Identities = 17/85 (20%), Positives = 38/85 (44%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            IA  E +KE +    E N + + V+ +  ++   +  E+  E +I   E E E    + 
Sbjct: 224 EIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKA 283

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEE 251
             +++  + E   E+ + + + + E
Sbjct: 284 ETRREAEQAEILAEQAIQEEKAQAE 308



 Score = 27.9 bits (62), Expect = 5.5
 Identities = 14/76 (18%), Positives = 28/76 (36%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            +  E++   +   +  E +      + +    + E   + E+ EI  E+   EE+ + E
Sbjct: 249 ELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAETRREAEQAEILAEQAIQEEKAQAE 308

Query: 227 VVKQKKNEEEEEEEEV 242
              Q     E  E  V
Sbjct: 309 QEVQHAKALEAREMRV 324


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 30.8 bits (69), Expect = 0.63
 Identities = 28/91 (30%), Positives = 41/91 (45%)

Query: 162 FKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
            +  E ++    KKE+   EEE  E   E        E E +E  + + E ++EE EN E
Sbjct: 246 EEPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAE 305

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
             +EEV   + +E  EE+EE     E E   
Sbjct: 306 AHKEEVQSDRPDEIGEEKEEDDENEENERHT 336



 Score = 28.1 bits (62), Expect = 4.9
 Identities = 27/81 (33%), Positives = 44/81 (54%), Gaps = 3/81 (3%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE-EEEEEV 227
           A E E KE  E ++E+ +EE E  +    +EE + +   E  E ++E+ ENEE E   E+
Sbjct: 281 AAEIENKEVSEGDKEQQQEEVENAE--AHKEEVQSDRPDEIGEEKEEDDENEENERHTEL 338

Query: 228 VKQKKNEEEEEEEEVLIKLEE 248
           +  + NE E+  EE   ++E 
Sbjct: 339 LADELNELEKGIEEKRRQMES 359



 Score = 27.4 bits (60), Expect = 8.2
 Identities = 20/74 (27%), Positives = 43/74 (58%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           G++E+++E+    E ++EE +  +     EE+EE+++ EE+E   E   +E  E E+ ++
Sbjct: 292 GDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIE 351

Query: 230 QKKNEEEEEEEEVL 243
           +K+ + E     +L
Sbjct: 352 EKRRQMESATNPIL 365



 Score = 27.4 bits (60), Expect = 8.4
 Identities = 16/60 (26%), Positives = 24/60 (40%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E    E  K+E      +E  EE+E   +  + E   E    E +E+EK   E   + E 
Sbjct: 300 EVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRRQMES 359


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 30.8 bits (70), Expect = 0.64
 Identities = 23/86 (26%), Positives = 49/86 (56%), Gaps = 9/86 (10%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE------- 224
           +EK E D+I++  +E E++++++  K E + E+++ E +E+EK   E  ++ E       
Sbjct: 272 DEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQ 331

Query: 225 --EEVVKQKKNEEEEEEEEVLIKLEE 248
             EE +K  +  +    EE+ I+L++
Sbjct: 332 LIEEGLKSVRLADFYGNEEIKIELDK 357



 Score = 29.7 bits (67), Expect = 1.5
 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 3/73 (4%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           +  K   + +  E +E     +   K   E+ E KK  +EI +E    EE   +   K++
Sbjct: 376 KGAKVNLDRQLSELKEAIAYYESA-KTALEKAEGKKAIEEIREELI--EEGLLKSKKKKR 432

Query: 232 KNEEEEEEEEVLI 244
           K +E  E+    +
Sbjct: 433 KKKEWFEKFRWFV 445


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 30.5 bits (69), Expect = 0.66
 Identities = 22/89 (24%), Positives = 48/89 (53%), Gaps = 8/89 (8%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKE---EEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           EE++K+E E E EE E  + + ++K +E   ++ ++  K+   + +KE  E+        
Sbjct: 109 EEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGS 168

Query: 228 VKQKKNEEEEEEEEVL-----IKLEEEEE 251
            K ++N  +EE ++ L      KL+++++
Sbjct: 169 AKPERNVSQEEAKKRLQEWELKKLKQQQQ 197



 Score = 30.5 bits (69), Expect = 0.79
 Identities = 15/64 (23%), Positives = 30/64 (46%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
           K+    ++ +   EE+ KQ++ KE EE E  ++   E  +E    + ++  +    K  +
Sbjct: 96  KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKK 155

Query: 235 EEEE 238
           E  E
Sbjct: 156 EAAE 159



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 22/83 (26%), Positives = 41/83 (49%), Gaps = 3/83 (3%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +E K E    E  EN    +  Q++ K ++  EE++K+E E E+EE E  +   +E  ++
Sbjct: 77  KEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136

Query: 231 ---KKNEEEEEEEEVLIKLEEEE 250
              +K ++  ++     K E  E
Sbjct: 137 WCRQKAQQAAKQRTPKHKKEAAE 159



 Score = 28.5 bits (64), Expect = 2.6
 Identities = 19/84 (22%), Positives = 41/84 (48%), Gaps = 3/84 (3%)

Query: 160 EIFKFW-ENIAGEEEKKEEDEI--EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           E ++ W    A +  K+   +   E  E+         K +    +EE KK   E E ++
Sbjct: 132 EKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKK 191

Query: 217 GENEEEEEEEVVKQKKNEEEEEEE 240
            + ++++ EE  ++++ +++EEEE
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEE 215



 Score = 28.5 bits (64), Expect = 3.4
 Identities = 20/108 (18%), Positives = 47/108 (43%), Gaps = 19/108 (17%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQ---KKNKEEEEEEEEKKEEDEIEKEEGEN 219
           K  E    +E +KE +E E  +   +E+ ++   +K ++  ++   K +++  E      
Sbjct: 106 KLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSL 165

Query: 220 ----------------EEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
                           +  +E E+ K K+ +++ EEE    + +++EE
Sbjct: 166 SGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEE 213


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 30.6 bits (69), Expect = 0.67
 Identities = 14/81 (17%), Positives = 36/81 (44%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
             KK +DE  +   +++E+ + + +++ +E+  +K+EE     E+            +  
Sbjct: 151 SRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSG 210

Query: 232 KNEEEEEEEEVLIKLEEEEEN 252
            + ++E  +E   K  + E  
Sbjct: 211 LSTKDEPPKEKRQKHHDPERR 231


>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
           protein [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 234

 Score = 30.4 bits (69), Expect = 0.67
 Identities = 23/107 (21%), Positives = 42/107 (39%)

Query: 136 KFLKPEGVMYPYKCILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKK 195
             +  E         +      +    K     A  EE++ E  +E +E E E   ++ +
Sbjct: 2   SNIINEDRDDALPFQVIEFKDLANAGPKVAPEKAVLEEEELEQALEAKEEELESAAQELQ 61

Query: 196 NKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
              EE   E  +E  ++  EEG  E +EE  V+++      E + E+
Sbjct: 62  EGIEEGAREGYEEGFQLGYEEGFEEGQEEGRVLERLAKLIAEFQAEL 108


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 30.7 bits (69), Expect = 0.67
 Identities = 18/76 (23%), Positives = 37/76 (48%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +    + E   +DE EEEE+E  E+       E +   E +   +++  +E E++EE++ 
Sbjct: 105 DGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKM 164

Query: 226 EVVKQKKNEEEEEEEE 241
           E  +  + +E  E+  
Sbjct: 165 EEEEAGEEKESVEQAT 180



 Score = 30.7 bits (69), Expect = 0.74
 Identities = 18/81 (22%), Positives = 41/81 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           ++ +  ED  +  + + E+    ++ +EE+E  E++  +DE E +     E   E++   
Sbjct: 95  QDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDD 154

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +  ++EE++ E     EE+E 
Sbjct: 155 ETEDDEEKKMEEEEAGEEKES 175



 Score = 29.6 bits (66), Expect = 1.9
 Identities = 16/70 (22%), Positives = 40/70 (57%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           AG++ + E+DE ++E  ++E + K ++  EE++E+E+ ++E+E  +   +  + +E  + 
Sbjct: 269 AGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLE 328

Query: 229 KQKKNEEEEE 238
                  +E 
Sbjct: 329 GVDLESPKEL 338



 Score = 29.2 bits (65), Expect = 2.3
 Identities = 22/81 (27%), Positives = 49/81 (60%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE+ +E +   E+  + +E+ K+K   ++ E E+++ +++ ++KE     EEE+EE  +Q
Sbjct: 244 EEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQ 303

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           + +++EEE  E  +   + +E
Sbjct: 304 EDDQDEEEPPEAAMDKVKLDE 324



 Score = 29.2 bits (65), Expect = 2.5
 Identities = 17/76 (22%), Positives = 39/76 (51%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
           K +D    E+  +  ++  + + ++EEEEEE +  ++   ++ +  +   E     +   
Sbjct: 93  KLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLS 152

Query: 235 EEEEEEEVLIKLEEEE 250
           ++E E++   K+EEEE
Sbjct: 153 DDETEDDEEKKMEEEE 168



 Score = 28.4 bits (63), Expect = 3.7
 Identities = 16/70 (22%), Positives = 40/70 (57%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           ++K++ +   ++ E E++   K+  ++E + + ++E++E +++E + +EEE  E    K 
Sbjct: 261 KEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKV 320

Query: 233 NEEEEEEEEV 242
             +E   E V
Sbjct: 321 KLDEPVLEGV 330



 Score = 28.4 bits (63), Expect = 3.9
 Identities = 13/71 (18%), Positives = 38/71 (53%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           + K EE++ E++E E++++ ++      ++ + ++   + ++ E  +     E+   K K
Sbjct: 290 DSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLK 349

Query: 232 KNEEEEEEEEV 242
           +  E+ E+E +
Sbjct: 350 QQIEQLEKENL 360



 Score = 28.0 bits (62), Expect = 5.5
 Identities = 18/77 (23%), Positives = 45/77 (58%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           +E+    +EK ++ +  ++   E++E  ++  K+E + + E+++E++ E+E+ ++EEE  
Sbjct: 254 YEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPP 313

Query: 225 EEVVKQKKNEEEEEEEE 241
           E  + + K +E   E  
Sbjct: 314 EAAMDKVKLDEPVLEGV 330



 Score = 27.6 bits (61), Expect = 6.8
 Identities = 18/86 (20%), Positives = 47/86 (54%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E +  +E  +   ++++ E+ E+       + E+  ++EE++EEDE  ++E  ++E+E +
Sbjct: 80  EFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEAD 139

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
              + + + E+  ++E     E++ E
Sbjct: 140 LFNESESSLEDLSDDETEDDEEKKME 165


>gnl|CDD|234492 TIGR04188, methyltr_grsp, methyltransferase, ATP-grasp peptide
           maturase system.  Members of this protein family are
           predicted SAM-dependent methyltransferases that
           regularly occur in the context of a putative peptide
           modification ATP-grasp enzyme (TIGR04187, related to
           enzymes of microviridin maturation) and a putative
           ribosomal peptide modification target (TIGR04186).
          Length = 363

 Score = 30.8 bits (70), Expect = 0.68
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 49  AGKTVLDVGTGTGKSILLQGH 69
            G  VL++GTGTG S  L  H
Sbjct: 107 DGHRVLEIGTGTGYSAALLCH 127


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 30.8 bits (68), Expect = 0.72
 Identities = 22/81 (27%), Positives = 50/81 (61%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+ KK E +  +++  ++ E +QK N ++  E+E ++ E E +K   + EE ++ +  ++
Sbjct: 68  EKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQE 127

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +  +E+E++E  L KL+ E++
Sbjct: 128 RIQKEQEKKEADLKKLQAEKK 148


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 30.8 bits (69), Expect = 0.73
 Identities = 18/75 (24%), Positives = 38/75 (50%), Gaps = 4/75 (5%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEE-EEEEEEKKEEDEIEKEEG---ENEEEEEEE 226
             E+ +    E+  N  E+   +    EE ++++EE + + E+E E      +E  E ++
Sbjct: 464 TMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADK 523

Query: 227 VVKQKKNEEEEEEEE 241
            V + KN++ + +EE
Sbjct: 524 DVNKSKNKKRKVDEE 538



 Score = 28.5 bits (63), Expect = 4.0
 Identities = 21/83 (25%), Positives = 31/83 (37%), Gaps = 17/83 (20%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV-------- 227
               +  EE +   E       E+   E    EED+ + EE + ++E E E         
Sbjct: 459 RASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSET 518

Query: 228 ---------VKQKKNEEEEEEEE 241
                     K KK + +EEEEE
Sbjct: 519 SEADKDVNKSKNKKRKVDEEEEE 541


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 30.5 bits (70), Expect = 0.75
 Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 19/95 (20%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEED-------------------EIEKEE 216
           E +E E  + E EEE ++K+ +  EE EEE KE+                     + +EE
Sbjct: 109 ESEEYEARKEEIEEEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEE 168

Query: 217 GENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            E   EEE E +++K +E EEE +E+L +L E E 
Sbjct: 169 FEALPEEEREELEEKIDELEEELQEILRQLRELER 203


>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
           This group consists of Eukaryotic TruB proteins similar
           to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
           makes psi55 in the T loop of both cytoplasmic and
           mitochondrial tRNAs. Psi55 is almost universally
           conserved.  Pseudouridine synthases catalyze the
           isomerization of specific uridines in an RNA molecule to
           pseudouridines (5-ribosyluracil, psi).
          Length = 312

 Score = 30.5 bits (69), Expect = 0.76
 Identities = 19/89 (21%), Positives = 32/89 (35%), Gaps = 5/89 (5%)

Query: 157 YSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
            S  + K W        +  E+E ++ E    + + ++     E  E     E +  +E 
Sbjct: 175 VSELLVKDWIEPGPLFTRTVEEEGKQYERSVVKMLGKELKTFAEVTELTATAEGDPVEEV 234

Query: 217 GENEEEEEEEVVKQKKNEEEEEEEEVLIK 245
               EE      K+K   EEE  E+ L  
Sbjct: 235 EATHEES-----KRKSEVEEEANEKSLGP 258


>gnl|CDD|216666 pfam01728, FtsJ, FtsJ-like methyltransferase.  This family consists
           of FtsJ from various bacterial and archaeal sources FtsJ
           is a methyltransferase, but actually has no effect on
           cell division. FtsJ's substrate is the 23S rRNA. The 1.5
           A crystal structure of FtsJ in complex with its cofactor
           S-adenosylmethionine revealed that FtsJ has a
           methyltransferase fold. This family also includes the N
           terminus of flaviviral NS5 protein. It has been
           hypothesised that the N-terminal domain of NS5 is a
           methyltransferase involved in viral RNA capping.
          Length = 177

 Score = 29.9 bits (68), Expect = 0.78
 Identities = 28/121 (23%), Positives = 44/121 (36%), Gaps = 39/121 (32%)

Query: 50  GKTVLDVGTGTG--KSILLQ------GHGVDHHSFCTKVHPLDHYAPQYLILEVIQNKIE 101
           GKTVLD+G   G    +LL+         VD          L    P   +   ++  I 
Sbjct: 23  GKTVLDLGAAPGGFSQVLLERGGAGKVVAVD----------LGPMEPIQGV-YFLRGDIT 71

Query: 102 NVE--------LPEKVDIIVSEWMGFYLLHESMIDS----------VIFARDKFLKPEGV 143
           + E        LP KVD+++S+     +      DS          ++ A +  L+P G 
Sbjct: 72  DPETLEKLRELLPGKVDLVLSD-GAPNVSGIENTDSFISLQLVLAALLLALE-VLRPGGN 129

Query: 144 M 144
            
Sbjct: 130 F 130


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 30.4 bits (69), Expect = 0.79
 Identities = 18/54 (33%), Positives = 30/54 (55%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           EE K+E  +  +E +   E+++ +  +E+EE EEE +   E  K EG  E  E+
Sbjct: 77  EEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEK 130



 Score = 29.6 bits (67), Expect = 1.4
 Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +    +E E   EE EE+ E  +++  +E +E  E+ EED  E  E   EE +EE  + +
Sbjct: 28  KVLSIKEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEE--LLK 85

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           K +E     E++ +++E E+E
Sbjct: 86  KTDEASSIIEKLQMQIEREQE 106



 Score = 28.4 bits (64), Expect = 3.4
 Identities = 20/73 (27%), Positives = 40/73 (54%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+ ++  +E  +E  E  EE ++   +  E  EEE KEE   + +E  +  E+ +  +++
Sbjct: 44  EQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIER 103

Query: 231 KKNEEEEEEEEVL 243
           ++ E EEE E ++
Sbjct: 104 EQEEWEEELERLI 116



 Score = 27.7 bits (62), Expect = 5.9
 Identities = 22/87 (25%), Positives = 45/87 (51%), Gaps = 1/87 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +    ++ + +   I+E+E   EEE +Q +   EE  +E K+  +E E++  E  E  EE
Sbjct: 18  KTHEIQKYRFKVLSIKEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEE 77

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E  ++   ++ +E   ++ KL+ + E 
Sbjct: 78  EAKEELL-KKTDEASSIIEKLQMQIER 103


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 30.1 bits (68), Expect = 0.79
 Identities = 25/86 (29%), Positives = 50/86 (58%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E I    EKK+    ++E +  EE  + + + +E++EEEE  E D ++ E  + +E+++E
Sbjct: 89  EEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDE 148

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEEE 251
            V    +++E +E+E+     E+EE+
Sbjct: 149 IVEILIEDDEVDEDEDDDEDEEDEED 174



 Score = 27.0 bits (60), Expect = 6.6
 Identities = 22/74 (29%), Positives = 44/74 (59%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           DEI+EE     E+   K   +E +  EE  E++    E+ E EE++E + +  + ++E+E
Sbjct: 85  DEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDE 144

Query: 238 EEEEVLIKLEEEEE 251
           +++E++  L E++E
Sbjct: 145 DDDEIVEILIEDDE 158


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 30.3 bits (69), Expect = 0.80
 Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 2/61 (3%)

Query: 191 VKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE--EEEEEEVLIKLEE 248
           V++ + K EEEEEEE  E +E E+EE E E++E+     + K  E  E++ E++L +LE 
Sbjct: 13  VEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEELEL 72

Query: 249 E 249
           E
Sbjct: 73  E 73



 Score = 29.9 bits (68), Expect = 1.0
 Identities = 19/52 (36%), Positives = 34/52 (65%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +K E++IEEEE EE  E ++++ +E+EEE++EK    +  K     E++ E+
Sbjct: 14  EKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65



 Score = 27.2 bits (61), Expect = 9.0
 Identities = 21/51 (41%), Positives = 31/51 (60%)

Query: 200 EEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
           EEEEEE+  E E E+EE + EE++E+     K    E +E+++   LEE E
Sbjct: 21  EEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEELE 71


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 30.0 bits (68), Expect = 0.80
 Identities = 16/66 (24%), Positives = 30/66 (45%), Gaps = 3/66 (4%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
               + + E+   + E+ K +K   +++ +E KK+ED  + E  E + EE +   K    
Sbjct: 1   PSSREALLEQRRRKREQRKARK---KQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPI 57

Query: 234 EEEEEE 239
           E  E  
Sbjct: 58  ENAEGN 63


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 29.6 bits (67), Expect = 0.82
 Identities = 14/35 (40%), Positives = 26/35 (74%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           EE+E ++++++E+E + EE ++E+E EEE    KK
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144



 Score = 28.1 bits (63), Expect = 3.1
 Identities = 15/37 (40%), Positives = 28/37 (75%), Gaps = 1/37 (2%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEV-VKQKK 232
           +E+E +++E+ EE+E ++E+ + +E EEEE  VK+ K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 30.8 bits (70), Expect = 0.82
 Identities = 13/45 (28%), Positives = 30/45 (66%), Gaps = 2/45 (4%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
           W+    EE ++E+   E+E  +E++ +++ K +EE++++E +K E
Sbjct: 545 WKLDDKEELQREK--EEKEALKEQKRLRKLKKQEEKKKKELEKLE 587



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 14/33 (42%), Positives = 24/33 (72%), Gaps = 1/33 (3%)

Query: 184 ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           E EE+E +K++K   + +++EEKK+  E+EK E
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKK-KELEKLE 587



 Score = 29.6 bits (67), Expect = 2.0
 Identities = 14/41 (34%), Positives = 26/41 (63%)

Query: 212 IEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
            +KEE + E+EE+E + +QK+  + +++EE   K  E+ E 
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 28.5 bits (64), Expect = 3.6
 Identities = 10/48 (20%), Positives = 29/48 (60%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           ++ ++ + ++EE+E  ++++     K++ E +++E E++ K K    E
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595



 Score = 28.5 bits (64), Expect = 4.0
 Identities = 11/41 (26%), Positives = 28/41 (68%), Gaps = 2/41 (4%)

Query: 186 EEEEEVKQKKNKEEEEEE--EEKKEEDEIEKEEGENEEEEE 224
           +++EE++++K ++E  +E    +K + + EK++ E E+ E+
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 28.5 bits (64), Expect = 4.6
 Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 6/69 (8%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
           I  E+  +   V +  +KEE + E+E+KE  + +K   + +++EE      KK +E E+ 
Sbjct: 533 IRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEE------KKKKELEKL 586

Query: 240 EEVLIKLEE 248
           E+  I   E
Sbjct: 587 EKAKIPPAE 595



 Score = 28.1 bits (63), Expect = 4.8
 Identities = 16/45 (35%), Positives = 27/45 (60%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           EE + E E +E   E++   K KK +E++++E EK E+ +I   E
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595


>gnl|CDD|234786 PRK00517, prmA, ribosomal protein L11 methyltransferase; Reviewed.
          Length = 250

 Score = 30.1 bits (69), Expect = 0.82
 Identities = 12/20 (60%), Positives = 14/20 (70%), Gaps = 2/20 (10%)

Query: 46  NIFAGKTVLDVGTGTGKSIL 65
            +  GKTVLDVG G+G  IL
Sbjct: 116 LVLPGKTVLDVGCGSG--IL 133


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 30.5 bits (69), Expect = 0.85
 Identities = 18/68 (26%), Positives = 39/68 (57%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           +++ +EE+   ++E V++  + EE++E+EE ++ D   +    + +EE+EE  K+     
Sbjct: 404 QKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPST 463

Query: 236 EEEEEEVL 243
             EEE  +
Sbjct: 464 ILEEEPTV 471



 Score = 29.0 bits (65), Expect = 2.8
 Identities = 16/65 (24%), Positives = 31/65 (47%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E     +++  E+  + EE++E+EE +      E    +  +E++E +KE   +   EEE
Sbjct: 409 EEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPSTILEEE 468

Query: 226 EVVKQ 230
             V  
Sbjct: 469 PTVGG 473


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.8 bits (70), Expect = 0.85
 Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 2/81 (2%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEE-EEEEEKKEEDEIEKEEGENEEEEEEEVV 228
            E+  + ED IE  E   E+  +    + E  EE+ E+ EE      E E E EE+ E  
Sbjct: 501 AEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAA 560

Query: 229 KQKKNEEEEEEEEVLIKLEEE 249
             +  EE EE  E + +L  +
Sbjct: 561 -AEAEEEAEEAREEVAELNSK 580



 Score = 28.5 bits (64), Expect = 4.7
 Identities = 28/85 (32%), Positives = 47/85 (55%), Gaps = 4/85 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEE--EEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           E E  E DE  E   E+ E+ ++ +++ +E  EE EE++EE E  + E E+  E   E  
Sbjct: 212 ESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETE 271

Query: 229 KQKKN--EEEEEEEEVLIKLEEEEE 251
           ++++   EE  +  E L +LEEE +
Sbjct: 272 REREELAEEVRDLRERLEELEEERD 296



 Score = 27.7 bits (62), Expect = 8.2
 Identities = 20/76 (26%), Positives = 33/76 (43%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
             I    E   E E E EE  EE    +++ +E EEE ++   E  ++  + E  E   E
Sbjct: 258 AEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARRE 317

Query: 226 EVVKQKKNEEEEEEEE 241
           E+  + +   +  EE 
Sbjct: 318 ELEDRDEELRDRLEEC 333


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.8 bits (70), Expect = 0.87
 Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 5/71 (7%)

Query: 172  EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEE-----KKEEDEIEKEEGENEEEEEEE 226
              +K+  EIE    EE+E   Q + + ++E++         +E  IE++  E++ ++ + 
Sbjct: 1512 SNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKN 1571

Query: 227  VVKQKKNEEEE 237
              + K N E E
Sbjct: 1572 KKQYKSNTEAE 1582



 Score = 30.8 bits (70), Expect = 0.89
 Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 8/56 (14%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           IF+ W     + E K  D +EE+  ++      KK ++++EEE +++E+  IE  E
Sbjct: 724 IFRNW--NGKDAEFKISDSVEEKTKKK------KKKEKKKEEEYKREEKARIEIAE 771



 Score = 30.5 bits (69), Expect = 0.97
 Identities = 22/79 (27%), Positives = 38/79 (48%), Gaps = 7/79 (8%)

Query: 162  FKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
                EN+  +  +K++  I   E +  E     +N+ +EE+E   + E E +KE+  N E
Sbjct: 1492 LNGNENVNKKINQKKKGFIPSNEKKSIE----IENRNQEEKEPAGQGELESDKEKKGNLE 1547

Query: 222  EEEEEVVKQKKNEEEEEEE 240
                 +  Q+KN EE+  E
Sbjct: 1548 SV---LSNQEKNIEEDYAE 1563


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 28.9 bits (65), Expect = 0.88
 Identities = 21/59 (35%), Positives = 33/59 (55%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
            K E  E+ EE+ EEEE+ + ++ +  EEE  ++   +E E EE E++E   EE  K  
Sbjct: 30  VKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKAL 88



 Score = 28.1 bits (63), Expect = 1.9
 Identities = 23/77 (29%), Positives = 46/77 (59%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
             +DEI E   E+ EE ++ + +E++  EEE  +E  +E+EE E EE++E  + + +K  
Sbjct: 29  SVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKAL 88

Query: 235 EEEEEEEVLIKLEEEEE 251
           +E+ +    +K+ + +E
Sbjct: 89  QEKRKALPALKVRKAKE 105


>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
           repeat [General function prediction only].
          Length = 561

 Score = 30.6 bits (69), Expect = 0.88
 Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 7/82 (8%)

Query: 175 KEEDEIEEEENEEE------EEVKQKKNKEEEEEEEEK-KEEDEIEKEEGENEEEEEEEV 227
           ++   +E +           E V+Q     +E++     K+   IE  +      EE EV
Sbjct: 480 EKAVIMEADSANRSSAPRKKELVEQWPEYSDEDKIRSLLKKLRAIEALKERMRSGEELEV 539

Query: 228 VKQKKNEEEEEEEEVLIKLEEE 249
           ++  K E EEE    L +L E+
Sbjct: 540 IQVNKIETEEEVLSELKELVED 561


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 30.4 bits (68), Expect = 0.88
 Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 1/75 (1%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEE-EEKKEEDEIEKEEGEN 219
           I K  E I  + E   E   E+E  +E+EE  +K +++ EE + EE KE DE +K++ + 
Sbjct: 192 IKKHSEFIGYDIELMVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKK 251

Query: 220 EEEEEEEVVKQKKNE 234
            +E  +E   Q K++
Sbjct: 252 VKEVTKEYEVQNKHK 266



 Score = 28.5 bits (63), Expect = 4.6
 Identities = 19/54 (35%), Positives = 33/54 (61%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
           E+  +K+  +E+EE+ +K +ED  E +  E +E +E +  K KK +E  +E EV
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEV 261


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 30.0 bits (68), Expect = 0.88
 Identities = 10/37 (27%), Positives = 21/37 (56%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE 208
           ++ K+++E   EE +E      KK+   ++ E +KK+
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 10/34 (29%), Positives = 16/34 (47%)

Query: 209 EDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
           +D  +KEE   EE +E      KK+   ++ E  
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVETK 193



 Score = 29.3 bits (66), Expect = 1.3
 Identities = 7/33 (21%), Positives = 14/33 (42%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
           +D  ++EE   EE  +      ++    +K E 
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVET 192



 Score = 27.0 bits (60), Expect = 7.5
 Identities = 10/39 (25%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 173 EKKEEDEIEE--EENEEEEEVKQKKNKEEEEEEEEKKEE 209
           E+K+  + EE   E  +E      K     ++ E KK++
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 30.4 bits (68), Expect = 0.89
 Identities = 23/77 (29%), Positives = 45/77 (58%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           NI  + E++   + EEEE+E+ EE+++ +  E  EEE +  E+ E + E+  + E +E E
Sbjct: 94  NIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENE 153

Query: 227 VVKQKKNEEEEEEEEVL 243
           +  +   EE +++  +L
Sbjct: 154 LAGEYIIEEVDDDVAIL 170



 Score = 29.3 bits (65), Expect = 2.1
 Identities = 23/72 (31%), Positives = 42/72 (58%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           + E   E E      + EEE   +K +EE+E+ EE +E +E+E  E E +++E+ E   +
Sbjct: 83  DNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDE 142

Query: 231 KKNEEEEEEEEV 242
           K+++ E +E E+
Sbjct: 143 KESDAEGDENEL 154



 Score = 28.9 bits (64), Expect = 3.0
 Identities = 23/87 (26%), Positives = 40/87 (45%)

Query: 155 PSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK 214
            S + +     E  A        ++  + EN+ E E      K EEE   +K+EE++ + 
Sbjct: 56  LSEACDYIAIEEYSACGNVASIANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDV 115

Query: 215 EEGENEEEEEEEVVKQKKNEEEEEEEE 241
           EE E  EE E    +   +E+ E+++E
Sbjct: 116 EEIEEVEEVEVVEEEYDDDEDSEKDDE 142


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 30.5 bits (69), Expect = 0.95
 Identities = 15/51 (29%), Positives = 23/51 (45%)

Query: 156 SYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEK 206
             + E+  F+E    +EEK+   E+EE E+    E       EE+    EK
Sbjct: 128 DINRELGSFYEYPKTDEEKELLREVEELESRLATEPSPAPELEEQLALMEK 178



 Score = 29.7 bits (67), Expect = 1.5
 Identities = 11/41 (26%), Positives = 15/41 (36%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
            E  K ++E E     EE E +         E EE+    E
Sbjct: 137 YEYPKTDEEKELLREVEELESRLATEPSPAPELEEQLALME 177



 Score = 27.8 bits (62), Expect = 6.2
 Identities = 17/87 (19%), Positives = 32/87 (36%), Gaps = 19/87 (21%)

Query: 172 EEKKEEDEIEEEENEEEEEVK----------------QKKNKEEEEEEEEKKEEDEIEKE 215
           +E ++    +   +EE +E                  +  N+E     E  K ++E E  
Sbjct: 90  KESQDYSSSDSGFDEEPDEPAETAGSLRPIRSSAAAYRDINRELGSFYEYPKTDEEKELL 149

Query: 216 EGENEEEEEEEVVKQKKNEEEEEEEEV 242
               E EE E  +  + +   E EE++
Sbjct: 150 R---EVEELESRLATEPSPAPELEEQL 173



 Score = 27.4 bits (61), Expect = 7.5
 Identities = 18/85 (21%), Positives = 39/85 (45%), Gaps = 7/85 (8%)

Query: 165 WENIAGEEEKKEEDEIEEEENEE-----EEEVKQKKNKEEEEEEEEKKEEDEIE--KEEG 217
           W   A  EE+   +E  +  N +     +E +   K K  E+E  E+K++D+    ++  
Sbjct: 23  WLIFAPSEEEDAAEEETKGFNADIPQPKDEGIIGDKMKAYEQEALEQKQKDKKRTLQDLA 82

Query: 218 ENEEEEEEEVVKQKKNEEEEEEEEV 242
           ++ E+ ++E      ++   +EE  
Sbjct: 83  DSFEKSDKESQDYSSSDSGFDEEPD 107


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 29.9 bits (68), Expect = 0.96
 Identities = 10/28 (35%), Positives = 21/28 (75%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKE 215
           EEE ++KK   +EE++  K+E++++E+ 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 29.7 bits (67), Expect = 0.98
 Identities = 16/77 (20%), Positives = 36/77 (46%), Gaps = 9/77 (11%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI---------EKEEGENE 220
            E++   + EIE+      +    +    ++E  E+ K+  E+         +K E ++ 
Sbjct: 47  AEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEVNERLKALEKKKAELKDL 106

Query: 221 EEEEEEVVKQKKNEEEE 237
           EEE E +++ K+ E+ +
Sbjct: 107 EEEREGILRSKQAEDGK 123


>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
           N-terminal domain.  Autophagocytosis is a
           starvation-induced process responsible for transport of
           cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
           ubiquitin like modifier that is topologically similar to
           the canonical E2 enzyme. It catalyzes the conjugation of
           Atg8 and phosphatidylethanolamine.
          Length = 146

 Score = 29.2 bits (66), Expect = 1.00
 Identities = 10/59 (16%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 171 EEEKKEEDEIEEEENEE---EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           E     E+ +E+E+ ++        + K K++  +EE+  E  + + +  ++ + +E++
Sbjct: 83  EYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDD 141


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 30.3 bits (69), Expect = 1.0
 Identities = 15/53 (28%), Positives = 26/53 (49%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
                  K +++  EE+ +  EE K+++ +EEEE  ++    D IE E G   
Sbjct: 318 FLAYLLSKRKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDPIELELGYGL 370



 Score = 28.4 bits (64), Expect = 4.6
 Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           +E++ EE++K  +E +KEE E EEE  ++V     +  E E    LI L +E++ 
Sbjct: 327 KEQQAEEKEKPAEEAKKEEEEEEEESVDDV--LLIDPIELELGYGLIPLVDEQQG 379


>gnl|CDD|215319 PLN02585, PLN02585, magnesium protoporphyrin IX methyltransferase.
          Length = 315

 Score = 30.2 bits (68), Expect = 1.1
 Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 16/72 (22%)

Query: 2   DMEVVKDHDEEEYFESY------TDLEVHKLMLDDTVRN------EAYRAAICDNPNIFA 49
           D EVV+++     FE +      TD EV+K+ LD  +R       E     + ++ ++ A
Sbjct: 89  DKEVVREYFNTTGFERWRKIYGETD-EVNKVQLD--IRLGHAQTVEKVLLWLAEDGSL-A 144

Query: 50  GKTVLDVGTGTG 61
           G TV D G GTG
Sbjct: 145 GVTVCDAGCGTG 156


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 30.3 bits (68), Expect = 1.1
 Identities = 17/64 (26%), Positives = 33/64 (51%)

Query: 157 YSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           Y  +IF+  E +  +E     +E  +EE+E+E E+ +   +E  E E  + + D++ +  
Sbjct: 324 YYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERL 383

Query: 217 GENE 220
            E E
Sbjct: 384 AETE 387



 Score = 29.1 bits (65), Expect = 2.1
 Identities = 15/65 (23%), Positives = 30/65 (46%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
               G+  +   + +E  E +E + V ++   EE E+E E  E    E  E E  E + +
Sbjct: 318 LADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVD 377

Query: 226 EVVKQ 230
           ++ ++
Sbjct: 378 DLAER 382



 Score = 27.2 bits (60), Expect = 9.1
 Identities = 13/43 (30%), Positives = 22/43 (51%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           E  E++E     +E   EE E+E E +E V+++    E  E +
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQ 375


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 30.1 bits (68), Expect = 1.1
 Identities = 17/84 (20%), Positives = 42/84 (50%), Gaps = 3/84 (3%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           ++      E+ EE+ ++E++ E   E + ++  E EE ++++  ++   K    +   + 
Sbjct: 121 FKRPKYPNEEDEEENVDEDDAEIILE-EVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQA 179

Query: 225 EEVVKQKKNEEE--EEEEEVLIKL 246
           ++V++   +  E   E E VL +L
Sbjct: 180 KDVLESLIDAAEWKLEVERVLPQL 203



 Score = 29.3 bits (66), Expect = 1.8
 Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 5/70 (7%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
             E +EEE V +     ++ E   ++ E+E+E EE +++E  +E   K+       + ++
Sbjct: 127 PNEEDEEENVDE-----DDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKD 181

Query: 242 VLIKLEEEEE 251
           VL  L +  E
Sbjct: 182 VLESLIDAAE 191


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 23/81 (28%), Positives = 43/81 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE+ KE  E++E+  E  +  +  +   +E  E EK+     E+  G  E  +E E  ++
Sbjct: 279 EEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEE 338

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +  E +++ +E+  +LEE EE
Sbjct: 339 RLEELKKKLKELEKRLEELEE 359



 Score = 28.5 bits (64), Expect = 4.7
 Identities = 19/71 (26%), Positives = 43/71 (60%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           EE+ EE + + +E E+  E  +++++  EE + +K+E + ++K       E+ E+ +++ 
Sbjct: 337 EERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEEL 396

Query: 232 KNEEEEEEEEV 242
           +  +EE EEE+
Sbjct: 397 EKAKEEIEEEI 407



 Score = 28.1 bits (63), Expect = 4.8
 Identities = 24/82 (29%), Positives = 49/82 (59%), Gaps = 1/82 (1%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE-EGENEEEEEEEVVK 229
           E  ++   E+ EE   E + ++++  + EE+E + +KE  E+EK  + E+E  + +E+ +
Sbjct: 444 ELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAE 503

Query: 230 QKKNEEEEEEEEVLIKLEEEEE 251
           Q K  EE+ ++  L +LE++ E
Sbjct: 504 QLKELEEKLKKYNLEELEKKAE 525



 Score = 28.1 bits (63), Expect = 5.5
 Identities = 27/81 (33%), Positives = 44/81 (54%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE    E+ I+E E +EE   + KK  +E E+  E+ EE     EE + ++EE E + K+
Sbjct: 321 EEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKR 380

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
                 E+ E+ L +LE+ +E
Sbjct: 381 LTGLTPEKLEKELEELEKAKE 401



 Score = 27.3 bits (61), Expect = 8.8
 Identities = 26/79 (32%), Positives = 40/79 (50%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           +K    +EEE N  EE +K+ + KEE  EE +KK ++  ++ E   E  E  E  K KK 
Sbjct: 313 EKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKE 372

Query: 234 EEEEEEEEVLIKLEEEEEN 252
           E E  ++ +     E+ E 
Sbjct: 373 ELERLKKRLTGLTPEKLEK 391


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 30.2 bits (68), Expect = 1.2
 Identities = 15/70 (21%), Positives = 32/70 (45%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
            E   +   + +  E  E    K N E ++EE       E + +  +N+EE E+   + K
Sbjct: 43  NENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEIEKPKDEPK 102

Query: 232 KNEEEEEEEE 241
           K +++ + ++
Sbjct: 103 KPDKKPQADQ 112


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 28.8 bits (65), Expect = 1.2
 Identities = 20/64 (31%), Positives = 34/64 (53%), Gaps = 2/64 (3%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
              +E+ K E + EE+++E E+    K + EE+ E E +K E E+E+ E E    E +  
Sbjct: 16  AIAKEKAKWEKKQEEKKSEAEK--LAKMSAEEKAEYELEKLEKELEELEAELARRELKAE 73

Query: 228 VKQK 231
            K+ 
Sbjct: 74  AKKM 77



 Score = 26.5 bits (59), Expect = 8.1
 Identities = 16/57 (28%), Positives = 35/57 (61%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           I +E+ + E++ ++KK++ E+  +   +E+ E E E+ E E EE E  + +++ + E
Sbjct: 17  IAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAE 73


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 1/50 (2%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E D + +E N+  +EV + K   +E+  E   E  E++KE  E E E +E
Sbjct: 48  ELDSLRKEFNKLNKEVAKLK-IAKEDATELIAETKELKKEITEKEAEVQE 96


>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein. 
          Length = 283

 Score = 29.5 bits (66), Expect = 1.3
 Identities = 16/80 (20%), Positives = 34/80 (42%), Gaps = 4/80 (5%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           EE++ +     +E+  EE         EE  + E   E  IE ++G   +    E V+  
Sbjct: 24  EEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSE--TEPSIEDDQGLYTQLPPAEQVEGF 81

Query: 232 KNEEEEE--EEEVLIKLEEE 249
                ++  ++++ +  EE+
Sbjct: 82  IQGPLDDIADDDIDVVFEED 101


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 29.8 bits (67), Expect = 1.4
 Identities = 18/58 (31%), Positives = 35/58 (60%)

Query: 194 KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            + +E+ E E+E++EEDE + ++G ++E+E+E+ V   +      E  V    E++EE
Sbjct: 129 TRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEE 186


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 28.7 bits (65), Expect = 1.4
 Identities = 12/44 (27%), Positives = 27/44 (61%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
           +AG+E K  ++EI++     +++++ K+  + E++ EE K   E
Sbjct: 58  LAGKELKLTDEEIQKALQAFQKKLQAKQQAKAEKKAEENKAAGE 101


>gnl|CDD|184860 PRK14858, tatA, twin arginine translocase protein A; Provisional.
          Length = 108

 Score = 28.3 bits (63), Expect = 1.4
 Identities = 13/63 (20%), Positives = 26/63 (41%)

Query: 164 FWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           F +++  E    EE E  E+  E ++E +  + K EE++  + K   E          + 
Sbjct: 46  FKQSMQEESRTAEEKEKAEKLAETKKEAEAPEAKAEEDQAPKPKGAGEPPATVASKAGDG 105

Query: 224 EEE 226
            + 
Sbjct: 106 AKA 108


>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD.  This
           family consists of several putative bacterial stage IV
           sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
           is known to be essential for efficient sporulation
           although its exact function is unknown.
          Length = 383

 Score = 29.6 bits (67), Expect = 1.4
 Identities = 14/51 (27%), Positives = 29/51 (56%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           I  E   E ++   K  KEE  E+ +K  + ++EKE  ++ + ++E+V+ +
Sbjct: 313 IVRETYYEVQDKVVKLTKEEAVEKGKKLAKKKLEKEIDKDAKIKDEKVLHE 363


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 29.7 bits (67), Expect = 1.4
 Identities = 17/82 (20%), Positives = 37/82 (45%), Gaps = 2/82 (2%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           I K   + A ++ K   + I+E    +E  + +K++   + +E   K  +   K+    E
Sbjct: 335 ILKIANDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQE 394

Query: 221 EEEEEEVVKQ--KKNEEEEEEE 240
           E+ + E +K    K  E  +++
Sbjct: 395 EQAKLEFIKLEELKRLENGKQK 416


>gnl|CDD|220184 pfam09332, Mcm10, Mcm10 replication factor.  Mcm10 is a eukaryotic
           DNA replication factor that regulates the stability and
           chromatin association of DNA polymerase alpha.
          Length = 346

 Score = 29.9 bits (67), Expect = 1.4
 Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 3/88 (3%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE---NEEEE 223
           N    +  +       +E  E+     K+ K  E+EE   K+   +E EE +     +  
Sbjct: 155 NSIKRKRSESGKRRAIDERVEKNASSAKRQKSPEDEEPAMKKRRYLESEEFQKILAAKSS 214

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
              ++++ + E +EE    L K E+ EE
Sbjct: 215 HTGLLEEAEREAQEEYFNPLEKKEQMEE 242


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 29.2 bits (66), Expect = 1.5
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 2/61 (3%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE--EDEIEKEEGENEEEEEEEVV 228
           EE +++  EI E+  +E E++ ++   E EEE E  KE  E EIE E+    EE   EV 
Sbjct: 68  EEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEAEIEAEKERALEELRAEVA 127

Query: 229 K 229
           +
Sbjct: 128 E 128


>gnl|CDD|233137 TIGR00811, sit, silicon transporter.  Marine diatoms such as
           Cylindrotheca fusiformis encode at least six silicon
           transport protein homologues which exhibit similar size
           and topology. One characterized member of the family
           (Sit1) functions in the energy-dependent uptake of
           either Silicic acid [Si(OH)4] or Silicate [Si(OH)3O-] by
           a Na+ symport mechanism. The system is found in marine
           diatoms which make their "glass houses" out of silicon
           [Transport and binding proteins, Other].
          Length = 545

 Score = 29.9 bits (67), Expect = 1.5
 Identities = 22/71 (30%), Positives = 30/71 (42%), Gaps = 6/71 (8%)

Query: 151 LHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEE----EVK--QKKNKEEEEEEE 204
           L +  ++S    +  +NI    E       +EE+   E     EVK   K N EEEEE  
Sbjct: 474 LSALETFSVRRERMLKNIRELREMINNAISDEEKTTFEAALAIEVKALDKLNAEEEEEAT 533

Query: 205 EKKEEDEIEKE 215
            KKE  + E  
Sbjct: 534 NKKEGKDTEAS 544


>gnl|CDD|172948 PRK14473, PRK14473, F0F1 ATP synthase subunit B; Provisional.
          Length = 164

 Score = 29.1 bits (65), Expect = 1.5
 Identities = 19/72 (26%), Positives = 37/72 (51%), Gaps = 4/72 (5%)

Query: 182 EEENEEEEEVKQK----KNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           EE   + E+V+++    K   E E  + ++E  +I  +  E    +E E++ Q + E E+
Sbjct: 45  EESLRDAEKVREQLANAKRDYEAELAKARQEAAKIVAQAQERARAQEAEIIAQARREAEK 104

Query: 238 EEEEVLIKLEEE 249
            +EE   + E+E
Sbjct: 105 IKEEARAQAEQE 116


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.3 bits (66), Expect = 1.5
 Identities = 22/83 (26%), Positives = 45/83 (54%), Gaps = 2/83 (2%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           NI  +  K+  + +E+E  E  EE+++ + + E+ +E  ++ E  + + E   EEE  E 
Sbjct: 95  NIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAE- 153

Query: 227 VVKQKKNEEEEEEEEVLIKLEEE 249
            ++++  E   + EE+  KL+ E
Sbjct: 154 -IREEGQELSSKREELKEKLDPE 175


>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
           domain 1.  The small mechanosensitive channel, MscS, is
           a part of the turgor-driven solute efflux system that
           protects bacteria from lysis in the event of osmotic
           shock. The MscS protein alone is sufficient to form a
           functional mechanosensitive channel gated directly by
           tension in the lipid bilayer. The MscS proteins are
           heptamers of three transmembrane subunits with seven
           converging M3 domains, and this domain is one of the
           inner membrane domains.
          Length = 339

 Score = 29.5 bits (67), Expect = 1.5
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEV 227
           K +  E   ++ E+E E  EG  E  EE E+
Sbjct: 264 KAKRAEILAQRAEEEEESSEGAAETIEEPEL 294


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 29.8 bits (67), Expect = 1.5
 Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 7/91 (7%)

Query: 136 KFLKPEG----VMYPYKCILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEV 191
           KFL+ +G    V++  K +        PE     + IA EE+ K  DE+ +   E+   V
Sbjct: 338 KFLQADGGWERVVWMPKELKERVKDAIPE--DLRDKIATEEDAKTTDELRKFLKEKGHPV 395

Query: 192 KQKKNKEEEEEE-EEKKEEDEIEKEEGENEE 221
            ++  +E +EEE EE++E  + E+ E E  E
Sbjct: 396 VKRVVREVDEEEIEEEEEAMQPEEMEMEGFE 426


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
           describes the F1/F0 ATP synthase b subunit in bacteria
           only. Scoring just below the trusted cutoff are the
           N-terminal domains of Mycobacterial b/delta fusion
           proteins and a subunit from an archaeon, Methanosarcina
           barkeri, in which the ATP synthase homolog differs in
           architecture and is not experimentally confirmed. This
           model helps resolve b from the related b' subunit.
           Within the family is an example from a
           sodium-translocating rather than proton-translocating
           ATP synthase [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 147

 Score = 28.9 bits (65), Expect = 1.5
 Identities = 21/86 (24%), Positives = 40/86 (46%), Gaps = 2/86 (2%)

Query: 166 ENIAGEEEKKEEDEIEEE--ENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           + IA      E  + E    + + +  +K+ K++ +E  E   K   EI +E      EE
Sbjct: 29  KKIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEIIENANKRGSEILEEAKAEAREE 88

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEEE 249
            E++  Q + E E E+E+   +L ++
Sbjct: 89  REKIKAQARAEIEAEKEQAREELRKQ 114


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 29.6 bits (67), Expect = 1.6
 Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 19/81 (23%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           W  +       + ++       +++  KQ++ + +E  E++  +EDE +  EG  E  EE
Sbjct: 169 WGLVGVPGHSSDSEK-------KKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEE 221

Query: 225 E------------EVVKQKKN 233
                        EV + KKN
Sbjct: 222 RQREHPFIVTEPGEVARGKKN 242


>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 147 to 176 amino acids in length. This domain is
           found associated with pfam00226.
          Length = 144

 Score = 28.8 bits (65), Expect = 1.6
 Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 192 KQKKNKEEEEEEEEKKEEDEI--EKEEGENEEEEEEEVVKQKKNEEEE 237
             K  K++E+E + +K + EI  +K+E E+      +VV++K   EEE
Sbjct: 2   FNKAQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREEE 49


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 29.4 bits (67), Expect = 1.7
 Identities = 14/37 (37%), Positives = 22/37 (59%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEE 204
           + G + +  E   EEEE  EEEE ++++ + EE E E
Sbjct: 222 LEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 28.2 bits (64), Expect = 4.1
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 205 EKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           E ++    E  E E EE EEEE  ++++  EE E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 28.2 bits (64), Expect = 4.2
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           +E  +    E  ++++ + EEEEEEE++EE E  + E
Sbjct: 222 LEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 5.2
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           + +  +  EEEEEE +EE+E E+EE   E E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 5.4
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEE 201
           E   G   +  E+E EE E EEEEE +++  + E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 27.4 bits (62), Expect = 5.9
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           D I E       E  +++ +E EEEEEE++EE+  E E 
Sbjct: 219 DAILEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257



 Score = 27.1 bits (61), Expect = 10.0
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 183 EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           E  +       ++ +EE EEEEE++EE+E E+ E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
          Length = 141

 Score = 28.5 bits (64), Expect = 1.7
 Identities = 15/77 (19%), Positives = 34/77 (44%), Gaps = 3/77 (3%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEE---EEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
             I+ +  + +         E E E      ++E ++I ++     +EE E+ ++ KK E
Sbjct: 41  ASIKNDLEKVKTNSSDVSEIEHEIETILKNAREEANKIRQKAIAKAKEEAEKKIEAKKAE 100

Query: 235 EEEEEEEVLIKLEEEEE 251
            E + E    +L  +++
Sbjct: 101 LESKYEAFAKQLANQKQ 117


>gnl|CDD|226413 COG3897, COG3897, Predicted methyltransferase [General function
          prediction only].
          Length = 218

 Score = 29.3 bits (66), Expect = 1.7
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 41 ICDNPNIFAGKTVLDVGTGTG 61
          I D+P    GK VLD+G G+G
Sbjct: 71 IDDHPETVRGKRVLDLGAGSG 91


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 29.8 bits (68), Expect = 1.7
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
            EEE    +E  +    +E EE+E+K E+ EI 
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEIT 544



 Score = 29.4 bits (67), Expect = 1.8
 Identities = 10/36 (27%), Positives = 18/36 (50%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
           D  EE    +E+       + EE+E++ +K E  I+
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEITID 546



 Score = 29.0 bits (66), Expect = 2.4
 Identities = 11/63 (17%), Positives = 26/63 (41%), Gaps = 5/63 (7%)

Query: 159 PEIFKFWENIAGEEEKKEEDEIEEEEN-----EEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
            E    WE++          ++ + E      + EEE+   K + E    +E +E+++  
Sbjct: 479 EEELTSWESLLEWGGLPAGTKVAKGEPLFPRIDVEEEIAYIKEQMEGSAPKEPEEKEKKP 538

Query: 214 KEE 216
           ++ 
Sbjct: 539 EKP 541



 Score = 27.8 bits (63), Expect = 6.7
 Identities = 7/32 (21%), Positives = 17/32 (53%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEE 203
           + ++E   I+E+      +  ++K K+ E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 29.4 bits (66), Expect = 1.7
 Identities = 20/79 (25%), Positives = 36/79 (45%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           +  E    E++ E E   K+     +E +++ +   + IEK+   N+ EE  +   +KK 
Sbjct: 103 EALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKL 162

Query: 234 EEEEEEEEVLIKLEEEEEN 252
           EE  E E    + +  E N
Sbjct: 163 EESLELEREKFEEQLHEAN 181



 Score = 28.2 bits (63), Expect = 4.1
 Identities = 19/84 (22%), Positives = 43/84 (51%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
                E+ K+ +   ++E  + E ++    ++++E E  +KE DE+ KE  +  +   E 
Sbjct: 81  KNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEI 140

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEE 250
           + K+++N + EE  +   + + EE
Sbjct: 141 IEKKRENNKNEERLKFENEKKLEE 164



 Score = 28.2 bits (63), Expect = 4.8
 Identities = 21/81 (25%), Positives = 41/81 (50%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E  K+ + +++      E++ +  KN+E  + E EKK E+ +E E  + EE+  E  +  
Sbjct: 125 ELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDL 184

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +  E EE+ E     L++ + 
Sbjct: 185 EFKENEEQRESKWAILKKLKR 205



 Score = 27.9 bits (62), Expect = 6.2
 Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEE--EEEVKQKKNK-EEEEEEEEKKEEDEIEKEE 216
           EI K         EK +E E+ E+E +E  +E  KQ +N  E  E++ E  + +E  K E
Sbjct: 98  EIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFE 157

Query: 217 GENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
            E + EE  E+ ++K  E+  E    L   E EE+
Sbjct: 158 NEKKLEESLELEREKFEEQLHEANLDLEFKENEEQ 192


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 28.0 bits (63), Expect = 1.7
 Identities = 16/45 (35%), Positives = 26/45 (57%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           ++    +EE+EEE E K+K  +E+ +  E   EE+E EK+    E
Sbjct: 64  DDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108



 Score = 27.6 bits (62), Expect = 2.6
 Identities = 17/40 (42%), Positives = 28/40 (70%), Gaps = 2/40 (5%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEE--EEEEVVKQKKNEE 235
            +EE+EEE++ + ++E+E+ +N E   EEEE  KQ+  EE
Sbjct: 69  GDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 29.5 bits (66), Expect = 1.7
 Identities = 7/64 (10%), Positives = 27/64 (42%), Gaps = 6/64 (9%)

Query: 163 KFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
           ++ +N++  +    +      E++++E           + +EE+  +D+  ++  +    
Sbjct: 251 RYKQNLSSNKYGDFKGVNYGYESDDDEG------SSSNDYDEEEDGDDDDNEDNDDTNTN 304

Query: 223 EEEE 226
             +E
Sbjct: 305 HGDE 308


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 28.8 bits (65), Expect = 1.7
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 187 EEEEVKQKKNKEEEEEEEEKKEEDEIE 213
           E+EE + +K  E+E  E+ KKEE+  E
Sbjct: 82  EKEERELRKRAEKEALEQAKKEEELRE 108



 Score = 27.2 bits (61), Expect = 4.4
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 212 IEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
            EKEE E  +  E+E ++Q K EEE  E +
Sbjct: 81  NEKEERELRKRAEKEALEQAKKEEELREAK 110



 Score = 26.5 bits (59), Expect = 8.3
 Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 8/46 (17%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEE 205
           E+  FW+       K E++E E  +  E+E ++Q K KEEE  E +
Sbjct: 73  EMLLFWK-------KNEKEERELRKRAEKEALEQAK-KEEELREAK 110


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 29.7 bits (67), Expect = 1.8
 Identities = 19/86 (22%), Positives = 32/86 (37%), Gaps = 8/86 (9%)

Query: 171 EEEKKEEDEIEEEENEEEEEV--------KQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
           E       EIE  E E +E+         +    + E EE +   E  E +      +  
Sbjct: 202 ELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLF 261

Query: 223 EEEEVVKQKKNEEEEEEEEVLIKLEE 248
           EE E ++++  E E   +    +L E
Sbjct: 262 EEREQLERQLKEIEAARKANRAQLRE 287


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 28.4 bits (63), Expect = 1.9
 Identities = 17/73 (23%), Positives = 31/73 (42%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A +          E   E E     +   E  E +   + ++E E +E  +E  +E +  
Sbjct: 58  AEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAE 117

Query: 229 KQKKNEEEEEEEE 241
            ++ +EEE+EE E
Sbjct: 118 AEEADEEEDEEAE 130



 Score = 26.5 bits (58), Expect = 7.9
 Identities = 17/74 (22%), Positives = 31/74 (41%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           +    E+          +  EE E  +  ++ DE  + +  +E +EEEE  +      +E
Sbjct: 54  EAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADE 113

Query: 238 EEEEVLIKLEEEEE 251
            + E     EEE+E
Sbjct: 114 ADAEAEEADEEEDE 127


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 29.4 bits (66), Expect = 2.0
 Identities = 13/72 (18%), Positives = 24/72 (33%), Gaps = 1/72 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
             E KK     + E    ++E+ QK+   E E   E ++E                +  K
Sbjct: 329 NAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQE-RAAAMARARARRAAVKAKK 387

Query: 230 QKKNEEEEEEEE 241
           +   +    E+ 
Sbjct: 388 KGLIDASPNEDT 399



 Score = 27.1 bits (60), Expect = 9.6
 Identities = 9/52 (17%), Positives = 20/52 (38%)

Query: 190 EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           E+  +  + ++    EK E    +KE  +     E E+ ++ + E       
Sbjct: 324 ELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMAR 375


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 15/63 (23%), Positives = 36/63 (57%), Gaps = 3/63 (4%)

Query: 170 GEEEKKEEDEIEEEE---NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           GEE  + E +IEE +    + EE+++Q + + E++EEE +K +++   +  +   ++ + 
Sbjct: 87  GEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDS 146

Query: 227 VVK 229
            + 
Sbjct: 147 NLS 149


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 28.5 bits (64), Expect = 2.0
 Identities = 28/67 (41%), Positives = 38/67 (56%), Gaps = 4/67 (5%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
           +EE  +E  E +EEEE  +    EEEE+ EE+ EE+E E + G   EEEEE+     +  
Sbjct: 66  REEARLELLEQQEEEESAE----EEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELP 121

Query: 235 EEEEEEE 241
             EE EE
Sbjct: 122 TPEEREE 128



 Score = 27.8 bits (62), Expect = 3.7
 Identities = 18/71 (25%), Positives = 33/71 (46%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EEE++     +         ++   ++EE   E  +++E+E   EE E + EEE E  ++
Sbjct: 40  EEEREARGRAKRWSEGLSGVLESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEE 99

Query: 231 KKNEEEEEEEE 241
           +       EEE
Sbjct: 100 EYQRGPFGEEE 110



 Score = 27.8 bits (62), Expect = 4.7
 Identities = 26/61 (42%), Positives = 34/61 (55%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE  +EE+E  EEENEEEEE  Q+    EEEEE+    ++    EE E   EE    + Q
Sbjct: 80  EESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREELREEFISEMHQ 139

Query: 231 K 231
           +
Sbjct: 140 R 140


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 29.1 bits (65), Expect = 2.1
 Identities = 22/77 (28%), Positives = 33/77 (42%), Gaps = 8/77 (10%)

Query: 154 APSYSPEIFKFWENI----AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
           AP  +PE  K  +      A E    E  E+     EE     +K+ + EE  E +  +E
Sbjct: 200 APDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDE 259

Query: 210 DEIEKEEGENEEEEEEE 226
           DE     GE+ E+ E +
Sbjct: 260 DE----HGEDSEDGETK 272



 Score = 27.9 bits (62), Expect = 5.4
 Identities = 13/55 (23%), Positives = 25/55 (45%)

Query: 186 EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           +++EE +    +   E  E      E  + +   E+E EE V +   +E+E  E+
Sbjct: 211 DQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGED 265


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 28.6 bits (64), Expect = 2.1
 Identities = 14/45 (31%), Positives = 28/45 (62%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE 215
           +EE   E ++ +EE  +E   K+ K   +EE++E++   +++EKE
Sbjct: 3   KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKE 47


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 28.4 bits (64), Expect = 2.1
 Identities = 17/60 (28%), Positives = 34/60 (56%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
           D+  E+E EE E+  +K    EE+EE ++  +    + +    ++ E E++K+ K +E+E
Sbjct: 58  DDYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKE 117


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 29.1 bits (66), Expect = 2.2
 Identities = 23/99 (23%), Positives = 37/99 (37%), Gaps = 13/99 (13%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE----- 220
           E +       EE  +E        E+        +   E  + E E+EK E E +     
Sbjct: 777 EELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKK 836

Query: 221 -------EEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
                   +  EEVV+++K +  E + + L KLEE    
Sbjct: 837 LSNEGFVAKAPEEVVEKEKEKLAEYQVK-LAKLEERLAV 874


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 29.1 bits (65), Expect = 2.3
 Identities = 14/39 (35%), Positives = 19/39 (48%)

Query: 204 EEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
               E +E E+EE E EEEEE +VV  +K       +  
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKAS 261



 Score = 29.1 bits (65), Expect = 2.3
 Identities = 14/22 (63%), Positives = 17/22 (77%)

Query: 170 GEEEKKEEDEIEEEENEEEEEV 191
           G + + EEDE EEEE EEEEE+
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEI 244


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 28.9 bits (65), Expect = 2.3
 Identities = 11/36 (30%), Positives = 16/36 (44%)

Query: 207 KEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
           ++ +E E++E    E    E       E EE EE V
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWV 240



 Score = 28.9 bits (65), Expect = 2.3
 Identities = 12/45 (26%), Positives = 18/45 (40%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           ++ +E EE+E    E    +       E EE +E       E EN
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNEWEN 249


>gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination,
           and repair / Cell division and chromosome partitioning /
           Translation].
          Length = 953

 Score = 29.1 bits (65), Expect = 2.4
 Identities = 15/56 (26%), Positives = 26/56 (46%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
            +IA +   ++E+E E E +   +   +  N   EEEEE + E+    +  G  E 
Sbjct: 486 MDIASDSPDEDEEEFESEVDSIRKIPDKYVNIIVEEEEENETEKTVNLRFPGWKER 541


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 28.3 bits (63), Expect = 2.4
 Identities = 21/68 (30%), Positives = 46/68 (67%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           +++ EEEE E+EEE+++ ++ E+EEE  E +EE+E ++E+  + ++ E++ +    N  +
Sbjct: 48  QEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQ 107

Query: 237 EEEEEVLI 244
           ++  + LI
Sbjct: 108 DDNAQNLI 115



 Score = 28.3 bits (63), Expect = 2.8
 Identities = 27/69 (39%), Positives = 50/69 (72%), Gaps = 3/69 (4%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           ++E+  +E    K  K+E++EEEE+++E+EIE+ E   + E+EEE+V+ ++ EEE+EE+ 
Sbjct: 32  DKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE---DIEDEEEIVEDEEEEEEDEEDN 88

Query: 242 VLIKLEEEE 250
           V +K  E++
Sbjct: 89  VDLKDIEKK 97


>gnl|CDD|184899 PRK14906, PRK14906, DNA-directed RNA polymerase subunit beta'/alpha
           domain fusion protein; Provisional.
          Length = 1460

 Score = 29.5 bits (66), Expect = 2.4
 Identities = 19/83 (22%), Positives = 34/83 (40%), Gaps = 13/83 (15%)

Query: 171 EEEKKEEDE----------IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           ++E +EED           +EE + E +  ++  +    +   E+ +  D+I  +E    
Sbjct: 142 DKEAREEDADDLRDELAADLEELDAERDRLIEATRRLSVDYVPEDDEFVDDIGDDERLTA 201

Query: 221 EEEEEEVVKQKKNEEEEEEEEVL 243
           EE   EV       EE  E + L
Sbjct: 202 EEVRAEVADI---YEEYNERKAL 221


>gnl|CDD|216289 pfam01080, Presenilin, Presenilin.  Mutations in presenilin-1 are a
           major cause of early onset Alzheimer's disease. It has
           been found that presenilin-1 binds to beta-catenin
           in-vivo. This family also contains SPE proteins from
           C.elegans.
          Length = 403

 Score = 29.0 bits (65), Expect = 2.5
 Identities = 28/139 (20%), Positives = 44/139 (31%), Gaps = 33/139 (23%)

Query: 138 LKPEGVMYPYKCILHSAPSYSPEIF---------KFWENIAGEEEKKEEDEIEE----EE 184
           L P+G   P + ++ +A   +  IF               + +EE  E          + 
Sbjct: 193 LTPKG---PLRMLVETAQERNEPIFPALIYSSTVVVLTVGSNQEETNEGTPSTIRRTSKS 249

Query: 185 NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE--------EEEVVKQ------ 230
                            E  EK    E+  +E ++ E E         EE   +      
Sbjct: 250 TRSAANPDSAPTSHSTLELPEKSSTPELSDDESDSSETESQSDSSLAPEEDAAEQPEVQS 309

Query: 231 ---KKNEEEEEEEEVLIKL 246
                NE+ EEEEE  +KL
Sbjct: 310 NSLPSNEKREEEEERGVKL 328


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 29.1 bits (66), Expect = 2.5
 Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 7/73 (9%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKN-------KEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           E K+E  E+EEE  E EEE+++ K        K   +   EK+E+ E   E  E   EE 
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEEL 390

Query: 225 EEVVKQKKNEEEE 237
           EE+ ++ K  +EE
Sbjct: 391 EELEEELKELKEE 403


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 28.0 bits (63), Expect = 2.5
 Identities = 16/69 (23%), Positives = 33/69 (47%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           G +   EEDE  +   + + E      K ++E+EE  +  + I++   E  ++  E+  K
Sbjct: 28  GVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEK 87

Query: 230 QKKNEEEEE 238
           ++  +  EE
Sbjct: 88  EEIEKYREE 96


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 28.9 bits (65), Expect = 2.6
 Identities = 16/78 (20%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           +KE    E++E ++E+++       +   E+  +E  ++E+ +  +E E EE       N
Sbjct: 127 RKEPWTGEDQEADDEDDIIY-----DGIFEDRTRESQDMEENDEVDEVEVEEVPDDYGAN 181

Query: 234 EEEEEEEEVLIKLEEEEE 251
              +E E+  +   +E +
Sbjct: 182 LAVDEPEQSTMSRPQEVK 199


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 29.0 bits (66), Expect = 2.6
 Identities = 15/46 (32%), Positives = 28/46 (60%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           +  + +EE  +E EE+++Q + K  + E E +K E E+E+ + E E
Sbjct: 1   ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELE 46


>gnl|CDD|234776 PRK00478, scpA, segregation and condensation protein A/unknown
           domain fusion protein; Provisional.
          Length = 505

 Score = 29.1 bits (65), Expect = 2.7
 Identities = 22/80 (27%), Positives = 46/80 (57%), Gaps = 5/80 (6%)

Query: 178 DEIEEEENEEEEEVKQ--KKNKEEEEEEEEKKEEDEIEKEEGEN---EEEEEEEVVKQKK 232
            E+ + + E+E+ +K+  K+ +EE      +K E+ ++K+ GE     E+ ++   ++K 
Sbjct: 426 QEMVQRQQEQEQALKESIKQIQEERAISNFQKREEYLKKKYGEYYLSREQYQKLTPEEKI 485

Query: 233 NEEEEEEEEVLIKLEEEEEN 252
           N    + +  LIKL++E+EN
Sbjct: 486 NIRINQRKIDLIKLDDEQEN 505


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 27.9 bits (62), Expect = 2.7
 Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 5/41 (12%)

Query: 170 GEEEKKEEDEIEEEENE-----EEEEVKQKKNKEEEEEEEE 205
           G    ++E+E EEEENE     +E +V+Q +  E   E  E
Sbjct: 90  GTGHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGE 130



 Score = 27.9 bits (62), Expect = 3.1
 Identities = 15/46 (32%), Positives = 24/46 (52%), Gaps = 6/46 (13%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
               +EEE EEEE  KQ+++ E + ++ ++ E         EN EE
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHE------HPAENGEE 131



 Score = 26.3 bits (58), Expect = 8.9
 Identities = 11/36 (30%), Positives = 21/36 (58%)

Query: 204 EEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
              ++E+E E+EE E +++ +E  V+Q +  E   E
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAE 127


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 29.0 bits (65), Expect = 2.8
 Identities = 11/64 (17%), Positives = 28/64 (43%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           +   + +E+    +     ++     +E+  +++E +  E E E E  +   K    EEE
Sbjct: 416 KKNQKPQESSPSLDQTNAASRNSVMMDEDSDDDEEFQIPESEQEPETTKNETKDTAMEEE 475

Query: 237 EEEE 240
            +++
Sbjct: 476 PQDK 479


>gnl|CDD|181382 PRK08317, PRK08317, hypothetical protein; Provisional.
          Length = 241

 Score = 28.7 bits (65), Expect = 2.8
 Identities = 11/27 (40%), Positives = 13/27 (48%)

Query: 35 EAYRAAICDNPNIFAGKTVLDVGTGTG 61
            YRA   +   +  G  VLDVG G G
Sbjct: 5  RRYRARTFELLAVQPGDRVLDVGCGPG 31


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 28.7 bits (64), Expect = 2.9
 Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 10/90 (11%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEE----------EEEEKKEEDEIEKEEGENEEE 222
           E +EEDE  EEE EE EE ++  ++++ E          ++    +E E E  + +  EE
Sbjct: 6   ELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEE 65

Query: 223 EEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           E +   +++K E  +  EE + K  E ++ 
Sbjct: 66  EAKRKAEERKRETLKIVEEEVKKELELKKR 95


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 29.1 bits (65), Expect = 2.9
 Identities = 18/115 (15%), Positives = 39/115 (33%), Gaps = 22/115 (19%)

Query: 132 FARDKFLKPEGVMYPYKCILHSAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEV 191
           F  ++ L+ E   YP                  +E  +G+  +K    +     +     
Sbjct: 754 FCFNEDLEEEEFEYPRM----------------YEEASGKSVRKVNTAVLSTTIKAAARA 797

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGEN------EEEEEEEVVKQKKNEEEEEEE 240
           KQK  ++   ++E K E   +E E          E+  +   +   K ++  + +
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852



 Score = 28.0 bits (62), Expect = 5.7
 Identities = 10/58 (17%), Positives = 28/58 (48%)

Query: 193 QKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
           +   + +++ +E+   + EI+ E    E E E   +KQ++ +  +    + +K ++  
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPY 849


>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
           Provisional.
          Length = 231

 Score = 28.4 bits (63), Expect = 2.9
 Identities = 12/47 (25%), Positives = 27/47 (57%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENE 220
           KK + +  E+  +  E++    +++++ EE+E  EE +  +E G+ E
Sbjct: 92  KKAKADKPEKSPKAVEKLCPPDDRDDKNEEKEPTEEAQRNEESGDAE 138


>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1112

 Score = 28.8 bits (65), Expect = 3.0
 Identities = 21/104 (20%), Positives = 51/104 (49%), Gaps = 15/104 (14%)

Query: 161 IFKFWENIAGEEEKKEEDEIE---EEENEEEEEVKQKKNKE------------EEEEEEE 205
           + +F + + G++ +++  EI+   E+   E  E++   N E            +E   + 
Sbjct: 1   MNEFLKKLFGDKSERDLKEIQPIVEKIKAEYPELEALSNDELRAKTDEFKQYIKEAVADI 60

Query: 206 KKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEE 249
             + +E++ E  E+ + +E E +  + ++ E+E  E+L K+ +E
Sbjct: 61  DAKIEELKAEAIESLDIDEREDIYAQIDKLEKEAYEILEKVLDE 104


>gnl|CDD|225173 COG2264, PrmA, Ribosomal protein L11 methylase [Translation,
           ribosomal structure and biogenesis].
          Length = 300

 Score = 28.4 bits (64), Expect = 3.1
 Identities = 26/81 (32%), Positives = 32/81 (39%), Gaps = 26/81 (32%)

Query: 50  GKTVLDVGTGTGK-SI--LLQGH----GVDHHSF----------CTKVHPLDHYAPQYLI 92
           GKTVLDVG G+G  +I     G     GVD                 V  L   A  +L+
Sbjct: 163 GKTVLDVGCGSGILAIAAAKLGAKKVVGVDIDPQAVEAARENARLNGVELLVQ-AKGFLL 221

Query: 93  LEVIQNKIENVELPEKVDIIV 113
           LEV +N           D+IV
Sbjct: 222 LEVPEN--------GPFDVIV 234


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 28.8 bits (65), Expect = 3.1
 Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 1/57 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEE-EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
                 E  E +E  N+ E+ ++K  + ++     EE++ E E  + + E +   EE
Sbjct: 8   EILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLREE 64


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 27.8 bits (62), Expect = 3.1
 Identities = 6/44 (13%), Positives = 16/44 (36%), Gaps = 7/44 (15%)

Query: 173 EKKEEDE-----IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDE 211
            K +        +   E    E ++     ++ EE  +K  +++
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQL--RAKKREEHRKKWYQND 145


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 28.2 bits (63), Expect = 3.1
 Identities = 12/52 (23%), Positives = 26/52 (50%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           KK + + +E+E   E   ++  + EE EE ++K + +  + +    E   E+
Sbjct: 65  KKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHED 116



 Score = 27.5 bits (61), Expect = 5.6
 Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 7/88 (7%)

Query: 171 EEEKKEEDEIEEEENEE-------EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           E   K  D+I  E+ E        +++   +K K  ++ +++KKE++E+ +   E   + 
Sbjct: 29  ESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDS 88

Query: 224 EEEVVKQKKNEEEEEEEEVLIKLEEEEE 251
           EE     KK + E  + +        E+
Sbjct: 89  EENEENDKKVDYELPKVQNTAAEVNHED 116


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 28.5 bits (64), Expect = 3.1
 Identities = 15/48 (31%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 189 EEVKQKKNK---EEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKN 233
           ++ K+KK+K    +++EEE++ E ++ +K   E+ E +E  V   KK 
Sbjct: 2   KKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKK 49



 Score = 27.7 bits (62), Expect = 5.2
 Identities = 25/87 (28%), Positives = 35/87 (40%), Gaps = 21/87 (24%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE-----------------DEIEKEEGE 218
            +D+ EE+E+E E+E K  K   E +E      +                    EKE   
Sbjct: 14  GDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFLPDKAREEKEAEL 73

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIK 245
            EE  EE + KQ    E  +EEE+ I 
Sbjct: 74  REELREEFLKKQ----EAVKEEEIEIT 96


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 28.5 bits (64), Expect = 3.1
 Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 202 EEEEKKEEDEI-EKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
           +  EKK   E+ + +  + E E+E    ++K  E +++ E +L K +EE + 
Sbjct: 35  DAREKKIAGELADADTKKREAEQERREYEEKNEELDQQREVLLTKAKEEAQA 86



 Score = 27.7 bits (62), Expect = 5.0
 Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 9/82 (10%)

Query: 171 EEEKKEEDEIEEEEN-----EEEEEVKQKKNKEEEEEEEEK----KEEDEIEKEEGENEE 221
             EKK   E+ + +      E+E    ++KN+E +++ E      KEE + E++   +E 
Sbjct: 36  AREKKIAGELADADTKKREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLLDEA 95

Query: 222 EEEEEVVKQKKNEEEEEEEEVL 243
            EE + +++K  E    E+  L
Sbjct: 96  REEADEIREKWQEALRREQAAL 117


>gnl|CDD|180745 PRK06915, PRK06915, acetylornithine deacetylase; Validated.
          Length = 422

 Score = 28.5 bits (64), Expect = 3.2
 Identities = 14/32 (43%), Positives = 19/32 (59%), Gaps = 4/32 (12%)

Query: 38  RAAICDNPNIFAGKTVLDVGTGTGKSILLQGH 69
           R +  D+PNI A       G+G GKS++L GH
Sbjct: 74  RTSFSDSPNIVATLK----GSGGGKSMILNGH 101


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 28.1 bits (62), Expect = 3.2
 Identities = 24/81 (29%), Positives = 48/81 (59%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E+++ E +E++    EE    +++  ++EEE   EK+E+ + + EE E +E+EE+E +++
Sbjct: 51  EQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQK 110

Query: 231 KKNEEEEEEEEVLIKLEEEEE 251
           +K E E    E   ++  E E
Sbjct: 111 QKEEAEARAREEAERMRLERE 131



 Score = 26.6 bits (58), Expect = 9.6
 Identities = 25/78 (32%), Positives = 51/78 (65%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E++E++E E  E EE++ +++++ K    EE  ++EE+   +EE    E+EE+   K ++
Sbjct: 37  EQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEE 96

Query: 233 NEEEEEEEEVLIKLEEEE 250
            E++E+EE+  I+ ++EE
Sbjct: 97  EEKQEQEEQERIQKQKEE 114


>gnl|CDD|222287 pfam13649, Methyltransf_25, Methyltransferase domain.  This family
           appears to be a methyltransferase domain.
          Length = 97

 Score = 27.3 bits (61), Expect = 3.2
 Identities = 16/104 (15%), Positives = 35/104 (33%), Gaps = 21/104 (20%)

Query: 53  VLDVGTGTGKSILLQGHGVDHHSFCTKVHPLD-----------HYAPQYLILEVIQNKIE 101
           +LD+G GTG+ +               V  +D               +   +  +     
Sbjct: 1   ILDLGCGTGRVLRALARAGPSS-----VTGVDISKEALELAKERLRDKGPKVRFVVADAR 55

Query: 102 NVELPE-KVDIIVSEWMGFYLLHESMIDSVI--FARDKFLKPEG 142
           ++   E   D+++   +    L    + +++   AR   L+P G
Sbjct: 56  DLPFEEGSFDLVICAGLSLDYLSPKQLRALLREAAR--LLRPGG 97


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
           MRVI1 proteins which are related to the
           lymphoid-restricted membrane protein (JAW1) and the IP3
           receptor associated cGMP kinase substrates A and B
           (IRAGA and IRAGB). The function of MRVI1 is unknown
           although mutations in the Mrvi1 gene induces myeloid
           leukaemia by altering the expression of a gene important
           for myeloid cell growth and/or differentiation so it has
           been speculated that Mrvi1 is a tumour suppressor gene.
           IRAG is very similar in sequence to MRVI1 and is an
           essential NO/cGKI-dependent regulator of IP3-induced
           calcium release. Activation of cGKI decreases
           IP3-stimulated elevations in intracellular calcium,
           induces smooth muscle relaxation and contributes to the
           antiproliferative and pro-apoptotic effects of NO/cGMP.
           Jaw1 is a member of a class of proteins with
           COOH-terminal hydrophobic membrane anchors and is
           structurally similar to proteins involved in vesicle
           targeting and fusion. This suggests that the function
           and/or the structure of the ER in lymphocytes may be
           modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 28.8 bits (64), Expect = 3.3
 Identities = 13/46 (28%), Positives = 21/46 (45%)

Query: 186 EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           EE++  K +  +E E EE  ++       EE    EE ++E    K
Sbjct: 416 EEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQRSSK 461



 Score = 28.1 bits (62), Expect = 5.3
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE 213
           A  EE+K+  ++++    E EE  ++  K    EE  + EE + E
Sbjct: 412 ADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKE 456



 Score = 27.7 bits (61), Expect = 8.0
 Identities = 14/42 (33%), Positives = 23/42 (54%)

Query: 200 EEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           EEE++ KK +D  E E  E  E   +  + ++  E EE ++E
Sbjct: 415 EEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKE 456


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 28.7 bits (65), Expect = 3.3
 Identities = 20/61 (32%), Positives = 29/61 (47%), Gaps = 4/61 (6%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
            EI     E  EEV    N    EE+ E+ EE   E  E E E++EE++ +    N E+ 
Sbjct: 45  KEIIPIVKEVVEEV----NSLSLEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKG 100

Query: 238 E 238
           +
Sbjct: 101 K 101


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 28.6 bits (64), Expect = 3.3
 Identities = 21/82 (25%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 159 PEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           P   K WE++      + +++  +E+ E++E  ++KK K+E   + +++ E     +  +
Sbjct: 265 PPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQ--K 322

Query: 219 NEEEEEEEVVKQKKNEEEEEEE 240
            +EE +E++ K +K E+E E+E
Sbjct: 323 LQEERKEQLAKLRKEEKEREKE 344



 Score = 27.4 bits (61), Expect = 6.8
 Identities = 22/78 (28%), Positives = 35/78 (44%), Gaps = 2/78 (2%)

Query: 159 PEIFKFWE--NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           PE F F E    +      +E+    EEN +    K   + ++ E   +    +  EKE 
Sbjct: 228 PEPFNFQESHKSSSRTYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEA 287

Query: 217 GENEEEEEEEVVKQKKNE 234
            E +E++E E  K+KK E
Sbjct: 288 KEQQEKKELEQRKKKKKE 305


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 28.3 bits (64), Expect = 3.4
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKE 215
           E E ++KK   +EE++  K+E+D++E+ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 28.5 bits (63), Expect = 3.4
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 200 EEEEEEKKEEDEIEKEEGENEEEEEEE 226
           EE EEE  + +  + E  E +EEEE E
Sbjct: 84  EEAEEEDNDRECPDTEAEEEDEEEEIE 110



 Score = 28.5 bits (63), Expect = 3.9
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 6/47 (12%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE 215
           AGEE ++E+++ E  + E EEE       EEEE E    E + ++ E
Sbjct: 82  AGEEAEEEDNDRECPDTEAEEE------DEEEEIEAPDPEVNPLDAE 122


>gnl|CDD|235520 PRK05589, PRK05589, peptide chain release factor 2; Provisional.
          Length = 325

 Score = 28.6 bits (64), Expect = 3.4
 Identities = 15/61 (24%), Positives = 33/61 (54%), Gaps = 4/61 (6%)

Query: 196 NKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIK----LEEEEE 251
            + +E   EEK  +D+++K        E+ EV+ +  +EE++E ++ +I     ++EE +
Sbjct: 11  KEAQEITSEEKYLKDKLDKYNHLRNRIEDIEVLCEMMSEEDDEMKKEIISEVKNIKEEID 70

Query: 252 N 252
            
Sbjct: 71  R 71


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 27.6 bits (62), Expect = 3.5
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK 207
            +   +EE+  +EEE K++  + +E E E+KK
Sbjct: 95  AQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 28.7 bits (65), Expect = 3.6
 Identities = 16/77 (20%), Positives = 35/77 (45%), Gaps = 4/77 (5%)

Query: 161 IFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKN----KEEEEEEEEKKEEDEIEKEE 216
           +  F   I  E+   +++  +  EN   + + +        E+E ++  ++   + + EE
Sbjct: 472 VLAFEYEIHFEKAMLDKELNDTIENILSKLLGKSVEIVAVPEDEWQKIREEFLQKHKNEE 531

Query: 217 GENEEEEEEEVVKQKKN 233
            E EEEEE  ++ + K 
Sbjct: 532 EEEEEEEELPLIPEAKE 548


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 27.9 bits (62), Expect = 3.6
 Identities = 17/64 (26%), Positives = 36/64 (56%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           EK+  ++  +   E+ +  K+KK  E++E  +++K E   ++     +E E+ E+ K+K+
Sbjct: 51  EKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQ 110

Query: 233 NEEE 236
            E E
Sbjct: 111 KERE 114


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 28.0 bits (63), Expect = 3.6
 Identities = 13/81 (16%), Positives = 52/81 (64%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A ++ +KE  +++ E  ++E+E+++++ K +++     +E  + +++E + +++E ++  
Sbjct: 36  AQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQ 95

Query: 229 KQKKNEEEEEEEEVLIKLEEE 249
           +  + E +++++E+L  + ++
Sbjct: 96  QAAQQELQQKQQELLQPIYDK 116


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 28.3 bits (63), Expect = 3.7
 Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 4/85 (4%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E  E+ E   +++ +E ++   +  + E E E   EE+  E+ E E E E E+E    
Sbjct: 182 EHEAHEQHEQPADDDPDEWKISASEPFQLESEVEASPEEENYEEYEDETELEVEDEEKAL 241

Query: 231 KKNEEEEEEEEVLI----KLEEEEE 251
            +  E+ ++E+ L     K  EEEE
Sbjct: 242 DEQTEDPQQEDALAGDAKKALEEEE 266


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 28.4 bits (63), Expect = 3.8
 Identities = 10/75 (13%), Positives = 26/75 (34%), Gaps = 1/75 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +    E          E+ +  E+++   + K +  +   K+E D  +K E        E
Sbjct: 203 KASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELD-AKKAELSKALAALE 261

Query: 226 EVVKQKKNEEEEEEE 240
                 ++ ++ +  
Sbjct: 262 AANAADEDPQDRDAA 276



 Score = 27.6 bits (61), Expect = 7.5
 Identities = 13/72 (18%), Positives = 25/72 (34%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
            E + +E E      K   E+ +  EK       K +      +EE   K+ +  +    
Sbjct: 200 EETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKALAA 259

Query: 240 EEVLIKLEEEEE 251
            E     +E+ +
Sbjct: 260 LEAANAADEDPQ 271


>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
           [Transcription].
          Length = 182

 Score = 27.9 bits (62), Expect = 4.0
 Identities = 21/84 (25%), Positives = 41/84 (48%), Gaps = 1/84 (1%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           AG+  ++  ++++E   +  E   +K  KE  +     +   +I+ E    ++E  EE +
Sbjct: 88  AGQLVRETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKL 147

Query: 229 KQKKNEEEEEEEEVLIKLEEEEEN 252
           K +   + E  EEV+ KLE   E+
Sbjct: 148 KGEV-IKGEPLEEVIDKLEYPVES 170


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 28.0 bits (62), Expect = 4.0
 Identities = 16/53 (30%), Positives = 26/53 (49%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
            +E+KK+E E   +  E+ ++  + K + E  E E+   ED      GE E E
Sbjct: 94  KKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPEDPPPEDTPPPPGGEGEVE 146


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 28.4 bits (64), Expect = 4.1
 Identities = 20/80 (25%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIE-KEEGENEEEEEEEVVKQKK 232
           K++   I       EEE+++   + E   EE+KK ++EIE +   ++   + E+ +K+  
Sbjct: 486 KEQSITITASSGLSEEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAG 545

Query: 233 NE-EEEEEEEVLIKLEEEEE 251
           ++   EE+E++   + E +E
Sbjct: 546 DKLPAEEKEKIEKAVAELKE 565


>gnl|CDD|241384 cd13230, PH1_SSRP1-like, Structure Specific Recognition protein 1
           (SSRP1) Pleckstrin homology (PH) domain, repeat 1.
           SSRP1 is a component of FACT (facilitator of chromatin
           transcription), an essential chromatin reorganizing
           factor. In yeast FACT (yFACT) is composed of three
           proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
           Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
           human and mouse, respectively. The middle domain of the
           Pob3 subunit (Pob3-M) has an unusual double pleckstrin
           homology (PH) architecture. yFACT interacts in a
           physiologically important way with the central
           single-strand DNA binding factor RPA to promote a step
           in DNA Replication. Coordinated function by yFACT and
           RPA is important during nucleosome deposition. These
           results support the model that the FACT family has an
           essential role in constructing nucleosomes during DNA
           replication, and suggest that RPA contributes to this
           process. Members of this cd are composed of the first
           PH-like repeat. PH domains have diverse functions, but
           in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 137

 Score = 27.5 bits (62), Expect = 4.1
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEK 206
           ++EIE E N  EEE+++K   + E+E E  
Sbjct: 87  DEEIELELNLTEEELEEKYKGKLEKEYEGP 116


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 28.5 bits (64), Expect = 4.2
 Identities = 18/86 (20%), Positives = 41/86 (47%), Gaps = 9/86 (10%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEE---------EEEKKEEDEIEKEEGENEEEEEEE 226
            +DE+E    E E+   + K K E+ +          + +K+  E++ +  E ++E+ E 
Sbjct: 503 PKDEVERMVKEAEKNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIEN 562

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEEN 252
           ++K+ +   + +  E +  L EE + 
Sbjct: 563 LIKKLRQALQNDNYESIKSLLEELQK 588


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 28.5 bits (63), Expect = 4.2
 Identities = 18/70 (25%), Positives = 36/70 (51%), Gaps = 3/70 (4%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           E+  IE  E EE E +++++    E  E E+ E + +E+E  E +  E + + + ++   
Sbjct: 456 EKKRIERLEREERERLERER---MERIERERLERERLERERLERDRLERDRLDRLERERV 512

Query: 236 EEEEEEVLIK 245
           +  E + L K
Sbjct: 513 DRLERDRLEK 522



 Score = 28.2 bits (62), Expect = 5.3
 Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 1/75 (1%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E+ E +E E  E E  E +++++  E E  E E+ E D +E++  +  E E  + +++ +
Sbjct: 461 ERLEREERERLERERMERIERER-LERERLERERLERDRLERDRLDRLERERVDRLERDR 519

Query: 233 NEEEEEEEEVLIKLE 247
            E+       L  +E
Sbjct: 520 LEKARRNSYFLKGME 534


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 28.5 bits (63), Expect = 4.2
 Identities = 16/81 (19%), Positives = 40/81 (49%), Gaps = 6/81 (7%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKN------KEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           +E K +  + E+ ++E  +E  Q++         + E++      +  E++E   EE+EE
Sbjct: 724 DEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEE 783

Query: 225 EEVVKQKKNEEEEEEEEVLIK 245
           EE  +      ++++ + ++K
Sbjct: 784 EENKEVSAKRAKKKQRKNMLK 804



 Score = 27.3 bits (60), Expect = 9.3
 Identities = 18/64 (28%), Positives = 30/64 (46%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A  E  +E D+ E  +        ++ N +   E  E+ E  E EKEE EN+E   +   
Sbjct: 736 AKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEEENKEVSAKRAK 795

Query: 229 KQKK 232
           K+++
Sbjct: 796 KKQR 799


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 28.5 bits (64), Expect = 4.2
 Identities = 12/55 (21%), Positives = 25/55 (45%), Gaps = 8/55 (14%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           +EK  E++ EE   + E        K +E E++ ++         G ++E+E + 
Sbjct: 321 DEKDLEEKPEEPGPDPE--------KPDEGEDDAEQSGPRGHPTPGNDDEKEPDP 367


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 28.1 bits (63), Expect = 4.3
 Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 5/66 (7%)

Query: 171 EEEKKEEDEIEE---EENEEEEEVKQKKNKEEEEEEEEKKE--EDEIEKEEGENEEEEEE 225
           +E KK   EIEE   + NE  +++ + K +++++ EE KKE  E + E  E     +  E
Sbjct: 37  DERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALE 96

Query: 226 EVVKQK 231
             ++ K
Sbjct: 97  AELQDK 102


>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 199 to 238 amino acids in length. This domain is
           found associated with pfam06512, pfam00520. This domain
           has a conserved ADD sequence motif.
          Length = 222

 Score = 27.8 bits (62), Expect = 4.3
 Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 1/35 (2%)

Query: 158 SPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVK 192
           S E  K   N   ++ KKEE  I  EE + E+E K
Sbjct: 7   SSESAKERRNRN-DKNKKEEHSIGSEEGDSEKEPK 40



 Score = 27.1 bits (60), Expect = 9.0
 Identities = 8/34 (23%), Positives = 14/34 (41%)

Query: 186 EEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           +E      K  KEE     E+ + ++  K E  +
Sbjct: 12  KERRNRNDKNKKEEHSIGSEEGDSEKEPKSESAD 45


>gnl|CDD|193515 cd03895, M20_ArgE_DapE_like2, M20 Peptidases with similarity to
          acetylornithine deacetylases and
          succinyl-diaminopimelate desuccinylases.  Peptidase M20
          family, Uncharacterized protein subfamily with
          similarity to acetylornithine
          deacetylase/succinyl-diaminopimelate desuccinylase
          (ArgE/DapE) subfamily. ArgE/DapE enzymes catalyze
          analogous reactions and share a common activator, the
          metal ion (usually Co2+ or Zn2+). ArgE catalyzes a
          broad range of substrates, including N-acetylornithine,
          alpha-N-acetylmethionine and alpha-N-formylmethionine,
          while DapE catalyzes the hydrolysis of
          N-succinyl-L,L-diaminopimelate (L,L-SDAP) to
          L,L-diaminopimelate and succinate. Proteins in this
          subfamily are mostly bacterial, and have been inferred
          by homology as being related to both ArgE and DapE.
          Length = 399

 Score = 28.3 bits (64), Expect = 4.5
 Identities = 8/16 (50%), Positives = 12/16 (75%), Gaps = 1/16 (6%)

Query: 57 GTGTGKSILLQGHGVD 72
          G G G+S++L GH +D
Sbjct: 69 GGGGGRSLILNGH-ID 83


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 27.9 bits (62), Expect = 4.5
 Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKE--EDEIEKE-EGENEEEEEEEVVK 229
           EK EE E EE E         ++N+E+++E+E +++  E EI K      +   +    K
Sbjct: 85  EKLEELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQK 144

Query: 230 QKKNEEEEEEEEVLIKLEEE 249
           +K+ ++    +E   +L  E
Sbjct: 145 RKREQKARAAKERKERLVAE 164


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 28.6 bits (64), Expect = 4.5
 Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 3/70 (4%)

Query: 160  EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEE---EEEEKKEEDEIEKEE 216
            E  K  E    EE +KE + IE E  E+  + +Q   +++E    EEE+K    E    E
Sbjct: 2564 EKLKLLEKAKIEESRKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNRE 2623

Query: 217  GENEEEEEEE 226
                ++  + 
Sbjct: 2624 VSGTDDTNKN 2633


>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 27.1 bits (60), Expect = 4.6
 Identities = 21/62 (33%), Positives = 38/62 (61%), Gaps = 1/62 (1%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEE-EEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           EK++ D++ +E+ EEEE  + +   E E  +  +K E+ E+EK + + E+E+  E +K K
Sbjct: 58  EKRKRDQVLKEQKEEEEAKRLQSELERELMKRAQKLEQLELEKAKEDEEQEQIPEFLKVK 117

Query: 232 KN 233
            N
Sbjct: 118 AN 119


>gnl|CDD|233941 TIGR02585, cas_Cst2_DevR, CRISPR-associated protein Cas7/Cst2/DevR,
           subtype I-B/TNEAP.  CRISPR is a term for Clustered
           Regularly Interspaced Short Palidromic Repeats. A number
           of protein families appear only in association with
           these repeats and are designated Cas (CRISPR associated)
           proteins. This clade is a member of the DevR family
           (TIGR01875) and includes the DevR protein of Myxococcus
           xanthus, a protein whose expression appears to be
           regulated through a number of means, including both
           location and autorepression; DevR mutants are incapable
           of fruiting body development [Mobile and
           extrachromosomal element functions, Other].
          Length = 310

 Score = 28.2 bits (63), Expect = 4.7
 Identities = 16/78 (20%), Positives = 30/78 (38%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
               K E+E  E E   + E+     ++  E+ E++     ++    ENEEE  E     
Sbjct: 232 LNVLKVEEEYLEGEVIIDVELIVTALEDFSEKVEDEVHIGLLKGFFEENEEEAIELKPDG 291

Query: 231 KKNEEEEEEEEVLIKLEE 248
                    ++V   +E+
Sbjct: 292 GVESIIVIFDKVKKAVEK 309


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
           mitochondrial protein translocase family, which is
           responsible for movement of nuclear encoded pre-proteins
           into mitochondria, is very complex with at least 19
           components. These proteins include several chaperone
           proteins, four proteins of the outer membrane
           translocase (Tom) import receptor, five proteins of the
           Tom channel complex, five proteins of the inner membrane
           translocase (Tim) and three "motor" proteins. This
           family represents the Tom22 proteins. The N terminal
           region of Tom22 has been shown to have chaperone-like
           activity, and the C terminal region faces the
           intermembrane face.
          Length = 136

 Score = 27.2 bits (61), Expect = 4.7
 Identities = 9/45 (20%), Positives = 24/45 (53%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           E+E+E  +E+    +   +EE ++++E   + + +  +  + E E
Sbjct: 6   EVEDETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENE 50


>gnl|CDD|213783 TIGR03181, PDH_E1_alph_x, pyruvate dehydrogenase E1 component,
           alpha subunit.  Members of this protein family are the
           alpha subunit of the E1 component of pyruvate
           dehydrogenase (PDH). This model represents one branch of
           a larger family that E1-alpha proteins from
           2-oxoisovalerate dehydrogenase, acetoin dehydrogenase,
           another PDH clade, etc [Energy metabolism, Pyruvate
           dehydrogenase].
          Length = 341

 Score = 27.9 bits (63), Expect = 4.7
 Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 6/50 (12%)

Query: 200 EEEEEEKKEEDEIEK------EEGENEEEEEEEVVKQKKNEEEEEEEEVL 243
           +EEEEE +++D I +       +G  +EE+EE + ++ + E  E   E L
Sbjct: 263 KEEEEEWRKKDPILRLRKYLERKGLWDEEQEEALEEEAEAEVAEAVAEAL 312


>gnl|CDD|224037 COG1112, COG1112, Superfamily I DNA and RNA helicases and helicase
           subunits [DNA replication, recombination, and repair].
          Length = 767

 Score = 28.4 bits (63), Expect = 4.7
 Identities = 18/79 (22%), Positives = 27/79 (34%), Gaps = 1/79 (1%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
                 E +E  +    K  K  E E+E E K +E EI+K        +   ++      
Sbjct: 80  DGTLNRELKELIDSLSKKLGKLVEIEQEAEIKTKELEIKKLRLAKRSFKGRAILGLLAKV 139

Query: 235 EEEEEE-EVLIKLEEEEEN 252
             E    E L+K    E  
Sbjct: 140 LGENLGSEALVKYGRLESY 158


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 27.7 bits (61), Expect = 4.7
 Identities = 13/34 (38%), Positives = 22/34 (64%)

Query: 190 EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEE 223
           + K K  K+E +E++EK EE EI+ E+ E   ++
Sbjct: 147 KTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQ 180


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 27.4 bits (61), Expect = 4.8
 Identities = 18/82 (21%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           E E KEE E EEE   E +          E   + K+ E +   ++ ++  ++E +  K 
Sbjct: 68  ERELKEEAEAEEE--GEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKD 125

Query: 231 KKNEEEEEEEEVLIKLEEEEEN 252
            +   ++  ++    L+  EE 
Sbjct: 126 PRGGTQDVVDKSQASLDYGEEE 147


>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
           [Transcription].
          Length = 507

 Score = 28.1 bits (62), Expect = 4.8
 Identities = 17/88 (19%), Positives = 30/88 (34%)

Query: 153 SAPSYSPEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI 212
            A        KF E     E  +      +  N  ++   +KK      +  ++  E+ +
Sbjct: 254 KAKLEKRRHVKFLEGSNTHEMDQLLKHFLDNSNFRQDRRSRKKKASASRDISDQNAEEIL 313

Query: 213 EKEEGENEEEEEEEVVKQKKNEEEEEEE 240
             + G    +E    V + KN   E EE
Sbjct: 314 MIKNGHIVVDEANMYVDRHKNASIEVEE 341


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 28.1 bits (62), Expect = 4.9
 Identities = 18/58 (31%), Positives = 34/58 (58%)

Query: 185 NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEV 242
            + EEE+   KN  ++  EEE+ E DE  +E  E+++E  E++  +   E+ + +EE+
Sbjct: 224 EKLEEEIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEI 281


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 28.0 bits (62), Expect = 5.0
 Identities = 18/67 (26%), Positives = 28/67 (41%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
           KE   I  +  +E+E + +K   +       ++  DE         E  EEEV+K  +NE
Sbjct: 451 KESVFIYPKSFKEKELLTEKGGAQFSSANSLERGADEDYLIVNGTNEPYEEEVIKTNENE 510

Query: 235 EEEEEEE 241
               E E
Sbjct: 511 NFPLENE 517


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 28.4 bits (63), Expect = 5.0
 Identities = 8/85 (9%), Positives = 30/85 (35%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           +    E+    E      + + E++++  +   ++  +E   +   +   +    +E   
Sbjct: 602 KLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVR 661

Query: 226 EVVKQKKNEEEEEEEEVLIKLEEEE 250
           E     +   +E      + L++ +
Sbjct: 662 EHALSIRVLPKELLASRQLALQKMQ 686


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 28.1 bits (63), Expect = 5.0
 Identities = 16/47 (34%), Positives = 31/47 (65%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
           E+EE+  +EE E+ QK++ E  +E+ EK  +D  + +E +++ + EE
Sbjct: 460 ELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEE 506


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 28.1 bits (63), Expect = 5.2
 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 1/66 (1%)

Query: 177 EDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           ++ IE    E  E+++QKK K  E   E   +    + +  E  EE      K+++ ++ 
Sbjct: 836 KELIERIIKEWIEDLRQKK-KLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQF 894

Query: 237 EEEEEV 242
            EEE +
Sbjct: 895 AEEEGL 900


>gnl|CDD|153125 cd02434, Nodulin-21_like_3, Nodulin-21 and CCC1-related protein
           family.  Nodulin-21_like_3: This is a family of proteins
           closely related to nodulin-21, a plant nodule-specific
           protein that may be involved in symbiotic nitrogen
           fixation. This family is also related to CCC1, a yeast
           vacuole transmembrane protein that functions as an iron
           and manganese transporter. .
          Length = 225

 Score = 27.7 bits (62), Expect = 5.3
 Identities = 14/33 (42%), Positives = 17/33 (51%), Gaps = 3/33 (9%)

Query: 172 EEKKEEDEIE---EEENEEEEEVKQKKNKEEEE 201
           E+K+EE EIE   E E  E  E+   K   EE 
Sbjct: 67  EKKREEWEIENYPEGEKSEMVEIYSLKGLSEEV 99


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 27.2 bits (59), Expect = 5.3
 Identities = 25/62 (40%), Positives = 35/62 (56%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNE 234
            +++  EEEE E EEE +++   EEE EEE + EE   E+ E E E E E E   + + E
Sbjct: 60  PDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPE 119

Query: 235 EE 236
            E
Sbjct: 120 PE 121



 Score = 26.8 bits (58), Expect = 6.7
 Identities = 25/59 (42%), Positives = 35/59 (59%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           ++E EEEEE + ++  EEE E EE+ EE+   +E GE E E E E   + + E E E E
Sbjct: 61  DDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPE 119



 Score = 26.5 bits (57), Expect = 8.9
 Identities = 27/77 (35%), Positives = 39/77 (50%)

Query: 169 AGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVV 228
           A +    + D+  EEE E E E + ++  E EEE EE+ E +E  +EE E E E E E  
Sbjct: 52  ADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPE 111

Query: 229 KQKKNEEEEEEEEVLIK 245
            + + E E E     +K
Sbjct: 112 PEPEPEPEPEPGAATLK 128


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 26.8 bits (60), Expect = 5.4
 Identities = 22/79 (27%), Positives = 53/79 (67%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           E+++E +E++   + + EE ++++   ++ EEE +K+E+E+++   + ++  +E   K++
Sbjct: 4   EKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRR 63

Query: 232 KNEEEEEEEEVLIKLEEEE 250
           + E++ EEE+ L K +EEE
Sbjct: 64  RAEKKAEEEKKLRKEKEEE 82


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 27.7 bits (61), Expect = 5.4
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEED 210
           E+E K+ED  E+  ++ E E KQK  KEE ++E E    D
Sbjct: 35  EKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVD 74


>gnl|CDD|226246 COG3723, RecT, Recombinational DNA repair protein (RecE pathway)
           [DNA replication, recombination, and repair].
          Length = 276

 Score = 27.9 bits (62), Expect = 5.4
 Identities = 16/61 (26%), Positives = 29/61 (47%)

Query: 178 DEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEE 237
            E++    E+E E ++  +   EE+ E+   E   E E  E +EE E+ +   ++ E   
Sbjct: 215 IEMQTAVTEDEAERREPLDVTPEEDSEQLDSEPSGEPEAQETDEENEKPIDVSEEKEPIT 274

Query: 238 E 238
           E
Sbjct: 275 E 275


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 27.7 bits (62), Expect = 5.5
 Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 1/53 (1%)

Query: 185 NEEEEEVKQKKNKEEEEEEEEKKE-EDEIEKEEGENEEEEEEEVVKQKKNEEE 236
           N  EEEV+  K + EEE  ++KK+ + +  K+        E  V      E  
Sbjct: 163 NPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELS 215


>gnl|CDD|227543 COG5218, YCG1, Chromosome condensation complex Condensin, subunit G
           [Chromatin structure and dynamics / Cell division and
           chromosome partitioning].
          Length = 885

 Score = 28.0 bits (62), Expect = 5.5
 Identities = 10/61 (16%), Positives = 24/61 (39%)

Query: 190 EVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEE 249
           E      + +EE  + KK+ + +  ++     +    V+  +K++    +EE        
Sbjct: 825 EQDDGDPRSQEELCDVKKDLENMSIQDTSTVAKNVSFVLPDEKSDAMSIDEEDKDSESFS 884

Query: 250 E 250
           E
Sbjct: 885 E 885


>gnl|CDD|219759 pfam08241, Methyltransf_11, Methyltransferase domain.  Members of
           this family are SAM dependent methyltransferases.
          Length = 92

 Score = 26.5 bits (59), Expect = 5.6
 Identities = 24/102 (23%), Positives = 40/102 (39%), Gaps = 21/102 (20%)

Query: 54  LDVGTGTGK-SILLQGHGVDHHSFCTKVHPLD------HYAPQYLILEVIQNKIENVELP 106
           LDVG GTG  +  L            +V  +D        A +    + +    E++  P
Sbjct: 1   LDVGCGTGLLAEAL------ARRGGARVTGVDLSPEMLALARKRAPRKFVVGDAEDLPFP 54

Query: 107 EK-VDIIVSEWMGFYLLHESMIDSVI--FARDKFLKPEGVMY 145
           ++  D++VS  +   L H    +  +   AR   LKP G + 
Sbjct: 55  DESFDVVVSSLV---LHHLPDPERALREIAR--VLKPGGKLV 91


>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor. 
          Length = 456

 Score = 28.1 bits (62), Expect = 5.6
 Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 199 EEEEEEEKKEEDEIEKEEGENEEEEEEE 226
           + +EE ++K +D+    + E  E E E 
Sbjct: 54  QGQEEPDEKTQDQQSLSDVERAEPEVEA 81


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 27.8 bits (62), Expect = 5.7
 Identities = 8/34 (23%), Positives = 20/34 (58%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK 207
            ++ +   +++ ++ EE +++  K  E+ EE KK
Sbjct: 21  LQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKK 54


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 28.0 bits (63), Expect = 5.9
 Identities = 18/86 (20%), Positives = 33/86 (38%), Gaps = 9/86 (10%)

Query: 173 EKKEEDEIEEEENEEE------EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEE 226
            K E   IE+E+ + E      E  + +  +E+   E   K+  E    + ++       
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALA 493

Query: 227 VVKQKKNEEEEEEEEVLIKLEEEEEN 252
            VK KK       + ++IK     +N
Sbjct: 494 RVKAKK---AAATQPIVIKAGARPDN 516


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 27.1 bits (60), Expect = 5.9
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 191 VKQKKNKEEEEEEEEKKEEDEIEKE 215
           +  KK +E+EEE  E  +E+E+  E
Sbjct: 93  LTSKKEEEKEEEIPEPTKEEELLGE 117


>gnl|CDD|234689 PRK00216, ubiE, ubiquinone/menaquinone biosynthesis
          methyltransferase; Reviewed.
          Length = 239

 Score = 27.4 bits (62), Expect = 6.0
 Identities = 7/21 (33%), Positives = 10/21 (47%)

Query: 46 NIFAGKTVLDVGTGTGKSILL 66
           +  G  VLD+  GTG   + 
Sbjct: 48 GVRPGDKVLDLACGTGDLAIA 68


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 26.6 bits (59), Expect = 6.1
 Identities = 10/72 (13%), Positives = 32/72 (44%), Gaps = 5/72 (6%)

Query: 171 EEEKKEEDEIEEEE---NEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN--EEEEEE 225
            ++ K + E +E+    +++E++ K K  + + ++      +    +   E    +++++
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 226 EVVKQKKNEEEE 237
           +  K  K E   
Sbjct: 62  DCKKGCKWEGNT 73


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 27.2 bits (61), Expect = 6.1
 Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 6/78 (7%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEI------EKEEGENEEEEEEEVVKQKKN 233
           +EE+   ++++ K KK K   E+E EK E DE         E GE    E+++ V +K +
Sbjct: 76  VEEKVKPKKKKPKSKKPKLSPEQELEKLENDERLNALLDRLEAGETLSAEDQKWVDEKLD 135

Query: 234 EEEEEEEEVLIKLEEEEE 251
             +E  E++ I  +++EE
Sbjct: 136 RIDELMEKLGISDDDDEE 153


>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
           Willebrand factor type A (vWA) domain [General function
           prediction only].
          Length = 437

 Score = 27.7 bits (62), Expect = 6.3
 Identities = 22/110 (20%), Positives = 43/110 (39%), Gaps = 12/110 (10%)

Query: 155 PSYSPEIFKFWENIAGEEEKKEEDEIEEE-ENEEEEEVKQKKNKEEEEEEE------EKK 207
           P    E+F++ E+ +  E   +   I +   ++    VK+     E+          E+ 
Sbjct: 53  PHLLEEVFRYQESNSEPEFLVQLPFILKLLSSDVYRRVKEATRLNEKHSLSAPESFLERW 112

Query: 208 EEDEIEKEEGENEEEEEEE-----VVKQKKNEEEEEEEEVLIKLEEEEEN 252
           EE   + +   +E+  E E       KQ++     E E +L ++EE    
Sbjct: 113 EELLQDLQREGSEDFLEGEREGLLSEKQEEISLSGEMEGILEEVEELISG 162


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 28.0 bits (62), Expect = 6.8
 Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
           E +KK++   E+ +   E++   KK+K++EEEE+   EED + K + E
Sbjct: 4   ESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQ-LSEEDAMLKGDLE 50


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 27.4 bits (61), Expect = 6.8
 Identities = 25/80 (31%), Positives = 43/80 (53%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           EE+ +   +  EE  E   E+++K  +EEEE +  +K+ DE+E+E    EEE      ++
Sbjct: 18  EEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEER 77

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
           ++ E E +E    +   EEE
Sbjct: 78  ERLEAEVDEATAEVAKLEEE 97


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 26.5 bits (59), Expect = 6.8
 Identities = 13/28 (46%), Positives = 17/28 (60%)

Query: 198 EEEEEEEEKKEEDEIEKEEGENEEEEEE 225
                 EEKKEE+E E+E+ E+EEE   
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 26.1 bits (58), Expect = 8.2
 Identities = 15/22 (68%), Positives = 17/22 (77%)

Query: 169 AGEEEKKEEDEIEEEENEEEEE 190
           A  EEKKEE+E EEE+ E EEE
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEE 96


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 27.9 bits (63), Expect = 6.8
 Identities = 11/40 (27%), Positives = 27/40 (67%)

Query: 175 KEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEK 214
           ++E   ++++ +E EE+++K  K +  +E ++K E E++K
Sbjct: 239 QKELGEDDDDKDEVEELREKIEKLKLPKEAKEKAEKELKK 278


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 27.7 bits (61), Expect = 6.9
 Identities = 9/69 (13%), Positives = 27/69 (39%), Gaps = 3/69 (4%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKK---EEDEIEKEEGENEEEEEEEVV 228
              + E    + E    E  ++    E ++  +E++   +E    ++     ++E   + 
Sbjct: 84  RALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLT 143

Query: 229 KQKKNEEEE 237
           KQ ++ +  
Sbjct: 144 KQAQDLQTR 152



 Score = 27.3 bits (60), Expect = 8.3
 Identities = 13/77 (16%), Positives = 36/77 (46%), Gaps = 3/77 (3%)

Query: 176 EEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEE 235
           + D+I  +      E+   + ++   E E +    E++K   +  E   +E+   ++N  
Sbjct: 75  QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQK-ARQEREAVRQELAAARQNLA 133

Query: 236 EEEEEEVLIKLEEEEEN 252
           + ++E  L +L ++ ++
Sbjct: 134 KAQQE--LARLTKQAQD 148


>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
          Length = 633

 Score = 27.9 bits (62), Expect = 6.9
 Identities = 25/96 (26%), Positives = 39/96 (40%), Gaps = 17/96 (17%)

Query: 171 EEEKKEEDEIEEEENEEEEEVKQKKNK----------------EEEEEEEEKKEEDEIEK 214
           +E+  E   +    N E    + K  K                ++EEEEE    ED +  
Sbjct: 428 KEDLPEATHVTSSVNSESLVTEVKTTKSVEHKSESSSSDGSGHDDEEEEEPLLSEDRVIT 487

Query: 215 EEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
              E EE  E E+  + + E + +E EV  + +EEE
Sbjct: 488 SSVE-EEVTEGELWYELEKELQRQETEVDAQAQEEE 522


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 27.8 bits (62), Expect = 7.0
 Identities = 11/44 (25%), Positives = 27/44 (61%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEE 209
                + E+K ++   +++ +++EE K K+ K+E+ E E++ +E
Sbjct: 338 NVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381



 Score = 27.4 bits (61), Expect = 7.2
 Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 167 NIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           +IA         E + EE  +   V+QK+ K+EE + +E+K+ED IE E+   E  EE
Sbjct: 329 SIATRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQED-IEFEKRFKELMEE 385


>gnl|CDD|225139 COG2230, Cfa, Cyclopropane fatty acid synthase and related
           methyltransferases [Cell envelope biogenesis, outer
           membrane].
          Length = 283

 Score = 27.6 bits (62), Expect = 7.1
 Identities = 36/144 (25%), Positives = 50/144 (34%), Gaps = 33/144 (22%)

Query: 35  EAYRAA---ICDNPNIFAGKTVLDVGTGTGK-SILLQGHGVDHHSFCTKVHPLDHYAPQY 90
           EA RA    I +   +  G T+LD+G G G  +I           +   V  +     Q 
Sbjct: 55  EAQRAKLDLILEKLGLKPGMTLLDIGCGWGGLAIYAAEE------YGVTVVGVTLSEEQL 108

Query: 91  LILEVIQNKI------ENVE--------LPEKVDIIVSEWMGFYLLHESMIDSVIFARDK 136
                 + +I      +NVE          E  D IVS  M  ++  E+  D        
Sbjct: 109 ---AYAEKRIAARGLEDNVEVRLQDYRDFEEPFDRIVSVGMFEHVGKEN-YDDFFKKVYA 164

Query: 137 FLKPEGVMYPYKCILHSAPSYSPE 160
            LKP G M     +LHS      E
Sbjct: 165 LLKPGGRM-----LLHSITGPDQE 183


>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752).  This
           domain family is found in eukaryotes, and is typically
           between 140 and 163 amino acids in length.
          Length = 148

 Score = 27.0 bits (60), Expect = 7.1
 Identities = 19/77 (24%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 165 WENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           W     E+ K+ +DE+   E              + ++E  +K ++  EK+ G++  E+ 
Sbjct: 35  WTETPEEKAKRLQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQH 94

Query: 225 EEVVKQKKNEEEEEEEE 241
           ++  KQKK ++EEE ++
Sbjct: 95  QK--KQKKKKKEEENDD 109


>gnl|CDD|235604 PRK05778, PRK05778, 2-oxoglutarate ferredoxin oxidoreductase
           subunit beta; Validated.
          Length = 301

 Score = 27.5 bits (62), Expect = 7.1
 Identities = 13/62 (20%), Positives = 21/62 (33%), Gaps = 1/62 (1%)

Query: 180 IEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
           ++ EE+ +  +  +   K  EEE   K       K E     EE  E + +   E     
Sbjct: 231 LKLEEDYDPTDRDKAAEKMLEEELGGKIPIGVFYKNERP-TFEERLEKLIEPLLELPPAA 289

Query: 240 EE 241
             
Sbjct: 290 LR 291


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 26.3 bits (58), Expect = 7.2
 Identities = 15/78 (19%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
           EEKK E   + + ++    +K++K K++   +  +    +  K+  + +++++E     +
Sbjct: 2   EEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTAR---PKATKKGQKKDKKKDEFPEFPE 58

Query: 232 KNEEEEEEEEVLIKLEEE 249
           +++    E+ + I  EEE
Sbjct: 59  ESKRRRTEDGLPIYTEEE 76


>gnl|CDD|237872 PRK14968, PRK14968, putative methyltransferase; Provisional.
          Length = 188

 Score = 27.2 bits (61), Expect = 7.3
 Identities = 8/13 (61%), Positives = 10/13 (76%)

Query: 49 AGKTVLDVGTGTG 61
           G  VL+VGTG+G
Sbjct: 23 KGDRVLEVGTGSG 35


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 27.0 bits (60), Expect = 7.3
 Identities = 17/43 (39%), Positives = 30/43 (69%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           KKE +E   +     EE+K+ ++K ++E++E +K+ DE+EKEE
Sbjct: 96  KKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEE 138



 Score = 26.6 bits (59), Expect = 9.5
 Identities = 13/45 (28%), Positives = 27/45 (60%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           ++E+++     E   EE KK  D+++KE+ E  ++ +E   ++KK
Sbjct: 96  KKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKK 140


>gnl|CDD|224106 COG1185, Pnp, Polyribonucleotide nucleotidyltransferase
           (polynucleotide phosphorylase) [Translation, ribosomal
           structure and biogenesis].
          Length = 692

 Score = 27.5 bits (62), Expect = 7.4
 Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 3/92 (3%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEE-EEEVKQKKNKEEEEEEEEKKEEDEIEKEEGE 218
            +    E +A E  KK+ +      +EE E +V+     +E +E    +E+ E       
Sbjct: 213 SVINAQEELALEVGKKKWELEPPSLDEELEAKVRDLAE-DELKEAVGIREKQERSAALDA 271

Query: 219 NEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
            +E+ EEE+  + +    +E + +L KLE++ 
Sbjct: 272 IKEKIEEELSGE-EESSLKEIKAILEKLEKKP 302


>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein.  Bacillus subtilis stage V
           sporulation protein R is involved in spore cortex
           formation. Little is known about cortex biosynthesis,
           except that it depends on several sigma E controlled
           genes, including spoVR.
          Length = 427

 Score = 27.5 bits (62), Expect = 7.6
 Identities = 13/48 (27%), Positives = 29/48 (60%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEE 239
           +++K ++EE EE  + + +++ +      E+ EEE  ++K+  EE +E
Sbjct: 178 EEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQE 225


>gnl|CDD|220839 pfam10661, EssA, WXG100 protein secretion system (Wss), protein
           EssA.  The WXG100 protein secretion system (Wss) is
           responsible for the secretion of WXG100 proteins
           (pfam06013) such as ESAT-6 and CFP-10 in Mycobacterium
           tuberculosis or EsxA and EsxB in Staphylococcus aureus.
           In S. aureus, the Wss seems to be encoded by a locus of
           eight CDS, called ess (eSAT-6 secretion system). This
           locus encodes, amongst several other proteins, EssA, a
           protein predicted to possess one transmembrane domain.
           Due to its predicted membrane location and its absolute
           requirement for WXG100 protein secretion, it has been
           speculated that EssA could form a secretion apparatus in
           conjunction with the polytopic membrane protein EsaA,
           YukC (pfam10140) and YukAB, which is a membrane-bound
           ATPase containing Ftsk/SpoIIIE domains (pfam01580)
           called EssC in S. aureus and Snm1/Snm2 in Mycobacterium
           tuberculosis. Proteins homologous to EssA, YukC, EsaA
           and YukD seem absent from mycobacteria.
          Length = 145

 Score = 26.8 bits (59), Expect = 7.6
 Identities = 19/54 (35%), Positives = 32/54 (59%)

Query: 197 KEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKLEEEE 250
           K ++E+ E+K  E E++K   E    E EE + +KKN E++E E++   L  E+
Sbjct: 24  KTDQEKNEKKLRETELDKLGIELFTTETEEEINKKKNAEQKEMEDIENSLFSED 77


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 27.6 bits (61), Expect = 7.6
 Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 1/80 (1%)

Query: 170 GEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
              E  E  E+  ++ E E +  Q+K    E+E E++++  + E      E EE+ E  +
Sbjct: 654 CMVESYESLELRAKDLEAEHKSLQEKISSLEDELEKERQNHQ-ELIAKCRELEEKIERAE 712

Query: 230 QKKNEEEEEEEEVLIKLEEE 249
           Q++N ++ +EEE  IK E+E
Sbjct: 713 QEENMQKLDEEEQKIKQEKE 732


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 26.2 bits (58), Expect = 7.6
 Identities = 12/46 (26%), Positives = 27/46 (58%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEE 224
           E   E+  E++ +K ++ + ++E+E E++   +  KE    +EE+E
Sbjct: 31  EKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKE 76


>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
          Length = 382

 Score = 27.5 bits (62), Expect = 7.7
 Identities = 13/54 (24%), Positives = 24/54 (44%), Gaps = 2/54 (3%)

Query: 189 EEVKQ--KKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           E V     + +EEEEE EE++ +     E    +     +  +Q+   + +E E
Sbjct: 218 ELVAILLGEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPE 271


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 27.3 bits (61), Expect = 7.7
 Identities = 9/30 (30%), Positives = 21/30 (70%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIEKEEG 217
           + + ++KK   +EE++  K E++++E+E G
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 27.6 bits (62), Expect = 7.7
 Identities = 26/73 (35%), Positives = 45/73 (61%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEE 238
            I+EE+  E +E ++K+ K  EE +E  +E  E ++EE E E EEE ++++ ++ + E E
Sbjct: 107 RIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAERE 166

Query: 239 EEEVLIKLEEEEE 251
           EE    + E +EE
Sbjct: 167 EEREAERRERKEE 179


>gnl|CDD|224796 COG1884, Sbm, Methylmalonyl-CoA mutase, N-terminal domain/subunit
           [Lipid metabolism].
          Length = 548

 Score = 27.7 bits (62), Expect = 7.8
 Identities = 18/80 (22%), Positives = 32/80 (40%), Gaps = 11/80 (13%)

Query: 178 DEIEEEENEEEEEVKQKK------NKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQK 231
            EI E     +  +++ +      N+  EE  EE  E  +++      E   E ++ + K
Sbjct: 434 REIAESAARRQARIEEGERVIVGVNEYPEEGPEEPVEILKVD-----PEAVRERQIARLK 488

Query: 232 KNEEEEEEEEVLIKLEEEEE 251
           K   E +EE V   L    +
Sbjct: 489 KVRAERDEEAVEAALAALRK 508


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 27.1 bits (60), Expect = 7.9
 Identities = 12/80 (15%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 174 KKEEDEIEE---EENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           KK +  I     +  + ++E  Q+  +  ++++  +++      +  E  +E+ + ++ +
Sbjct: 35  KKRQQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNE 94

Query: 231 KKNEEEEEEEEVLIKLEEEE 250
            + +     +E L +LE E+
Sbjct: 95  AREDVATARDEWLEQLEREK 114


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 27.4 bits (62), Expect = 8.0
 Identities = 19/82 (23%), Positives = 36/82 (43%), Gaps = 21/82 (25%)

Query: 188 EEEVKQKKNKEEEEEEEEKKEEDEIE----------------KEEGENEEEEEEEVVKQK 231
           +EE+++     E   EE+KK ++ +E                KE G+    +E+E ++  
Sbjct: 502 DEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAA 561

Query: 232 KNE-----EEEEEEEVLIKLEE 248
             E     + E++E +  K EE
Sbjct: 562 IKELKEALKGEDKEAIKAKTEE 583


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 25.8 bits (57), Expect = 8.0
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 192 KQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEE 240
           K+KK K+ + +EE   E++E EK   E++ +E EE    +K E+EE+  
Sbjct: 23  KKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGM 71


>gnl|CDD|237898 PRK15068, PRK15068, tRNA mo(5)U34 methyltransferase; Provisional.
          Length = 322

 Score = 27.1 bits (61), Expect = 8.1
 Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 20/83 (24%)

Query: 49  AGKTVLDVGTGTG-----------KSILLQGHGVD--HHSFCTKVHPLDHYAPQYLILEV 95
            G+TVLDVG G G           K ++    G+D      C +   +           +
Sbjct: 122 KGRTVLDVGCGNGYHMWRMLGAGAKLVV----GIDPSQLFLC-QFEAVRKLLGNDQRAHL 176

Query: 96  IQNKIENVELPEKVDIIVSEWMG 118
           +   IE +   +  D + S  MG
Sbjct: 177 LPLGIEQLPALKAFDTVFS--MG 197


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 27.2 bits (61), Expect = 8.1
 Identities = 18/44 (40%), Positives = 26/44 (59%)

Query: 179 EIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEE 222
            +EE+ NEEE E  +KK     EE++EK  E  +E EE ++  E
Sbjct: 6   TLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPE 49


>gnl|CDD|227499 COG5171, YRB1, Ran GTPase-activating protein (Ran-binding protein)
           [Intracellular trafficking and secretion].
          Length = 211

 Score = 26.9 bits (59), Expect = 8.2
 Identities = 15/58 (25%), Positives = 25/58 (43%)

Query: 189 EEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIKL 246
              ++KK ++    E    E DE +  E  N   E    +++   +  EE+E VL K 
Sbjct: 38  AGGEEKKVQQSPFLENAVPEGDEGKGPESPNIHFEPVVELQRVHLKTNEEDETVLFKA 95


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
           This family consists of several brain acid soluble
           protein 1 (BASP1) or neuronal axonal membrane protein
           NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
           calmodulin-binding protein of unknown function.
          Length = 233

 Score = 27.1 bits (59), Expect = 8.3
 Identities = 19/68 (27%), Positives = 33/68 (48%)

Query: 173 EKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKK 232
           E  E  E +EE+ +++ +    K +E+E E+E    ++E  K E E  E   E   +  K
Sbjct: 47  ETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAAKEEAPKAEPEKTEGAAEAKAEPPK 106

Query: 233 NEEEEEEE 240
             + E+E 
Sbjct: 107 ASDPEQEP 114


>gnl|CDD|218637 pfam05558, DREPP, DREPP plasma membrane polypeptide.  This family
           contains several plant plasma membrane proteins termed
           DREPPs as they are developmentally regulated plasma
           membrane polypeptides.
          Length = 201

 Score = 26.9 bits (59), Expect = 8.7
 Identities = 20/58 (34%), Positives = 29/58 (50%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           EEKK E+E    E E++  VK+K+   EEE++EE+             E + EE   K
Sbjct: 132 EEKKPEEEAPAAETEKKPAVKEKEIVVEEEKKEEEAVPVAAAPSTKAPETKVEETPAK 189


>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
          Length = 159

 Score = 26.6 bits (59), Expect = 8.7
 Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 2/91 (2%)

Query: 164 FWENIAG--EEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEE 221
           F++ I    +  + E D   E+ +E++++ ++ K K E E +  K+E  +I +E     E
Sbjct: 26  FFDKIKAVIDSRQSEIDNKIEKADEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAE 85

Query: 222 EEEEEVVKQKKNEEEEEEEEVLIKLEEEEEN 252
              EE+VK+   E +   E   ++ + E+E 
Sbjct: 86  NVYEEIVKEAHEEADLIIERAKLEAQREKEK 116



 Score = 26.6 bits (59), Expect = 9.3
 Identities = 21/79 (26%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEE--EEEKKEEDEIEKEEGENEEEE 223
             I  + EK +ED+ +  E + + E + K  KEE ++  EE K + + + +E  +   EE
Sbjct: 39  SEIDNKIEKADEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAENVYEEIVKEAHEE 98

Query: 224 EEEVVKQKKNEEEEEEEEV 242
            + ++++ K E + E+E+ 
Sbjct: 99  ADLIIERAKLEAQREKEKA 117


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 26.5 bits (59), Expect = 8.7
 Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 4/85 (4%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEE 225
            NI   EE+ ++         E E+   +   E  E     K+E +  KEE   E +++ 
Sbjct: 37  NNIKEAEERLKQ---AAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEILAEAQKDA 93

Query: 226 EVVKQKKNEE-EEEEEEVLIKLEEE 249
           E + +    E E+E+E+ L +L ++
Sbjct: 94  ERLLESARAEIEQEKEQALAELRQQ 118


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 27.3 bits (61), Expect = 8.7
 Identities = 25/100 (25%), Positives = 47/100 (47%), Gaps = 17/100 (17%)

Query: 163  KFWENIAGEEEKKEEDEIEEEENEEEEEVKQ-----------------KKNKEEEEEEEE 205
            K  +  A EEE  EED+  ++E++EEE                     K  K   E E++
Sbjct: 1055 KSEKITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKLNAELEKK 1114

Query: 206  KKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEEVLIK 245
            +KE ++++    ++   E+ +  ++   E+EE EE+ + K
Sbjct: 1115 EKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAK 1154


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 26.9 bits (60), Expect = 8.8
 Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 10/83 (12%)

Query: 171 EEEKKEEDEIEEEENE---EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEG-------ENE 220
            E ++E  E++E+  E   E +E+KQ+ +  E E E  +KE   I++          EN 
Sbjct: 76  AELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENR 135

Query: 221 EEEEEEVVKQKKNEEEEEEEEVL 243
           E  EE    +++NE  E E E L
Sbjct: 136 ELREELAELKQENEALEAENERL 158


>gnl|CDD|181354 PRK08287, PRK08287, cobalt-precorrin-6Y C(15)-methyltransferase;
          Validated.
          Length = 187

 Score = 26.9 bits (60), Expect = 9.1
 Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 35 EAYRAAICDNPNIFAGKTVLDVGTGTGKSILLQ 67
          E  RA       +   K ++DVG GTG S+ ++
Sbjct: 17 EEVRALALSKLELHRAKHLIDVGAGTG-SVSIE 48


>gnl|CDD|184859 PRK14857, tatA, twin arginine translocase protein A; Provisional.
          Length = 90

 Score = 25.8 bits (57), Expect = 9.2
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 174 KKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
           K+ E+EI+ E  E E+EVK     E  +  E  +E+ + E   
Sbjct: 46  KEFENEIKREMAEPEQEVKAPAELESNQTNEANQEQVDAENSP 88


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 26.7 bits (59), Expect = 9.2
 Identities = 13/58 (22%), Positives = 29/58 (50%)

Query: 159 PEIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEE 216
            ++  F +N+       +  + E E  + + E  QK+N+E E+E E+ ++     +E+
Sbjct: 83  QDVISFLQNLKTTNPSDQALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEED 140


>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 215

 Score = 26.8 bits (59), Expect = 9.4
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 168 IAGEEEKKEEDEIEEEENEEEEEVKQKKNK 197
           IA EE K E+ E +++ +E++++    K  
Sbjct: 80  IAAEENKAEQLERDKQLSEQQKQAALAKEY 109


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 27.2 bits (61), Expect = 9.4
 Identities = 15/59 (25%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 172 EEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
           +  +E +E++ E NE  +E+ +   + E++ EE   E  E+ KE+ +  E   +E+  +
Sbjct: 40  KLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKEL-KEKLKELEAALDELEAE 97


>gnl|CDD|226655 COG4192, COG4192, Signal transduction histidine kinase regulating
           phosphoglycerate transport system [Signal transduction
           mechanisms].
          Length = 673

 Score = 27.2 bits (60), Expect = 9.6
 Identities = 12/50 (24%), Positives = 28/50 (56%)

Query: 181 EEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQ 230
            +  NE+ + ++++  + + E EE ++ E  ++K +GE  +  +  VV Q
Sbjct: 404 LQRVNEKTQSLRERNQELQAEVEERQEAEAHLKKTQGELIQAAKLAVVGQ 453


>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
           Provisional.
          Length = 693

 Score = 27.3 bits (62), Expect = 9.6
 Identities = 19/70 (27%), Positives = 36/70 (51%)

Query: 160 EIFKFWENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGEN 219
           E    WE    E +++ +  ++E    + +E  Q  +K+E E   +  +E+ +E    E 
Sbjct: 225 EAGPKWEWQPPEVDEELKAAVKELAEAKLKEAYQITDKQEREAALDAIKEEVLEALAAEE 284

Query: 220 EEEEEEEVVK 229
           EEEE+E+ +K
Sbjct: 285 EEEEDEKEIK 294


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 27.2 bits (61), Expect = 9.6
 Identities = 13/76 (17%), Positives = 31/76 (40%), Gaps = 1/76 (1%)

Query: 166 ENIAGEEEKKEEDEIEEEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKE-EGENEEEEE 224
           +        ++E    +++ E +   K +     E +++E    + +  E E + +E E 
Sbjct: 139 DPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEA 198

Query: 225 EEVVKQKKNEEEEEEE 240
           +    Q+K  E  +E 
Sbjct: 199 QLEQLQEKAAETSQER 214


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 26.4 bits (59), Expect = 9.7
 Identities = 18/80 (22%), Positives = 45/80 (56%), Gaps = 6/80 (7%)

Query: 171 EEEKKEEDEIEEEENE-EEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVK 229
           +E KK + E+E+ E E ++ + K +K+     E   +K+E E++K     + +E +   +
Sbjct: 25  KEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQK-----KVQEFQRKQQ 79

Query: 230 QKKNEEEEEEEEVLIKLEEE 249
           + + + ++ ++E L K+ ++
Sbjct: 80  KLQQDLQKRQQEELQKILDK 99


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 27.3 bits (61), Expect = 9.7
 Identities = 15/72 (20%), Positives = 38/72 (52%), Gaps = 1/72 (1%)

Query: 182 EEENEEEEEVKQKKNKEEEEEEEEKKEEDEIEKEEGENEEEEEEEVVKQKKNEEEEEEEE 241
           +E+ +  +   ++  K     E     + +++ +   + E+ + EV ++KK +++EE+++
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342

Query: 242 VLI-KLEEEEEN 252
             I +LEE  E 
Sbjct: 343 KQIERLEERIEK 354


>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
          Length = 586

 Score = 27.1 bits (60), Expect = 9.7
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 201 EEEEEKKEEDEIEKEEGENEEEEEEEV 227
           E++EEK+E+   E +E E E+ E+E  
Sbjct: 53  EQKEEKEEDAGEEGDEDEEEQGEDEHF 79


>gnl|CDD|215276 PLN02500, PLN02500, cytochrome P450 90B1.
          Length = 490

 Score = 27.1 bits (60), Expect = 9.9
 Identities = 10/24 (41%), Positives = 18/24 (75%)

Query: 205 EKKEEDEIEKEEGENEEEEEEEVV 228
           E+K E+ IEK + E+E  EE++++
Sbjct: 243 ERKMEERIEKLKEEDESVEEDDLL 266


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.306    0.130    0.357 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,845,181
Number of extensions: 1476719
Number of successful extensions: 28792
Number of sequences better than 10.0: 1
Number of HSP's gapped: 14629
Number of HSP's successfully gapped: 3797
Length of query: 252
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 157
Effective length of database: 6,723,972
Effective search space: 1055663604
Effective search space used: 1055663604
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 58 (26.0 bits)