RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8459
         (106 letters)



>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 40.4 bits (95), Expect = 4e-05
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 1/54 (1%)

Query: 43  NNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDT-EGKEVRGK 95
            NE  D +D+EE      E +D+E +  + D  DEE+ + E E T   KEV  K
Sbjct: 219 QNELDDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272



 Score = 33.0 bits (76), Expect = 0.017
 Identities = 11/47 (23%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 50  EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
           + D+E E      ED++ E++  D   ++D E+E +    +    K+
Sbjct: 224 DIDDEEEERVLADEDDDDEDDMFD--MDDDDEEESDPEVERTSTIKE 268



 Score = 27.7 bits (62), Expect = 1.3
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 1/54 (1%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDE-EDGEDEEEDTEGKEVRGKKRKHD 100
           D E   E +  D E E+  L +E+DD  D+  D +D++E+    EV       +
Sbjct: 215 DVEIQNELDDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKE 268


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 40.4 bits (94), Expect = 5e-05
 Identities = 21/49 (42%), Positives = 32/49 (65%), Gaps = 1/49 (2%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
           +AK +E    +    S+G D E E+EE EEEE++  +EE+ E+EEE+ E
Sbjct: 844 EAKQDEK-GVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 39.2 bits (91), Expect = 1e-04
 Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 2/49 (4%)

Query: 45  EAPDSED--DEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
           EA   E   D     D  + E+EE EEEE++  +EE+ E+EEE+ E +E
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 36.9 bits (85), Expect = 7e-04
 Identities = 19/63 (30%), Positives = 34/63 (53%)

Query: 44  NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
           +E  + E + E++G+  + E         DG D E+ E+EEE+ E +E   ++ + +EEE
Sbjct: 830 DETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889

Query: 104 EGE 106
             E
Sbjct: 890 NEE 892



 Score = 33.8 bits (77), Expect = 0.009
 Identities = 19/68 (27%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 44  NEAPDSEDDEESEGDDYEGEDEELEEE-----EDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
            E  + E   E++ DD E +DE  E+E     + +   +E G D    ++G +   ++ +
Sbjct: 810 GEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEE 869

Query: 99  HDEEEEGE 106
            +EEEE E
Sbjct: 870 EEEEEEEE 877



 Score = 33.8 bits (77), Expect = 0.010
 Identities = 19/69 (27%), Positives = 31/69 (44%), Gaps = 1/69 (1%)

Query: 38  GFDAKNNEAPDSEDDEESEGD-DYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
           G + +     + E+ EES G+ + EGE E   E E +G    + + E+E     E +   
Sbjct: 646 GEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEAD 705

Query: 97  RKHDEEEEG 105
            K + E E 
Sbjct: 706 HKGETEAEE 714



 Score = 31.9 bits (72), Expect = 0.049
 Identities = 18/45 (40%), Positives = 28/45 (62%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEE 84
           D K  +     D  +SE ++ E E+EE EEEE++  +EE+ E+EE
Sbjct: 848 DEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 31.5 bits (71), Expect = 0.054
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 1/69 (1%)

Query: 38  GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
           G +A+     +++ + ESEG+    E +  +E E +   +E     E + E  E  G+  
Sbjct: 664 GGEAEQEGETETKGENESEGE-IPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETE 722

Query: 98  KHDEEEEGE 106
               E+EGE
Sbjct: 723 AEGTEDEGE 731



 Score = 31.5 bits (71), Expect = 0.063
 Identities = 13/67 (19%), Positives = 28/67 (41%)

Query: 38  GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
             +A+  E  D  + +  E  + +G++    + E +G  E   +DE E     +    + 
Sbjct: 769 ETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEV 828

Query: 98  KHDEEEE 104
           K +  E+
Sbjct: 829 KDETGEQ 835



 Score = 31.1 bits (70), Expect = 0.090
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query: 38  GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEE 78
           G D  ++E  + E++EE E ++ E E+EE EEE ++ L  E
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897



 Score = 30.3 bits (68), Expect = 0.13
 Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 2/63 (3%)

Query: 45  EAPDSEDDEESEGDDYEGEDE-ELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
           +  D + + E+E  ++EGE E E  E+E +    E+GE+ E++ EG E  GK     E +
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEG-EAEGKHEVETEGD 760

Query: 104 EGE 106
             E
Sbjct: 761 RKE 763



 Score = 30.3 bits (68), Expect = 0.17
 Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 6/68 (8%)

Query: 45  EAPDSEDDEESEGDDYEGE---DEELEEEEDDGLDEEDGEDEEE---DTEGKEVRGKKRK 98
           E  + E + E+EG + EGE    EE EE ED+G  E +G+ E E   D +  E  G+   
Sbjct: 713 EEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEA 772

Query: 99  HDEEEEGE 106
             +E+E E
Sbjct: 773 EGKEDEDE 780



 Score = 30.0 bits (67), Expect = 0.21
 Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 4/71 (5%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE---DTEGK-EVRGK 95
           +    E     + E   G++  GE E+  E E  G +E +GE   E   + EG+ E+  K
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK 702

Query: 96  KRKHDEEEEGE 106
           +  H  E E E
Sbjct: 703 EADHKGETEAE 713



 Score = 29.6 bits (66), Expect = 0.25
 Identities = 16/61 (26%), Positives = 33/61 (54%), Gaps = 2/61 (3%)

Query: 48  DSEDDEESEGD-DYEGEDEELEEEEDDGLDEEDGEDEEEDTEGK-EVRGKKRKHDEEEEG 105
           + + ++E EG+ + +  D + E E ++   E + E E  + EG+ E   +  + ++E EG
Sbjct: 688 ERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEG 747

Query: 106 E 106
           E
Sbjct: 748 E 748



 Score = 29.2 bits (65), Expect = 0.33
 Identities = 13/37 (35%), Positives = 24/37 (64%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDD 73
            G  +   ++ + E++EE E ++ E E+EE EEEE++
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890



 Score = 28.8 bits (64), Expect = 0.52
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEE 70
           DG D++  E  + E++EE E ++ E E+EE  EE
Sbjct: 859 DGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 28.8 bits (64), Expect = 0.57
 Identities = 17/61 (27%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 45  EAPDSEDDEESEGDDYEGEDE-ELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
           +  ++E + E+E +  E EDE E++  ED  +  ++G + + + EG+   G+K +H+ + 
Sbjct: 760 DRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQS 819

Query: 104 E 104
           E
Sbjct: 820 E 820



 Score = 28.0 bits (62), Expect = 0.91
 Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 1/60 (1%)

Query: 48  DSEDDEESEG-DDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           +SE +  +E   + EGE E   +E D   + E  E E E     E    + + +  EEGE
Sbjct: 680 ESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGE 739



 Score = 28.0 bits (62), Expect = 0.91
 Identities = 16/56 (28%), Positives = 27/56 (48%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
           ++ ++ E   D+ EGE E   E E +G  +E   + E + EGKE   +      E+
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGED 788



 Score = 27.3 bits (60), Expect = 2.0
 Identities = 15/56 (26%), Positives = 24/56 (42%)

Query: 51  DDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           D+      ++EGE E  E++E +G  E   +D E   E  E         E ++ E
Sbjct: 794 DEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDE 849



 Score = 26.9 bits (59), Expect = 2.6
 Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 1/67 (1%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           D    EA + + + E+EGD  E E E   E E    ++E      ED E K   G + K 
Sbjct: 743 DEGEGEA-EGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKV 801

Query: 100 DEEEEGE 106
           + E E E
Sbjct: 802 EHEGETE 808



 Score = 26.5 bits (58), Expect = 3.3
 Identities = 19/81 (23%), Positives = 28/81 (34%), Gaps = 11/81 (13%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE----------- 85
           DG    +  A    + E       + E E   E + D  + +D   E+E           
Sbjct: 788 DGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQ 847

Query: 86  DTEGKEVRGKKRKHDEEEEGE 106
           D +G +  G     D EEE E
Sbjct: 848 DEKGVDGGGGSDGGDSEEEEE 868



 Score = 26.1 bits (57), Expect = 4.8
 Identities = 17/59 (28%), Positives = 29/59 (49%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           ++E+ E     + EG ++E E E  +  +E + E E E     EV  +  + + E EGE
Sbjct: 711 EAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGE 769


>gnl|CDD|197729 smart00446, LRRcap, occurring C-terminal to leucine-rich repeats.
           A motif occurring C-terminal to leucine-rich repeats
          in "sds22-like" and "typical" LRR-containing proteins.
          Length = 19

 Score = 36.9 bits (87), Expect = 6e-05
 Identities = 12/18 (66%), Positives = 14/18 (77%)

Query: 20 ENYREKVFELLPNLKYLD 37
           +YREKV ELLP L+ LD
Sbjct: 2  AHYREKVIELLPQLRKLD 19


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 38.8 bits (91), Expect = 2e-04
 Identities = 20/63 (31%), Positives = 32/63 (50%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           DA  +   D +DDEES+ +D E    E +E++DD  D++D    E   E +  R +  + 
Sbjct: 118 DAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEK 177

Query: 100 DEE 102
             E
Sbjct: 178 RAE 180



 Score = 36.9 bits (86), Expect = 8e-04
 Identities = 17/71 (23%), Positives = 37/71 (52%)

Query: 36  LDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGK 95
           L      ++   D E+D++ + +  E ++E  + E+D+  D++D +D+    E    R +
Sbjct: 110 LQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRR 169

Query: 96  KRKHDEEEEGE 106
           +R+  EE+  E
Sbjct: 170 RRREWEEKRAE 180



 Score = 32.6 bits (75), Expect = 0.023
 Identities = 15/62 (24%), Positives = 34/62 (54%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
           DG   +  +     DD   + ++ + +DEE +EE+++    ED ED+++D +  ++  ++
Sbjct: 103 DGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162

Query: 97  RK 98
           R 
Sbjct: 163 RS 164



 Score = 30.0 bits (68), Expect = 0.18
 Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 1/57 (1%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
              ++E  +   Y+    +LEE++DD  +E D EDEE      +        D++  
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDDD-EESDEEDEESSKSEDDEDDDDDDDDDDIA 159



 Score = 30.0 bits (68), Expect = 0.20
 Identities = 13/56 (23%), Positives = 27/56 (48%)

Query: 51  DDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
            D E E  D    D+   + E+D  D+E+ ++E+E++   E        D++++  
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIA 159


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 38.1 bits (88), Expect = 3e-04
 Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 8/59 (13%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
             ED    +    + +DE+ +E++DD  D+E+ E+EEE+ +G +        DE+EE E
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFD--------DEDEEDE 191



 Score = 30.0 bits (67), Expect = 0.19
 Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 32  NLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEED-DGLDEEDGEDE 83
           +L   D     N    D ++DE+ + DD E ++EE EEEE+  G D+ED EDE
Sbjct: 139 HLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191



 Score = 29.6 bits (66), Expect = 0.32
 Identities = 18/70 (25%), Positives = 40/70 (57%)

Query: 34  KYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
           ++L   D    +    +DD+E E +D + E+++ EEEE++   +   +++EED  G++  
Sbjct: 138 RHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFT 197

Query: 94  GKKRKHDEEE 103
            +K + D+ +
Sbjct: 198 YEKSEVDKTD 207


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 35.9 bits (83), Expect = 0.002
 Identities = 21/54 (38%), Positives = 26/54 (48%), Gaps = 5/54 (9%)

Query: 51  DDEESEGDDYEGEDEELEEEEDDGLDE-----EDGEDEEEDTEGKEVRGKKRKH 99
           +D  +E DD E EDEE EEEED+         +D E EE+D E K         
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSS 435



 Score = 30.1 bits (68), Expect = 0.20
 Identities = 17/50 (34%), Positives = 28/50 (56%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
           DA      + E+DEE E ++ E E    E  +D+  +E+D E + ED++G
Sbjct: 383 DANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432



 Score = 27.8 bits (62), Expect = 1.2
 Identities = 17/44 (38%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 63  EDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
              E+E+   +  DEE  EDEEE+ E  E  G  ++H ++EE E
Sbjct: 377 SALEIEDANTERDDEE-EEDEEEEEEEDEDEGPSKEHSDDEEFE 419



 Score = 26.6 bits (59), Expect = 2.7
 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 2/58 (3%)

Query: 49  SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
            E +++   D +   + E    E D  +EED E+EEE+ E +     K   D+EE  E
Sbjct: 365 KETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPS--KEHSDDEEFEE 420


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 35.7 bits (83), Expect = 0.002
 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 35  YLDGFDAKNNEAPDSEDDEESE--GDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
            L   +    E  D  DDE+ E   DD E E+E+++  +++  +E++  D+E+D E +E 
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEE 374

Query: 93  RGKKRKHDEEEE 104
             +K+K    E 
Sbjct: 375 EKEKKKKKSAES 386



 Score = 35.0 bits (81), Expect = 0.003
 Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 2/71 (2%)

Query: 38  GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGE--DEEEDTEGKEVRGK 95
           G + +  E    +D++E + DD   E+EE  +  D+  DEED +  DE+++ E +E + K
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378

Query: 96  KRKHDEEEEGE 106
           K+K   E    
Sbjct: 379 KKKKSAESTRS 389



 Score = 34.2 bits (79), Expect = 0.006
 Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 2/72 (2%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDD--YEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRG 94
           +  +     A D +D+ E + DD    G+ EE EEEE+DG+D+ED ED+++D E +E   
Sbjct: 290 EEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDV 349

Query: 95  KKRKHDEEEEGE 106
                +E+EE E
Sbjct: 350 DLSDEEEDEEDE 361



 Score = 32.7 bits (75), Expect = 0.024
 Identities = 23/70 (32%), Positives = 38/70 (54%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
           D F+  +++       EE E ++ +G D+E EE++DD L+EE+ + +  D E  E     
Sbjct: 304 DEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363

Query: 97  RKHDEEEEGE 106
              D+EEE E
Sbjct: 364 DDEDDEEEEE 373



 Score = 26.9 bits (60), Expect = 2.3
 Identities = 15/74 (20%), Positives = 36/74 (48%)

Query: 33  LKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
           L+ + G +  + E  DS++  +   D++E +D++         DEE+ ED  +D + ++ 
Sbjct: 279 LRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDD 338

Query: 93  RGKKRKHDEEEEGE 106
                + +E+ +  
Sbjct: 339 DDDLEEEEEDVDLS 352



 Score = 25.7 bits (57), Expect = 6.3
 Identities = 11/48 (22%), Positives = 21/48 (43%), Gaps = 1/48 (2%)

Query: 38  GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
               K+++  D +D ++ +  D    D+          DEED E++ E
Sbjct: 124 SEIDKDDDVRDDDDFDDDDLGD-LASDDRAAHFGGGEDDEEDEEEQPE 170


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
            [Transcription / DNA replication, recombination, and
            repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 34.6 bits (79), Expect = 0.005
 Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 3/62 (4%)

Query: 40   DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGED--EEEDTEGKEVRGKKR 97
               +++    E DE+ E D+   ED   +E E+D  DEEDGED  E E     + R  KR
Sbjct: 941  YEASSDDESDETDEDEESDESS-EDLSEDESENDSSDEEDGEDWDELESKAAYDSRPGKR 999

Query: 98   KH 99
            + 
Sbjct: 1000 RK 1001


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
          family is conserved in dsDNA adenoviruses of
          vertebrates. The function is not known.
          Length = 172

 Score = 33.7 bits (77), Expect = 0.008
 Identities = 20/42 (47%), Positives = 28/42 (66%), Gaps = 1/42 (2%)

Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
           D E+D +S+ ++ E EDEE  E+ +D LDEED E EE + E
Sbjct: 18 KDEEEDWDSQAEEVE-EDEEEMEDWEDSLDEEDEEAEEVEEE 58


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 33.0 bits (76), Expect = 0.015
 Identities = 16/57 (28%), Positives = 31/57 (54%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
           D  D  +++  D EDD++ + DD + E +E +E E    D++   DE++    ++ R
Sbjct: 145 DDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQAR 201



 Score = 32.7 bits (75), Expect = 0.019
 Identities = 16/71 (22%), Positives = 39/71 (54%)

Query: 34  KYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
           K +D  +  +++  D +DD+  + D  + +D+E ++E+DD  D +D ++E+++ +  E  
Sbjct: 122 KDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKL 181

Query: 94  GKKRKHDEEEE 104
                   +E+
Sbjct: 182 SDDDDFVWDED 192



 Score = 32.7 bits (75), Expect = 0.022
 Identities = 14/68 (20%), Positives = 34/68 (50%)

Query: 36  LDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGK 95
            D  D  + +  D ++D++ + DD + +DE+ E++E   L++   +D+    E      +
Sbjct: 139 DDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALR 198

Query: 96  KRKHDEEE 103
           + + D + 
Sbjct: 199 QARKDAKL 206



 Score = 32.7 bits (75), Expect = 0.024
 Identities = 15/81 (18%), Positives = 41/81 (50%)

Query: 24  EKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
           +     + ++  L+  D  +++  D + D++   DD + ED++ ++++DD  DE++ + E
Sbjct: 115 DDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKE 174

Query: 84  EEDTEGKEVRGKKRKHDEEEE 104
            ++ E           +++ E
Sbjct: 175 AKELEKLSDDDDFVWDEDDSE 195



 Score = 31.9 bits (73), Expect = 0.042
 Identities = 16/70 (22%), Positives = 35/70 (50%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
           D       +  D +DD++   DD   +D++ E++++D  D++  +++EE  E KE+    
Sbjct: 123 DIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182

Query: 97  RKHDEEEEGE 106
              D   + +
Sbjct: 183 DDDDFVWDED 192



 Score = 31.1 bits (71), Expect = 0.081
 Identities = 16/77 (20%), Positives = 38/77 (49%)

Query: 15  DVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDG 74
           D     NY + +  L       D  D  + +  D +DD++ E DD + +D+++++E+++ 
Sbjct: 113 DKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEK 172

Query: 75  LDEEDGEDEEEDTEGKE 91
            + ++ E   +D +   
Sbjct: 173 KEAKELEKLSDDDDFVW 189



 Score = 31.1 bits (71), Expect = 0.083
 Identities = 15/67 (22%), Positives = 39/67 (58%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
            A +++  D +DD + +  D + +DE+ +E++DD   +++ E+++E  E +++       
Sbjct: 129 QADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFV 188

Query: 100 DEEEEGE 106
            +E++ E
Sbjct: 189 WDEDDSE 195



 Score = 29.2 bits (66), Expect = 0.31
 Identities = 18/70 (25%), Positives = 39/70 (55%), Gaps = 2/70 (2%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
              D   N   D +   +++ DD + +D++L++++ D  D++D ED++ED +  +V  + 
Sbjct: 112 LDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDID--DDDDDEDDDEDDDDDDVDDED 169

Query: 97  RKHDEEEEGE 106
            +  E +E E
Sbjct: 170 EEKKEAKELE 179



 Score = 28.8 bits (65), Expect = 0.46
 Identities = 17/67 (25%), Positives = 36/67 (53%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           + KN    D + +   + D     D++ ++++DD LD++D +D+++D +  E        
Sbjct: 107 EKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVD 166

Query: 100 DEEEEGE 106
           DE+EE +
Sbjct: 167 DEDEEKK 173



 Score = 28.8 bits (65), Expect = 0.48
 Identities = 16/68 (23%), Positives = 36/68 (52%), Gaps = 1/68 (1%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGED-EEEDTEGKEVRGKKRK 98
           D    +  D  +  + + DD + +D + ++ +DD  DE+D ED +++D + ++   K+ K
Sbjct: 117 DLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAK 176

Query: 99  HDEEEEGE 106
             E+   +
Sbjct: 177 ELEKLSDD 184



 Score = 27.3 bits (61), Expect = 1.7
 Identities = 11/84 (13%), Positives = 33/84 (39%)

Query: 3   LTNLKHVDLFNNDVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEG 62
           L  +K +D+ N      ++  +   +        D  D   ++  D  DDE+ E  + + 
Sbjct: 118 LNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKE 177

Query: 63  EDEELEEEEDDGLDEEDGEDEEED 86
            ++  ++++    +++     +  
Sbjct: 178 LEKLSDDDDFVWDEDDSEALRQAR 201


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 33.2 bits (76), Expect = 0.017
 Identities = 18/74 (24%), Positives = 27/74 (36%), Gaps = 19/74 (25%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE-------------------DTE 88
           DS +    +G + + ED    +++DD  D   G  EEE                   D E
Sbjct: 23  DSGESRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNMRRRLIRTPPRVDSFDVE 82

Query: 89  GKEVRGKKRKHDEE 102
             E+ G  R   E+
Sbjct: 83  AMEIPGAHRHDSED 96



 Score = 27.4 bits (61), Expect = 1.4
 Identities = 8/66 (12%), Positives = 24/66 (36%), Gaps = 2/66 (3%)

Query: 41  AKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
            +++    S  +        +  +    +  +D  D ED    ++D + ++  G   + +
Sbjct: 3   EESSGGIGSSSNGSGGLVGTDSGESRWVDGSED--DSEDPASLDDDDDDRDSGGGMLEEE 60

Query: 101 EEEEGE 106
           E+    
Sbjct: 61  EDGNMR 66


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP),
          alpha subunit.  The alpha-subunit of the TRAP complex
          (TRAP alpha) is a single-spanning membrane protein of
          the endoplasmic reticulum (ER) which is found in
          proximity of nascent polypeptide chains translocating
          across the membrane.
          Length = 281

 Score = 32.8 bits (75), Expect = 0.019
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 46 APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDT 87
          A D  +DEE+E D  + ++E+    E+D  +  + E++EE  
Sbjct: 31 AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 28.2 bits (63), Expect = 0.62
 Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 7/44 (15%)

Query: 63  EDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           EDEE E++  D  +E++   EE++ E  E        +E+EEGE
Sbjct: 36  EDEEAEDDVVDEDEEDEAVVEEDENELTEE-------EEDEEGE 72



 Score = 27.0 bits (60), Expect = 1.7
 Identities = 15/41 (36%), Positives = 24/41 (58%), Gaps = 4/41 (9%)

Query: 53 EESEGDDYEGEDEELEEEEDD----GLDEEDGEDEEEDTEG 89
           +   +D E ED+ ++E+E+D      DE +  +EEED EG
Sbjct: 31 AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEG 71



 Score = 26.7 bits (59), Expect = 2.8
 Identities = 15/41 (36%), Positives = 25/41 (60%), Gaps = 4/41 (9%)

Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
          E  ++EDD   E    + EDE + EE+++ L EE+ ++E E
Sbjct: 36 EDEEAEDDVVDE----DEEDEAVVEEDENELTEEEEDEEGE 72


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The
          domain is found in the primary vegetative sigma factor.
          The function of this domain is unclear and can be
          removed without loss of function.
          Length = 211

 Score = 32.5 bits (75), Expect = 0.022
 Identities = 14/51 (27%), Positives = 26/51 (50%)

Query: 35 YLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
          ++D   A       + + E  E D  + +D++ +E+EDD  + + G D EE
Sbjct: 29 FIDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79



 Score = 28.3 bits (64), Expect = 0.69
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
          P++     +        DEE  E++DD  ++ED +DEEE   G
Sbjct: 32 PNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLG 74


>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat. 
          Length = 60

 Score = 30.6 bits (70), Expect = 0.022
 Identities = 16/37 (43%), Positives = 23/37 (62%), Gaps = 3/37 (8%)

Query: 1  KQLTNLKHVDLFNNDVTTIENYREKVFELLPNLKYLD 37
          K L NLK +DL  N++T+I     + F  LP+L+ LD
Sbjct: 21 KGLPNLKVLDLSGNNLTSIS---PEAFSGLPSLRSLD 54



 Score = 30.6 bits (70), Expect = 0.030
 Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 3/33 (9%)

Query: 5  NLKHVDLFNNDVTTIENYREKVFELLPNLKYLD 37
          NLK +DL NN +T I    +  F+ LPNLK LD
Sbjct: 1  NLKSLDLSNNRLTVIP---DGAFKGLPNLKVLD 30


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 32.7 bits (74), Expect = 0.026
 Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 40   DAKNNEAPDSED--DEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
            DA+ +E    ED  +E +  D+   +D+  +  EDD    EDG +E      +      +
Sbjct: 4024 DAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVK 4083

Query: 98   KHDEEEEGE 106
              +E E+GE
Sbjct: 4084 SDEELEQGE 4092



 Score = 28.8 bits (64), Expect = 0.53
 Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 3/51 (5%)

Query: 41   AKNNEAP--DSEDDEES-EGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
            A NNE+     EDD ++ E  D + +++E E  +D G+D+E   D +E+  
Sbjct: 3928 AANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNS 3978



 Score = 28.0 bits (62), Expect = 0.94
 Identities = 13/71 (18%), Positives = 29/71 (40%), Gaps = 1/71 (1%)

Query: 37   DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLD-EEDGEDEEEDTEGKEVRGK 95
             G  A++     +EDD+  E  + E    + + +E +  + E D     +D+   E    
Sbjct: 3820 SGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENS 3879

Query: 96   KRKHDEEEEGE 106
              + + ++  E
Sbjct: 3880 DSEEENQDLDE 3890



 Score = 28.0 bits (62), Expect = 1.1
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query: 50   EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
            E+ EE++ +  E   +E   EE++ LDE+  +D+  D
Sbjct: 4018 ENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSD 4054



 Score = 25.7 bits (56), Expect = 5.7
 Identities = 12/48 (25%), Positives = 23/48 (47%)

Query: 44   NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
            NE    E +++S        + +L  +EDD    ED + +E++ E + 
Sbjct: 3912 NEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEM 3959


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 32.3 bits (73), Expect = 0.029
 Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 12/63 (19%)

Query: 44  NEAPDSEDDEESEG--------DDYEGEDEELEEEEDDGLDEEDG----EDEEEDTEGKE 91
           +E  D EDD +S          D+ +  D+   +E D G++ +DG      EEE+ E + 
Sbjct: 464 DERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQAIEEEEEEERA 523

Query: 92  VRG 94
           V G
Sbjct: 524 VLG 526



 Score = 28.8 bits (64), Expect = 0.52
 Identities = 16/50 (32%), Positives = 26/50 (52%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
           D  D+  + +  S +DE    DD  G++ +   E DDG   +  E+EEE+
Sbjct: 471 DDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQAIEEEEEE 520



 Score = 28.0 bits (62), Expect = 0.79
 Identities = 18/77 (23%), Positives = 33/77 (42%), Gaps = 12/77 (15%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELE----------EEEDDGLDEEDGEDEEEDTEG 89
            A   ++ + + DE+  GD+    +++ +          E+E D  D  D   +E D+  
Sbjct: 446 HADRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSAD--DSWGDESDSGI 503

Query: 90  KEVRGKKRKHDEEEEGE 106
           +   G   +  EEEE E
Sbjct: 504 EHDDGGVGQAIEEEEEE 520


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 32.0 bits (73), Expect = 0.029
 Identities = 12/54 (22%), Positives = 22/54 (40%), Gaps = 1/54 (1%)

Query: 39  FDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
              +   A     D + EGD  + +D+E   E D   D +D   +E+  +  + 
Sbjct: 235 SSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDL-DDSDDDVSDEDGEDLFDT 287



 Score = 31.3 bits (71), Expect = 0.069
 Identities = 11/42 (26%), Positives = 21/42 (50%), Gaps = 2/42 (4%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEE 78
              +   ++  D ED  ES+ DD   +D+  +E+ +D  D +
Sbjct: 249 SDDEGDGSDDDDDEDAIESDLDD--SDDDVSDEDGEDLFDTD 288



 Score = 29.7 bits (67), Expect = 0.19
 Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 1/45 (2%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
           +  DS+D+ +   DD + ED    + +D   D  D + E+     
Sbjct: 245 DGIDSDDEGDGSDDDDD-EDAIESDLDDSDDDVSDEDGEDLFDTD 288


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 32.2 bits (73), Expect = 0.035
 Identities = 18/59 (30%), Positives = 32/59 (54%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           +   +EE E ++ E E+E+  EEE+   +EE+ E E ++   +E+ G      + EE E
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPE 500



 Score = 31.8 bits (72), Expect = 0.044
 Identities = 24/58 (41%), Positives = 37/58 (63%)

Query: 49  SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           S++ EE E  + E E+EE EEEE+   +EE+GEDEEE+ E +   G + + +   EG+
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493



 Score = 31.0 bits (70), Expect = 0.074
 Identities = 17/67 (25%), Positives = 36/67 (53%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           +++  E+ + E++EE E ++ E E EE E E+++  +E + ++  E+       G     
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497

Query: 100 DEEEEGE 106
           + EE+ E
Sbjct: 498 EPEEDAE 504



 Score = 30.7 bits (69), Expect = 0.10
 Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
             +  E  + E++EE E ++ EGEDEE EEE +   D    E+ E  +EG     +  + 
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEA--DNGSEEEMEGSSEGDGDGEEPEED 502

Query: 100 DEEEEGE 106
            E    E
Sbjct: 503 AERRNSE 509



 Score = 28.7 bits (64), Expect = 0.57
 Identities = 20/50 (40%), Positives = 31/50 (62%)

Query: 42  KNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
            + E+ + E  EE E ++ E E+EE E EE++G DEE+ E+ E D   +E
Sbjct: 435 ASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEE 484



 Score = 26.4 bits (58), Expect = 2.9
 Identities = 14/53 (26%), Positives = 25/53 (47%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
           E  + ED+EE E  + +   EE  E   +G  + +  +E+ +    E+ G  R
Sbjct: 463 EEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISR 515



 Score = 25.3 bits (55), Expect = 9.1
 Identities = 13/65 (20%), Positives = 29/65 (44%)

Query: 42  KNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
           ++ E    +++EE E +   G +EE+E   +   D E+ E++ E    +     +    +
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQ 520

Query: 102 EEEGE 106
           +  G 
Sbjct: 521 QPRGS 525



 Score = 25.3 bits (55), Expect = 9.7
 Identities = 18/61 (29%), Positives = 31/61 (50%)

Query: 46  APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEG 105
           +  S D  ++     E      +E E++   EE+ E+EEE+ E ++   ++   DEEEE 
Sbjct: 415 SSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEE 474

Query: 106 E 106
           E
Sbjct: 475 E 475


>gnl|CDD|130728 TIGR01667, YCCS_YHJK, integral membrane protein, YccS/YhfK family. 
           This model represents two clades of putative
           transmembrane proteins including the E. coli YccS and
           YhfK proteins. The YccS hypothetical equivalog
           (TIGR01666) is found in beta and gamma proteobacteria,
           while the smaller YhfK group is only found in E. coli,
           Salmonella and Yersinia. TMHMM on the 19 hits to this
           model shows a consensus of 11 transmembrane helices
           separated into two clusters, an N-terminal cluster of 6
           and a central cluster of 5. This would indicate two
           non-membrane domains one on each side of the membrane.
          Length = 701

 Score = 31.0 bits (70), Expect = 0.083
 Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 5/57 (8%)

Query: 22  YREKVFELLPNLKYLDGF----DAKNNEAPDSEDDEES-EGDDYEGEDEELEEEEDD 73
           +RE++       + L        A        E DEE+  GD    E +E++  ED 
Sbjct: 619 HRERLLNPELAAELLQACEIVAKAIQRCQARLEYDEEALAGDANIMEAQEMQPHEDA 675


>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies).  Leucine rich
          repeats are short sequence motifs present in a number
          of proteins with diverse functions and cellular
          locations. These repeats are usually involved in
          protein-protein interactions. Each Leucine Rich Repeat
          is composed of a beta-alpha unit. These units form
          elongated non-globular structures. Leucine Rich Repeats
          are often flanked by cysteine rich domains.
          Length = 43

 Score = 28.6 bits (65), Expect = 0.089
 Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 5/34 (14%)

Query: 4  TNLKHVDLFNNDVTTIENYREKVFELLPNLKYLD 37
          TNL+ +DL NN +T +          LPNL+ LD
Sbjct: 1  TNLETLDLSNNQITDLPP-----LSNLPNLETLD 29



 Score = 27.5 bits (62), Expect = 0.32
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query: 1  KQLTNLKHVDLFNNDVTTIENYRE 24
            L NL+ +DL  N +T +     
Sbjct: 20 SNLPNLETLDLSGNKITDLSPLSN 43


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 30.7 bits (69), Expect = 0.090
 Identities = 10/70 (14%), Positives = 20/70 (28%), Gaps = 2/70 (2%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
           DG      E     D E  +  D  G+     + + +       E     +E  +    +
Sbjct: 356 DGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHD--ETE 413

Query: 97  RKHDEEEEGE 106
            +  E+    
Sbjct: 414 PEVPEKAAPI 423


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 31.0 bits (70), Expect = 0.097
 Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 1/52 (1%)

Query: 39  FDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEED-GEDEEEDTEG 89
           F+ +  +  D E+ ++ E +    ED E E E D    EED  ED+ E  EG
Sbjct: 645 FEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEG 696



 Score = 29.5 bits (66), Expect = 0.30
 Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 41  AKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
            +  E  + ++D+  E DD E E E +E+ E +   E DGED EED +
Sbjct: 642 QQLFEGEEEDEDDLEETDDDEDECEAIEDSESE--SESDGEDGEEDEQ 687



 Score = 27.5 bits (61), Expect = 1.3
 Identities = 15/52 (28%), Positives = 23/52 (44%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
           D   +      + EE + DD E  D++ +E E     E + E + ED E  E
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDE 686



 Score = 26.4 bits (58), Expect = 4.0
 Identities = 15/57 (26%), Positives = 28/57 (49%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
           DSE + ES+G+D E +++E + E ++G+   D        +   +       D E+E
Sbjct: 670 DSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDE 726


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 30.5 bits (69), Expect = 0.12
 Identities = 21/57 (36%), Positives = 32/57 (56%)

Query: 50  EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           E ++E   ++ E E EE +EEE+   D+E+  DEEE+ E K+ + KK K    E   
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 30.5 bits (69), Expect = 0.12
 Identities = 16/59 (27%), Positives = 24/59 (40%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           + ED+EE +  D   E+E  + EE+     EDG  E     G +   K  K    +   
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDS 430


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 30.3 bits (68), Expect = 0.13
 Identities = 12/68 (17%), Positives = 16/68 (23%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
               A       S + + S       ED   E E      E D     +          +
Sbjct: 146 PHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQ 205

Query: 97  RKHDEEEE 104
              DE  E
Sbjct: 206 SPPDEPGE 213



 Score = 29.9 bits (67), Expect = 0.20
 Identities = 6/70 (8%), Positives = 15/70 (21%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
                  +E    E    S         +   E+  +  +    E E +     +     
Sbjct: 140 PPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPT 199

Query: 97  RKHDEEEEGE 106
                +   +
Sbjct: 200 SSPPPQSPPD 209



 Score = 27.2 bits (60), Expect = 1.6
 Identities = 6/60 (10%), Positives = 10/60 (16%)

Query: 47  PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           P++      E                               +      +    D  EE E
Sbjct: 120 PENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPE 179



 Score = 25.7 bits (56), Expect = 5.8
 Identities = 6/52 (11%), Positives = 12/52 (23%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
            A ++  P                      ++     +   ED  E+ E   
Sbjct: 131 PASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPT 182



 Score = 25.3 bits (55), Expect = 7.1
 Identities = 5/70 (7%), Positives = 13/70 (18%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
            G ++  +  P      E        E+      E                  +    + 
Sbjct: 95  SGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES 154

Query: 97  RKHDEEEEGE 106
                 ++  
Sbjct: 155 HNPSPNQQPS 164


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 30.3 bits (68), Expect = 0.15
 Identities = 22/60 (36%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 49  SEDDEESEGDDYEGEDEELEEEED--DGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           S +  E  GDD E EDEE  +++   +   EE GE E E  EG   +  +    E E GE
Sbjct: 196 SMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255



 Score = 29.1 bits (65), Expect = 0.35
 Identities = 11/54 (20%), Positives = 22/54 (40%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
           + E  E S   + E  D E E  E++ +  +  +  +E  +  E  G+  +   
Sbjct: 232 EGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGARPAR 285



 Score = 26.4 bits (58), Expect = 3.3
 Identities = 13/65 (20%), Positives = 29/65 (44%), Gaps = 1/65 (1%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           D   +E  +  DD++   ++ E + E  E E  +G   ++ E  + ++E  E    +   
Sbjct: 205 DDTESEDEEDGDDDQPTENEQEEQGEG-EGEGQEGSAPQESEATDRESESGEEEMVQSDQ 263

Query: 100 DEEEE 104
           D+  +
Sbjct: 264 DDLPD 268



 Score = 26.1 bits (57), Expect = 4.8
 Identities = 14/72 (19%), Positives = 30/72 (41%), Gaps = 3/72 (4%)

Query: 38  GFDAKNNEAPDSEDDE-ESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE--DTEGKEVRG 94
           G D ++ +  D +DD+      + +GE E   +E     + E  + E E  + E  +   
Sbjct: 204 GDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQ 263

Query: 95  KKRKHDEEEEGE 106
                + +++ E
Sbjct: 264 DDLPDESDDDSE 275


>gnl|CDD|233914 TIGR02538, type_IV_pilB, type IV-A pilus assembly ATPase PilB.
           This model describes a protein of type IV pilus
           biogenesis designated PilB in Pseudomonas aeruginosa but
           PilF in Neisseria gonorrhoeae; the more common usage,
           reflected here, is PilB. This protein is an ATPase
           involved in protein export for pilin assembly and is
           closely related to GspE (TIGR02533) of type II
           secretion, also called the main terminal branch of the
           general secretion pathway. Note that type IV pilus
           systems are often divided into type IV-A and IV-B, with
           the latter group including bundle-forming pilus,
           mannose-sensitive hemagglutinin, etc. Members of this
           family are found in type IV-A systems [Cell envelope,
           Surface structures, Protein fate, Protein and peptide
           secretion and trafficking].
          Length = 564

 Score = 30.0 bits (68), Expect = 0.20
 Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 1/42 (2%)

Query: 51  DDEESEGDDYEGEDEELEEEED-DGLDEEDGEDEEEDTEGKE 91
              +S   +   ED    EE D D +D+E  +D E+D    +
Sbjct: 137 GGSDSLAKELGDEDIGDLEELDVDAIDDEGPDDIEQDAVDDD 178



 Score = 29.6 bits (67), Expect = 0.24
 Identities = 9/40 (22%), Positives = 22/40 (55%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDT 87
           DS   E  + D  + E+ +++  +D+G D+ + +  ++D 
Sbjct: 140 DSLAKELGDEDIGDLEELDVDAIDDEGPDDIEQDAVDDDA 179


>gnl|CDD|216289 pfam01080, Presenilin, Presenilin.  Mutations in presenilin-1 are a
           major cause of early onset Alzheimer's disease. It has
           been found that presenilin-1 binds to beta-catenin
           in-vivo. This family also contains SPE proteins from
           C.elegans.
          Length = 403

 Score = 29.8 bits (67), Expect = 0.24
 Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 2/61 (3%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
           E P+     E   D  E +  E E + D  L  E+   E+ + +   +   +++ +EEE 
Sbjct: 267 ELPEKSSTPELSDD--ESDSSETESQSDSSLAPEEDAAEQPEVQSNSLPSNEKREEEEER 324

Query: 105 G 105
           G
Sbjct: 325 G 325


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 29.6 bits (67), Expect = 0.24
 Identities = 15/63 (23%), Positives = 32/63 (50%)

Query: 44  NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
           ++  +  + EE E  D EGE  ++E +++    + + E+E+++   K       +  EE+
Sbjct: 98  DDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEED 157

Query: 104 EGE 106
           E E
Sbjct: 158 EEE 160



 Score = 26.2 bits (58), Expect = 4.2
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 6/90 (6%)

Query: 23  REKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLD------ 76
            E+V + +P L+ L+ +  +  +  ++E   ES+ DD E E+ E+EE+ED   +      
Sbjct: 62  EEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDV 121

Query: 77  EEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           E D E E  D+E +E + +  K  +E+  E
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDE 151


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 29.8 bits (67), Expect = 0.25
 Identities = 17/53 (32%), Positives = 26/53 (49%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
           +SE  +ESE D+ E  D E EE E D  +  +  + +E  E  E  G+  +  
Sbjct: 253 ESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDARPA 305



 Score = 29.0 bits (65), Expect = 0.41
 Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 8/68 (11%)

Query: 47  PDSEDDEESEGDDYEGEDEELEEEEDDGLDE--------EDGEDEEEDTEGKEVRGKKRK 98
            D+++++  +      ED E   EE +G DE         DGE EE + +  E       
Sbjct: 229 EDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSES 288

Query: 99  HDEEEEGE 106
            + +E+ E
Sbjct: 289 DESDEDTE 296


>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus.  This
           family represents the C-terminus (approximately 500
           residues) of the eukaryotic coatomer alpha subunit.
           Coatomer (COPI) is a large cytosolic protein complex
           which forms a coat around vesicles budding from the
           Golgi apparatus. Such coatomer-coated vesicles have been
           proposed to play a role in many distinct steps of
           intracellular transport. Note that many family members
           also contain the pfam04053 domain.
          Length = 421

 Score = 29.4 bits (66), Expect = 0.33
 Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 10/62 (16%)

Query: 44  NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
            +   + D     GDD+ GED +L  +ED  L+ EDG  ++E+           +  +EE
Sbjct: 25  GDKAAAADFAVMGGDDW-GEDADLGLDEDGYLEGEDGLLDDEEGP---------EGGDEE 74

Query: 104 EG 105
            G
Sbjct: 75  GG 76



 Score = 26.4 bits (58), Expect = 2.9
 Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 4/39 (10%)

Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
          ++D   EG+D   +DEE  E  D    EE G D E+D  
Sbjct: 50 DEDGYLEGEDGLLDDEEGPEGGD----EEGGWDVEDDLV 84


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 29.2 bits (66), Expect = 0.36
 Identities = 9/43 (20%), Positives = 14/43 (32%), Gaps = 6/43 (13%)

Query: 47  PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
           P      E E              EDD    ED E++++  + 
Sbjct: 259 PAPPQPPEEE------PPPPPPPPEDDDDPPEDEEEQDDAEDR 295



 Score = 26.9 bits (60), Expect = 2.5
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGE 81
           + P+ E        + + +  E EEE+DD  D    E
Sbjct: 263 QPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 29.0 bits (66), Expect = 0.40
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
           D   D  +E D      E  E ++D+  +EE+ E+++  
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSL 209



 Score = 28.6 bits (65), Expect = 0.50
 Identities = 10/39 (25%), Positives = 23/39 (58%)

Query: 47  PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
           P++E+D    G + E  D++ +EEE++  +++    +E 
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADES 214



 Score = 26.3 bits (59), Expect = 3.4
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGE 81
           EAPD++D   +E +     DE+ EEE    L   + E
Sbjct: 61 EEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98



 Score = 25.5 bits (57), Expect = 6.4
 Identities = 10/45 (22%), Positives = 20/45 (44%)

Query: 59  DYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
           D   +    E+    G + E+ +D+E++ E ++        DE E
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESE 215


>gnl|CDD|140234 PTZ00207, PTZ00207, hypothetical protein; Provisional.
          Length = 591

 Score = 29.0 bits (65), Expect = 0.41
 Identities = 11/49 (22%), Positives = 21/49 (42%), Gaps = 6/49 (12%)

Query: 29  LLPNLKY-LDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLD 76
           + P + + L  FD K        DD + +  ++    EE+   ED  ++
Sbjct: 289 IFPFMAFPLTTFDGKR-----PHDDSDGKAKEHVEAGEEVSAAEDKVVE 332


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 29.1 bits (66), Expect = 0.42
 Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 2/60 (3%)

Query: 33  LKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
           + +++    +  E  ++ +  E   DDY  E  E  EEEDD  D+ D + +E+D EG EV
Sbjct: 329 MTFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDD--DDWDDDWDEDDDEGVEV 386


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This
          is a family of uncharacterized proteins.
          Length = 449

 Score = 28.8 bits (64), Expect = 0.46
 Identities = 16/43 (37%), Positives = 32/43 (74%), Gaps = 2/43 (4%)

Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDT 87
          EA + EDD+E + DD + EDE+ ++++DD  ++++ ED+++ T
Sbjct: 48 EAMEEEDDDEEDDDDDDDEDEDDDDDDDD--EDDEDEDDDDST 88



 Score = 28.0 bits (62), Expect = 0.94
 Identities = 20/61 (32%), Positives = 31/61 (50%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
           E  D +DD+E E DD + +DE+ E+E+DD     D    ++  E     G     DE ++
Sbjct: 57  EEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFADSDDESDD 116

Query: 105 G 105
           G
Sbjct: 117 G 117



 Score = 26.9 bits (59), Expect = 2.3
 Identities = 17/43 (39%), Positives = 29/43 (67%)

Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
          E+  E E DD E +D++ +E+EDD  D++D +DE+ED +   +
Sbjct: 47 EEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTL 89



 Score = 26.1 bits (57), Expect = 4.8
 Identities = 14/51 (27%), Positives = 30/51 (58%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGK 90
           D    +  D +D++E + DD + ED+E E+++D  L ++   D+  +T+ +
Sbjct: 54  DDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNE 104



 Score = 25.3 bits (55), Expect = 8.9
 Identities = 10/56 (17%), Positives = 32/56 (57%)

Query: 50  EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEG 105
           ++ EE   ++ + ++E+ ++++D+  D++D +D+E+D +  +           ++G
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDG 98


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.
          Members of this family are necessary for accurate
          chromosome transmission during cell division.
          Length = 804

 Score = 29.0 bits (65), Expect = 0.46
 Identities = 12/53 (22%), Positives = 23/53 (43%), Gaps = 4/53 (7%)

Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
                   + D+ D E++    Y+G  ++L+EE+D   DE  G   +   + 
Sbjct: 14 RAPAGGGGGSFDAIDFEDT----YDGLGDQLDEEDDALNDETFGGVADVGKDF 62



 Score = 27.4 bits (61), Expect = 1.4
 Identities = 14/49 (28%), Positives = 18/49 (36%), Gaps = 8/49 (16%)

Query: 38 GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
          GFD           D    G      D    E+  DGL ++   DEE+D
Sbjct: 5  GFDTSLPP------DRAPAGGGGGSFDAIDFEDTYDGLGDQ--LDEEDD 45


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 28.6 bits (64), Expect = 0.56
 Identities = 28/69 (40%), Positives = 35/69 (50%), Gaps = 6/69 (8%)

Query: 42  KNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED----TEGKEVRGKKR 97
           KNN+    EDD     +D E EDEE EE+EDD  D  D EDE+ED    TE      +  
Sbjct: 119 KNNK--QFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESG 176

Query: 98  KHDEEEEGE 106
               +E+ E
Sbjct: 177 VDRVKEDDE 185



 Score = 27.8 bits (62), Expect = 0.94
 Identities = 20/80 (25%), Positives = 40/80 (50%), Gaps = 16/80 (20%)

Query: 41  AKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGL----------------DEEDGEDEE 84
            +  E P+SED+EE E +D + +  + E+E++DG+                 +ED E++E
Sbjct: 129 TRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDE 188

Query: 85  EDTEGKEVRGKKRKHDEEEE 104
           +    K+   ++ K  ++ E
Sbjct: 189 DADLSKKDVLEEPKMFKKPE 208


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
          complex subunit was formerly known as Srb4 in yeasts or
          Trap80 in Drosophila and human. The Med17 subunit is
          located within the head domain and is essential for
          cell viability to the extent that a mutant strain of
          cerevisiae lacking it shows all RNA polymerase
          II-dependent transcription ceasing at non-permissive
          temperatures.
          Length = 454

 Score = 28.5 bits (64), Expect = 0.58
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 51 DDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
            E ++ D  E  DEE ++EEDD  D E+ +D  E+  
Sbjct: 59 AKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFP 96


>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
          Length = 520

 Score = 28.6 bits (64), Expect = 0.60
 Identities = 11/63 (17%), Positives = 27/63 (42%), Gaps = 13/63 (20%)

Query: 40  DAKNNEAPDSEDDEESEGDD-------------YEGEDEELEEEEDDGLDEEDGEDEEED 86
             +    P++ED+ +  G               +  +D+E ++E+DD  ++   + +E+D
Sbjct: 392 RRRPRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFDLDEDD 451

Query: 87  TEG 89
              
Sbjct: 452 VYD 454



 Score = 25.1 bits (55), Expect = 9.7
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query: 46  APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
            P    D++ EGDD + +D E    + D  D  D +D +E
Sbjct: 421 PPPPFPDDDDEGDDEDDDDWEDLGFDLDEDDVYDLKDVDE 460


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 28.1 bits (63), Expect = 0.60
 Identities = 17/38 (44%), Positives = 26/38 (68%)

Query: 56  EGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
             +D   +DEE EEEEDD  D+++ E EEE++  K+V+
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 25.7 bits (57), Expect = 4.0
 Identities = 14/31 (45%), Positives = 23/31 (74%)

Query: 55  SEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
           SE D+ + ++E+ EEE+D+  D+ED  +EEE
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEE 139


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 28.4 bits (63), Expect = 0.61
 Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 1/54 (1%)

Query: 45  EAPDSEDDEESEGDDY-EGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
           E  D +DD  +  D Y  GE  + EEE +   D  +G DE+++    E +G  R
Sbjct: 50  EDEDDDDDFPAPADGYSRGEAADDEEEGEASSDATEGHDEDDEIYEGEYQGIPR 103


>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 393 and 442 amino
           acids in length.
          Length = 301

 Score = 28.1 bits (63), Expect = 0.74
 Identities = 13/57 (22%), Positives = 18/57 (31%), Gaps = 8/57 (14%)

Query: 47  PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
            +   D    G D+  ED   E    D        D E ++E +         DE E
Sbjct: 224 AEDSRDTVLLGPDFVDEDIRAEMASIDDESFSSLSDSESESEDEI--------DEAE 272


>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
          Length = 694

 Score = 28.4 bits (63), Expect = 0.77
 Identities = 13/72 (18%), Positives = 22/72 (30%), Gaps = 20/72 (27%)

Query: 46  APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGED------------EEEDTEGKEVR 93
            P +    E   D Y        E +DD  D  DG               ++D   + V 
Sbjct: 527 GPPAAAPPEQMEDPYL-------EPDDDRFDRRDGAAAAATSHPREAPAPDDDPIYEGVS 579

Query: 94  -GKKRKHDEEEE 104
             ++  ++E   
Sbjct: 580 DSEEPVYEEIPT 591


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 28.1 bits (63), Expect = 0.78
 Identities = 15/50 (30%), Positives = 29/50 (58%)

Query: 53  EESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
           EE + +++E E+EE EEE D   D+ + ++ E D E +  +  +R+   +
Sbjct: 36  EEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLK 85



 Score = 25.8 bits (57), Expect = 4.8
 Identities = 16/55 (29%), Positives = 31/55 (56%)

Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
           E  D E+ E  E ++ E  D + ++ EDD  + +D E+ E++ + +E   KK++
Sbjct: 35 EEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKKKKR 89


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
          chain; Provisional.
          Length = 1033

 Score = 28.2 bits (63), Expect = 0.82
 Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 40 DAKNNEAPDSEDDEESE-GDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
          +  N +A + ED+EE E      G D + +E   +  DE++ +DEE ++  K    K+ K
Sbjct: 3  EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREK 62



 Score = 26.7 bits (59), Expect = 3.1
 Identities = 15/47 (31%), Positives = 17/47 (36%)

Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
          E          E D+ E E        D   DE   EDE+ED E  E
Sbjct: 1  EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDE 47



 Score = 25.5 bits (56), Expect = 6.5
 Identities = 13/46 (28%), Positives = 21/46 (45%)

Query: 61  EGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           E  + +  EEED+   E        D++  EV  +    DEE++ E
Sbjct: 3   EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEE 48



 Score = 25.1 bits (55), Expect = 8.4
 Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 9/63 (14%)

Query: 50  EDDEESEGDDYEGEDEELE---------EEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
           E+++ +   + E ++EELE          ++D+   E++ EDEE+D E +     +    
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60

Query: 101 EEE 103
           E+ 
Sbjct: 61  EKA 63



 Score = 25.1 bits (55), Expect = 9.9
 Identities = 9/52 (17%), Positives = 22/52 (42%)

Query: 54  ESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEG 105
           E E  + +  +EE EEE +        + ++++   ++    +   +E E  
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESP 52


>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
           Toc86/159, G and M domains.  The long precursor of the
           86K protein originally described is proposed to have
           three domains. The N-terminal A-domain is acidic,
           repetitive, weakly conserved, readily removed by
           proteolysis during chloroplast isolation, and not
           required for protein translocation. The other domains
           are designated G (GTPase) and M (membrane anchor); this
           family includes most of the G domain and all of M
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 763

 Score = 28.4 bits (63), Expect = 0.82
 Identities = 16/30 (53%), Positives = 18/30 (60%), Gaps = 2/30 (6%)

Query: 54  ESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
           E +G D E  D ELE+  D   DEE GEDE
Sbjct: 379 EQQGGDEEDSDIELEDSSDS--DEESGEDE 406


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 27.8 bits (62), Expect = 0.90
 Identities = 14/55 (25%), Positives = 24/55 (43%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           EA D + D  S+ DD + +  +  +      + +   +EE+D   KE  G K   
Sbjct: 278 EAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGAKSNE 332



 Score = 25.1 bits (55), Expect = 9.2
 Identities = 8/53 (15%), Positives = 20/53 (37%)

Query: 50  EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
           ++D +   D+ + + +  +  +    D     D  E+ +  E      K +E 
Sbjct: 281 DEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGAKSNEL 333


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 27.6 bits (61), Expect = 1.1
 Identities = 16/35 (45%), Positives = 19/35 (54%)

Query: 38  GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEED 72
           G     N +  S  D ESE D+ E E+EE EEE D
Sbjct: 211 GLRTPPNSSSSSGSDSESEEDEEEEEEEEEEEEID 245



 Score = 27.6 bits (61), Expect = 1.2
 Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 2/40 (5%)

Query: 62  GEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
           G D E EE+E++  +EE+ E+EE D    E R        
Sbjct: 223 GSDSESEEDEEE--EEEEEEEEEIDVVTVEKRRSSSNRKA 260


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 27.6 bits (61), Expect = 1.1
 Identities = 12/47 (25%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 42  KNNEAPDSEDDEESEGDDYEGEDEELEE--EEDDGLDEEDGEDEEED 86
           K N + +   D +     YE +D+E     + D+  D +D ++E+ D
Sbjct: 253 KQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDND 299



 Score = 27.6 bits (61), Expect = 1.2
 Identities = 17/52 (32%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
            G     N   +S+DDE S  +DY   DEE + ++DD  D +D      D +
Sbjct: 261 YGDFKGVNYGYESDDDEGSSSNDY---DEEEDGDDDDNEDNDDTNTNHGDED 309



 Score = 27.2 bits (60), Expect = 1.6
 Identities = 8/46 (17%), Positives = 20/46 (43%)

Query: 43  NNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
           +N+  D +        D +      + +E++  D++D ED ++   
Sbjct: 258 SNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNT 303


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 27.7 bits (62), Expect = 1.2
 Identities = 13/44 (29%), Positives = 26/44 (59%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
           +    E  +  ++EE E D  E EDE+ E+E+++  +++D  D+
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 26.9 bits (60), Expect = 1.7
 Identities = 15/44 (34%), Positives = 28/44 (63%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
           + + +E  E  + E E+++  E ED+  ++ED E+EE+D EG +
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 26.2 bits (58), Expect = 3.3
 Identities = 12/43 (27%), Positives = 26/43 (60%)

Query: 49  SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
             D+E+ E    + E+EE E++  +  DE++ +++EE+ E  +
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDD 315



 Score = 26.2 bits (58), Expect = 3.9
 Identities = 14/43 (32%), Positives = 27/43 (62%)

Query: 43  NNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
            +E    E  E++E ++ E +  E E+E+++  DEE+ ED++E
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316



 Score = 25.4 bits (56), Expect = 7.5
 Identities = 12/43 (27%), Positives = 26/43 (60%)

Query: 62  GEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
           G DEE  EE  +  +EE+ ED+  ++E ++   +  + +E+++
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDD 315


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 27.4 bits (61), Expect = 1.2
 Identities = 13/51 (25%), Positives = 25/51 (49%)

Query: 55  SEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEG 105
           S  DD E E EE   +E +  D  + + E ++ + +EV   K + ++ +  
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 26.3 bits (58), Expect = 2.7
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 76  DEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
           D+E+ E EEE T+  E         E +E +
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTESKEKK 125


>gnl|CDD|236987 PRK11798, PRK11798, ClpXP protease specificity-enhancing factor;
           Provisional.
          Length = 138

 Score = 27.1 bits (61), Expect = 1.2
 Identities = 8/31 (25%), Positives = 12/31 (38%)

Query: 69  EEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           E E    ++E  +D + D E        R H
Sbjct: 103 EPEAAYDEDEHDDDTDPDDEPPPPPRGGRPH 133


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 27.3 bits (61), Expect = 1.2
 Identities = 15/53 (28%), Positives = 29/53 (54%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
            S++D+E E    E ++++++++EDD  D+ D    E     K +  KK + D
Sbjct: 3   KSDEDDEFEVVPAEEDEKKVDKDEDDDKDDIDDLTAEALALAKLLARKKTRED 55


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 27.7 bits (62), Expect = 1.2
 Identities = 20/87 (22%), Positives = 36/87 (41%), Gaps = 3/87 (3%)

Query: 8   HVDLFNNDVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDE-ESEGDDYEGEDEE 66
              L      T E+ RE    +LP+ +    F+    +     +++ E  G D E  DE 
Sbjct: 283 LAALDGRRRVTAEDVREAAELVLPHRRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEG 342

Query: 67  LEEEEDDG--LDEEDGEDEEEDTEGKE 91
            ++ E  G       G D+E++ + +E
Sbjct: 343 EDDAEQSGPRGHPTPGNDDEKEPDPQE 369



 Score = 26.6 bits (59), Expect = 2.9
 Identities = 14/48 (29%), Positives = 20/48 (41%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGK 95
           + EDD E  G          +E+E D  +E DG+    D  G   R +
Sbjct: 341 EGEDDAEQSGPRGHPTPGNDDEKEPDPQEEADGQGSSTDPAGDIFRIR 388


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 27.5 bits (61), Expect = 1.2
 Identities = 12/46 (26%), Positives = 18/46 (39%)

Query: 53  EESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
           EE   DD      E+E EE      E+   + +    +  R  K+K
Sbjct: 242 EEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKK 287


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 27.3 bits (61), Expect = 1.3
 Identities = 18/64 (28%), Positives = 34/64 (53%)

Query: 43  NNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
            +E+   + +E+ E  + + +DE  +EEE D  D+E  E+ + + E   V  ++ + D E
Sbjct: 2   GSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPE 61

Query: 103 EEGE 106
            E E
Sbjct: 62  AEAE 65


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 27.7 bits (61), Expect = 1.4
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 7/54 (12%)

Query: 47  PDSEDDEESEGDDYEGEDEELEEEEDDGLDEED-------GEDEEEDTEGKEVR 93
            D E  +    D    E+EE+E EED+   E++         +EEE+ E K ++
Sbjct: 126 IDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMK 179



 Score = 25.4 bits (55), Expect = 7.9
 Identities = 12/36 (33%), Positives = 18/36 (50%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEED 72
           D    +  E    ED+E  E +  +  DE+ EEEE+
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEE 172


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 27.5 bits (61), Expect = 1.4
 Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 3/58 (5%)

Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEE--LEEEEDDGLDEEDGEDEEEDTEGKEV 92
          DG DA     P++ DD      D  GED+E  +E    +  +EE+ E+ EED E +  
Sbjct: 40 DGDDAIELAEPETSDDPYG-NPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGE 96


>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
           Glycoprotein E (gE) of Alphaherpesvirus forms a complex
           with glycoprotein I (gI) (pfam01688), functioning as an
           immunoglobulin G (IgG) Fc binding protein. gE is
           involved in virus spread but is not essential for
           propagation.
          Length = 437

 Score = 27.4 bits (61), Expect = 1.6
 Identities = 5/27 (18%), Positives = 12/27 (44%)

Query: 41  AKNNEAPDSEDDEESEGDDYEGEDEEL 67
             N    D   D ES+ ++++ + +  
Sbjct: 399 PTNELFEDVSSDSESDDEEFDSDSDRE 425


>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor.  This
           product is encoded by astrovirus ORF2, one of the three
           astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
           undergoes an intracellular cleavage to form a 79kD
           protein. Subsequently, extracellular trypsin cleavage
           yields the three proteins forming the infectious virion.
          Length = 787

 Score = 27.4 bits (61), Expect = 1.7
 Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 4/51 (7%)

Query: 45  EAPDSEDD----EESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
           +APD  +D    EE++ +D   EDE+ E +  D  D    E E++D   + 
Sbjct: 661 DAPDPYEDLISLEETDTEDESTEDEDDELDRFDLHDSSGSEPEDDDENNRV 711


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 27.4 bits (60), Expect = 1.7
 Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 4/59 (6%)

Query: 52  DEESEGDDYEGEDEELEEEEDDGLDEEDGE----DEEEDTEGKEVRGKKRKHDEEEEGE 106
           DEE E      E+  +   +DD   E D E    DEE   +  E   +    +EE +  
Sbjct: 395 DEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLR 453



 Score = 25.8 bits (56), Expect = 5.8
 Identities = 20/57 (35%), Positives = 28/57 (49%)

Query: 36  LDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
           +D  D K N+  +   DEES  DD EG +E   EEE+  L E    ++E+     E 
Sbjct: 413 IDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDEA 469



 Score = 25.4 bits (55), Expect = 7.7
 Identities = 16/65 (24%), Positives = 29/65 (44%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           + ++ +  D E    +  DD   E++  E   D+    +D E  EE +  +E R  +   
Sbjct: 397 EEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFR 456

Query: 100 DEEEE 104
           D E+E
Sbjct: 457 DMEKE 461



 Score = 25.0 bits (54), Expect = 9.6
 Identities = 19/81 (23%), Positives = 31/81 (38%), Gaps = 1/81 (1%)

Query: 24  EKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGL-DEEDGED 82
           +  ++ L  ++  D       E    +         Y+ +  E EEEED    DEE    
Sbjct: 352 DFEYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMS 411

Query: 83  EEEDTEGKEVRGKKRKHDEEE 103
             +D + KE   ++   DEE 
Sbjct: 412 AIDDDDPKENDNEEVAGDEES 432


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are
          transmembrane heparin sulfate proteoglycans which are
          implicated in the binding of extracellular matrix
          components and growth factors.
          Length = 207

 Score = 27.0 bits (60), Expect = 1.8
 Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 6/48 (12%)

Query: 41 AKNNEAPDSEDDEESEGDDYEGEDEELEEEED------DGLDEEDGED 82
          A      +   DEE  GDD E  D+E+++E           D+ED E 
Sbjct: 23 AAAEYPDERYLDEEGSGDDDEFIDDEMDDEYSGSGSGATPSDDEDSEP 70


>gnl|CDD|223059 PHA03367, PHA03367, single-stranded DNA binding protein; Provisional.
          Length = 1115

 Score = 27.3 bits (61), Expect = 2.0
 Identities = 11/48 (22%), Positives = 18/48 (37%), Gaps = 4/48 (8%)

Query: 54   ESEGDDYEGEDEELEEEEDD---GLDEEDGEDEEEDTEGKEVRGKKRK 98
            E+  D   G++ E    E+    G+   D            + GKKR+
Sbjct: 1068 EAVADAPGGDELEFVAVEEPQSVGVQAGDEF-LLPGGGSGLLAGKKRR 1114


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 27.1 bits (60), Expect = 2.1
 Identities = 16/59 (27%), Positives = 25/59 (42%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
             K     + ED+E     +   E+EE EE E   L +++ E EE        R + R+
Sbjct: 297 RLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRR 355


>gnl|CDD|179525 PRK03003, PRK03003, GTP-binding protein Der; Reviewed.
          Length = 472

 Score = 26.9 bits (60), Expect = 2.2
 Identities = 9/34 (26%), Positives = 14/34 (41%)

Query: 56 EGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
          + +     D    +E D  LD+ED  + E    G
Sbjct: 3  DDEAEMRADGTWADESDWELDDEDLAELEAAEGG 36


>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
          Length = 487

 Score = 27.0 bits (60), Expect = 2.4
 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 1/45 (2%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
           D  + EE E     G   E+   E    DEED +D++E    + V
Sbjct: 442 DFIEGEEMEPLFSMGGKLEMPGSESVS-DEEDDDDDDEVLYRRTV 485



 Score = 25.1 bits (55), Expect = 9.6
 Identities = 10/35 (28%), Positives = 14/35 (40%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEED 79
           E  + E      G       E + +EEDD  D+E 
Sbjct: 445 EGEEMEPLFSMGGKLEMPGSESVSDEEDDDDDDEV 479


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 26.8 bits (59), Expect = 2.4
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 49  SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
           S+DDE++   + +  DE   E++DD  D ED  +E
Sbjct: 284 SDDDEDAIETEEDDVDESAIEDDDDDSDWEDSVEE 318


>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B.  Escherichia coli
           stringent starvation protein B (SspB), is thought to
           enhance the specificity of degradation of tmRNA-tagged
           proteins by the ClpXP protease. The tmRNA tag, also
           known as ssrA, is an 11-aa peptide added to the C
           terminus of proteins stalled during translation, targets
           proteins for degradation by ClpXP and ClpAP. SspB a
           cytoplasmic protein that specifically binds to residues
           1-4 and 7 of the tag. Binding of SspB enhances
           degradation of tagged proteins by ClpX, and masks
           sequence elements important for ClpA interactions,
           inhibiting degradation by ClpA. However, more recent
           work has cast doubt on the importance of SspB in
           wild-type cells. SspB is encoded in an operon whose
           synthesis is stimulated by carbon, amino acid, and
           phosphate starvation. SspB may play a special role
           during nutrient stress, for example by ensuring rapid
           degradation of the products of stalled translation,
           without causing a global increase in degradation of all
           ClpXP substrates.
          Length = 153

 Score = 26.4 bits (59), Expect = 2.6
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query: 63  EDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
            +E  EEEE++  D +D  ++E+       + K R   +
Sbjct: 112 PEEADEEEEEEDDDADDSPEDEDPEPKDPPKTKGRPSLK 150



 Score = 24.9 bits (55), Expect = 9.5
 Identities = 9/33 (27%), Positives = 18/33 (54%)

Query: 58  DDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGK 90
           +  E E+EE ++ +D   DE+    +   T+G+
Sbjct: 114 EADEEEEEEDDDADDSPEDEDPEPKDPPKTKGR 146


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 26.7 bits (59), Expect = 2.6
 Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEE--DDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
           E+ DSED+E+ +    + +D++ EEEE        ED +   E+ E  E+   +   D+ 
Sbjct: 217 ESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDT 276

Query: 103 EE 104
            +
Sbjct: 277 PD 278



 Score = 25.2 bits (55), Expect = 7.6
 Identities = 18/66 (27%), Positives = 32/66 (48%), Gaps = 4/66 (6%)

Query: 41  AKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
           +  + A +  D+ ES  D  + EDE+  +E++   D++ GE+EE  +            +
Sbjct: 204 SSMDMAEELGDEPESA-DSEDNEDEDDPKEDE---DDDQGEEEESGSSDSLSEDSDASSE 259

Query: 101 EEEEGE 106
           E E GE
Sbjct: 260 EMESGE 265


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 26.6 bits (59), Expect = 2.6
 Identities = 15/64 (23%), Positives = 30/64 (46%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           +    E    E+ E +  +D E + +  +E+E+   + E+ ++E  D   KE   K   +
Sbjct: 58  ETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESN 117

Query: 100 DEEE 103
            E+E
Sbjct: 118 VEKE 121



 Score = 25.1 bits (55), Expect = 8.0
 Identities = 16/62 (25%), Positives = 32/62 (51%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
           +A   E + +   D    E EE++EEE +  + ED ED+ +  +  E   ++ + ++EE 
Sbjct: 43  QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102

Query: 105 GE 106
            +
Sbjct: 103 SD 104


>gnl|CDD|132741 cd06943, NR_LBD_RXR_like, The ligand binding domain of the retinoid
           X receptor and Ultraspiracle, members of nuclear
           receptor superfamily.  The ligand binding domain of the
           retinoid X receptor (RXR) and Ultraspiracle (USP): This
           family includes two evolutionary related nuclear
           receptors: retinoid X receptor (RXR) and Ultraspiracle
           (USP). RXR is a nuclear receptor in mammalian and USP is
           its counterpart in invertebrates.  The native ligand of
           retinoid X receptor is 9-cis retinoic acid (RA). RXR
           functions as a DNA binding partner by forming
           heterodimers with other nuclear receptors including CAR,
           FXR, LXR, PPAR, PXR, RAR, TR, and VDR. RXRs can play
           different roles in these heterodimers. It acts  either
           as a structural component of the heterodimer complex,
           required for DNA binding but not acting as a receptor or
           as both a structural and a functional component of the
           heterodimer, allowing 9-cis RA to signal through the
           corresponding heterodimer. In addition, RXR can also
           form homodimers, functioning as a receptor for 9-cis RA,
           independently of other nuclear receptors. Ultraspiracle
           (USP) plays similar roles as DNA binding partner of
           other nuclear rec eptors in invertebrates. USP has no
           known high-affinity ligand and is thought to be a silent
           component in the heterodimeric complex with partner
           receptors. Like other members of the nuclear receptor
           (NR) superfamily of ligand-activated transcription
           factors, RXR and USP  have a central well conserved DNA
           binding domain (DBD), a variable N-terminal domain, a
           flexible hinge and a C-terminal ligand binding domain
           (LBD).
          Length = 207

 Score = 26.5 bits (59), Expect = 2.6
 Identities = 12/31 (38%), Positives = 17/31 (54%), Gaps = 6/31 (19%)

Query: 6   LKHVDLFNNDVTT------IENYREKVFELL 30
           L+ + LFN DV        +E+ REKV+  L
Sbjct: 137 LRAIILFNPDVKGLKSRQEVESLREKVYASL 167


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase
          19 kDa subunit.  This family contains several
          DNA-directed RNA polymerase 19 kDa polypeptides. The
          Poxvirus DNA-directed RNA polymerase (EC: 2.7.7.6)
          catalyzes DNA-template-directed extension of the 3'-end
          of an RNA strand by one nucleotide at a time.
          Length = 167

 Score = 26.2 bits (58), Expect = 2.7
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 52 DEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
          ++  +  DYE +D++ EE E++  DEED E  E  
Sbjct: 2  EDSDDIIDYESDDDDSEEYEEEEEDEEDAESLESS 36


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 26.5 bits (59), Expect = 2.7
 Identities = 9/32 (28%), Positives = 17/32 (53%)

Query: 47   PDSEDDEESEGDDYEGEDEELEEEEDDGLDEE 78
               + D  SE DD    D+  +E+++D  D++
Sbjct: 1357 RKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|220777 pfam10486, PI3K_1B_p101, Phosphoinositide 3-kinase gamma adapter
           protein p101 subunit.  Class I PI3Ks are dual-specific
           lipid and protein kinases involved in numerous
           intracellular signaling pathways. Class IB PI3K,
           p110gamma, is mainly activated by seven-transmembrane
           G-protein-coupled receptors (GPCRs), through its
           regulatory subunit p101 and G-protein beta-gamma
           subunits.
          Length = 856

 Score = 26.9 bits (59), Expect = 2.7
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 47  PDSEDDEESEGDDYEGEDEELEEEEDDGLDEE 78
            +S+  E    +D E E+EE E++E DGL  E
Sbjct: 303 KESQLLEPVLSEDEEVEEEEEEDDETDGLSPE 334



 Score = 26.5 bits (58), Expect = 3.3
 Identities = 14/26 (53%), Positives = 18/26 (69%), Gaps = 1/26 (3%)

Query: 62  GEDEELEEEEDDGLDEEDGEDEEEDT 87
            EDEE+EEEE++  DE DG   E D+
Sbjct: 313 SEDEEVEEEEEED-DETDGLSPERDS 337


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 26.5 bits (59), Expect = 3.0
 Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 1/61 (1%)

Query: 42  KNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
           +  E  D E +  SE    + E E    EED  L  E+ ED ++  +  ++   K K DE
Sbjct: 234 EREEETDVEIETTSETKGTKQEQEG-STEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292

Query: 102 E 102
           E
Sbjct: 293 E 293



 Score = 26.1 bits (58), Expect = 4.8
 Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 4/70 (5%)

Query: 39  FDAKNNEAPDSEDDEESEGDDYEGEDE----ELEEEEDDGLDEEDGEDEEEDTEGKEVRG 94
           F  K  E  ++E+ EE    + E   E    + E+E     D     +E+ED +  E   
Sbjct: 222 FTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281

Query: 95  KKRKHDEEEE 104
           K     E+++
Sbjct: 282 KLEILKEKKD 291


>gnl|CDD|221855 pfam12921, ATP13, Mitochondrial ATPase expression.  ATP13 is
          necessary for the expression of subunit 9 of
          mitochondrial ATPase. The protein has a basic amino
          terminal signal sequence that is cleaved upon import
          into mitochondria.
          Length = 125

 Score = 26.1 bits (58), Expect = 3.1
 Identities = 8/39 (20%), Positives = 15/39 (38%)

Query: 12 FNNDVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSE 50
          +   +  +E Y E V+ +    K  +  D  +   P S 
Sbjct: 14 YEGQLELLEKYIETVWGIDSQGKSKERVDKGSPLYPSSR 52


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 26.6 bits (58), Expect = 3.2
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 41  AKNNEAPDSEDDEESEGDDYEGEDEELEEEE 71
           A+  E  + ED++    D    E++E EE E
Sbjct: 80  AEAGEEAEEEDNDRECPDTEAEEEDEEEEIE 110



 Score = 26.2 bits (57), Expect = 3.5
 Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 3/49 (6%)

Query: 61  EGEDEELEEEED--DGLDEEDGED-EEEDTEGKEVRGKKRKHDEEEEGE 106
           +GE+ E E   D    L  E GE+ EEED + +    +  + DEEEE E
Sbjct: 62  DGENSETERPRDLTAALFAEAGEEAEEEDNDRECPDTEAEEEDEEEEIE 110



 Score = 25.0 bits (54), Expect = 9.2
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 4/40 (10%)

Query: 50  EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
           E+D + E  D E E+E+ EEE    ++  D E    D EG
Sbjct: 88  EEDNDRECPDTEAEEEDEEEE----IEAPDPEVNPLDAEG 123


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 26.3 bits (58), Expect = 3.3
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
           ++ED + +E      E+E L+ E ++ L+ E  E  EED   +E
Sbjct: 379 EAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRE 422



 Score = 25.9 bits (57), Expect = 5.1
 Identities = 14/44 (31%), Positives = 21/44 (47%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
           E  ++E  E +E D    E   L E+E +  DE +    EE+ E
Sbjct: 404 ERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEE 447



 Score = 25.5 bits (56), Expect = 6.0
 Identities = 14/53 (26%), Positives = 25/53 (47%)

Query: 39  FDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
            + +     ++E+  E+E  +   ED  L E      DE + EDE E+ + +E
Sbjct: 392 REEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEE 444


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 26.2 bits (58), Expect = 3.4
 Identities = 16/70 (22%), Positives = 25/70 (35%)

Query: 37  DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
           +       E     +DE+    + E  + +  +EE D LD E    EE   E +      
Sbjct: 6   EALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAAS 65

Query: 97  RKHDEEEEGE 106
            +    EE E
Sbjct: 66  GEGGGGEEEE 75


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 26.3 bits (58), Expect = 3.4
 Identities = 12/69 (17%), Positives = 30/69 (43%)

Query: 25 KVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEE 84
          +V   L + KY    D  N E  +  + ++   + ++   +    E    L  ++ ++E+
Sbjct: 11 RVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEK 70

Query: 85 EDTEGKEVR 93
          +  + K+V 
Sbjct: 71 KKPKKKKVP 79


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 26.5 bits (58), Expect = 3.4
 Identities = 15/52 (28%), Positives = 21/52 (40%)

Query: 51  DDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
           D   S  D  E ED+   E       E+D + EE + E  E  G   K  ++
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340



 Score = 25.3 bits (55), Expect = 7.6
 Identities = 13/51 (25%), Positives = 20/51 (39%)

Query: 48  DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
           DS        +  +    E+  + +   DE+  E EEE  E +    KK K
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGK 339


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 26.5 bits (58), Expect = 3.5
 Identities = 10/46 (21%), Positives = 20/46 (43%)

Query: 54  ESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           E+   +    +EE+E+ ED+    +   D++E         +K  H
Sbjct: 376 ENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADEKNSH 421


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 26.1 bits (58), Expect = 3.8
 Identities = 18/33 (54%), Positives = 22/33 (66%), Gaps = 1/33 (3%)

Query: 61  EGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
           E E+EE EEEE    +EE  E+EEE T  +EVR
Sbjct: 304 EPEEEEEEEEEVPEEEEE-EEEEEERTFEEEVR 335



 Score = 25.4 bits (56), Expect = 8.4
 Identities = 14/33 (42%), Positives = 20/33 (60%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDE 77
             P+ E++EE E +  E E+EE EEEE    +E
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 25.0 bits (55), Expect = 9.6
 Identities = 14/38 (36%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 46  APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
            P S  + E E ++ E   EE EEEE++   E   E+E
Sbjct: 298 PPPSPPEPEEEEEEEEEVPEEEEEEEEEE--ERTFEEE 333


>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3). 
           This family contains EBNA-3A, -3B, and -3C which are
           latent infection nuclear proteins important for
           Epstein-Barr virus (EBV)-induced B-cell immortalisation
           and the immune response to EBV infection.
          Length = 254

 Score = 26.2 bits (58), Expect = 3.9
 Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 49  SEDDEESEGDDYEGEDEEL 67
            E   ESE  + E +DEEL
Sbjct: 216 VETSSESEDSESESDDEEL 234


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 26.0 bits (57), Expect = 4.2
 Identities = 6/52 (11%), Positives = 21/52 (40%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
           D  ++E     +D+E +  +    +   +++    L + D + + +    + 
Sbjct: 88  DDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQT 139



 Score = 25.2 bits (55), Expect = 9.0
 Identities = 12/67 (17%), Positives = 25/67 (37%), Gaps = 2/67 (2%)

Query: 40  DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           DA      D  +D + +       D+ L++  +    E+  E+++  TE           
Sbjct: 129 DADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENT--FKFYNFG 186

Query: 100 DEEEEGE 106
           ++ EE  
Sbjct: 187 NDGEEAA 193


>gnl|CDD|235279 PRK04319, PRK04319, acetyl-CoA synthetase; Provisional.
          Length = 570

 Score = 26.0 bits (58), Expect = 4.5
 Identities = 15/69 (21%), Positives = 26/69 (37%), Gaps = 15/69 (21%)

Query: 16  VTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDD-- 73
           +TT      K  + LP+LK++               ++  EG      +  +E+  D+  
Sbjct: 150 ITTPALLERKPADDLPSLKHVLLV-----------GEDVEEGPGTLDFNALMEQASDEFD 198

Query: 74  --GLDEEDG 80
               D EDG
Sbjct: 199 IEWTDREDG 207


>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
          Length = 325

 Score = 25.7 bits (57), Expect = 4.7
 Identities = 6/40 (15%), Positives = 16/40 (40%)

Query: 49 SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
          +    +   D    E+     +E   ++EE  +++  + E
Sbjct: 2  TLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEE 41



 Score = 25.3 bits (56), Expect = 7.1
 Identities = 7/47 (14%), Positives = 21/47 (44%)

Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGK 90
                + +E+++ D+   E  + +   ++   + D  +EE  ++G 
Sbjct: 1  MTLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGA 47


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 26.0 bits (58), Expect = 4.9
 Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 15/62 (24%)

Query: 52 DEESEGDDYEGEDEELEEEE-------DDGLDEEDGED-------EEEDTEGKEVRGKKR 97
          + E+E  D E E E+L +         DD    + G         E E+ E ++++  KR
Sbjct: 20 EAEAECSDGEDELEDLFDTGSDISDLIDDAEVVQGGNSLALFHQQEAEEDE-QQIQALKR 78

Query: 98 KH 99
          K+
Sbjct: 79 KY 80



 Score = 25.2 bits (56), Expect = 7.7
 Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 2/39 (5%)

Query: 52 DEESEGDDYEGEDEE--LEEEEDDGLDEEDGEDEEEDTE 88
            + +G D E       L E E +  D ED  ++  DT 
Sbjct: 1  MADDKGTDPEEGGSGWFLVEAEAECSDGEDELEDLFDTG 39


>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
           Provisional.
          Length = 606

 Score = 25.9 bits (57), Expect = 5.0
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 39  FDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDD 73
           +D + N    SE+ E     +   EDEE ++E+DD
Sbjct: 572 YDPRANNNEASEETETITVPENNEEDEEDDDEDDD 606


>gnl|CDD|185764 cd09241, BRO1_ScRim20-like, Protein-interacting, N-terminal,
          Bro1-like domain of Saccharomyces cerevisiae Rim20 and
          related proteins.  This family contains the N-terminal,
          Bro1-like domain of Saccharomyces cerevisiae Rim20
          (also known as PalA) and related proteins. It belongs
          to the BRO1_Alix_like superfamily which also includes
          the Bro1-like domains of mammalian Alix
          (apoptosis-linked gene-2 interacting protein X),
          His-Domain type N23 protein tyrosine phosphatase
          (HD-PTP, also known as PTPN23), RhoA-binding proteins
          Rhophilin-1 and -2, Brox, Saccharomyces cerevisiae
          Bro1, Ustilago maydis Rim23 (also known as PalC), and
          related domains. Alix, HD-PTP, Brox, Bro1, Rim20, and
          Rim23, interact with the ESCRT (Endosomal Sorting
          Complexes Required for Transport) system. Rim20 and
          Rim23 participate in the response to the external pH
          via the Rim101 pathway. Bro1-like domains are
          boomerang-shaped, and part of the domain is a
          tetratricopeptide repeat (TPR)-like structure.
          Bro1-like domains bind components of the ESCRT-III
          complex: Snf7 in the case of Rim20. RIM20, and some
          other members of the BRO1_Alix_like  superfamily
          including Alix, also have a V-shaped (V) domain. In the
          case of Alix, the V-domain is a dimerization domain
          that also contains a binding site for the retroviral
          late assembly (L) domain YPXnL motif, which is
          partially conserved in the V-domain superfamily. Rim20
          localizes to endosomes under alkaline pH conditions. By
          binding Snf7, it may bring the protease Rim13 (a
          YPxL-containing transcription factor) into proximity
          with Rim101, and thus aid in the proteolytic activation
          of the latter. Rim20 and other intermediates in the
          Rim101 pathway play roles in the pathogenesis of fungal
          corneal infection during Candida albicans keratitis.
          Length = 355

 Score = 25.7 bits (57), Expect = 5.0
 Identities = 6/32 (18%), Positives = 12/32 (37%)

Query: 6  LKHVDLFNNDVTTIENYREKVFELLPNLKYLD 37
           +    F +D+  I+  R       P++  L 
Sbjct: 28 FQTPSSFEDDLAEIDKLRNDAINPEPSVNGLS 59


>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein.  This family consists of
           Ebola and Marburg virus nucleoproteins. These proteins
           are responsible for encapsidation of genomic RNA. It has
           been found that nucleoprotein DNA vaccines can offer
           protection from the virus.
          Length = 717

 Score = 26.1 bits (57), Expect = 5.1
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 50  EDDEESEGDDYEGEDEELEEEEDDGLDEEDGED 82
           ED+ E + DD +     LE ++D G D E G D
Sbjct: 547 EDEPEDQTDDDDSSLPPLESDDDPGSDNEQGVD 579


>gnl|CDD|131585 TIGR02533, type_II_gspE, type II secretion system protein E.  This
           family describes GspE, the E protein of the type II
           secretion system, also called the main terminal branch
           of the general secretion pathway. This model separates
           GspE from the PilB protein of type IV pilin biosynthesis
           [Protein fate, Protein and peptide secretion and
           trafficking, Cellular processes, Pathogenesis].
          Length = 486

 Score = 25.8 bits (57), Expect = 5.3
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 53  EESEGDDYEGEDEELEEEEDDGLDEEDG 80
           EE EG+D     E  E + +D LD ED 
Sbjct: 78  EEIEGEDDLSALELDEPKIEDLLDLEDD 105


>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
           transition [Transcription / Cell division and chromosome
           partitioning].
          Length = 423

 Score = 25.8 bits (56), Expect = 5.5
 Identities = 18/97 (18%), Positives = 34/97 (35%), Gaps = 11/97 (11%)

Query: 6   LKHVDLFNNDVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDE 65
           L+ VDL ++D+    N       + P+    + F+ +++   D E   +S G+       
Sbjct: 247 LEDVDLMDDDILGPSNEEMLYKYISPSQGSAELFE-ESSLGFDYEFIHKSVGNK------ 299

Query: 66  ELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
               E   G+   +  D  +          K  H  E
Sbjct: 300 ----EIRGGISTGEMIDVRKLPCTNSSSNGKLAHGGE 332


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 25.8 bits (57), Expect = 5.7
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 50  EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
           E+DEE   D+ + E    E EE+  ++E D ++  ++T+ K 
Sbjct: 129 EEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKR 170



 Score = 25.4 bits (56), Expect = 6.7
 Identities = 14/60 (23%), Positives = 31/60 (51%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
           + P   ++E+ E +  E + E + EE ++ ++ E+ +D+E   E K  RG      + ++
Sbjct: 122 KRPKYPNEEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKD 181


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 25.9 bits (57), Expect = 5.7
 Identities = 10/41 (24%), Positives = 15/41 (36%), Gaps = 1/41 (2%)

Query: 48 DSEDDEESEGDDYEGEDEE-LEEEEDDGLDEEDGEDEEEDT 87
              DE  EG   E  D E  +     G   E   ++++D 
Sbjct: 14 YDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDL 54


>gnl|CDD|235052 PRK02597, rpoC2, DNA-directed RNA polymerase subunit beta';
            Provisional.
          Length = 1331

 Score = 25.7 bits (57), Expect = 5.7
 Identities = 10/53 (18%), Positives = 21/53 (39%)

Query: 27   FELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEED 79
             E   +   +D   +        + D+E   DD   +   LE  +++GL  ++
Sbjct: 1279 LEATRSRHGIDPSSSNFAAFARPDADDELLIDDQLPDPAALEGLQEEGLLSDE 1331


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 25.7 bits (57), Expect = 6.0
 Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 6/42 (14%)

Query: 61  EGEDEELEEEEDDGLDEEDGEDEEED-----TEGKEV-RGKK 96
           E +D++ +E++D     E GE+ + +     TE  EV RGKK
Sbjct: 200 EDDDDDEDEDDDGEGGGEGGEERQREHPFIVTEPGEVARGKK 241


>gnl|CDD|201520 pfam00945, Rhabdo_ncap, Rhabdovirus nucleocapsid protein.  The
           Nucleocapsid (N) Protein is said to have a "tight"
           structure. The carboxyl end of the N-terminal domain
           possesses an RNA binding domain. Sequence alignments
           show 2 regions of reasonable conservation, approx.
           64-103 and 201-329. A whole functional protein is
           required for encapsidation to take place.
          Length = 406

 Score = 25.4 bits (56), Expect = 6.7
 Identities = 7/38 (18%), Positives = 17/38 (44%), Gaps = 3/38 (7%)

Query: 60  YEGEDEELEEEEDDGLDEEDGEDE---EEDTEGKEVRG 94
           +  E +  E  +DDG + +    +   +E  + ++V  
Sbjct: 341 FRDEKKYTEYRDDDGDNTDAPLADDGDDEAPKSRDVVE 378


>gnl|CDD|184899 PRK14906, PRK14906, DNA-directed RNA polymerase subunit beta'/alpha
           domain fusion protein; Provisional.
          Length = 1460

 Score = 25.6 bits (56), Expect = 6.9
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query: 52  DEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
           D   E D++  +  + E    + +  E  +  EE  E K +R
Sbjct: 181 DYVPEDDEFVDDIGDDERLTAEEVRAEVADIYEEYNERKALR 222


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 25.4 bits (55), Expect = 7.0
 Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 3/58 (5%)

Query: 50  EDDEESEGDDYEGEDEELEEEEDDGLDEED---GEDEEEDTEGKEVRGKKRKHDEEEE 104
              E   G++ + +DEEL+ +++  L+ +     E  E D +  + + KKRK DEEEE
Sbjct: 483 VRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEE 540


>gnl|CDD|237084 PRK12378, PRK12378, hypothetical protein; Provisional.
          Length = 235

 Score = 25.1 bits (56), Expect = 7.2
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 61  EGEDEELEEEEDDGLDEEDGEDEEE 85
           + EDE LE   D  +D ED E+EE 
Sbjct: 142 DDEDELLEALIDADVDVEDVEEEEG 166


>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
          Length = 586

 Score = 25.5 bits (56), Expect = 7.4
 Identities = 14/39 (35%), Positives = 19/39 (48%)

Query: 63  EDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
           E EE   EE D  +EE GEDE     G  +  K+ +  +
Sbjct: 57  EKEEDAGEEGDEDEEEQGEDEHFSILGHPMCLKRPRDGD 95


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 25.4 bits (56), Expect = 8.0
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 60  YEGEDEELEEEEDDGLDEEDGEDEEED 86
              E ++ EEE++D  +EE+ E+ EE 
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEP 368



 Score = 25.0 bits (55), Expect = 8.7
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 61  EGEDEELEEEEDDGLDEEDGEDEEE 85
           + E+EE E+EE++  +EE  E E E
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPE 371


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 25.4 bits (56), Expect = 8.6
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 73  DGLDEEDGEDEEEDTEGKEVRGKKRKH 99
           D L++ +GE+EE      ++R +K K 
Sbjct: 606 DELEQLEGENEENVPSDHQIRSRKAKR 632


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 25.3 bits (56), Expect = 8.7
 Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 3/47 (6%)

Query: 11  LFNNDVTTIENYREKVFELLPN-LKYLDGFDAKNNEAPDSEDDEESE 56
                V +I  Y +   E LP  +  +D +DA   E     +D E  
Sbjct: 894 FSEYGVASISMYNKASGEKLPQIVIIIDNYDAVKEEP--FYEDFEEL 938


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 25.0 bits (54), Expect = 9.2
 Identities = 23/76 (30%), Positives = 36/76 (47%), Gaps = 9/76 (11%)

Query: 31  PNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEEL--------EEEEDDGLDEEDGED 82
           P L  +D  DAK+  + +S D EE   + +   D E           EED+  +EE  E+
Sbjct: 726 PKLDAIDDEDAKSEGSQES-DQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEE 784

Query: 83  EEEDTEGKEVRGKKRK 98
           E ++   K  + K+RK
Sbjct: 785 ENKEVSAKRAKKKQRK 800


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
          inhibitor of apoptosis (IAP)-associated factor (VIAF)
          subfamily; VIAF is a Phd-like protein that functions in
          caspase activation during apoptosis. It was identified
          as an IAP binding protein through a screen of a human
          B-cell library using a prototype IAP. VIAF lacks a
          consensus IAP binding motif and while it does not
          function as an IAP antagonist, it still plays a
          regulatory role in the complete activation of caspases.
          VIAF itself is a substrate for IAP-mediated
          ubiquitination, suggesting that it may be a target of
          IAPs in the prevention of cell death. The similarity of
          VIAF to Phd points to a potential role distinct from
          apoptosis regulation. Phd functions as a cytosolic
          regulator of G protein by specifically binding to G
          protein betagamma (Gbg)-subunits. The C-terminal domain
          of Phd adopts a thioredoxin fold, but it does not
          contain a CXXC motif. Phd interacts with G protein beta
          mostly through the N-terminal helical domain.
          Length = 192

 Score = 24.9 bits (55), Expect = 9.5
 Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
           +P  E++E  E    E  +  LE++  D LDEE  ++EE+D   +E R K+
Sbjct: 18 PSPKEEEEEALELAIQEAHENALEKKLLDELDEEL-DEEEDDRFLEEYRRKR 68


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 25.1 bits (55), Expect = 9.7
 Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 45  EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
               S  DE SEG   E +D+  EE +D+     +G + E +   K VR K++ 
Sbjct: 228 APEASRLDEMSEGLLEESDDDGEEESDDE--SAWEGFESEYEPINKPVRPKRKT 279


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.297    0.129    0.345 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,840,208
Number of extensions: 561981
Number of successful extensions: 5357
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2991
Number of HSP's successfully gapped: 1108
Length of query: 106
Length of database: 10,937,602
Length adjustment: 71
Effective length of query: 35
Effective length of database: 7,788,468
Effective search space: 272596380
Effective search space used: 272596380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (21.8 bits)
S2: 53 (24.3 bits)