RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5944
         (309 letters)



>gnl|CDD|215999 pfam00573, Ribosomal_L4, Ribosomal protein L4/L1 family.  This
           family includes Ribosomal L4/L1 from eukaryotes and
           archaebacteria and L4 from eubacteria. L4 from yeast has
           been shown to bind rRNA.
          Length = 190

 Score = 57.9 bits (141), Expect = 4e-10
 Identities = 25/100 (25%), Positives = 46/100 (46%), Gaps = 5/100 (5%)

Query: 191 GLTSTLSIKLMQDDLHIVKDLVLPVDDPKYLENLVEE-RCWGPSVLFVDNTDYMERNISI 249
            L S LS K  +  L +V D  L +   K    L++        VL V +    + N+ +
Sbjct: 93  ALRSALSAKAREGKLVVVDDFELEIPKTKDAVKLLKNLGLKNKKVLIVVDEK--DENLYL 150

Query: 250 VTDQIKHYNIMPAYGEN--NMLKHDTLVITVDALKQIEDK 287
               +   +++   G N  ++L  D +VIT  AL+++E++
Sbjct: 151 SARNLPGVDVVTVDGLNVYDLLPADKVVITKSALEKLEER 190



 Score = 50.2 bits (121), Expect = 2e-07
 Identities = 19/53 (35%), Positives = 26/53 (49%)

Query: 72  VNLHPDIFGLVPRLDVITMNIEWQKKYKWVRFDHTKVTNEVERTGRKPWPQKG 124
           V L P +FG   R D++   +  Q   +      TK  +EV   G+KPW QKG
Sbjct: 1   VELPPAVFGAEIRPDLLHRAVVAQLANRRQGTASTKTRSEVSGGGKKPWRQKG 53


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 57.3 bits (138), Expect = 4e-09
 Identities = 26/29 (89%), Positives = 27/29 (93%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G  EE+EEEEEEEEEEEEEEEEEEEEEEE
Sbjct: 861 GDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 55.4 bits (133), Expect = 2e-08
 Identities = 26/39 (66%), Positives = 27/39 (69%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
             G + EEEEEEEEEEEEEEEEEEEEEEE      P  L
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896



 Score = 50.0 bits (119), Expect = 9e-07
 Identities = 22/29 (75%), Positives = 24/29 (82%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G  +  + EEEEEEEEEEEEEEEEEEEEE
Sbjct: 856 GGSDGGDSEEEEEEEEEEEEEEEEEEEEE 884



 Score = 47.3 bits (112), Expect = 6e-06
 Identities = 22/29 (75%), Positives = 24/29 (82%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G G +  + EEEEEEEEEEEEEEEEEEEE
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEE 883



 Score = 45.4 bits (107), Expect = 3e-05
 Identities = 21/29 (72%), Positives = 22/29 (75%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G G     + EEEEEEEEEEEEEEEEEEE
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEE 882



 Score = 38.1 bits (88), Expect = 0.006
 Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           KG +     +  + EEEEEEEEEEEEEE
Sbjct: 850 KGVDGGGGSDGGDSEEEEEEEEEEEEEE 877



 Score = 31.9 bits (72), Expect = 0.52
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             + E+  +     +  + EEEEEEEEEEE
Sbjct: 845 AKQDEKGVDGGGGSDGGDSEEEEEEEEEEE 874



 Score = 28.4 bits (63), Expect = 6.5
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            KGE + E E   E + E+E E E E +E
Sbjct: 675 TKGENESEGEIPAERKGEQEGEGEIEAKE 703



 Score = 28.0 bits (62), Expect = 8.3
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
            +GEE E+E E E E + E E E + +E
Sbjct: 736 EEGEEVEDEGEGEAEGKHEVETEGDRKE 763


>gnl|CDD|234414 TIGR03953, rplD_bact, 50S ribosomal protein L4,
           bacterial/organelle.  Members of this protein family are
           ribosomal protein L4. This model recognizes bacterial
           and most organellar forms, but excludes homologs from
           the eukaryotic cytoplasm and from archaea [Protein
           synthesis, Ribosomal proteins: synthesis and
           modification].
          Length = 188

 Score = 51.7 bits (125), Expect = 5e-08
 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 16/104 (15%)

Query: 191 GLTSTLSIKLMQDDLHIVKDLVLPVDDPK------YLENLVEERCWGPSVLFVDNTDYME 244
            L S LS K  +  L +V D  L  ++PK       L+NL         VL V  TD ++
Sbjct: 93  ALRSALSEKAREGKLVVVDDFDL--EEPKTKDLAKILKNLG----LDKKVLIV--TDEVD 144

Query: 245 RNISIVTDQIKHYNIMPAYGEN--NMLKHDTLVITVDALKQIED 286
            N+ +    + +  ++PA G N  ++L+ D +V+T  A++++E+
Sbjct: 145 ENLYLSARNLPNVKVLPANGLNVYDLLRADKVVLTEAAVEKLEE 188



 Score = 42.9 bits (102), Expect = 5e-05
 Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 72  VNLHPDIFGLVPRLDVITMNIEWQ--KKYKWVRFDHTKVTNEVERTGRKPWPQKG 124
           + L+  +FG+ P   ++   ++ Q   + +      TK  +EV   G+KPW QKG
Sbjct: 1   IELNDAVFGIEPNEHLLHQVVKAQLANRRQGTH--KTKTRSEVSGGGKKPWRQKG 53


>gnl|CDD|235404 PRK05319, rplD, 50S ribosomal protein L4; Provisional.
          Length = 205

 Score = 48.5 bits (117), Expect = 7e-07
 Identities = 33/106 (31%), Positives = 58/106 (54%), Gaps = 17/106 (16%)

Query: 191 GLTSTLSIKLMQDDLHIVKDLVLPVDDPK------YLENLVEERCWGPSVLFVDNTDYME 244
            L S LS K  +  L +V DL L  + PK       L+NL  ++     VL V  TD ++
Sbjct: 108 ALRSALSEKAREGRLVVVDDLSL--EAPKTKELAAKLKNLGLKK-----VLIV--TDEVD 158

Query: 245 RNISIVTDQIKHYNIMPAYGEN--NMLKHDTLVITVDALKQIEDKL 288
            N+ +    + + +++PA G N  ++L+ D +V+T  A+K++E++L
Sbjct: 159 ENLYLSARNLPNVDVLPAQGLNVYDLLRADKVVLTKAAVKKLEERL 204



 Score = 43.2 bits (103), Expect = 5e-05
 Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 4/57 (7%)

Query: 70  GFVNLHPDIFGLVPRLDVITMNIEWQ--KKYKWVRFDHTKVTNEVERTGRKPWPQKG 124
           G V L   +FG+ P   ++   +  Q     +      TK  +EV   G+KPW QKG
Sbjct: 14  GEVELSDAVFGVEPNEALLHQVVVAQLANARQGTH--ATKTRSEVSGGGKKPWRQKG 68


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 45.3 bits (108), Expect = 2e-06
 Identities = 17/28 (60%), Positives = 20/28 (71%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
               EE++EEEEEEEE+EE EEE  A L
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEAAAGL 101



 Score = 43.0 bits (102), Expect = 2e-05
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
                 EE++EEEEEEEE+EE EEE
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 42.2 bits (100), Expect = 2e-05
 Identities = 15/26 (57%), Positives = 18/26 (69%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
                  EE++EEEEEEEE+EE EEE
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 42.2 bits (100), Expect = 2e-05
 Identities = 15/26 (57%), Positives = 18/26 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
               EE++EEEEEEEE+EE EEE   
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 41.5 bits (98), Expect = 5e-05
 Identities = 14/23 (60%), Positives = 17/23 (73%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
                EE++EEEEEEEE+EE EE
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEE 95



 Score = 40.3 bits (95), Expect = 1e-04
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEE 165
              E+++EEEEEEEE+EE EEE   
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 39.2 bits (92), Expect = 3e-04
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
                 EE++EEEEEEEE+EE E
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESE 94



 Score = 36.9 bits (86), Expect = 0.002
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
                  EE++EEEEEEEE+EE 
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEES 93


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 44.6 bits (106), Expect = 3e-06
 Identities = 21/28 (75%), Positives = 21/28 (75%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
              EEEEEEEEEEEEEEEE EEE  A L
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEAMAGL 100



 Score = 43.1 bits (102), Expect = 1e-05
 Identities = 19/26 (73%), Positives = 19/26 (73%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
                 EEEEEEEEEEEEEEEE EEE
Sbjct: 70  AAAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 41.6 bits (98), Expect = 4e-05
 Identities = 18/23 (78%), Positives = 18/23 (78%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
               EEEEEEEEEEEEEEEE EE
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEE 94



 Score = 40.8 bits (96), Expect = 8e-05
 Identities = 18/24 (75%), Positives = 19/24 (79%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEE 163
               E+EEEEEEEEEEEEEE EEE
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 40.8 bits (96), Expect = 8e-05
 Identities = 18/25 (72%), Positives = 19/25 (76%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEE 164
                +EEEEEEEEEEEEEEE EEE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 39.3 bits (92), Expect = 2e-04
 Identities = 17/23 (73%), Positives = 17/23 (73%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
                EEEEEEEEEEEEEEEE E
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESE 93


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 47.3 bits (113), Expect = 6e-06
 Identities = 27/45 (60%), Positives = 30/45 (66%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           I+RL  +RR R        + EEEEEEEEE  EEEEEEEEEEE T
Sbjct: 285 IDRLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERT 329



 Score = 41.2 bits (97), Expect = 5e-04
 Identities = 22/30 (73%), Positives = 24/30 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           EE+EEEE  EEEEEEEEEEE   EEE+ AT
Sbjct: 308 EEEEEEEVPEEEEEEEEEEERTFEEEVRAT 337



 Score = 37.7 bits (88), Expect = 0.007
 Identities = 19/32 (59%), Positives = 20/32 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           EE+EEE  EEEEEEEEEEE   EEE       
Sbjct: 309 EEEEEEVPEEEEEEEEEEERTFEEEVRATVAE 340


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 43.3 bits (102), Expect = 1e-05
 Identities = 18/31 (58%), Positives = 24/31 (77%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
              EE+ EE+EEE+++EEE+EEEEEE  A L
Sbjct: 72  AAAEEKAEEKEEEKKKEEEKEEEEEEALAGL 102



 Score = 40.9 bits (96), Expect = 8e-05
 Identities = 17/25 (68%), Positives = 22/25 (88%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEE 165
             EEK EE+EEE+++EEE+EEEEEE
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 39.0 bits (91), Expect = 3e-04
 Identities = 15/29 (51%), Positives = 21/29 (72%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
               EE+ EE+EEE+++EEE+EEEEE   
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEEAL 99



 Score = 36.3 bits (84), Expect = 0.003
 Identities = 13/24 (54%), Positives = 21/24 (87%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEE 163
            + + +E+EEE+++EEE+EEEEEE
Sbjct: 74  AEEKAEEKEEEKKKEEEKEEEEEE 97


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 45.4 bits (108), Expect = 2e-05
 Identities = 20/39 (51%), Positives = 26/39 (66%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
              + +++EEE+E+EEEEEEEEE EE E EE    L P 
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPI 380



 Score = 43.9 bits (104), Expect = 6e-05
 Identities = 20/33 (60%), Positives = 24/33 (72%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           R      E+K+EEEE+E+EEEEEEEEE EE E 
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEP 370



 Score = 43.1 bits (102), Expect = 1e-04
 Identities = 17/34 (50%), Positives = 23/34 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
             EE++E+EEEEEEEEE EE E EE   +   ++
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPIS 381



 Score = 33.9 bits (78), Expect = 0.098
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           EE+EEEEEEEE  E +E EE  + E +  
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEV 316



 Score = 33.9 bits (78), Expect = 0.10
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           +GEE+EEEEE  E +EE+E EEEEEEE   
Sbjct: 169 EGEEEEEEEEVGEADEEDEGEEEEEEEPEE 198



 Score = 33.9 bits (78), Expect = 0.11
 Identities = 20/39 (51%), Positives = 26/39 (66%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
           G+ EE+EEE  E +EE+E EEEEEEE EE+       G+
Sbjct: 170 GEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTGV 208



 Score = 32.3 bits (74), Expect = 0.33
 Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +G  + EEEEEEEEEEE  E +E EE  
Sbjct: 282 RGLYQLEEEEEEEEEEEPAERDELEENP 309



 Score = 32.0 bits (73), Expect = 0.39
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           Y  + E +E EEEEEEEE  E +EE+E EEE
Sbjct: 161 YVAEVELREGEEEEEEEEVGEADEEDEGEEE 191



 Score = 32.0 bits (73), Expect = 0.43
 Identities = 15/26 (57%), Positives = 18/26 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+EEEEEEEEE  E +E EE  + E
Sbjct: 287 LEEEEEEEEEEEPAERDELEENPDFE 312



 Score = 31.6 bits (72), Expect = 0.49
 Identities = 15/38 (39%), Positives = 17/38 (44%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPV 184
           E E  E EEEEEEEE  E +EE             +P 
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPK 201



 Score = 31.6 bits (72), Expect = 0.51
 Identities = 18/26 (69%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E EEEEEEEE  E +EE+E EEEEE
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEE 193



 Score = 31.2 bits (71), Expect = 0.76
 Identities = 16/33 (48%), Positives = 19/33 (57%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           PR      EE+EEEEEEE  E +E EE  + E 
Sbjct: 281 PRGLYQLEEEEEEEEEEEPAERDELEENPDFEG 313


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 42.4 bits (100), Expect = 3e-05
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            G E   E +E EEEE+EEE EEE +++
Sbjct: 75  AGAEAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 41.2 bits (97), Expect = 6e-05
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            E   E +E EEEE+EEE EEE +++ +  
Sbjct: 77  AEAAAEADEAEEEEKEEEAEEESDDDMLFG 106



 Score = 39.3 bits (92), Expect = 3e-04
 Identities = 14/28 (50%), Positives = 16/28 (57%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
            E   E +E EEEE+EEE EEE     L
Sbjct: 77  AEAAAEADEAEEEEKEEEAEEESDDDML 104



 Score = 38.5 bits (90), Expect = 5e-04
 Identities = 13/29 (44%), Positives = 15/29 (51%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                   E   E +E EEEE+EEE EEE
Sbjct: 70  AAAAAAGAEAAAEADEAEEEEKEEEAEEE 98



 Score = 38.5 bits (90), Expect = 6e-04
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           G     E +E EEEE+EEE EEE +++
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 34.7 bits (80), Expect = 0.013
 Identities = 12/29 (41%), Positives = 14/29 (48%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G         E   E +E EEEE+EEE E
Sbjct: 68  GAAAAAAAGAEAAAEADEAEEEEKEEEAE 96


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 14/34 (41%), Positives = 27/34 (79%)

Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           + + + E  ++EE+EEEE++EE+++E+E EEE +
Sbjct: 107 VASEEDESDDDEEDEEEEDDEEDDDEDESEEEES 140


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 42.9 bits (102), Expect = 9e-05
 Identities = 19/28 (67%), Positives = 19/28 (67%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
               EEEEEEEEEEEEEE  EEE  A L
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSEEEAAAGL 325



 Score = 41.4 bits (98), Expect = 3e-04
 Identities = 17/24 (70%), Positives = 18/24 (75%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
           +    EEEEEEEEEEEEEE  EEE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 40.2 bits (95), Expect = 7e-04
 Identities = 17/25 (68%), Positives = 18/25 (72%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
            +    EEEEEEEEEEEEEE  EEE
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 39.1 bits (92), Expect = 0.002
 Identities = 16/23 (69%), Positives = 17/23 (73%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
           +    EEEEEEEEEEEEEE  EE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 34.1 bits (79), Expect = 0.068
 Identities = 14/19 (73%), Positives = 16/19 (84%)

Query: 141 KGEEKEEEEEEEEEEEEEE 159
           + EE+EEEEEEEEE  EEE
Sbjct: 302 EEEEEEEEEEEEEEPSEEE 320



 Score = 34.1 bits (79), Expect = 0.068
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E    + +    EEEEEEEEEEEEE
Sbjct: 289 KEVLSAQAQAAAAEEEEEEEEEEEEE 314



 Score = 33.7 bits (78), Expect = 0.11
 Identities = 14/21 (66%), Positives = 15/21 (71%)

Query: 148 EEEEEEEEEEEEEEEEEEEEE 168
            +    EEEEEEEEEEEEEE 
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEP 316



 Score = 30.2 bits (69), Expect = 1.4
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E KE    + +    EEEEEEEEEEE
Sbjct: 287 ELKEVLSAQAQAAAAEEEEEEEEEEE 312



 Score = 28.7 bits (65), Expect = 3.7
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            EE +E    + +    EEEEEEEEEE
Sbjct: 285 DEELKEVLSAQAQAAAAEEEEEEEEEE 311


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 43.2 bits (102), Expect = 1e-04
 Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 125 LIERLDGKRRPRLYT-GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              ++  KRR RL      E  E+E+EEEE+  +E EEEE E+ E
Sbjct: 350 KESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSE 394



 Score = 29.3 bits (66), Expect = 3.0
 Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E +EEE E+ EEE  +  E+   E  
Sbjct: 383 DEHEEEEGEDSEEEGSQSREDGSSESS 409



 Score = 28.5 bits (64), Expect = 4.9
 Identities = 9/29 (31%), Positives = 15/29 (51%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + EE+E E+ EEE  +  E+   E   +
Sbjct: 383 DEHEEEEGEDSEEEGSQSREDGSSESSSD 411



 Score = 27.8 bits (62), Expect = 9.4
 Identities = 8/30 (26%), Positives = 13/30 (43%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            + EE E+ EEE  +  E+   E   +   
Sbjct: 385 HEEEEGEDSEEEGSQSREDGSSESSSDVGS 414


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 42.4 bits (99), Expect = 1e-04
 Identities = 19/32 (59%), Positives = 23/32 (71%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           K +E +E+EEEEEEEEEEE E EE E+    T
Sbjct: 131 KEDESDEDEEEEEEEEEEEAEVEENEQGTNGT 162



 Score = 40.4 bits (94), Expect = 6e-04
 Identities = 21/36 (58%), Positives = 24/36 (66%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
            GK   KE+E +E+EEEEEEEEEEE E EE     N
Sbjct: 125 AGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTN 160



 Score = 39.3 bits (91), Expect = 0.001
 Identities = 17/30 (56%), Positives = 23/30 (76%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           G   +K  +E+E +E+EEEEEEEEEEE E+
Sbjct: 123 GNAGKKATKEDESDEDEEEEEEEEEEEAEV 152



 Score = 35.8 bits (82), Expect = 0.018
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           P+     G++  +E+E +E+EEEEEEEEEEE E
Sbjct: 119 PKKAGNAGKKATKEDESDEDEEEEEEEEEEEAE 151



 Score = 35.4 bits (81), Expect = 0.029
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
               +++EEEEEEEEEEE E EE E+    T+T
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVEENEQGTNGTST 164



 Score = 32.7 bits (74), Expect = 0.21
 Identities = 21/72 (29%), Positives = 25/72 (34%), Gaps = 30/72 (41%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEE------------------------------EEEE 160
           GK+  +      +E+EEEEEEEEE E                              EE E
Sbjct: 126 GKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGE 185

Query: 161 EEEEEEEEITAT 172
           EE   E E   T
Sbjct: 186 EESVTEAEAEGT 197



 Score = 32.0 bits (72), Expect = 0.36
 Identities = 14/28 (50%), Positives = 20/28 (71%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K     ++  +E+E +E+EEEEEEEEEE
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEE 148



 Score = 31.6 bits (71), Expect = 0.47
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G G+  EEE EEE   EEE  E+ +  E+
Sbjct: 55  GDGDSSEEEGEEETSNEEENNEDSDGNED 83



 Score = 30.0 bits (67), Expect = 1.3
 Identities = 14/32 (43%), Positives = 18/32 (56%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           G   E+E EEE   EEE  E+ +  E+EE  A
Sbjct: 57  GDSSEEEGEEETSNEEENNEDSDGNEDEEAEA 88



 Score = 30.0 bits (67), Expect = 1.3
 Identities = 13/33 (39%), Positives = 16/33 (48%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           R     G +  EE  + +  EEE EEE   EEE
Sbjct: 41  RFSVQSGSDSSEENGDGDSSEEEGEEETSNEEE 73



 Score = 28.5 bits (63), Expect = 4.3
 Identities = 18/59 (30%), Positives = 19/59 (32%), Gaps = 30/59 (50%)

Query: 143 EEKEEEEEEEE------------------------------EEEEEEEEEEEEEEEITA 171
           EE+EEEEEE E                              EE EEE   E E E  T 
Sbjct: 141 EEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVTEAEAEGTTV 199



 Score = 28.1 bits (62), Expect = 5.2
 Identities = 12/32 (37%), Positives = 16/32 (50%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
             +EE EEE   EEE  E+ +  E+E     N
Sbjct: 59  SSEEEGEEETSNEEENNEDSDGNEDEEAEAEN 90


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 41.8 bits (98), Expect = 3e-04
 Identities = 26/58 (44%), Positives = 31/58 (53%)

Query: 111 EVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E ER G                  P + + + EE+E  EEEEEEEEEEEEEE+E EEE
Sbjct: 408 ERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEE 465



 Score = 39.9 bits (93), Expect = 0.001
 Identities = 21/29 (72%), Positives = 25/29 (86%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + EE+EEEEEEEEE+E EEEE E+EEEE
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEE 473



 Score = 39.9 bits (93), Expect = 0.001
 Identities = 22/32 (68%), Positives = 25/32 (78%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           EE+EEEEEEEE+E EEEE E+EEEEE     N
Sbjct: 449 EEEEEEEEEEEQESEEEEGEDEEEEEEVEADN 480



 Score = 36.8 bits (85), Expect = 0.013
 Identities = 26/70 (37%), Positives = 34/70 (48%), Gaps = 8/70 (11%)

Query: 109 TNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGE------EKEEEEEEEEEEEEEEEEE 162
           T E ER  R+   ++            +  +  GE      ++ EEEE  EEEEEEEEEE
Sbjct: 398 TEEEERRKRQE--RERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEE 455

Query: 163 EEEEEEITAT 172
           EEEE+E    
Sbjct: 456 EEEEQESEEE 465



 Score = 36.4 bits (84), Expect = 0.018
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           +E EEEE E+EEEEEE E +   EEE+  
Sbjct: 460 QESEEEEGEDEEEEEEVEADNGSEEEMEG 488



 Score = 34.9 bits (80), Expect = 0.048
 Identities = 15/26 (57%), Positives = 19/26 (73%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E++ EEEE E+EEEEEE E +   EE
Sbjct: 459 EQESEEEEGEDEEEEEEVEADNGSEE 484



 Score = 34.5 bits (79), Expect = 0.081
 Identities = 16/28 (57%), Positives = 20/28 (71%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + EE+E EEEE E+EEEEEE E +   E
Sbjct: 456 EEEEQESEEEEGEDEEEEEEVEADNGSE 483



 Score = 34.1 bits (78), Expect = 0.094
 Identities = 16/27 (59%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EE+E+E EEEE E+EEEEEE E +   
Sbjct: 456 EEEEQESEEEEGEDEEEEEEVEADNGS 482



 Score = 33.0 bits (75), Expect = 0.20
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           + EE+E E+EEEEEE E +   EEE E
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 28.7 bits (64), Expect = 4.4
 Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 10/52 (19%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEE----------EEEEEEITATLNPCGLCRI 181
            +GE++EEEEE E +   EEE E          E EE+         G+ R+
Sbjct: 465 EEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRM 516


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 38.8 bits (91), Expect = 3e-04
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEE 161
                  EEE++EEEEEEEE++
Sbjct: 60  AAAAAAAEEEKKEEEEEEEEDD 81



 Score = 38.8 bits (91), Expect = 3e-04
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
                     EEE++EEEEEEEE++
Sbjct: 57  AAAAAAAAAAEEEKKEEEEEEEEDD 81



 Score = 38.4 bits (90), Expect = 4e-04
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEE 162
                    EEE++EEEEEEEE++
Sbjct: 58  AAAAAAAAAEEEKKEEEEEEEEDD 81



 Score = 38.4 bits (90), Expect = 4e-04
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEI 169
                 EEE++EEEEEEEE++++
Sbjct: 61  AAAAAAEEEKKEEEEEEEEDDDM 83



 Score = 37.6 bits (88), Expect = 6e-04
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEE 164
                     EEE++EEEEEEEE++
Sbjct: 57  AAAAAAAAAAEEEKKEEEEEEEEDD 81


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 41.1 bits (96), Expect = 4e-04
 Identities = 19/37 (51%), Positives = 21/37 (56%)

Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            R  L T          + E EE+EEEEEEEEEEEEI
Sbjct: 208 PRLGLRTPPNSSSSSGSDSESEEDEEEEEEEEEEEEI 244



 Score = 36.1 bits (83), Expect = 0.019
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEIT 170
            + E EE+EEEEEEEEEEEE + +T
Sbjct: 224 SDSESEEDEEEEEEEEEEEEIDVVT 248


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 41.6 bits (97), Expect = 4e-04
 Identities = 15/27 (55%), Positives = 26/27 (96%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           ++ +E+E+E++++EE++EEEEEEEEEI
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEI 180



 Score = 41.2 bits (96), Expect = 7e-04
 Identities = 13/25 (52%), Positives = 24/25 (96%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
            ++++E+E++++EE++EEEEEEEEE
Sbjct: 155 DDDEDEDEDDDDEEDDEEEEEEEEE 179



 Score = 40.8 bits (95), Expect = 8e-04
 Identities = 16/33 (48%), Positives = 26/33 (78%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           PR      ++ E+E+E++++EE++EEEEEEEEE
Sbjct: 147 PRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179



 Score = 38.9 bits (90), Expect = 0.004
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEE 165
           D   R        +E E+E++++EE++EEEEEEEEE
Sbjct: 144 DMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179



 Score = 38.1 bits (88), Expect = 0.006
 Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 6/52 (11%)

Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE------EEEE 168
           + L E     R   +     E+++E++++EE++EEEEEEEEE      E+EE
Sbjct: 138 RHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189



 Score = 36.6 bits (84), Expect = 0.018
 Identities = 10/31 (32%), Positives = 23/31 (74%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
           ++++E+E+E++++EE++EEEEE    +    
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEIKGFD 184



 Score = 29.2 bits (65), Expect = 3.4
 Identities = 14/26 (53%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E+++EEEEEEEEE +  ++E+EE+E
Sbjct: 166 DEEDDEEEEEEEEEIKGFDDEDEEDE 191



 Score = 28.9 bits (64), Expect = 5.1
 Identities = 8/37 (21%), Positives = 23/37 (62%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +RR R    +     +    ++++E+E+E++++EE++
Sbjct: 134 RRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDD 170


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 39.4 bits (92), Expect = 0.001
 Identities = 13/28 (46%), Positives = 26/28 (92%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + +EK+EEEEEEEEEE+E+ ++++++++
Sbjct: 170 EEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 39.0 bits (91), Expect = 0.001
 Identities = 12/28 (42%), Positives = 25/28 (89%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E++++EEEEEEEEEE+E+ +++++++
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDDD 196



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 15/27 (55%), Positives = 24/27 (88%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+ +EE+E++EEEEEEEEEE+E+ ++
Sbjct: 165 AEDVDEEDEKDEEEEEEEEEEDEDFDD 191



 Score = 37.4 bits (87), Expect = 0.004
 Identities = 12/28 (42%), Positives = 24/28 (85%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
            + E+ EEEEEEEEEE+E+ ++++++++
Sbjct: 170 EEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 37.0 bits (86), Expect = 0.006
 Identities = 17/34 (50%), Positives = 25/34 (73%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           +L   + + KE E E+ +EE+E++EEEEEEEEE 
Sbjct: 152 KLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEE 185



 Score = 36.7 bits (85), Expect = 0.007
 Identities = 13/26 (50%), Positives = 24/26 (92%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            ++E+E++EEEEEEEEEE+E+ ++++
Sbjct: 168 VDEEDEKDEEEEEEEEEEDEDFDDDD 193



 Score = 36.7 bits (85), Expect = 0.007
 Identities = 14/26 (53%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +  EE+E++EEEEEEEEEE+E+ +++
Sbjct: 167 DVDEEDEKDEEEEEEEEEEDEDFDDD 192



 Score = 36.7 bits (85), Expect = 0.009
 Identities = 17/28 (60%), Positives = 24/28 (85%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K  E E+ +EE+E++EEEEEEEEEE+E+
Sbjct: 161 KELEAEDVDEEDEKDEEEEEEEEEEDED 188



 Score = 35.1 bits (81), Expect = 0.023
 Identities = 17/28 (60%), Positives = 24/28 (85%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K +E E E+ +EE+E++EEEEEEEEEE+
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEEED 186



 Score = 34.0 bits (78), Expect = 0.072
 Identities = 9/25 (36%), Positives = 20/25 (80%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEE 163
               EE+EEEEEE+E+ ++++++++
Sbjct: 173 EKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 31.3 bits (71), Expect = 0.42
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             K    E++ +E E E+ +EE+E++EEEE
Sbjct: 150 DEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179



 Score = 30.5 bits (69), Expect = 0.88
 Identities = 12/38 (31%), Positives = 24/38 (63%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            KR+  L+T + E+ +E+    E++ +E E E+ +EE+
Sbjct: 135 FKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEED 172


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 38.7 bits (90), Expect = 0.001
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 4/38 (10%)

Query: 142 GEEKEEEEEEEEEEE----EEEEEEEEEEEEITATLNP 175
            EE EE+EEE E+ E    EE+EE EE EEE  A+   
Sbjct: 28  AEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKA 65



 Score = 31.8 bits (72), Expect = 0.26
 Identities = 16/46 (34%), Positives = 21/46 (45%)

Query: 119 PWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
           P P K   E  D +        +  E  E+  +EE+EE EE EEE 
Sbjct: 14  PPPTKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59



 Score = 29.5 bits (66), Expect = 1.7
 Identities = 12/25 (48%), Positives = 18/25 (72%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           + EEE+ + + EE EE+EEE E+ E
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEMEDWE 42



 Score = 29.1 bits (65), Expect = 2.2
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +++EE+ + + EE EE+EEE E+ E+
Sbjct: 18  KDEEEDWDSQAEEVEEDEEEMEDWED 43


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 39.4 bits (92), Expect = 0.002
 Identities = 20/45 (44%), Positives = 22/45 (48%)

Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           L E L  K  P +     EE+EEEEEEEEE  E E   EE     
Sbjct: 386 LREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGF 430



 Score = 38.6 bits (90), Expect = 0.003
 Identities = 17/28 (60%), Positives = 17/28 (60%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
           EEEEEEEEEEEEE  E E   EE     
Sbjct: 403 EEEEEEEEEEEEEAAEAEAPMEEPVPGF 430



 Score = 34.4 bits (79), Expect = 0.076
 Identities = 16/25 (64%), Positives = 16/25 (64%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITA 171
           E   EEEEEEEEEEEEE  E E   
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPM 423



 Score = 32.9 bits (75), Expect = 0.20
 Identities = 16/25 (64%), Positives = 17/25 (68%)

Query: 149 EEEEEEEEEEEEEEEEEEEEITATL 173
           E   EEEEEEEEEEEEE  E  A +
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPM 423



 Score = 30.6 bits (69), Expect = 1.3
 Identities = 14/35 (40%), Positives = 16/35 (45%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
           EE+EEEEEE  E E   EE     E     +   G
Sbjct: 407 EEEEEEEEEAAEAEAPMEEPVPGFEVPEMPMPAAG 441


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 36.5 bits (84), Expect = 0.002
 Identities = 23/58 (39%), Positives = 30/58 (51%)

Query: 111 EVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E+   K  P  G     +G +       + EE +E++EEEE E EEEE EEEEE E
Sbjct: 23  VEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETE 80



 Score = 33.4 bits (76), Expect = 0.032
 Identities = 19/31 (61%), Positives = 23/31 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           EE+ +E++EEEE E EEEE EEEEE   AT 
Sbjct: 54  EEEVDEDDEEEEGEGEEEEGEEEEETEGATG 84



 Score = 27.2 bits (60), Expect = 5.4
 Identities = 18/38 (47%), Positives = 21/38 (55%), Gaps = 6/38 (15%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEE------EEEEEITAT 172
             EE+E E EEEE EEEEE E        E+EE+   T
Sbjct: 60  DDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAET 97


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 37.3 bits (87), Expect = 0.003
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           R   G  ++++EEEEEEE E EE +EEE+ +E +  
Sbjct: 91  RKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEK 126



 Score = 35.3 bits (82), Expect = 0.012
 Identities = 17/34 (50%), Positives = 23/34 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           K +E+EEEEE E EE +EEE+ +E  E+  A L 
Sbjct: 99  KEKEEEEEEEVEVEELDEEEQIDELLEKELAKLK 132



 Score = 34.2 bits (79), Expect = 0.037
 Identities = 15/31 (48%), Positives = 22/31 (70%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             + EE+EEEE E EE +EEE+ +E  E+E+
Sbjct: 98  KKEKEEEEEEEVEVEELDEEEQIDELLEKEL 128



 Score = 30.7 bits (70), Expect = 0.57
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
             + EE E EE +EEE+ +E  E+E  + 
Sbjct: 103 EEEEEEVEVEELDEEEQIDELLEKELAKL 131



 Score = 29.2 bits (66), Expect = 1.8
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 129 LDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           LD K +      + E +E +EEE+ +E  E+E  + + E+
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREK 135



 Score = 26.9 bits (60), Expect = 8.6
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+ +EEE+ +E  E+E  + + E+ 
Sbjct: 111 VEELDEEEQIDELLEKELAKLKREKR 136



 Score = 26.9 bits (60), Expect = 9.3
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE +EEE+ +E  E+E  + + E+  
Sbjct: 112 EELDEEEQIDELLEKELAKLKREKRR 137


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 38.1 bits (89), Expect = 0.005
 Identities = 17/27 (62%), Positives = 18/27 (66%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITA 171
             EEEEEEEEEEEEEEE   E   + A
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPA 429



 Score = 37.4 bits (87), Expect = 0.008
 Identities = 17/24 (70%), Positives = 18/24 (75%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
           E+   EEEEEEEEEEEEEEE   E
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAE 423



 Score = 36.2 bits (84), Expect = 0.016
 Identities = 20/42 (47%), Positives = 23/42 (54%), Gaps = 3/42 (7%)

Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           L + L  K  P +   +    EEEEEEEEEEEEEEE   E  
Sbjct: 387 LKKFLKEKEHPVV---ERWAAEEEEEEEEEEEEEEEPVAEVM 425



 Score = 35.4 bits (82), Expect = 0.031
 Identities = 15/22 (68%), Positives = 16/22 (72%)

Query: 151 EEEEEEEEEEEEEEEEEEITAT 172
             EEEEEEEEEEEEEEE +   
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEV 424



 Score = 34.7 bits (80), Expect = 0.053
 Identities = 15/27 (55%), Positives = 16/27 (59%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             +EEEEEEEEEEEEEE   E      
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPA 429



 Score = 33.9 bits (78), Expect = 0.11
 Identities = 14/23 (60%), Positives = 14/23 (60%)

Query: 153 EEEEEEEEEEEEEEEEITATLNP 175
             EEEEEEEEEEEEEE       
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVM 425



 Score = 31.6 bits (72), Expect = 0.50
 Identities = 17/33 (51%), Positives = 18/33 (54%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           +EKE    E    EEEEEEEEEEEEE       
Sbjct: 392 KEKEHPVVERWAAEEEEEEEEEEEEEEEPVAEV 424


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 37.4 bits (87), Expect = 0.005
 Identities = 17/31 (54%), Positives = 20/31 (64%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           KE  EE+E EEEE+  EEE+  EE   T NP
Sbjct: 156 KERAEEKEREEEEKAAEEEKAREEEILTGNP 186



 Score = 27.8 bits (62), Expect = 7.8
 Identities = 3/31 (9%), Positives = 15/31 (48%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           +     + +  +++ ++++ E+E   +   L
Sbjct: 121 DSDSSSDSDSSDDDSDDDDSEDETAALLREL 151


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 35.9 bits (83), Expect = 0.006
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEE 165
           G G  ++EEEEEEEE E++++ +E +
Sbjct: 90  GTGHTRQEEEEEEEENEKQQQSDEAQ 115



 Score = 34.4 bits (79), Expect = 0.023
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           T  G     +EEEEEEEE E++++ +E +
Sbjct: 87  TTSGTGHTRQEEEEEEEENEKQQQSDEAQ 115



 Score = 32.1 bits (73), Expect = 0.14
 Identities = 10/23 (43%), Positives = 18/23 (78%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEI 169
             +EEEEEEEE E++++ +E ++
Sbjct: 94  TRQEEEEEEEENEKQQQSDEAQV 116



 Score = 32.1 bits (73), Expect = 0.15
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
               +EEEEEEEE E++++ +E 
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEA 114



 Score = 29.4 bits (66), Expect = 1.1
 Identities = 11/33 (33%), Positives = 23/33 (69%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           T + EE+EEEE E++++ +E + ++ ++ E  A
Sbjct: 94  TRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPA 126


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 35.5 bits (81), Expect = 0.006
 Identities = 12/34 (35%), Positives = 27/34 (79%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           P L+  +GEE  + + +EEE++++E+EE+++E++
Sbjct: 1   PTLHFLEGEEDSDSDSDEEEDDDDEDEEDDDEDD 34



 Score = 33.2 bits (75), Expect = 0.032
 Identities = 8/27 (29%), Positives = 23/27 (85%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+ + + +EEE++++E+EE+++E+++
Sbjct: 9   EEDSDSDSDEEEDDDDEDEEDDDEDDD 35



 Score = 32.0 bits (72), Expect = 0.091
 Identities = 8/24 (33%), Positives = 24/24 (100%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
           +E+E++++E+EE+++E+++E+++E
Sbjct: 17  DEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 32.0 bits (72), Expect = 0.091
 Identities = 8/24 (33%), Positives = 24/24 (100%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
           ++EE++++E+EE+++E+++E+++E
Sbjct: 17  DEEEDDDDEDEEDDDEDDDEDDDE 40


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 37.1 bits (87), Expect = 0.006
 Identities = 19/29 (65%), Positives = 20/29 (68%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G+     E  EEEEEE EEEEEEEEEEE 
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEA 252



 Score = 35.5 bits (83), Expect = 0.023
 Identities = 20/37 (54%), Positives = 25/37 (67%)

Query: 129 LDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEE 165
           L+G++       + EE+E EEEEEEEEEEE EE E E
Sbjct: 222 LEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 34.8 bits (81), Expect = 0.040
 Identities = 20/27 (74%), Positives = 21/27 (77%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
             EE+EEE EEEEEEEEEEE EE E E
Sbjct: 232 AAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 34.4 bits (80), Expect = 0.050
 Identities = 21/29 (72%), Positives = 22/29 (75%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G+  E  EEEEEE EEEEEEEEEEE EE 
Sbjct: 227 GRLAEAAEEEEEEAEEEEEEEEEEEAEEA 255



 Score = 33.6 bits (78), Expect = 0.083
 Identities = 19/26 (73%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
             +EEEEE EEEEEEEEEEE EE E 
Sbjct: 232 AAEEEEEEAEEEEEEEEEEEAEEAEA 257


>gnl|CDD|223166 COG0088, RplD, Ribosomal protein L4 [Translation, ribosomal
           structure and biogenesis].
          Length = 214

 Score = 36.8 bits (86), Expect = 0.006
 Identities = 22/65 (33%), Positives = 28/65 (43%), Gaps = 1/65 (1%)

Query: 72  VNLHPDIFGLVPRLDVITMNIEWQKKYKWVRFDHTKVTNEVERTGRKPWPQKGL-IERLD 130
           V L P++FG     D+I   +  Q   +      TK   EV   G+KPW QKG    R  
Sbjct: 17  VELLPEVFGPERNEDLIHRAVLAQLANRRQGTHSTKTRGEVSGGGKKPWGQKGTGRARQG 76

Query: 131 GKRRP 135
             R P
Sbjct: 77  SIRSP 81



 Score = 34.1 bits (79), Expect = 0.052
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 15/107 (14%)

Query: 191 GLTSTLSIKLMQDDLHIVKDLVLPVDDPK------YLENLVEERCWGPSVLFVDNTDYME 244
            L S LS K     L +V+  V   D PK      +L+ L  +      +L V      +
Sbjct: 110 ALRSALSAKARAGKLVVVRGHVF-EDAPKTKELVEFLKKLGLDV---KRLLIV--KGERD 163

Query: 245 RNISIVTDQIKHYNIMPAYGENN---MLKHDTLVITVDALKQIEDKL 288
            N  +    +K+  ++   G      +L+ D LVIT  A  +IE++L
Sbjct: 164 GNGKLSARNLKNVKVVLVVGGLPVVDVLRADKLVITKLAPGKIEERL 210


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 36.6 bits (85), Expect = 0.006
 Identities = 18/43 (41%), Positives = 28/43 (65%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           ER + K+      G+ +E+EE EEE E+++EEEE +E EE+  
Sbjct: 27  EREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQAR 69



 Score = 36.2 bits (84), Expect = 0.010
 Identities = 21/52 (40%), Positives = 30/52 (57%)

Query: 117 RKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +    Q+   E    +R+      +GE KEEEE EEE E+++EEEE +E EE
Sbjct: 15  QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREE 66



 Score = 33.5 bits (77), Expect = 0.065
 Identities = 19/60 (31%), Positives = 30/60 (50%)

Query: 111 EVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           E ER  RK   +K   ER + +        K EE+E +E EE+  +E+EE E+ +   + 
Sbjct: 25  EEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVV 84


>gnl|CDD|182840 PRK10921, PRK10921, twin-arginine protein translocation system
           subunit TatC; Provisional.
          Length = 258

 Score = 37.0 bits (86), Expect = 0.008
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEE 160
           R Y GKG  +EEE + E E E+ EE
Sbjct: 234 RFYVGKGRRREEENDAEAESEKTEE 258


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 34.5 bits (80), Expect = 0.008
 Identities = 14/27 (51%), Positives = 21/27 (77%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E EEEEE E+ E E+EE+EEE++++
Sbjct: 45  DAEWEEEEEGEDLESEDEEDEEEDDDD 71



 Score = 32.2 bits (74), Expect = 0.048
 Identities = 12/26 (46%), Positives = 22/26 (84%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+EE E+ E E+EE+EEE+++++ +
Sbjct: 49  EEEEEGEDLESEDEEDEEEDDDDDMD 74



 Score = 28.3 bits (64), Expect = 0.95
 Identities = 9/21 (42%), Positives = 18/21 (85%)

Query: 142 GEEKEEEEEEEEEEEEEEEEE 162
           GE+ E E+EE+EEE+++++ +
Sbjct: 54  GEDLESEDEEDEEEDDDDDMD 74


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 36.9 bits (86), Expect = 0.008
 Identities = 15/39 (38%), Positives = 26/39 (66%)

Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +      L   + EE+E++  E E+E+EE+E+EEEEE++
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDD 314



 Score = 35.0 bits (81), Expect = 0.037
 Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 8/47 (17%)

Query: 132 KRRPRLYTGKGEE--------KEEEEEEEEEEEEEEEEEEEEEEEIT 170
           K  P +  G  E+        +EEEEE++  E E+E+EE+E+EEE  
Sbjct: 266 KGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEE 312



 Score = 34.6 bits (80), Expect = 0.044
 Identities = 17/39 (43%), Positives = 26/39 (66%)

Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           D +    L     EE+EE++  E E+E+EE+E+EEEEE+
Sbjct: 275 DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEED 313



 Score = 34.6 bits (80), Expect = 0.055
 Identities = 14/26 (53%), Positives = 22/26 (84%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE +  E E+E+EE+E+EEEEE+++E
Sbjct: 291 EEDDYSESEDEDEEDEDEEEEEDDDE 316



 Score = 34.3 bits (79), Expect = 0.060
 Identities = 13/27 (48%), Positives = 21/27 (77%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+   E E+E+EE+E+EEEEE+++E 
Sbjct: 291 EEDDYSESEDEDEEDEDEEEEEDDDEG 317



 Score = 33.1 bits (76), Expect = 0.15
 Identities = 12/29 (41%), Positives = 23/29 (79%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + +  E E+E+EE+E+EEEEE+++E ++
Sbjct: 291 EEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 28.5 bits (64), Expect = 4.1
 Identities = 10/25 (40%), Positives = 21/25 (84%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEE 163
             + E+++EE+E+EEEEE+++E ++
Sbjct: 295 YSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 33.5 bits (77), Expect = 0.009
 Identities = 14/23 (60%), Positives = 18/23 (78%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
           EEK E+EEE E+EEE EE+ E +
Sbjct: 23  EEKREDEEENEDEEEGEEQSEVK 45



 Score = 32.3 bits (74), Expect = 0.024
 Identities = 14/23 (60%), Positives = 18/23 (78%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
           EEE+ E+EEE E+EEE EE+ E 
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEV 44



 Score = 31.9 bits (73), Expect = 0.034
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
           EE++ E+EEE E+EEE EE+ E +
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVK 45



 Score = 31.5 bits (72), Expect = 0.044
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEE 168
           +EE+ E+EEE E+EEE EE+ E +
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVK 45



 Score = 31.2 bits (71), Expect = 0.056
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
           E+E+ E+EEE E+EEE EE+ E +
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVK 45



 Score = 26.5 bits (59), Expect = 2.6
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 151 EEEEEEEEEEEEEEEEEEI 169
           EEE+ E+EEE E+EEE E 
Sbjct: 22  EEEKREDEEENEDEEEGEE 40



 Score = 26.5 bits (59), Expect = 2.6
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 141 KGEEKEEEEEEEEEEEEEEEE 161
           K E++EE E+EEE EE+ E +
Sbjct: 25  KREDEEENEDEEEGEEQSEVK 45



 Score = 25.4 bits (56), Expect = 5.9
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 152 EEEEEEEEEEEEEEEEEIT 170
           EEE+ E+EEE E+EEE   
Sbjct: 22  EEEKREDEEENEDEEEGEE 40


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 36.7 bits (85), Expect = 0.009
 Identities = 18/29 (62%), Positives = 19/29 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            E  EEEE EE EE+EEEE EEE EE   
Sbjct: 76  SEDGEEEEVEEGEEDEEEEGEEESEEFEP 104



 Score = 36.0 bits (83), Expect = 0.021
 Identities = 18/26 (69%), Positives = 20/26 (76%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
            E+ EEEE EE EE+EEEE EEE EE
Sbjct: 76  SEDGEEEEVEEGEEDEEEEGEEESEE 101



 Score = 35.6 bits (82), Expect = 0.027
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            +  E+EE EE EE+EEEE EEE EE E +
Sbjct: 76  SEDGEEEEVEEGEEDEEEEGEEESEEFEPL 105



 Score = 34.8 bits (80), Expect = 0.047
 Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE-----EEITA 171
           E  +G+   R+     EE+ EE EE+EEEE EEE EE E      EE+T 
Sbjct: 65  EDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTE 114



 Score = 33.3 bits (76), Expect = 0.13
 Identities = 19/38 (50%), Positives = 24/38 (63%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
           G+ E +  E+ EEEE EE EE+EEEE EE +    P G
Sbjct: 69  GRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLG 106



 Score = 32.1 bits (73), Expect = 0.32
 Identities = 18/36 (50%), Positives = 21/36 (58%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
            G+ EE EE EE+EEEE EEE EE E   +    L 
Sbjct: 78  DGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELT 113



 Score = 28.6 bits (64), Expect = 3.6
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           G+ +E   E    E+ EEEE EE EE+EE  
Sbjct: 64  GEDDEGRIEVRISEDGEEEEVEEGEEDEEEE 94


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 37.1 bits (86), Expect = 0.010
 Identities = 16/27 (59%), Positives = 23/27 (85%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           + EEK+EEEE+  ++EEE +EEEE+EE
Sbjct: 43  EKEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 36.3 bits (84), Expect = 0.018
 Identities = 16/26 (61%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EEKEE++EEEE+  ++EEE +EEEE+
Sbjct: 42  EEKEEKKEEEEKTTDKEEEVDEEEEK 67



 Score = 35.9 bits (83), Expect = 0.021
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
            L+  K  EKE  +EEEEEE+EE++EEEE+  +  
Sbjct: 24  YLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKE 58



 Score = 35.9 bits (83), Expect = 0.026
 Identities = 15/26 (57%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE++EE++EEEE+  ++EEE +EEEE
Sbjct: 41  EEEKEEKKEEEEKTTDKEEEVDEEEE 66



 Score = 35.5 bits (82), Expect = 0.034
 Identities = 14/26 (53%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E++E++EEEE+  ++EEE +EEEE+E
Sbjct: 43  EKEEKKEEEEKTTDKEEEVDEEEEKE 68



 Score = 35.1 bits (81), Expect = 0.037
 Identities = 15/26 (57%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+E+EE++EEEE+  ++EEE +EEE
Sbjct: 40  EEEEKEEKKEEEEKTTDKEEEVDEEE 65



 Score = 35.1 bits (81), Expect = 0.043
 Identities = 14/26 (53%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E+++EEEE+  ++EEE +EEEE+EE
Sbjct: 44  KEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 35.1 bits (81), Expect = 0.046
 Identities = 18/28 (64%), Positives = 23/28 (82%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           EKE E+E  +EEEEEE+EE++EEEE T 
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTT 55



 Score = 34.7 bits (80), Expect = 0.058
 Identities = 15/26 (57%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+EE+EE++EEEE+  ++EEE +EE
Sbjct: 39  EEEEEKEEKKEEEEKTTDKEEEVDEE 64



 Score = 34.4 bits (79), Expect = 0.064
 Identities = 15/26 (57%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+EEE+EE++EEEE+  ++EEE +E
Sbjct: 38  EEEEEEKEEKKEEEEKTTDKEEEVDE 63



 Score = 32.8 bits (75), Expect = 0.21
 Identities = 14/26 (53%), Positives = 23/26 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E+EEEE+EE++EEEE+  ++EEE +
Sbjct: 37  DEEEEEEKEEKKEEEEKTTDKEEEVD 62



 Score = 32.0 bits (73), Expect = 0.39
 Identities = 13/33 (39%), Positives = 26/33 (78%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           + +++EEE+  ++EEE +EEEE+EE+++ T  +
Sbjct: 45  EEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKV 77



 Score = 30.1 bits (68), Expect = 1.8
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEE 162
            K EE++  ++EEE +EEEE+EE
Sbjct: 47  KKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 13/30 (43%), Positives = 24/30 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           EEK  ++EEE +EEEE+EE++++ +++  T
Sbjct: 51  EEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80



 Score = 29.4 bits (66), Expect = 2.6
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           + EE++EE E E  E  EE+ E+ + EE+
Sbjct: 500 EVEEEDEEAEVETTEPAEEDAEDSKMEEV 528



 Score = 29.0 bits (65), Expect = 3.3
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
           +E+EE EEE+EE E E  E  EE    
Sbjct: 496 DEDEEVEEEDEEAEVETTEPAEEDAED 522



 Score = 29.0 bits (65), Expect = 3.4
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E++E EEE+EE E E  E  EE+ E+
Sbjct: 497 EDEEVEEEDEEAEVETTEPAEEDAED 522



 Score = 28.6 bits (64), Expect = 4.8
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E EE EEE+EE E E  E  EE+ E
Sbjct: 496 DEDEEVEEEDEEAEVETTEPAEEDAE 521



 Score = 27.8 bits (62), Expect = 9.6
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE EEE+EE E E  E  EE+ E+ +
Sbjct: 499 EEVEEEDEEAEVETTEPAEEDAEDSK 524


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 37.0 bits (86), Expect = 0.012
 Identities = 15/29 (51%), Positives = 23/29 (79%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
             E+++EEEE+EEEEEEE+E+E   +E +
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEHS 413



 Score = 36.3 bits (84), Expect = 0.019
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+   E ++EEEE+EEEEEEE+E+E 
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEG 407



 Score = 36.3 bits (84), Expect = 0.019
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
               + EEEE+EEEEEEE+E+E   +E 
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEH 412



 Score = 36.3 bits (84), Expect = 0.020
 Identities = 15/26 (57%), Positives = 20/26 (76%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E    E ++EEEE+EEEEEEE+E+E 
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEG 407



 Score = 35.9 bits (83), Expect = 0.027
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E E+   E ++EEEE+EEEEEEE+E
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDE 404



 Score = 35.5 bits (82), Expect = 0.033
 Identities = 14/27 (51%), Positives = 20/27 (74%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            +   E ++EEEE+EEEEEEE+E+E  
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEGP 408



 Score = 35.1 bits (81), Expect = 0.046
 Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 5/40 (12%)

Query: 140 GKGEEKEEEEEEEEEEEEEEE-----EEEEEEEEITATLN 174
              EE+E+EEEEEEE+E+E       ++EE EE+   +  
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKY 427



 Score = 34.7 bits (80), Expect = 0.058
 Identities = 14/31 (45%), Positives = 20/31 (64%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
            L       + ++EEEE+EEEEEEE+E+E  
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGP 408



 Score = 33.2 bits (76), Expect = 0.20
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           +E+EEEEE+E+E   +E  ++EE EE       
Sbjct: 395 DEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKY 427


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 36.2 bits (83), Expect = 0.014
 Identities = 19/42 (45%), Positives = 27/42 (64%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+L  K +  L   K EEK+ +EE+EEEE EEE ++EE +  
Sbjct: 200 EQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGT 241



 Score = 33.1 bits (75), Expect = 0.12
 Identities = 12/29 (41%), Positives = 21/29 (72%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           T + + +EE+EEEE EEE ++EE +  ++
Sbjct: 215 TEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243



 Score = 32.7 bits (74), Expect = 0.18
 Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 107 KVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
            +   V+    KP  Q+ L+ +L    +      K  ++E+EEEE EEE ++EE +  ++
Sbjct: 185 SLGGSVQVKAPKP-KQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243



 Score = 31.9 bits (72), Expect = 0.34
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 17/78 (21%)

Query: 105 HTKVTNEVERTGRKPWPQK------GLIE---RLDGK-----RRPRLYTGKGEEKEEEEE 150
           H  +T   E    +PWP++       L++    L G       +P+      + +E  +E
Sbjct: 156 HPTIT---ENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQE 212

Query: 151 EEEEEEEEEEEEEEEEEE 168
            + EE++ +EE+EEEE E
Sbjct: 213 HKTEEKQPQEEQEEEEVE 230


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 36.6 bits (84), Expect = 0.014
 Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 10/84 (11%)

Query: 100 WVRFDHTKVTNEVERTGRKPWPQKGLIE-RLDGKRRPR-----LYTGKGEEKEEEEEEEE 153
           W+R +      +  R   +   Q G+I+       RPR     L+   GEE EEE+ + E
Sbjct: 37  WLRKEKACALRQQRRHRLQR--QHGVIDGENSETERPRDLTAALFAEAGEEAEEEDNDRE 94

Query: 154 --EEEEEEEEEEEEEEEITATLNP 175
             + E EEE+EEEE E     +NP
Sbjct: 95  CPDTEAEEEDEEEEIEAPDPEVNP 118


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 36.5 bits (85), Expect = 0.015
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           +  +EEE+EE+E+ ++E++EEEEEEE    
Sbjct: 350 DLSDEEEDEEDEDSDDEDDEEEEEEEKEKK 379



 Score = 35.7 bits (83), Expect = 0.029
 Identities = 13/32 (40%), Positives = 23/32 (71%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
             +  +  +EEE+EE+E+ ++E++EEEEEE  
Sbjct: 345 EEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEK 376



 Score = 35.7 bits (83), Expect = 0.031
 Identities = 14/30 (46%), Positives = 24/30 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            ++EE+EE+E+ ++E++EEEEEEE+E    
Sbjct: 352 SDEEEDEEDEDSDDEDDEEEEEEEKEKKKK 381



 Score = 35.4 bits (82), Expect = 0.035
 Identities = 15/32 (46%), Positives = 25/32 (78%)

Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              G+GEE EEEEE+  ++E+EE+++++ EEE
Sbjct: 314 FGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEE 345



 Score = 34.2 bits (79), Expect = 0.090
 Identities = 14/30 (46%), Positives = 25/30 (83%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           +E+E+EE+E+ ++E++EEEEEEE+E+    
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382



 Score = 33.8 bits (78), Expect = 0.11
 Identities = 14/26 (53%), Positives = 24/26 (92%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
            EE++EE+E+ ++E++EEEEEEE+E+
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEK 378



 Score = 33.4 bits (77), Expect = 0.14
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + EE ++++ EEEEE+ +  +EEE+EE+
Sbjct: 332 DEDEEDDDDDLEEEEEDVDLSDEEEDEED 360



 Score = 33.4 bits (77), Expect = 0.14
 Identities = 12/29 (41%), Positives = 25/29 (86%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           + EE+EE+  ++E+EE+++++ EEEEE++
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEEEDV 349



 Score = 33.4 bits (77), Expect = 0.16
 Identities = 15/52 (28%), Positives = 29/52 (55%)

Query: 121 PQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
             K   + LD +  P      G  + EE+EEEEE+  ++E+EE++++++   
Sbjct: 294 DSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEE 345



 Score = 33.0 bits (76), Expect = 0.19
 Identities = 11/28 (39%), Positives = 23/28 (82%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + EE++ +  +EEE+EE+E+ ++E++EE
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEE 370



 Score = 33.0 bits (76), Expect = 0.19
 Identities = 12/30 (40%), Positives = 22/30 (73%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           G  +E EE+++++ EEEEE+ +  +EEE+ 
Sbjct: 329 GVDDEDEEDDDDDLEEEEEDVDLSDEEEDE 358



 Score = 32.7 bits (75), Expect = 0.26
 Identities = 11/26 (42%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+EEE+ +  +EEE+EE+E+ ++E+
Sbjct: 342 LEEEEEDVDLSDEEEDEEDEDSDDED 367



 Score = 32.7 bits (75), Expect = 0.29
 Identities = 13/29 (44%), Positives = 25/29 (86%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G+ +E+EEE+  ++E+EE+++++ EEEEE
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEE 347



 Score = 32.7 bits (75), Expect = 0.31
 Identities = 11/28 (39%), Positives = 21/28 (75%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            G + E+EE+++++ EEEEE+ +  +EE
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEE 355



 Score = 32.3 bits (74), Expect = 0.35
 Identities = 14/30 (46%), Positives = 25/30 (83%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           EE EE+E+ ++E++EEEEEEE+E+++  + 
Sbjct: 355 EEDEEDEDSDDEDDEEEEEEEKEKKKKKSA 384



 Score = 32.3 bits (74), Expect = 0.38
 Identities = 10/26 (38%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE   ++E+EE+++++ EEEEE+ + 
Sbjct: 326 EEDGVDDEDEEDDDDDLEEEEEDVDL 351



 Score = 32.3 bits (74), Expect = 0.38
 Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            ++ EEEEE+ +  +EEE+EE+E+ ++
Sbjct: 339 DDDLEEEEEDVDLSDEEEDEEDEDSDD 365



 Score = 32.3 bits (74), Expect = 0.39
 Identities = 11/26 (42%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + +EEEE+ +  +EEE+EE+E+ ++E
Sbjct: 341 DLEEEEEDVDLSDEEEDEEDEDSDDE 366



 Score = 31.5 bits (72), Expect = 0.67
 Identities = 9/31 (29%), Positives = 21/31 (67%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              + E+  ++E+EE+++++ EEEEE+ +  
Sbjct: 322 DEEEEEDGVDDEDEEDDDDDLEEEEEDVDLS 352



 Score = 31.1 bits (71), Expect = 0.83
 Identities = 12/27 (44%), Positives = 21/27 (77%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+ +++ EEEEE+ +  +EEE+EE+E
Sbjct: 335 EEDDDDDLEEEEEDVDLSDEEEDEEDE 361



 Score = 30.7 bits (70), Expect = 1.0
 Identities = 11/26 (42%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++ ++ EEEEE+ +  +EEE+EE+E+
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDED 362



 Score = 30.4 bits (69), Expect = 1.5
 Identities = 13/33 (39%), Positives = 28/33 (84%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            + E++E+E+ ++E++EEEEEEE+E++++ +A 
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAE 385



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 11/32 (34%), Positives = 26/32 (81%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           +E++E+ ++E++EEEEEEE+E+++++   +  
Sbjct: 357 DEEDEDSDDEDDEEEEEEEKEKKKKKSAESTR 388



 Score = 29.6 bits (67), Expect = 2.2
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++ + EEEEE+ +  +EEE+EE+E+ 
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDS 363



 Score = 28.4 bits (64), Expect = 5.3
 Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 10/39 (25%)

Query: 143 EEKEEEEEE-----EEE-----EEEEEEEEEEEEEEITA 171
           EE  +EE E     E E       EEE++EEEE+ + +A
Sbjct: 261 EELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESA 299


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 36.5 bits (84), Expect = 0.015
 Identities = 12/28 (42%), Positives = 24/28 (85%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            G+E EEE  EEE+++EE+++++++E+E
Sbjct: 41  LGKEAEEEAMEEEDDDEEDDDDDDDEDE 68



 Score = 28.8 bits (64), Expect = 3.6
 Identities = 6/26 (23%), Positives = 25/26 (96%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+E+++EE+++++++E+E+++++++
Sbjct: 50  MEEEDDDEEDDDDDDDEDEDDDDDDD 75



 Score = 27.7 bits (61), Expect = 9.0
 Identities = 6/29 (20%), Positives = 26/29 (89%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
            ++ ++E+E++++++++E++E+E++++ T
Sbjct: 60  DDDDDDEDEDDDDDDDDEDDEDEDDDDST 88


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 35.5 bits (82), Expect = 0.015
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            R    T K ++K++E+  EEE++ E ++++ EE E
Sbjct: 59  PRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENE 94



 Score = 33.2 bits (76), Expect = 0.11
 Identities = 10/35 (28%), Positives = 21/35 (60%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           ++       K ++KE+  EEE++ E ++++ EE E
Sbjct: 60  RKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENE 94



 Score = 32.8 bits (75), Expect = 0.13
 Identities = 15/62 (24%), Positives = 29/62 (46%), Gaps = 6/62 (9%)

Query: 115 TGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           T RKP   K    +   K + +L   + E+K E ++++ EE E + +  EE  +   + +
Sbjct: 58  TPRKPATTK----KSKKKDKEKL--TEEEKKPESDDDKTEENENDPDNNEESGDSQESAS 111

Query: 175 PC 176
             
Sbjct: 112 AN 113


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 36.7 bits (86), Expect = 0.016
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
            EE       E EE +++E+EEEEE+E   +L
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSL 209



 Score = 34.8 bits (81), Expect = 0.061
 Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 5/52 (9%)

Query: 125 LIERL-DGKRRPR----LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
             +RL +G+RR R     +     E++      E EE +++E+EEEEE+   
Sbjct: 155 WYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDEND 206



 Score = 31.3 bits (72), Expect = 0.63
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
                E EE +++E+EEEEE+E ++ + A  + 
Sbjct: 183 AHVGSELEELDDDEDEEEEEDENDDSLAADESE 215



 Score = 29.4 bits (67), Expect = 2.9
 Identities = 12/40 (30%), Positives = 24/40 (60%)

Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           + +  P     + EE +++E+EEEEE+E ++    +E E+
Sbjct: 177 NAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESEL 216



 Score = 29.4 bits (67), Expect = 2.9
 Identities = 10/35 (28%), Positives = 19/35 (54%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
               + EE +++E+EEEEE+E ++      + L  
Sbjct: 184 HVGSELEELDDDEDEEEEEDENDDSLAADESELPE 218



 Score = 27.5 bits (62), Expect = 9.8
 Identities = 7/26 (26%), Positives = 14/26 (53%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE  + ++    E E + + +E+ EE
Sbjct: 61  EEAPDADDLLLAENEADAQTDEDAEE 86


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 36.4 bits (84), Expect = 0.017
 Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 1/53 (1%)

Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
             L + L  K  P +      E +EEE EEEEE  + EE E E  E+ A   P
Sbjct: 382 DELRKFLKEKGHPVV-KRVVREVDEEEIEEEEEAMQPEEMEMEGFEVPALQMP 433



 Score = 29.1 bits (65), Expect = 3.4
 Identities = 14/48 (29%), Positives = 21/48 (43%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVHIVV 190
           +E+E EEEEE  + EE E E  E   +           I  V+ +  +
Sbjct: 404 DEEEIEEEEEAMQPEEMEMEGFEVPALQMPAASAAPAGIKIVLKNAKI 451


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 34.3 bits (79), Expect = 0.017
 Identities = 26/75 (34%), Positives = 34/75 (45%), Gaps = 3/75 (4%)

Query: 102 RFDHTKVTNEVER--TGRKPW-PQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEE 158
           R   +    E +R   GR  W   K  I  L  ++       + EEK+  EEE  +E   
Sbjct: 7   RHSGSGRGGEDKREGAGRGNWGSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVV 66

Query: 159 EEEEEEEEEEITATL 173
           EEEE EEEE+   TL
Sbjct: 67  EEEEVEEEEDKEMTL 81


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 36.4 bits (84), Expect = 0.018
 Identities = 16/28 (57%), Positives = 21/28 (75%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K EEK+EE+ EE + EE +EE EEEE+ 
Sbjct: 282 KKEEKDEEKSEEVKTEEVDEEFEEEEKG 309



 Score = 34.4 bits (79), Expect = 0.075
 Identities = 15/27 (55%), Positives = 21/27 (77%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E+KEE++EE+ EE + EE +EE EEE 
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEE 307



 Score = 33.7 bits (77), Expect = 0.14
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           K EEK EE + EE +EE EEEE+    E+   +N
Sbjct: 286 KDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVN 319



 Score = 33.7 bits (77), Expect = 0.16
 Identities = 14/28 (50%), Positives = 22/28 (78%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K +++E++EE+ EE + EE +EE EEEE
Sbjct: 280 KEKKEEKDEEKSEEVKTEEVDEEFEEEE 307



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E KE++EE++EE+ EE + EE +EE 
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEF 303



 Score = 28.7 bits (64), Expect = 5.0
 Identities = 11/24 (45%), Positives = 18/24 (75%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEIT 170
           E++EE++EE+ EE + EE +EE  
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFE 304


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 35.7 bits (82), Expect = 0.019
 Identities = 17/33 (51%), Positives = 23/33 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            E+E+E  EEEEEE EEEEE + E+++   L P
Sbjct: 7   LEEEDESGEEEEEESEEEEETDSEDDMEPRLKP 39



 Score = 31.0 bits (70), Expect = 0.59
 Identities = 18/27 (66%), Positives = 19/27 (70%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           E E  E EEE+E  EEEEEE EEEE T
Sbjct: 1   ETEVLELEEEDESGEEEEEESEEEEET 27



 Score = 29.5 bits (66), Expect = 2.3
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEE 163
                 +  E+EEEE EEEEE + E++ E
Sbjct: 6   ELEEEDESGEEEEEESEEEEETDSEDDME 34



 Score = 28.7 bits (64), Expect = 3.5
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEE 165
           + E  EEEEEE EEEEE + E++ E
Sbjct: 10  EDESGEEEEEESEEEEETDSEDDME 34


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 33.8 bits (78), Expect = 0.021
 Identities = 7/38 (18%), Positives = 25/38 (65%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            R P        +K+E+EE+E++   ++++++++++++
Sbjct: 43  SRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDL 80



 Score = 30.7 bits (70), Expect = 0.28
 Identities = 6/35 (17%), Positives = 23/35 (65%)

Query: 134 RPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + R      E+  +++E+EE+E++   ++++++++
Sbjct: 42  KSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDD 76



 Score = 29.6 bits (67), Expect = 0.76
 Identities = 5/27 (18%), Positives = 19/27 (70%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           + E+  +++E+EE+E++   +++++  
Sbjct: 49  DAEDAAKKDEDEEDEDDVVLDDDDDDD 75


>gnl|CDD|184735 PRK14547, rplD, 50S ribosomal protein L4; Provisional.
          Length = 298

 Score = 35.7 bits (82), Expect = 0.022
 Identities = 16/38 (42%), Positives = 18/38 (47%)

Query: 105 HTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKG 142
           HTK   EV   G+KPW QKG      G  R   + G G
Sbjct: 50  HTKTRAEVRGGGKKPWRQKGTGRARQGSIRAPQWRGGG 87


>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose
           2'-O-methyltransferase Hen1.  Members of this protein
           family are bacterial Hen1, a 3' terminal RNA ribose
           2'-O-methyltransferase that acts in bacterial RNA
           repair. All members of the seed alignment belong to a
           cassette with the RNA repair enzyme polynucleotide
           kinase-phosphatase (Pnkp). Chemically similar Hen1 in
           eukaryotes acts instead on small regulatory RNAs
           [Transcription, RNA processing, Protein synthesis, tRNA
           and rRNA base modification].
          Length = 462

 Score = 35.7 bits (83), Expect = 0.025
 Identities = 34/100 (34%), Positives = 43/100 (43%), Gaps = 21/100 (21%)

Query: 81  LVPRLDVITMNIEWQKKYKWVRFDHTKVTNEVERTGRK--PW----PQKGLIERLDGKRR 134
           L+P LD          K+ WV  D      EVE+  RK   W    P++ LI R   K R
Sbjct: 183 LIPVLD--------DAKHYWVGDD------EVEKLLRKGEGWLAEHPERELITRRYLKHR 228

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
            R        +  E +E E EE E EE +EE  E   +LN
Sbjct: 229 -RSLARDALARLAEADEAEPEEAETEEAQEEAAEKPPSLN 267


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 35.6 bits (82), Expect = 0.026
 Identities = 18/30 (60%), Positives = 23/30 (76%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              G+  EE+E +E+EEEEE EEEEEEE+E
Sbjct: 254 DQDGDYVEEKELKEDEEEEETEEEEEEEDE 283



 Score = 34.4 bits (79), Expect = 0.058
 Identities = 19/25 (76%), Positives = 23/25 (92%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           EEKE +E+EEEEE EEEEEEE+E+E
Sbjct: 261 EEKELKEDEEEEETEEEEEEEDEDE 285



 Score = 33.6 bits (77), Expect = 0.094
 Identities = 15/29 (51%), Positives = 21/29 (72%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              ++ +  EE+E +E+EEEEE EEEEEE
Sbjct: 252 DDDQDGDYVEEKELKEDEEEEETEEEEEE 280



 Score = 33.3 bits (76), Expect = 0.11
 Identities = 16/31 (51%), Positives = 20/31 (64%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            +    EE+E +E+EEEEE EEEEEEE    
Sbjct: 255 QDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285



 Score = 31.7 bits (72), Expect = 0.39
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           ++  +  EE+E +E+EEEEE EEEEE    
Sbjct: 254 DQDGDYVEEKELKEDEEEEETEEEEEEEDE 283



 Score = 30.6 bits (69), Expect = 1.0
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E+ +++ +  EE+E +E+EEEEE EE 
Sbjct: 251 EDDDQDGDYVEEKELKEDEEEEETEEE 277



 Score = 29.4 bits (66), Expect = 2.3
 Identities = 11/28 (39%), Positives = 21/28 (75%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K  + EE+++++ +  EE+E +E+EEEE
Sbjct: 245 KKSDDEEDDDQDGDYVEEKELKEDEEEE 272



 Score = 28.6 bits (64), Expect = 4.1
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE ++++ +  EE+E +E+EEEEE E
Sbjct: 250 EEDDDQDGDYVEEKELKEDEEEEETE 275



 Score = 28.3 bits (63), Expect = 4.7
 Identities = 15/23 (65%), Positives = 19/23 (82%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEE 162
            + +E EEEEE EEEEEEE+E+E
Sbjct: 263 KELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 35.7 bits (83), Expect = 0.029
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
             K     +++EEEEEEE+E++EEE+EEEE  A
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 35.3 bits (82), Expect = 0.032
 Identities = 20/30 (66%), Positives = 26/30 (86%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
            G++KEEEEEEE+E++EEE+EEEEEE E  
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 34.9 bits (81), Expect = 0.049
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
            K   +    K E+K++    +++EEEEEEE+E++EEE  
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456



 Score = 34.9 bits (81), Expect = 0.054
 Identities = 20/28 (71%), Positives = 26/28 (92%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K EE+EEE+E++EEE+EEEEEE EEE+E
Sbjct: 440 KEEEEEEEKEKKEEEKEEEEEEAEEEKE 467



 Score = 34.5 bits (80), Expect = 0.060
 Identities = 21/30 (70%), Positives = 26/30 (86%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             + EEKE++EEE+EEEEEE EEE+EEEEE
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 34.1 bits (79), Expect = 0.075
 Identities = 17/45 (37%), Positives = 27/45 (60%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           I     K+R      K ++    +++EEEEEEE+E++EEE+EE  
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459



 Score = 34.1 bits (79), Expect = 0.079
 Identities = 18/37 (48%), Positives = 28/37 (75%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             + +    + E++++EEE+EEEEEE EEE+EEEEE+
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472



 Score = 34.1 bits (79), Expect = 0.081
 Identities = 18/44 (40%), Positives = 31/44 (70%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           ++ + +++ +       +K+EEEEEEE+E++EEE+EEEEEE   
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464



 Score = 34.1 bits (79), Expect = 0.091
 Identities = 22/33 (66%), Positives = 29/33 (87%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           + E+KEEE+EEEEEE EEE+EEEEE+++  ATL
Sbjct: 447 EKEKKEEEKEEEEEEAEEEKEEEEEKKKKQATL 479



 Score = 33.0 bits (76), Expect = 0.22
 Identities = 19/60 (31%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 110 NEVER-TGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+E  TG K   +K        +++      + ++K    +++EEEEEEE+E++EEE+E
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456



 Score = 32.6 bits (75), Expect = 0.23
 Identities = 19/30 (63%), Positives = 26/30 (86%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             + +EK+EEE+EEEEEE EEE+EEEEE++
Sbjct: 444 EEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 32.2 bits (74), Expect = 0.34
 Identities = 12/32 (37%), Positives = 23/32 (71%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           + EEK+E++++    +++EEEEEEE+E+    
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 31.8 bits (73), Expect = 0.44
 Identities = 10/30 (33%), Positives = 20/30 (66%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            EK+ EEE++E++++    +++EEEE    
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 34.9 bits (81), Expect = 0.030
 Identities = 10/27 (37%), Positives = 22/27 (81%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E +EE+ E++++++E+E+E++EEE 
Sbjct: 45  ESELDEEDLEDDDDDDEDEDEDDEEEA 71



 Score = 34.9 bits (81), Expect = 0.033
 Identities = 10/26 (38%), Positives = 22/26 (84%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E E +EE+ E++++++E+E+E++EE
Sbjct: 44  IESELDEEDLEDDDDDDEDEDEDDEE 69



 Score = 34.5 bits (80), Expect = 0.039
 Identities = 8/27 (29%), Positives = 22/27 (81%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           +E++ E++++++E+E+E++EEE +   
Sbjct: 49  DEEDLEDDDDDDEDEDEDDEEEADLGP 75



 Score = 34.5 bits (80), Expect = 0.045
 Identities = 10/26 (38%), Positives = 22/26 (84%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  EE+ E++++++E+E+E++EEE +
Sbjct: 47  ELDEEDLEDDDDDDEDEDEDDEEEAD 72



 Score = 34.1 bits (79), Expect = 0.052
 Identities = 8/29 (27%), Positives = 20/29 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
              E E +EE+ E++++++E+E+E++   
Sbjct: 42  AAIESELDEEDLEDDDDDDEDEDEDDEEE 70



 Score = 32.5 bits (75), Expect = 0.19
 Identities = 8/30 (26%), Positives = 18/30 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
                E E +EE+ E++++++E+E+E    
Sbjct: 40  TAAAIESELDEEDLEDDDDDDEDEDEDDEE 69



 Score = 31.8 bits (73), Expect = 0.30
 Identities = 7/26 (26%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
                 E E +EE+ E++++++E+E+
Sbjct: 39  ATAAAIESELDEEDLEDDDDDDEDED 64



 Score = 30.6 bits (70), Expect = 0.75
 Identities = 6/26 (23%), Positives = 20/26 (76%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEE 166
           + + +++++++E+E+E++EEE +   
Sbjct: 50  EEDLEDDDDDDEDEDEDDEEEADLGP 75



 Score = 30.6 bits (70), Expect = 0.79
 Identities = 7/26 (26%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
                  E E +EE+ E++++++E+E
Sbjct: 38  AATAAAIESELDEEDLEDDDDDDEDE 63



 Score = 29.8 bits (68), Expect = 1.4
 Identities = 6/26 (23%), Positives = 15/26 (57%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
                   E E +EE+ E++++++E+
Sbjct: 37  AAATAAAIESELDEEDLEDDDDDDED 62



 Score = 29.8 bits (68), Expect = 1.5
 Identities = 8/29 (27%), Positives = 19/29 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            E +++++E+E+E++EEE +   + E   
Sbjct: 53  LEDDDDDDEDEDEDDEEEADLGPDPEEAR 81



 Score = 28.3 bits (64), Expect = 3.6
 Identities = 6/26 (23%), Positives = 14/26 (53%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
                    E E +EE+ E++++++E
Sbjct: 36  AAAATAAAIESELDEEDLEDDDDDDE 61



 Score = 28.3 bits (64), Expect = 3.7
 Identities = 5/26 (19%), Positives = 13/26 (50%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
                     E E +EE+ E++++++
Sbjct: 35  AAAAATAAAIESELDEEDLEDDDDDD 60



 Score = 27.9 bits (63), Expect = 6.2
 Identities = 5/26 (19%), Positives = 12/26 (46%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
                      E E +EE+ E+++++
Sbjct: 34  AAAAAATAAAIESELDEEDLEDDDDD 59


>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727).  This
           domain family is found in bacteria and eukaryotes, and
           is approximately 100 amino acids in length.
          Length = 100

 Score = 33.3 bits (77), Expect = 0.035
 Identities = 20/43 (46%), Positives = 29/43 (67%), Gaps = 2/43 (4%)

Query: 132 KRRPRLYTGKGE--EKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           KR     T +GE  E EE+E  E EEE++++E+EEEE E+ A+
Sbjct: 6   KRSAFTLTVRGELPEPEEDEILELEEEDDDDEDEEEEYELLAS 48


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 34.0 bits (78), Expect = 0.037
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +++       +EEEE EEE  +E E+E+  E +
Sbjct: 86  KIFRELSSSDDEEEETEEESTDETEQEDPPETK 118



 Score = 33.2 bits (76), Expect = 0.082
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           L +   EE+E EEE  +E E+E+  E + E +
Sbjct: 91  LSSSDDEEEETEEESTDETEQEDPPETKTESK 122



 Score = 31.6 bits (72), Expect = 0.21
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
           ++EEEE EEE  +E E+E+  E  T +
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTES 121



 Score = 30.9 bits (70), Expect = 0.41
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE EEE  +E E+E+  E + E +E+
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEK 124



 Score = 29.3 bits (66), Expect = 1.7
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITAT 172
              ++EEEE EEE  +E E+E+   T T
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKT 119


>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor. 
          Length = 456

 Score = 35.0 bits (80), Expect = 0.042
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 129 LDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           LDG   PR   G+ E  E+ ++++   + E  E E E   
Sbjct: 44  LDGLLYPRSCQGQEEPDEKTQDQQSLSDVERAEPEVEASR 83


>gnl|CDD|214375 CHL00147, rpl4, ribosomal protein L4; Validated.
          Length = 215

 Score = 34.6 bits (80), Expect = 0.043
 Identities = 17/38 (44%), Positives = 19/38 (50%)

Query: 105 HTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKG 142
            TK  +EV   GRKPW QKG      G  R  L+ G G
Sbjct: 55  STKTRSEVRGGGRKPWKQKGTGRARAGSNRSPLWKGGG 92


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 34.4 bits (79), Expect = 0.049
 Identities = 15/32 (46%), Positives = 23/32 (71%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           E++E+E   EE+E E  EEEE+EE E+ A+ +
Sbjct: 47  EDEEDEAVVEEDENELTEEEEDEEGEVKASPD 78



 Score = 32.4 bits (74), Expect = 0.24
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            +++EE+E   EE+E E  EEEE+EE     +P
Sbjct: 45  VDEDEEDEAVVEEDENELTEEEEDEEGEVKASP 77



 Score = 29.4 bits (66), Expect = 2.2
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
            +E++E   EE+E E  EEEE+EE E
Sbjct: 47  EDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 29.0 bits (65), Expect = 3.0
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
           +EE E++  +E+EE+E   EE+E   T
Sbjct: 37  DEEAEDDVVDEDEEDEAVVEEDENELT 63


>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
          Length = 232

 Score = 34.4 bits (79), Expect = 0.050
 Identities = 18/42 (42%), Positives = 23/42 (54%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILP 183
           G  +  EE E E E+E  EEEEEE+ ++   L P GL    P
Sbjct: 57  GGLEVSEETEAEVEDEGGEEEEEEDADVETELRPRGLTEKTP 98


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 34.8 bits (80), Expect = 0.053
 Identities = 9/39 (23%), Positives = 20/39 (51%)

Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             KR       K +++ EE ++++  E+E  ++ E+E  
Sbjct: 73  SAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERL 111



 Score = 33.6 bits (77), Expect = 0.11
 Identities = 9/31 (29%), Positives = 21/31 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           K E+++ EE ++++  E+E  ++ E+E + A
Sbjct: 83  KKEQQQAEELQQKQAAEQERLKQLEKERLAA 113



 Score = 32.1 bits (73), Expect = 0.32
 Identities = 8/38 (21%), Positives = 21/38 (55%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           ++R +    + EE ++++  E+E  ++ E+E    +E 
Sbjct: 79  EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ 116



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 6/26 (23%), Positives = 20/26 (76%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E++ +++E+++ EE ++++  E+E +
Sbjct: 78  EEQRKKKEQQQAEELQQKQAAEQERL 103



 Score = 29.0 bits (65), Expect = 3.3
 Identities = 6/26 (23%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++K+  E+E  ++ E+E    +E+++
Sbjct: 93  QQKQAAEQERLKQLEKERLAAQEQKK 118



 Score = 28.6 bits (64), Expect = 4.4
 Identities = 7/28 (25%), Positives = 16/28 (57%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K  EKE    +E++++ EE  ++   ++
Sbjct: 104 KQLEKERLAAQEQKKQAEEAAKQAALKQ 131


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 32.7 bits (75), Expect = 0.057
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEE 165
                    E ++EE++EEEEEE +++
Sbjct: 71  AAAAAAAAAEAKKEEKKEEEEEESDDD 97



 Score = 31.9 bits (73), Expect = 0.096
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
                 E ++EE++EEEEEE ++++
Sbjct: 74  AAAAAAEAKKEEKKEEEEEESDDDM 98


>gnl|CDD|224759 COG1846, MarR, Transcriptional regulators [Transcription].
          Length = 126

 Score = 33.0 bits (75), Expect = 0.059
 Identities = 19/57 (33%), Positives = 25/57 (43%), Gaps = 4/57 (7%)

Query: 122 QKGLIERL---DGKRRPRLY-TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
            KGLIERL   + +R   +  T KG E  E+     +E   E      EEE+   L 
Sbjct: 62  DKGLIERLRDPEDRRAVLVRLTEKGRELLEQLLPAAQELLAEILAGLSEEELRKLLR 118


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 34.2 bits (79), Expect = 0.064
 Identities = 15/44 (34%), Positives = 29/44 (65%)

Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           L+E+   + R +    +G E +++++EEEE E EE+E+ ++E E
Sbjct: 74  LLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGE 117



 Score = 34.2 bits (79), Expect = 0.070
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
              E+ +EE  EE+EEE  EEEE E E+E  + L
Sbjct: 144 KAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177



 Score = 33.5 bits (77), Expect = 0.11
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            K  E+  E +++++EEEE E EE+E+ + 
Sbjct: 85  KKEAEQGLESDDDDDEEEEWEVEEDEDSDD 114



 Score = 32.3 bits (74), Expect = 0.25
 Identities = 20/56 (35%), Positives = 26/56 (46%), Gaps = 7/56 (12%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRIL 182
           E    +   +      EE  EE+EEE  EEEE E E+E+  E+  T       RIL
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATT-------RIL 183



 Score = 31.2 bits (71), Expect = 0.58
 Identities = 12/43 (27%), Positives = 26/43 (60%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E L+  +       + E+  E +++++EEEE E EE+E+ ++
Sbjct: 72  LELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDD 114



 Score = 30.4 bits (69), Expect = 1.1
 Identities = 11/42 (26%), Positives = 24/42 (57%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + + +++     G   + +++EEEE E EE+E+ ++E E  
Sbjct: 78  WKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWI 119



 Score = 30.0 bits (68), Expect = 1.4
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           GE  + E ++E E  + E+EEE++E   
Sbjct: 116 GEWIDVESDKEIESSDSEDEEEKDEAAK 143



 Score = 29.2 bits (66), Expect = 2.3
 Identities = 10/34 (29%), Positives = 20/34 (58%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
            +KE E  + E+EEE++E  ++ +E+    L+  
Sbjct: 123 SDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEE 156



 Score = 29.2 bits (66), Expect = 2.7
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            EE+ E EE+E+ ++E E  + E ++EI
Sbjct: 100 EEEEWEVEEDEDSDDEGEWIDVESDKEI 127



 Score = 28.1 bits (63), Expect = 6.7
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  +E E  + E+EEE++E  ++ +E
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKE 147



 Score = 28.1 bits (63), Expect = 6.9
 Identities = 8/29 (27%), Positives = 15/29 (51%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           +  +E E  + E ++E E  + E+EE   
Sbjct: 111 DSDDEGEWIDVESDKEIESSDSEDEEEKD 139



 Score = 28.1 bits (63), Expect = 7.0
 Identities = 9/31 (29%), Positives = 19/31 (61%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
            + E + EE+E+ ++E E  + E ++E E +
Sbjct: 100 EEEEWEVEEDEDSDDEGEWIDVESDKEIESS 130



 Score = 27.7 bits (62), Expect = 7.7
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            K  E  + E+EEE++E  ++ +E+ +EE++ 
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELSE 155



 Score = 27.7 bits (62), Expect = 8.1
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E EEE++E  ++ +E+ +EE  EE+
Sbjct: 132 SEDEEEKDEAAKKAKEDSDEELSEED 157



 Score = 27.7 bits (62), Expect = 8.7
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            ++ EEEE E EE+E+ ++E E  + E
Sbjct: 96  DDDDEEEEWEVEEDEDSDDEGEWIDVE 122


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 32.6 bits (74), Expect = 0.070
 Identities = 12/19 (63%), Positives = 17/19 (89%)

Query: 141 KGEEKEEEEEEEEEEEEEE 159
           K E K+EE++EEEEEEE++
Sbjct: 87  KAEAKKEEKKEEEEEEEDD 105



 Score = 31.0 bits (70), Expect = 0.21
 Identities = 10/19 (52%), Positives = 15/19 (78%)

Query: 140 GKGEEKEEEEEEEEEEEEE 158
               +KEE++EEEEEEE++
Sbjct: 87  KAEAKKEEKKEEEEEEEDD 105



 Score = 31.0 bits (70), Expect = 0.22
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEE 163
             K E ++EE++EEEEEEE++
Sbjct: 85  GAKAEAKKEEKKEEEEEEEDD 105



 Score = 30.6 bits (69), Expect = 0.30
 Identities = 11/21 (52%), Positives = 18/21 (85%)

Query: 142 GEEKEEEEEEEEEEEEEEEEE 162
           G + E ++EE++EEEEEEE++
Sbjct: 85  GAKAEAKKEEKKEEEEEEEDD 105



 Score = 30.3 bits (68), Expect = 0.42
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
           G      + E ++EE++EEEEEEE++
Sbjct: 80  GAATAGAKAEAKKEEKKEEEEEEEDD 105



 Score = 30.3 bits (68), Expect = 0.50
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 148 EEEEEEEEEEEEEEEEEEEEEITATL 173
             + E ++EE++EEEEEEE+++  +L
Sbjct: 85  GAKAEAKKEEKKEEEEEEEDDLGFSL 110



 Score = 29.5 bits (66), Expect = 0.78
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 140 GKGEEKEEEEEEEEEEEEEEE 160
           G   E ++EE++EEEEEEE++
Sbjct: 85  GAKAEAKKEEKKEEEEEEEDD 105



 Score = 29.5 bits (66), Expect = 0.80
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
                + E ++EE++EEEEEEE+
Sbjct: 82  ATAGAKAEAKKEEKKEEEEEEED 104



 Score = 28.0 bits (62), Expect = 2.8
 Identities = 10/23 (43%), Positives = 18/23 (78%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEE 161
           T   + + ++EE++EEEEEEE++
Sbjct: 83  TAGAKAEAKKEEKKEEEEEEEDD 105


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 33.3 bits (76), Expect = 0.070
 Identities = 18/27 (66%), Positives = 23/27 (85%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           +E +EEEEEE+EEE EE E+ E+EEEI
Sbjct: 48  QEDDEEEEEEDEEEIEEPEDIEDEEEI 74



 Score = 32.9 bits (75), Expect = 0.11
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
              + EE E+ E+EEE  E+EEEEEE+EE+
Sbjct: 58  DEEEIEEPEDIEDEEEIVEDEEEEEEDEED 87



 Score = 32.5 bits (74), Expect = 0.15
 Identities = 16/25 (64%), Positives = 22/25 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           +EK+E++EEEEEE+EEE EE E+ E
Sbjct: 45  DEKQEDDEEEEEEDEEEIEEPEDIE 69



 Score = 31.8 bits (72), Expect = 0.26
 Identities = 18/27 (66%), Positives = 22/27 (81%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           K E+ EEEEEE+EEE EE E+ E+EEE
Sbjct: 47  KQEDDEEEEEEDEEEIEEPEDIEDEEE 73



 Score = 31.0 bits (70), Expect = 0.41
 Identities = 17/28 (60%), Positives = 24/28 (85%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K E++E++EEEEEE+EEE EE E+ E+E
Sbjct: 44  KDEKQEDDEEEEEEDEEEIEEPEDIEDE 71



 Score = 31.0 bits (70), Expect = 0.51
 Identities = 16/25 (64%), Positives = 23/25 (92%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
           K+E++E++EEEEEE+EEE EE E+I
Sbjct: 44  KDEKQEDDEEEEEEDEEEIEEPEDI 68



 Score = 30.6 bits (69), Expect = 0.58
 Identities = 18/28 (64%), Positives = 23/28 (82%), Gaps = 2/28 (7%)

Query: 143 EEKEEEEEEEEEEE--EEEEEEEEEEEE 168
           EE EE E+ E+EEE  E+EEEEEE+EE+
Sbjct: 60  EEIEEPEDIEDEEEIVEDEEEEEEDEED 87



 Score = 29.8 bits (67), Expect = 1.1
 Identities = 14/26 (53%), Positives = 22/26 (84%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEI 169
           ++++E++EEEEEE+EEE EE E+ E 
Sbjct: 45  DEKQEDDEEEEEEDEEEIEEPEDIED 70



 Score = 29.4 bits (66), Expect = 1.5
 Identities = 16/27 (59%), Positives = 21/27 (77%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EE+EE+EEE EE E+ E+EEE  E+E 
Sbjct: 53  EEEEEDEEEIEEPEDIEDEEEIVEDEE 79



 Score = 29.1 bits (65), Expect = 1.8
 Identities = 14/27 (51%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            E EEE  E+EEEEEE+EE+  + ++I
Sbjct: 68  IEDEEEIVEDEEEEEEDEEDNVDLKDI 94



 Score = 29.1 bits (65), Expect = 2.0
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           E++EEEEE+EE+  + ++ E++   +I 
Sbjct: 76  EDEEEEEEDEEDNVDLKDIEKKNINDIF 103



 Score = 29.1 bits (65), Expect = 2.0
 Identities = 15/25 (60%), Positives = 23/25 (92%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
           KE E+ ++E++E++EEEEEE+EEEI
Sbjct: 38  KENEDVKDEKQEDDEEEEEEDEEEI 62



 Score = 29.1 bits (65), Expect = 2.2
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEIT 170
           E++E++EEEEEE+EEE EE E+   
Sbjct: 46  EKQEDDEEEEEEDEEEIEEPEDIED 70



 Score = 28.7 bits (64), Expect = 2.9
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE  E+EEEEEE+EE+  + ++ E++
Sbjct: 72  EEIVEDEEEEEEDEEDNVDLKDIEKK 97



 Score = 27.9 bits (62), Expect = 5.3
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E  E+EEEEEE+EE+  + ++ E++ I
Sbjct: 73  EIVEDEEEEEEDEEDNVDLKDIEKKNI 99



 Score = 27.5 bits (61), Expect = 6.5
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+  E+EEEEEE+EE+  + ++ E+
Sbjct: 71  EEEIVEDEEEEEEDEEDNVDLKDIEK 96


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 34.2 bits (79), Expect = 0.077
 Identities = 14/36 (38%), Positives = 19/36 (52%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           +  +   K+EEE   E  + +EE  EEEEE I   L
Sbjct: 307 WYLRRRSKQEEEAAAEAAKAQEEAAEEEEESINDIL 342



 Score = 31.1 bits (71), Expect = 0.83
 Identities = 12/39 (30%), Positives = 19/39 (48%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLC 179
           + +++EE   E  + +EE  EEEEE       L+   L 
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESINDILALDDLELE 350



 Score = 29.9 bits (68), Expect = 2.0
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            L     +E+E   E  + +EE  EEEEE   +I A 
Sbjct: 308 YLRRRSKQEEEAAAEAAKAQEEAAEEEEESINDILAL 344


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 33.8 bits (78), Expect = 0.081
 Identities = 7/24 (29%), Positives = 9/24 (37%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEE 162
                          EEEEEEE++
Sbjct: 280 AAAAAAAAAAAAAPAEEEEEEEDD 303



 Score = 32.7 bits (75), Expect = 0.23
 Identities = 7/23 (30%), Positives = 9/23 (39%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEE 166
                         EEEEEEE++
Sbjct: 281 AAAAAAAAAAAAPAEEEEEEEDD 303



 Score = 31.9 bits (73), Expect = 0.33
 Identities = 7/21 (33%), Positives = 9/21 (42%)

Query: 147 EEEEEEEEEEEEEEEEEEEEE 167
                       EEEEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303



 Score = 31.9 bits (73), Expect = 0.33
 Identities = 7/21 (33%), Positives = 9/21 (42%)

Query: 148 EEEEEEEEEEEEEEEEEEEEE 168
                       EEEEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303



 Score = 31.9 bits (73), Expect = 0.35
 Identities = 7/21 (33%), Positives = 9/21 (42%)

Query: 143 EEKEEEEEEEEEEEEEEEEEE 163
                       EEEEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 7/20 (35%), Positives = 7/20 (35%)

Query: 150 EEEEEEEEEEEEEEEEEEEI 169
                       EEEEEEE 
Sbjct: 283 AAAAAAAAAAPAEEEEEEED 302


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 33.1 bits (76), Expect = 0.085
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           GEE EE  EE+E++E  +EEE +  ++ 
Sbjct: 10  GEEDEELPEEDEDDESSDEEEVDLPDDE 37



 Score = 31.6 bits (72), Expect = 0.30
 Identities = 11/29 (37%), Positives = 21/29 (72%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           + +E  +EEE +  ++E++EE + EEE+I
Sbjct: 21  EDDESSDEEEVDLPDDEQDEESDSEEEQI 49



 Score = 31.2 bits (71), Expect = 0.44
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  DG+    L     +++  +EEE +  ++E++EE + EEE
Sbjct: 6   ESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEE 47



 Score = 30.4 bits (69), Expect = 0.64
 Identities = 10/34 (29%), Positives = 22/34 (64%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           G+ +E+  EE+E++E  +EEE +  ++E+   + 
Sbjct: 10  GEEDEELPEEDEDDESSDEEEVDLPDDEQDEESD 43



 Score = 30.4 bits (69), Expect = 0.77
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           G   E ++ EE+EE  EE+E++E  +EEE+   
Sbjct: 2   GSESESDDGEEDEELPEEDEDDESSDEEEVDLP 34



 Score = 30.0 bits (68), Expect = 0.83
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
                 EEE +  ++E++EE + EEE+  +T
Sbjct: 22  DDESSDEEEVDLPDDEQDEESDSEEEQIFVT 52



 Score = 29.7 bits (67), Expect = 1.3
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           G E E ++ EE+EE  EE+E++E  +E    L
Sbjct: 2   GSESESDDGEEDEELPEEDEDDESSDEEEVDL 33



 Score = 28.9 bits (65), Expect = 2.4
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              E  +EEE +  ++E++EE + EEE+  
Sbjct: 21  EDDESSDEEEVDLPDDEQDEESDSEEEQIF 50


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
           Ribosomal proteins P1 and P2 are the eukaryotic proteins
           that are functionally equivalent to bacterial L7/L12.
           L12p is the archaeal homolog. Unlike other ribosomal
           proteins, the archaeal L12p and eukaryotic P1 and P2 do
           not share sequence similarity with their bacterial
           counterparts. They are part of the ribosomal stalk
           (called the L7/L12 stalk in bacteria), along with 28S
           rRNA and the proteins L11 and P0 in eukaryotes (23S
           rRNA, L11, and L10e in archaea). In bacterial ribosomes,
           L7/L12 homodimers bind the extended C-terminal helix of
           L10 to anchor the L7/L12 molecules to the ribosome.
           Eukaryotic P1/P2 heterodimers and archaeal L12p
           homodimers are believed to bind the L10 equivalent
           proteins, eukaryotic P0 and archaeal L10e, in a similar
           fashion. P1 and P2 (L12p, L7/L12) are the only proteins
           in the ribosome to occur as multimers, always appearing
           as sets of dimers. Recent data indicate that most
           archaeal species contain six copies of L12p (three
           homodimers), while eukaryotes have two copies each of P1
           and P2 (two heterodimers). Bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. As in bacteria, the stalk is crucial for
           binding of initiation, elongation, and release factors
           in eukaryotes and archaea.
          Length = 105

 Score = 32.2 bits (73), Expect = 0.095
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
                E  E+ EE +EEEEEEE+E+   
Sbjct: 75  AAATAEPAEKAEEAKEEEEEEEDEDFGF 102



 Score = 31.5 bits (71), Expect = 0.14
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEE 161
           T +  EK EE +EEEEEEE+E+ 
Sbjct: 78  TAEPAEKAEEAKEEEEEEEDEDF 100



 Score = 31.5 bits (71), Expect = 0.16
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                 E  E+ EE +EEEEEEE+E+ 
Sbjct: 74  TAAATAEPAEKAEEAKEEEEEEEDEDF 100



 Score = 30.7 bits (69), Expect = 0.29
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
           P        E  E+ EE +EEEEEEE+E+ 
Sbjct: 71  PAATAAATAEPAEKAEEAKEEEEEEEDEDF 100



 Score = 29.9 bits (67), Expect = 0.55
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEE 165
           T     +  E+ EE +EEEEEEE+E+ 
Sbjct: 74  TAAATAEPAEKAEEAKEEEEEEEDEDF 100


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 33.5 bits (76), Expect = 0.11
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             KEE + +  +E  EE+EE++E EE 
Sbjct: 306 AHKEEVQSDRPDEIGEEKEEDDENEEN 332



 Score = 32.4 bits (73), Expect = 0.33
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E  +EE + +  +E  EE+EE++E E
Sbjct: 304 AEAHKEEVQSDRPDEIGEEKEEDDENE 330



 Score = 32.0 bits (72), Expect = 0.40
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
            EE + +  +E  EE+EE++E EE E  T  L
Sbjct: 308 KEEVQSDRPDEIGEEKEEDDENEENERHTELL 339



 Score = 31.6 bits (71), Expect = 0.48
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K E + +  +E  EE+EE++E EE E  
Sbjct: 308 KEEVQSDRPDEIGEEKEEDDENEENERH 335



 Score = 31.2 bits (70), Expect = 0.59
 Identities = 11/43 (25%), Positives = 20/43 (46%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           IE  +     +    +  E  E  +EE + +  +E  EE+EE+
Sbjct: 284 IENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEED 326



 Score = 29.7 bits (66), Expect = 1.8
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
            +  ++E + +  +E  EE+EE++E EE
Sbjct: 304 AEAHKEEVQSDRPDEIGEEKEEDDENEE 331



 Score = 28.5 bits (63), Expect = 4.8
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  E  +EE + +  +E  EE+EE++
Sbjct: 302 ENAEAHKEEVQSDRPDEIGEEKEEDD 327



 Score = 28.5 bits (63), Expect = 5.1
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
             +  + +  +E  EE+EE++E EE E  
Sbjct: 307 HKEEVQSDRPDEIGEEKEEDDENEENERH 335



 Score = 28.5 bits (63), Expect = 5.1
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            + +    +E  EE+EE++E EE E   E+ A
Sbjct: 309 EEVQSDRPDEIGEEKEEDDENEENERHTELLA 340


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 31.8 bits (72), Expect = 0.11
 Identities = 19/28 (67%), Positives = 21/28 (75%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K  E EE EE +EEEEEEEEEE+E E E
Sbjct: 53  KALEAEESEENDEEEEEEEEEEDEGEIE 80



 Score = 31.4 bits (71), Expect = 0.12
 Identities = 17/27 (62%), Positives = 19/27 (70%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EE EE +EEEEEEEEEE+E E E    
Sbjct: 58  EESEENDEEEEEEEEEEDEGEIEYVSD 84



 Score = 30.2 bits (68), Expect = 0.33
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EE+EEEEEE+E E E   ++EE EEEI
Sbjct: 66  EEEEEEEEEDEGEIEYVSDDEELEEEI 92



 Score = 30.2 bits (68), Expect = 0.36
 Identities = 17/26 (65%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E +E EE +EEEEEEEEEE+E E E 
Sbjct: 56  EAEESEENDEEEEEEEEEEDEGEIEY 81



 Score = 29.8 bits (67), Expect = 0.50
 Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E  EEEEEEEEEE+E E E   ++EE
Sbjct: 60  SEENDEEEEEEEEEEDEGEIEYVSDDEE 87



 Score = 29.5 bits (66), Expect = 0.61
 Identities = 17/25 (68%), Positives = 20/25 (80%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           +K  E EE EE +EEEEEEEEEE+E
Sbjct: 52  KKALEAEESEENDEEEEEEEEEEDE 76



 Score = 29.1 bits (65), Expect = 0.87
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           EE +EEEEEEEEEE+E E E   ++E   
Sbjct: 61  EENDEEEEEEEEEEDEGEIEYVSDDEELE 89



 Score = 28.3 bits (63), Expect = 1.8
 Identities = 16/26 (61%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+EEEEEEE+E E E   ++EE EE
Sbjct: 65  EEEEEEEEEEDEGEIEYVSDDEELEE 90



 Score = 28.3 bits (63), Expect = 1.9
 Identities = 16/24 (66%), Positives = 18/24 (75%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEIT 170
           E EE EE +EEEEEEEEEE+E   
Sbjct: 56  EAEESEENDEEEEEEEEEEDEGEI 79



 Score = 27.5 bits (61), Expect = 3.4
 Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           K+  E EE EE +EEEEEEEEEE+     
Sbjct: 52  KKALEAEESEENDEEEEEEEEEEDEGEIE 80



 Score = 27.1 bits (60), Expect = 3.8
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            EE+EEEEE+E E E   ++EE EEE 
Sbjct: 66  EEEEEEEEEDEGEIEYVSDDEELEEEI 92



 Score = 27.1 bits (60), Expect = 4.9
 Identities = 14/27 (51%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EE+EEE+E E E   ++EE EEE E++
Sbjct: 69  EEEEEEDEGEIEYVSDDEELEEEIEDL 95



 Score = 26.4 bits (58), Expect = 8.7
 Identities = 15/26 (57%), Positives = 19/26 (73%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+EEEE+E E E   ++EE EEE E
Sbjct: 68  EEEEEEEDEGEIEYVSDDEELEEEIE 93


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 33.6 bits (77), Expect = 0.12
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            ++EEEE  + E EE  E E+ E  E  A L  
Sbjct: 390 SQREEEEALQREAEERLEAEQAERAEEDARLRE 422



 Score = 32.4 bits (74), Expect = 0.30
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           EE+E  + E EE  E E+ E  EE+     L P
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYP 425



 Score = 32.4 bits (74), Expect = 0.31
 Identities = 14/45 (31%), Positives = 20/45 (44%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E L  +   RL   + E  EE+    E     E+E E+E+E   A
Sbjct: 396 EALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEA 440



 Score = 30.5 bits (69), Expect = 1.1
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           +  E     E+E E+E+E EE + EEEE  
Sbjct: 419 RLRELYPLPEDEFEDEDELEEAQPEEEEEA 448



 Score = 29.8 bits (67), Expect = 1.8
 Identities = 7/33 (21%), Positives = 11/33 (33%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            E+  E E+ E  EE+    E          + 
Sbjct: 402 AEERLEAEQAERAEEDARLRELYPLPEDEFEDE 434



 Score = 29.8 bits (67), Expect = 1.9
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E   + EEEE  + E EE  E E+ E
Sbjct: 387 ELISQREEEEALQREAEERLEAEQAE 412



 Score = 29.4 bits (66), Expect = 2.3
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EE     E     E+E E+E+E EE  
Sbjct: 415 EEDARLRELYPLPEDEFEDEDELEEAQ 441



 Score = 29.4 bits (66), Expect = 2.4
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            +      E+E E+E+E EE + EEEEE 
Sbjct: 420 LRELYPLPEDEFEDEDELEEAQPEEEEEA 448



 Score = 29.4 bits (66), Expect = 2.5
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
               E+E E+E+E EE + EEEEE  
Sbjct: 424 YPLPEDEFEDEDELEEAQPEEEEEAR 449



 Score = 29.0 bits (65), Expect = 3.4
 Identities = 9/29 (31%), Positives = 14/29 (48%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E+    E     E+E E+E+E EE +   
Sbjct: 416 EDARLRELYPLPEDEFEDEDELEEAQPEE 444



 Score = 29.0 bits (65), Expect = 3.6
 Identities = 9/33 (27%), Positives = 13/33 (39%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E ++ E  EE+    E     E+E E    L  
Sbjct: 407 EAEQAERAEEDARLRELYPLPEDEFEDEDELEE 439



 Score = 29.0 bits (65), Expect = 4.0
 Identities = 8/33 (24%), Positives = 14/33 (42%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           + +  EE+    E     E+E E+E+      P
Sbjct: 410 QAERAEEDARLRELYPLPEDEFEDEDELEEAQP 442



 Score = 28.2 bits (63), Expect = 7.0
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           Y  + E+ +  E   + EEEE  + E EE  
Sbjct: 376 YDQEAEDAKVAELISQREEEEALQREAEERL 406



 Score = 27.8 bits (62), Expect = 7.2
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            E   + EEEE  + E EE  E E+   A 
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAE 415


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 30.4 bits (69), Expect = 0.12
 Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           ++R      K EEKE+  E E+EE E  ++EEEEE 
Sbjct: 8   EKRAEFDK-KLEEKEKALEAEKEEAEARQKEEEEEA 42


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 32.2 bits (73), Expect = 0.12
 Identities = 8/33 (24%), Positives = 12/33 (36%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            KG         E  EE E  + +E+ +E    
Sbjct: 59  EKGPAAAATAAAEAAEEAEAADADEDADEAAEA 91



 Score = 31.5 bits (71), Expect = 0.22
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            EE E  + +E+ +E  E +  +E +E   T
Sbjct: 73  AEEAEAADADEDADEAAEADAADEADEEEET 103



 Score = 31.5 bits (71), Expect = 0.25
 Identities = 10/38 (26%), Positives = 16/38 (42%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            K      T   E  EE E  + +E+ +E  E +  +E
Sbjct: 59  EKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADE 96



 Score = 29.5 bits (66), Expect = 1.1
 Identities = 9/47 (19%), Positives = 17/47 (36%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           +E  +  R  +          E  EE E  + +E+ +E  E +    
Sbjct: 50  LEWTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADE 96



 Score = 29.1 bits (65), Expect = 1.4
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E  + +E+ +E  E +  +E +EEE T 
Sbjct: 77  EAADADEDADEAAEADAADEADEEEETD 104



 Score = 29.1 bits (65), Expect = 1.4
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
            +E  +E  +E + E EE +EEE+EE
Sbjct: 103 TDEAVDETADEADAEAEEADEEEDEE 128



 Score = 28.8 bits (64), Expect = 1.7
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  +E  +E + E EE +EEE+EE E
Sbjct: 105 EAVDETADEADAEAEEADEEEDEEAE 130



 Score = 28.4 bits (63), Expect = 2.2
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
             E  +E + E EE +EEE+EE E 
Sbjct: 107 VDETADEADAEAEEADEEEDEEAEA 131



 Score = 28.4 bits (63), Expect = 2.9
 Identities = 7/38 (18%), Positives = 16/38 (42%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G         +  E+ E  + +E+ +E  E +  +E +
Sbjct: 61  GPAAAATAAAEAAEEAEAADADEDADEAAEADAADEAD 98



 Score = 28.0 bits (62), Expect = 3.0
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           T +  ++  +E + E EE +EEE+EE E 
Sbjct: 103 TDEAVDETADEADAEAEEADEEEDEEAEA 131



 Score = 27.6 bits (61), Expect = 4.1
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            +++ +E  E +  +E +EEEE +E +  T
Sbjct: 81  ADEDADEAAEADAADEADEEEETDEAVDET 110



 Score = 27.6 bits (61), Expect = 4.1
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITA 171
            +E  +E + E EE +EEE+EE  A
Sbjct: 107 VDETADEADAEAEEADEEEDEEAEA 131



 Score = 27.6 bits (61), Expect = 4.5
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +  E+ +E  E +  +E +EEEE +E
Sbjct: 80  DADEDADEAAEADAADEADEEEETDE 105



 Score = 27.2 bits (60), Expect = 5.6
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  E  +E  +E + E EE +EEE+E
Sbjct: 102 ETDEAVDETADEADAEAEEADEEEDE 127



 Score = 27.2 bits (60), Expect = 5.6
 Identities = 8/33 (24%), Positives = 16/33 (48%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            + +E+ +E  E +  +E +EEEE +       
Sbjct: 79  ADADEDADEAAEADAADEADEEEETDEAVDETA 111



 Score = 27.2 bits (60), Expect = 5.7
 Identities = 9/29 (31%), Positives = 15/29 (51%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
               +E+ +E  E +  +E +EEEE   A
Sbjct: 78  AADADEDADEAAEADAADEADEEEETDEA 106



 Score = 26.8 bits (59), Expect = 7.7
 Identities = 7/28 (25%), Positives = 13/28 (46%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            +  EE E  + +E+ +E  E +    A
Sbjct: 70  AEAAEEAEAADADEDADEAAEADAADEA 97



 Score = 26.8 bits (59), Expect = 9.4
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            E+ +E  E +  +E +EEEE +E  + TA
Sbjct: 82  DEDADEAAEADAADEADEEEETDEAVDETA 111


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 32.7 bits (75), Expect = 0.13
 Identities = 16/34 (47%), Positives = 20/34 (58%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           + EE+ EE E EE  E+E EEE  EEE     L+
Sbjct: 7   QVEEEVEETEVEEAVEDEVEEETVEEESEAELLD 40



 Score = 32.7 bits (75), Expect = 0.14
 Identities = 16/28 (57%), Positives = 18/28 (64%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K E+ EEE EE E EE  E+E EEE  E
Sbjct: 4   KNEQVEEEVEETEVEEAVEDEVEEETVE 31



 Score = 32.7 bits (75), Expect = 0.14
 Identities = 15/29 (51%), Positives = 17/29 (58%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           EE E EE  E+E EEE  EEE E E +  
Sbjct: 13  EETEVEEAVEDEVEEETVEEESEAELLDE 41



 Score = 32.3 bits (74), Expect = 0.17
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             + E +E E EE  E+E EEE  EEE E 
Sbjct: 7   QVEEEVEETEVEEAVEDEVEEETVEEESEA 36



 Score = 31.9 bits (73), Expect = 0.23
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE  E+E EEE  EEE E E  +EE+
Sbjct: 18  EEAVEDEVEEETVEEESEAELLDEEQ 43



 Score = 31.9 bits (73), Expect = 0.29
 Identities = 17/28 (60%), Positives = 19/28 (67%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           EEK E+ EEE EE E EE  E+E EE T
Sbjct: 2   EEKNEQVEEEVEETEVEEAVEDEVEEET 29



 Score = 31.5 bits (72), Expect = 0.39
 Identities = 15/29 (51%), Positives = 18/29 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E+ E+ EEE EE E EE  E+E EEE   
Sbjct: 3   EKNEQVEEEVEETEVEEAVEDEVEEETVE 31



 Score = 30.0 bits (68), Expect = 1.1
 Identities = 13/29 (44%), Positives = 15/29 (51%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E + EE  E+E EEE  EEE E E     
Sbjct: 14  ETEVEEAVEDEVEEETVEEESEAELLDEE 42



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           +E EEE  EEE E E  +EE+ +  E+ A L+
Sbjct: 23  DEVEEETVEEESEAELLDEEQAKIAELEAKLD 54



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           +  E E EEE  EEE E E  +EE+ +
Sbjct: 19  EAVEDEVEEETVEEESEAELLDEEQAK 45



 Score = 29.2 bits (66), Expect = 2.2
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  E+E EEE  EEE E E  +EE+ 
Sbjct: 19  EAVEDEVEEETVEEESEAELLDEEQA 44



 Score = 28.8 bits (65), Expect = 2.9
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            E++ E+ EEE EE E EE  E+E E       
Sbjct: 1   MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEE 33


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 30.8 bits (70), Expect = 0.14
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           L + L+     +L   K E K EEE+ EEEEE  + EE +E E+
Sbjct: 10  LKDMLNADLLAKLKAKKKELKAEEEKREEEEEARKREERKEREK 53



 Score = 29.2 bits (66), Expect = 0.52
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           + +EE+ EEEEE  + EE +E E+ 
Sbjct: 30  KAEEEKREEEEEARKREERKEREKN 54



 Score = 29.2 bits (66), Expect = 0.54
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           + EEE+ EEEEE  + EE +E E+ 
Sbjct: 30  KAEEEKREEEEEARKREERKEREKN 54



 Score = 28.1 bits (63), Expect = 1.1
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + E++EEEEE  + EE +E E+ +  EE
Sbjct: 32  EEEKREEEEEARKREERKEREKNKSFEE 59



 Score = 27.7 bits (62), Expect = 1.8
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           K ++KE + EEE+ EEEEE  + EE +E  
Sbjct: 23  KAKKKELKAEEEKREEEEEARKREERKERE 52



 Score = 26.1 bits (58), Expect = 5.5
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEE 165
           K EE++ EEEEE  + EE +E E+ 
Sbjct: 30  KAEEEKREEEEEARKREERKEREKN 54


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 32.4 bits (74), Expect = 0.14
 Identities = 6/27 (22%), Positives = 6/27 (22%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
              E    E      E       EEE 
Sbjct: 129 AAAEAAAAEAAAPAAEAAAAAAAEEEA 155



 Score = 32.0 bits (73), Expect = 0.17
 Identities = 6/28 (21%), Positives = 7/28 (25%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K +   E    E      E       EE
Sbjct: 126 KKKAAAEAAAAEAAAPAAEAAAAAAAEE 153



 Score = 31.6 bits (72), Expect = 0.26
 Identities = 6/28 (21%), Positives = 7/28 (25%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K   +    E      E       EEE 
Sbjct: 128 KAAAEAAAAEAAAPAAEAAAAAAAEEEA 155



 Score = 31.2 bits (71), Expect = 0.32
 Identities = 7/28 (25%), Positives = 7/28 (25%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K    E    E      E       EEE
Sbjct: 127 KKAAAEAAAAEAAAPAAEAAAAAAAEEE 154



 Score = 30.0 bits (68), Expect = 0.74
 Identities = 6/30 (20%), Positives = 8/30 (26%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           T K ++   E    E      E       E
Sbjct: 123 TPKKKKAAAEAAAAEAAAPAAEAAAAAAAE 152



 Score = 28.1 bits (63), Expect = 4.2
 Identities = 4/29 (13%), Positives = 7/29 (24%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            +K++   E    E      E        
Sbjct: 124 PKKKKAAAEAAAAEAAAPAAEAAAAAAAE 152



 Score = 27.0 bits (60), Expect = 9.9
 Identities = 6/31 (19%), Positives = 8/31 (25%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEE 162
           K++         E      E       EEE 
Sbjct: 125 KKKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 33.4 bits (76), Expect = 0.14
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K E +++E+ EE EEE+ EEE    ++ 
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKG 338



 Score = 33.0 bits (75), Expect = 0.19
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              E   + E E++E+ EE EEE+ EEE
Sbjct: 304 LSPEIPAKPEIEQDEDSEESEEEKNEEE 331



 Score = 31.9 bits (72), Expect = 0.49
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
             K E E++E+ EE EEE+ EEE   
Sbjct: 309 PAKPEIEQDEDSEESEEEKNEEEGGL 334



 Score = 31.1 bits (70), Expect = 0.91
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              + E E++E+ EE EEE+ EEE    
Sbjct: 308 IPAKPEIEQDEDSEESEEEKNEEEGGLS 335



 Score = 30.7 bits (69), Expect = 1.00
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+K   E   + E E++E+ EE EEE
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEE 326



 Score = 30.3 bits (68), Expect = 1.5
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+ E+ EE EEE+ EEE    ++ ++
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKK 340



 Score = 29.9 bits (67), Expect = 1.7
 Identities = 9/29 (31%), Positives = 17/29 (58%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + ++   E   + E E++E+ EE EEE+
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEK 327



 Score = 29.2 bits (65), Expect = 3.5
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEE 166
           + +E  EE EEE+ EEE    ++ ++
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKK 340



 Score = 28.8 bits (64), Expect = 4.5
 Identities = 10/40 (25%), Positives = 21/40 (52%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEE 165
            ++L  +   +    + E+ EE EEE+ EEE    ++ ++
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340



 Score = 28.4 bits (63), Expect = 5.8
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+E++   E   + E E++E+ EE 
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEES 323


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 32.8 bits (75), Expect = 0.14
 Identities = 14/34 (41%), Positives = 20/34 (58%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           K E+K +EE+ E EE E+EE  E+ EE     + 
Sbjct: 24  KEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57



 Score = 31.3 bits (71), Expect = 0.45
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E KEE++ +EE+ E EE E+EE  E+
Sbjct: 22  ENKEEDKGKEEDLEFEEIEKEEIIED 47



 Score = 31.3 bits (71), Expect = 0.46
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            GK E+ E EE E+EE  E+ EE  E + E
Sbjct: 28  KGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57



 Score = 31.3 bits (71), Expect = 0.47
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K E  EE+  +E E +EE++ +EE+ E 
Sbjct: 9   KHENMEEDCCKENENKEEDKGKEEDLEF 36



 Score = 30.9 bits (70), Expect = 0.60
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+  +E E +EE++ +EE+ E EE E
Sbjct: 15  EDCCKENENKEEDKGKEEDLEFEEIE 40



 Score = 30.9 bits (70), Expect = 0.63
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            ++ E +EE++ +EE+ E EE E+EEI
Sbjct: 18  CKENENKEEDKGKEEDLEFEEIEKEEI 44



 Score = 30.5 bits (69), Expect = 0.84
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           +E E +EE++ +EE+ E EE E+EE I
Sbjct: 19  KENENKEEDKGKEEDLEFEEIEKEEII 45



 Score = 30.5 bits (69), Expect = 0.85
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           EE E+EE  E+ EE  E + EE ++E  
Sbjct: 37  EEIEKEEIIEDSEESNEVKIEELKDENN 64



 Score = 30.5 bits (69), Expect = 0.92
 Identities = 13/28 (46%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K  E +EE++ +EE+ E EE E+EE  E
Sbjct: 19  KENENKEEDKGKEEDLEFEEIEKEEIIE 46



 Score = 29.3 bits (66), Expect = 2.0
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           + + EE E+EE  E+ EE  E + EE+
Sbjct: 33  DLEFEEIEKEEIIEDSEESNEVKIEEL 59



 Score = 28.2 bits (63), Expect = 4.8
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + K E  EE+  +E E +EE++ +EE
Sbjct: 7   DAKHENMEEDCCKENENKEEDKGKEE 32



 Score = 27.8 bits (62), Expect = 5.8
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           K  + E  EE+  +E E +EE++ +EE++
Sbjct: 6   KDAKHENMEEDCCKENENKEEDKGKEEDL 34



 Score = 27.8 bits (62), Expect = 6.6
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
             EKEE  E+ EE  E + EE ++E
Sbjct: 38  EIEKEEIIEDSEESNEVKIEELKDE 62



 Score = 27.4 bits (61), Expect = 7.5
 Identities = 8/27 (29%), Positives = 17/27 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E++ ++ + E  EE+  +E E +EE+ 
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEEDK 28



 Score = 27.0 bits (60), Expect = 9.6
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            EKE ++ + E  EE+  +E E +EE
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEE 26


>gnl|CDD|236152 PRK08115, PRK08115, ribonucleotide-diphosphate reductase subunit
           alpha; Validated.
          Length = 858

 Score = 33.7 bits (77), Expect = 0.14
 Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 15/73 (20%)

Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL--------- 173
           KG    +DG R  ++ T K EE + +E+     EE EEEE ++   +  T+         
Sbjct: 762 KGGTVYVDGSRDAQVLTLKAEENDLDEQLLLLPEELEEEETKKPVVLVDTIQDVRSTDVT 821

Query: 174 ------NPCGLCR 180
                 N C +CR
Sbjct: 822 IGSEVGNTCPVCR 834


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 32.7 bits (75), Expect = 0.18
 Identities = 11/36 (30%), Positives = 13/36 (36%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
                   E    E E E E EEE E  + + P  L
Sbjct: 128 PVCSVNMTECTGPEPEPEPEPEEEPEKKSGMGPLLL 163



 Score = 31.5 bits (72), Expect = 0.37
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           Y  K +EK   +++ +E +  +E+EEE++E 
Sbjct: 181 YKPKQQEKGAPDDDLDEYDYGDEDEEEDDEP 211



 Score = 30.0 bits (68), Expect = 1.2
 Identities = 9/34 (26%), Positives = 13/34 (38%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           +   GE + E  + E  +  E   E   EEE   
Sbjct: 21  FAYGGEPEPEVPQTEATDPSEVVVETVTEEEGGP 54



 Score = 29.2 bits (66), Expect = 2.4
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEE 166
           KG   ++ +E +  +E+EEE++E   
Sbjct: 188 KGAPDDDLDEYDYGDEDEEEDDEPPW 213



 Score = 28.1 bits (63), Expect = 4.9
 Identities = 8/25 (32%), Positives = 8/25 (32%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
                    E    E E E E EEE
Sbjct: 127 CPVCSVNMTECTGPEPEPEPEPEEE 151



 Score = 28.1 bits (63), Expect = 5.1
 Identities = 12/36 (33%), Positives = 14/36 (38%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
           G E E E  + E  +  E   E   EE    L P G
Sbjct: 24  GGEPEPEVPQTEATDPSEVVVETVTEEEGGPLTPEG 59



 Score = 28.1 bits (63), Expect = 5.5
 Identities = 8/26 (30%), Positives = 8/26 (30%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
                    E    E E E E EEE 
Sbjct: 127 CPVCSVNMTECTGPEPEPEPEPEEEP 152


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 32.4 bits (74), Expect = 0.18
 Identities = 9/52 (17%), Positives = 31/52 (59%)

Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           K   E     +  +  + K ++K++++++++++++ E+++E+E E+    L 
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLT 126



 Score = 32.0 bits (73), Expect = 0.21
 Identities = 14/69 (20%), Positives = 37/69 (53%), Gaps = 10/69 (14%)

Query: 107 KVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           KV  E E   ++ W  K    +            K ++K++++++++ E+++E+E E++ 
Sbjct: 73  KVKKEYEE--KQKWKWKKKKSKKK--------KDKDKDKKDDKKDDKSEKKDEKEAEDKL 122

Query: 167 EEITATLNP 175
           E++T + + 
Sbjct: 123 EDLTKSYSE 131


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 33.2 bits (75), Expect = 0.18
 Identities = 24/70 (34%), Positives = 30/70 (42%), Gaps = 5/70 (7%)

Query: 104 DHTKVTN-EVERTGRKPWPQKGL----IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEE 158
           DH +    E E   RK    K L    IERL+ + R RL   + E  E E  E E  E E
Sbjct: 433 DHAERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERE 492

Query: 159 EEEEEEEEEE 168
             E +  E +
Sbjct: 493 RLERDRLERD 502


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 32.4 bits (74), Expect = 0.18
 Identities = 12/29 (41%), Positives = 13/29 (44%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           EK+  E      EE    EEEE EE  A 
Sbjct: 149 EKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177



 Score = 29.4 bits (66), Expect = 1.8
 Identities = 13/32 (40%), Positives = 13/32 (40%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           K  E      EE    EEEE EE   EE  A 
Sbjct: 151 KAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAE 182



 Score = 28.6 bits (64), Expect = 3.5
 Identities = 11/25 (44%), Positives = 12/25 (48%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           E+K  E      EE    EEEE EE
Sbjct: 149 EKKAAEAAAVAAEEAAAAEEEEAEE 173



 Score = 27.4 bits (61), Expect = 7.1
 Identities = 12/25 (48%), Positives = 12/25 (48%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           E    EEEE EE   EE   EE  E
Sbjct: 162 EAAAAEEEEAEEAPAEEAPAEESAE 186



 Score = 27.4 bits (61), Expect = 8.3
 Identities = 11/26 (42%), Positives = 12/26 (46%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
             +E    EEEE EE   EE   EE 
Sbjct: 159 AAEEAAAAEEEEAEEAPAEEAPAEES 184



 Score = 27.4 bits (61), Expect = 8.3
 Identities = 13/26 (50%), Positives = 13/26 (50%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE    EEEE EE   EE   EE  E
Sbjct: 161 EEAAAAEEEEAEEAPAEEAPAEESAE 186


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 33.0 bits (75), Expect = 0.19
 Identities = 15/80 (18%), Positives = 37/80 (46%), Gaps = 3/80 (3%)

Query: 96  KKYKWVRFDHTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEE 155
           + YK+ R      +++ +  GR    ++ L     G  +   Y  + ++ E     + +E
Sbjct: 230 RSYKFGRQTRGFRSDDDQGYGRY---KQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDE 286

Query: 156 EEEEEEEEEEEEEITATLNP 175
           EE+ ++++ E+ + T T + 
Sbjct: 287 EEDGDDDDNEDNDDTNTNHG 306


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 33.1 bits (76), Expect = 0.19
 Identities = 11/39 (28%), Positives = 23/39 (58%), Gaps = 4/39 (10%)

Query: 131  GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              RRPR    K +     E++++ E ++ E+E++E++E 
Sbjct: 1352 LLRRPR----KKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386



 Score = 32.3 bits (74), Expect = 0.42
 Identities = 8/38 (21%), Positives = 23/38 (60%)

Query: 131  GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             +   R    K +   E++++ E ++ E+E++E++E++
Sbjct: 1350 SRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 7/48 (14%), Positives = 24/48 (50%)

Query: 121  PQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             ++    +     R           E++++ E ++ E+E++E++E+++
Sbjct: 1341 VKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 32.7 bits (74), Expect = 0.20
 Identities = 17/29 (58%), Positives = 19/29 (65%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           G E E  EEEE+EE  EE EE EE E +T
Sbjct: 105 GAENETVEEEEKEESREEREEVEETEGVT 133



 Score = 32.3 bits (73), Expect = 0.29
 Identities = 16/40 (40%), Positives = 18/40 (45%)

Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
             P     +    E E  EEEE+EE  EE EE EE    T
Sbjct: 94  SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVT 133



 Score = 31.5 bits (71), Expect = 0.59
 Identities = 15/37 (40%), Positives = 18/37 (48%)

Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
               +R       + E  EEEE+EE  EE EE EE E
Sbjct: 94  SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETE 130



 Score = 31.2 bits (70), Expect = 0.79
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
             K E+K +  + EE ++EE+E E EEEE+  
Sbjct: 132 VTKSEQKNDWRDAEECQKEEKEPEPEEEEKPK 163



 Score = 30.8 bits (69), Expect = 0.89
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           T + EEKEE  EE EE EE E   + E++ 
Sbjct: 110 TVEEEEKEESREEREEVEETEGVTKSEQKN 139



 Score = 30.8 bits (69), Expect = 0.98
 Identities = 17/35 (48%), Positives = 20/35 (57%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEE 165
             RR +  +G   E  EEEE+EE  EE EE EE E
Sbjct: 96  PSRRMQEDSGAENETVEEEEKEESREEREEVEETE 130



 Score = 29.6 bits (66), Expect = 2.4
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
             EE+E+EE  EE EE EE E   + E+     +  
Sbjct: 110 TVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAE 145



 Score = 27.7 bits (61), Expect = 9.8
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+EE EE E   + E++ +  + EE
Sbjct: 121 EEREEVEETEGVTKSEQKNDWRDAEE 146


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 32.3 bits (74), Expect = 0.22
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           K EEKE+ + E+EE+   E EE+  E+E+ 
Sbjct: 65  KIEEKEKAKREKEEKGLRELEEDTPEDELA 94



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 10/26 (38%), Positives = 19/26 (73%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITAT 172
           E+E+++ ++  +EEE+EE+EEE    
Sbjct: 27  EDEDDDVKDSWDEEEDEEKEEEKAKV 52



 Score = 29.6 bits (67), Expect = 1.8
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           K   K + EE+E+ + E+EE+   E EE T
Sbjct: 59  KKALKAKIEEKEKAKREKEEKGLRELEEDT 88



 Score = 29.2 bits (66), Expect = 2.1
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           + E+E+++ ++  +EEE+EE+EEE+       
Sbjct: 25  DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKA 56



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 9/27 (33%), Positives = 21/27 (77%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+++++ ++  +EEE+EE+EEE+ + 
Sbjct: 26  DEDEDDDVKDSWDEEEDEEKEEEKAKV 52



 Score = 28.5 bits (64), Expect = 4.5
 Identities = 11/32 (34%), Positives = 22/32 (68%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E E+++ ++  +EEE+EE+EEE+ ++ A    
Sbjct: 27  EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKA 58



 Score = 28.1 bits (63), Expect = 6.0
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           K + +       E++E+ + E+EE+   E EE+  E+  A
Sbjct: 55  KAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELA 94



 Score = 27.7 bits (62), Expect = 7.3
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           K + K E+EE+   E EE+  E+E  E+
Sbjct: 69  KEKAKREKEEKGLRELEEDTPEDELAEK 96


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 31.1 bits (71), Expect = 0.23
 Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 8/42 (19%)

Query: 134 RPRLYTGKGEEKEEEEEEEEEEEEEEEE--------EEEEEE 167
           +  L      ++E+EEE E + + E+E+        EEEEEE
Sbjct: 60  KVPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEE 101



 Score = 27.2 bits (61), Expect = 4.5
 Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 8/35 (22%)

Query: 142 GEEKEEEEEEEEEE--------EEEEEEEEEEEEE 168
            EE+E + + E+E+        EEEEEE++   EE
Sbjct: 74  EEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108



 Score = 26.5 bits (59), Expect = 8.8
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
            +  + +EE+EEE E + + E+E+I
Sbjct: 64  DDVGDGDEEDEEEREAKRKLEQEKI 88


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 32.7 bits (75), Expect = 0.24
 Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 3/54 (5%)

Query: 122 QKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           +K L E+L   R+      K E K   EE+E+E+ + EEE+E+++E+     NP
Sbjct: 301 RKALEEQL---RQGEELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENP 351



 Score = 30.4 bits (69), Expect = 1.4
 Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 5/39 (12%)

Query: 141 KGEEKEEEEEEEEEEE-----EEEEEEEEEEEEITATLN 174
           + E +++EE + E EE     E EEE +EEE E  +  N
Sbjct: 381 RAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKN 419



 Score = 28.5 bits (64), Expect = 6.3
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           K + KEE +E EE  + + E   EE E++
Sbjct: 237 KEKLKEELKEFEELVKADPEAALEELEKL 265


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 32.3 bits (73), Expect = 0.24
 Identities = 20/65 (30%), Positives = 28/65 (43%), Gaps = 7/65 (10%)

Query: 113 ERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE----EEEE 168
           E     P   +       G R PR +  K  E++EE+ E E   E   E+EE    E +E
Sbjct: 194 EPDEGTPSSSE---RSNIGGRGPRGHKRKLNEEDEEDAESESSFESSNEDEEGSSSEADE 250

Query: 169 ITATL 173
           + A L
Sbjct: 251 MAAAL 255


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 32.5 bits (74), Expect = 0.25
 Identities = 9/31 (29%), Positives = 20/31 (64%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             + E++ + EE  E++ ++EEEE+ + E+ 
Sbjct: 174 ESESEDESKSEESAEDDSDDEEEEDSDSEDY 204



 Score = 30.2 bits (68), Expect = 1.4
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             + +        EE E E+E + EE  E++ ++EEEE+
Sbjct: 160 KKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEED 198



 Score = 29.4 bits (66), Expect = 2.3
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
                EE+ E E+E + EE  E++ ++EEEE +
Sbjct: 167 SDKDDEEESESEDESKSEESAEDDSDDEEEEDS 199



 Score = 29.4 bits (66), Expect = 2.6
 Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 4/54 (7%)

Query: 116 GRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           G +   +KG  ++   K +      + +E  ++++EEE E E+E + EE  E+ 
Sbjct: 141 GIETKAKKGKAKKKTKKSK----KKEAKESSDKDDEEESESEDESKSEESAEDD 190



 Score = 28.6 bits (64), Expect = 4.3
 Identities = 13/33 (39%), Positives = 23/33 (69%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           T K ++KE +E  ++++EEE E E+E + E +A
Sbjct: 155 TKKSKKKEAKESSDKDDEEESESEDESKSEESA 187


>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306).  This
           family of proteobacterial species proteins has no known
           function.
          Length = 115

 Score = 30.7 bits (70), Expect = 0.33
 Identities = 13/46 (28%), Positives = 19/46 (41%)

Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           +L  +        +  E+E          EEEEE E E+EE+   L
Sbjct: 10  KLAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEELLEEL 55



 Score = 28.8 bits (65), Expect = 1.6
 Identities = 13/36 (36%), Positives = 13/36 (36%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
            E  EEE          EEEEE E E    L    L
Sbjct: 22  AETAEEEAAAAAPAPAPEEEEEAELEDEELLEELDL 57


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 31.1 bits (71), Expect = 0.36
 Identities = 14/30 (46%), Positives = 24/30 (80%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +G+  + EE+ EE+E E+EE+E +EE+E+E
Sbjct: 8   SGRIIDIEEKREEKEREKEEKERKEEKEKE 37



 Score = 28.0 bits (63), Expect = 4.1
 Identities = 12/23 (52%), Positives = 19/23 (82%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEI 169
           EE+ EE+E E+EE+E +EE+E+ 
Sbjct: 15  EEKREEKEREKEEKERKEEKEKE 37


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 32.2 bits (73), Expect = 0.36
 Identities = 6/31 (19%), Positives = 10/31 (32%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
                 ++  E E+E  E E E        +
Sbjct: 222 APSPNTQQAVEHEDEPTEPEREGPPFPGHRS 252



 Score = 31.5 bits (71), Expect = 0.52
 Identities = 7/35 (20%), Positives = 11/35 (31%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           T +       ++  E E+E  E E E         
Sbjct: 218 TPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRS 252



 Score = 31.5 bits (71), Expect = 0.63
 Identities = 11/60 (18%), Positives = 14/60 (23%), Gaps = 3/60 (5%)

Query: 109 TNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           T E  R G K    +G      G     + +         EE       E       E  
Sbjct: 75  TAEESRHGEKEERGQGGPS---GSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESP 131



 Score = 31.1 bits (70), Expect = 0.66
 Identities = 6/36 (16%), Positives = 12/36 (33%), Gaps = 1/36 (2%)

Query: 142 GEEKEEEEEEEE-EEEEEEEEEEEEEEEITATLNPC 176
            ++      ++  E E+E  E E E        +  
Sbjct: 219 PQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRSHS 254



 Score = 30.7 bits (69), Expect = 1.0
 Identities = 4/29 (13%), Positives = 8/29 (27%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G  E    E       ++     +   E+
Sbjct: 145 GPHEPAPPESHNPSPNQQPSSFLQPSHED 173



 Score = 30.3 bits (68), Expect = 1.4
 Identities = 6/29 (20%), Positives = 11/29 (37%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
               ++      ++  E E+E  E E E 
Sbjct: 216 SPTPQQAPSPNTQQAVEHEDEPTEPEREG 244



 Score = 30.3 bits (68), Expect = 1.4
 Identities = 4/29 (13%), Positives = 4/29 (13%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G   E                  E    E
Sbjct: 125 GSSPESPASHSPPPSPPSHPGPHEPAPPE 153



 Score = 29.9 bits (67), Expect = 1.8
 Identities = 6/29 (20%), Positives = 10/29 (34%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            +      +   E+  EE E    E E +
Sbjct: 160 NQQPSSFLQPSHEDSPEEPEPPTSEPEPD 188



 Score = 29.2 bits (65), Expect = 2.8
 Identities = 5/28 (17%), Positives = 5/28 (17%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           G  EE       E       E       
Sbjct: 109 GSAEELASGLSPENTSGSSPESPASHSP 136



 Score = 28.8 bits (64), Expect = 3.7
 Identities = 5/27 (18%), Positives = 11/27 (40%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           GE +    ++      ++  E E+E  
Sbjct: 212 GEPQSPTPQQAPSPNTQQAVEHEDEPT 238



 Score = 28.8 bits (64), Expect = 3.8
 Identities = 5/28 (17%), Positives = 9/28 (32%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
               ++     +   E+  EE E    E
Sbjct: 157 PSPNQQPSSFLQPSHEDSPEEPEPPTSE 184



 Score = 28.8 bits (64), Expect = 4.4
 Identities = 4/29 (13%), Positives = 4/29 (13%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                 EE       E       E     
Sbjct: 106 SPSGSAEELASGLSPENTSGSSPESPASH 134



 Score = 28.8 bits (64), Expect = 4.4
 Identities = 5/27 (18%), Positives = 10/27 (37%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                ++      ++  E E+E  E E
Sbjct: 215 QSPTPQQAPSPNTQQAVEHEDEPTEPE 241



 Score = 28.4 bits (63), Expect = 5.2
 Identities = 6/29 (20%), Positives = 9/29 (31%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                 E+  EE E    E E +     +
Sbjct: 166 FLQPSHEDSPEEPEPPTSEPEPDSPGPPQ 194



 Score = 28.4 bits (63), Expect = 5.8
 Identities = 8/31 (25%), Positives = 11/31 (35%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
              E+  EE E    E E +     + E  T
Sbjct: 169 PSHEDSPEEPEPPTSEPEPDSPGPPQSETPT 199



 Score = 28.0 bits (62), Expect = 7.2
 Identities = 6/29 (20%), Positives = 9/29 (31%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                +   E+  EE E    E E +   
Sbjct: 163 PSSFLQPSHEDSPEEPEPPTSEPEPDSPG 191



 Score = 27.6 bits (61), Expect = 8.4
 Identities = 7/26 (26%), Positives = 12/26 (46%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +   + E + E  EE    E+EE  +
Sbjct: 64  QGHRQTESDTETAEESRHGEKEERGQ 89



 Score = 27.6 bits (61), Expect = 8.8
 Identities = 2/27 (7%), Positives = 7/27 (25%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              +       ++     +   E+  E
Sbjct: 150 APPESHNPSPNQQPSSFLQPSHEDSPE 176


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 30.7 bits (70), Expect = 0.37
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              EE++ +EEE + + EE +E E E 
Sbjct: 98  ARDEEKKLDEEEAKRQHEEAKEREREK 124



 Score = 29.6 bits (67), Expect = 0.94
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EEK+ +EEE + + EE +E E E+++
Sbjct: 101 EEKKLDEEEAKRQHEEAKEREREKKK 126



 Score = 29.2 bits (66), Expect = 1.3
 Identities = 8/26 (30%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            ++  +EE++ +EEE + + EE +E 
Sbjct: 95  AQRARDEEKKLDEEEAKRQHEEAKER 120



 Score = 28.4 bits (64), Expect = 2.0
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             + E +   +EE++ +EEE + + EE +E 
Sbjct: 90  DERAEAQRARDEEKKLDEEEAKRQHEEAKER 120



 Score = 26.5 bits (59), Expect = 9.4
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E+++ +EEE + + EE +E E E++
Sbjct: 100 DEEKKLDEEEAKRQHEEAKEREREKK 125


>gnl|CDD|223130 COG0052, RpsB, Ribosomal protein S2 [Translation, ribosomal
           structure and biogenesis].
          Length = 252

 Score = 31.5 bits (72), Expect = 0.38
 Identities = 17/48 (35%), Positives = 21/48 (43%), Gaps = 9/48 (18%)

Query: 130 DGKRRPRLYT---------GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           D  R   L           G+G   +EEE   EE+EE EE E +EE  
Sbjct: 204 DAIRSIALIYWLLARAILEGRGGALDEEEAAIEEDEEVEEFEAKEEAA 251


>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
          Length = 586

 Score = 32.1 bits (73), Expect = 0.39
 Identities = 14/26 (53%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+KEE+EE+  EE +E+EEE+ E+E 
Sbjct: 53  EQKEEKEEDAGEEGDEDEEEQGEDEH 78



 Score = 30.2 bits (68), Expect = 1.5
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
           +L   K E++E+  EE +E+EEE+ E+E 
Sbjct: 50  QLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78


>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal.  This is the N terminal of
           the DNA repair protein Rad54.
          Length = 191

 Score = 31.2 bits (71), Expect = 0.40
 Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 2/36 (5%)

Query: 137 LY--TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           LY  T   + K EEE+ E+++E EE E +       
Sbjct: 136 LYDPTVDDKPKIEEEKAEKDQEPEESETKLSNGPKV 171



 Score = 30.0 bits (68), Expect = 1.2
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           ++K + EEE+ E+++E EE E +       
Sbjct: 142 DDKPKIEEEKAEKDQEPEESETKLSNGPKV 171



 Score = 27.3 bits (61), Expect = 8.9
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
           EEE+ E+++E EE E +     ++  +L
Sbjct: 148 EEEKAEKDQEPEESETKLSNGPKVHKSL 175


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 31.8 bits (72), Expect = 0.45
 Identities = 12/29 (41%), Positives = 24/29 (82%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           GK EEKE+E+ +EE+++++E+ +EE ++ 
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131



 Score = 29.9 bits (67), Expect = 2.1
 Identities = 12/36 (33%), Positives = 24/36 (66%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            K E  +EEE+E+E+ +EE+++++E+ +E      P
Sbjct: 98  PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKP 133



 Score = 29.1 bits (65), Expect = 3.6
 Identities = 11/40 (27%), Positives = 25/40 (62%)

Query: 129 LDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              K         G+E+E+E+E+ +EE+++++E+ +EE +
Sbjct: 90  AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129



 Score = 28.3 bits (63), Expect = 6.7
 Identities = 13/43 (30%), Positives = 25/43 (58%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E+   K          E K E  +EEE+E+E+ +EE+++++E
Sbjct: 80  VEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKE 122



 Score = 27.9 bits (62), Expect = 8.9
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +RP     K +EK+ EE  + EEE++ E    +   
Sbjct: 140 KRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRP 175



 Score = 27.5 bits (61), Expect = 9.1
 Identities = 9/30 (30%), Positives = 23/30 (76%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
               EE++E+E+ +EE+++++E+ +EE ++
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKD 130


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 30.8 bits (70), Expect = 0.48
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           +E +EEEE E++ ++ +EE EEEE+  
Sbjct: 2   DEPDEEEELEDDIDDLDEEAEEEEDGE 28



 Score = 30.4 bits (69), Expect = 0.55
 Identities = 12/26 (46%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E++ +EEEE E++ ++ +EE EEEE+
Sbjct: 1   EDEPDEEEELEDDIDDLDEEAEEEED 26



 Score = 30.0 bits (68), Expect = 0.91
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            ++EEE E++ ++ +EE EEEE+ E+
Sbjct: 4   PDEEEELEDDIDDLDEEAEEEEDGED 29



 Score = 28.4 bits (64), Expect = 2.8
 Identities = 11/25 (44%), Positives = 19/25 (76%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
            EE+E E++ ++ +EE EEEE+ E+
Sbjct: 5   DEEEELEDDIDDLDEEAEEEEDGED 29


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 31.8 bits (72), Expect = 0.51
 Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILP 183
            E+ E E E + E+ EE+E+E++ E           + R LP
Sbjct: 668 IEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALP 709



 Score = 31.0 bits (70), Expect = 0.92
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E E E  E+ E E E + E+ EE+E
Sbjct: 661 DEDECEAIEDSESESESDGEDGEEDE 686



 Score = 30.6 bits (69), Expect = 1.3
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E +  E+ E E E + E+ EE+E+E+
Sbjct: 664 ECEAIEDSESESESDGEDGEEDEQED 689



 Score = 29.8 bits (67), Expect = 2.3
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             +  E E E + E+ EE+E+E++ E  E +
Sbjct: 667 AIEDSESESESDGEDGEEDEQEDDAEANEGV 697



 Score = 29.5 bits (66), Expect = 2.7
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           E  E+ E E E + E+ EE+E+E++
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDD 690



 Score = 29.5 bits (66), Expect = 3.0
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E++ E  E+ E E E + E+ EE+E+
Sbjct: 662 EDECEAIEDSESESESDGEDGEEDEQ 687



 Score = 28.7 bits (64), Expect = 5.2
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           EE +++E+E E  E+ E E E + E+   
Sbjct: 656 EETDDDEDECEAIEDSESESESDGEDGEE 684



 Score = 28.7 bits (64), Expect = 5.4
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + ++ E+E E  E+ E E E + E+ EE
Sbjct: 656 EETDDDEDECEAIEDSESESESDGEDGEE 684



 Score = 28.3 bits (63), Expect = 6.9
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           GEE++E++ EE +++E+E E  E+ E 
Sbjct: 647 GEEEDEDDLEETDDDEDECEAIEDSES 673



 Score = 27.9 bits (62), Expect = 8.5
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           ++  E EEE+E++ EE +++E+E E I
Sbjct: 642 QQLFEGEEEDEDDLEETDDDEDECEAI 668



 Score = 27.9 bits (62), Expect = 9.2
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E  ++E+E E  E+ E E E + E+ 
Sbjct: 655 LEETDDDEDECEAIEDSESESESDGEDG 682


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 31.5 bits (72), Expect = 0.51
 Identities = 8/33 (24%), Positives = 15/33 (45%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           +  EEE        E++++  E+EEE     + 
Sbjct: 263 QPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDR 295



 Score = 30.8 bits (70), Expect = 0.92
 Identities = 7/26 (26%), Positives = 13/26 (50%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            +  EEE        E++++  E+EE
Sbjct: 262 PQPPEEEPPPPPPPPEDDDDPPEDEE 287



 Score = 29.2 bits (66), Expect = 2.7
 Identities = 6/26 (23%), Positives = 12/26 (46%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
             +  EEE        E++++  E+E
Sbjct: 261 PPQPPEEEPPPPPPPPEDDDDPPEDE 286



 Score = 28.8 bits (65), Expect = 3.9
 Identities = 5/26 (19%), Positives = 11/26 (42%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
              +  EEE        E++++  E+
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPED 285



 Score = 28.5 bits (64), Expect = 5.4
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              E++++  E+EEE+++ E+   EEI
Sbjct: 274 PPPEDDDDPPEDEEEQDDAEDRALEEI 300


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This is
           the highly conserved family of the major tail subunit
           protein.
          Length = 121

 Score = 30.1 bits (67), Expect = 0.55
 Identities = 12/32 (37%), Positives = 22/32 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           E KEE ++ E E E+ ++E +EE ++  +T+N
Sbjct: 47  EMKEERDKMETEREKRDKESKEERDKFISTMN 78


>gnl|CDD|217530 pfam03390, 2HCT, 2-hydroxycarboxylate transporter family.  The
           2-hydroxycarboxylate transporter family is a family of
           secondary transporters found exclusively in the
           bacterial kingdom. They function in the metabolism of
           the di- and tricarboxylates malate and citrate, mostly
           in fermentative pathways involving decarboxylation of
           malate or oxaloacetate.
          Length = 414

 Score = 31.3 bits (72), Expect = 0.55
 Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 2/36 (5%)

Query: 124 GLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEE 159
           GL+ +L GK++P L TG G+   +E +   EEEE+E
Sbjct: 207 GLLNKL-GKKKPSL-TGNGQLVRKESDLLVEEEEKE 240


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 31.4 bits (71), Expect = 0.55
 Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           T +  ++E E+E E +E   EE  E E  E+
Sbjct: 344 TEESTDEESEDEVEIDESVIEEVAEMELLEV 374



 Score = 30.7 bits (69), Expect = 1.1
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              EE E+E E +E   EE  E E  E ++
Sbjct: 347 STDEESEDEVEIDESVIEEVAEMELLEVQV 376



 Score = 30.3 bits (68), Expect = 1.5
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              E++E +   EE  +EE E+E E +E +
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESV 362



 Score = 29.9 bits (67), Expect = 1.8
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G    +E  +EE E+E E +E   EE  E
Sbjct: 340 GDVVTEESTDEESEDEVEIDESVIEEVAE 368



 Score = 29.1 bits (65), Expect = 3.0
 Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
             EE  +EE E+E E +E   EE+   
Sbjct: 343 VTEESTDEESEDEVEIDESVIEEVAEM 369



 Score = 28.0 bits (62), Expect = 7.4
 Identities = 10/30 (33%), Positives = 14/30 (46%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           T +  E E E +E   EE  E E  E + +
Sbjct: 348 TDEESEDEVEIDESVIEEVAEMELLEVQVD 377



 Score = 28.0 bits (62), Expect = 7.7
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E +   EE  +EE E+E E +E   
Sbjct: 338 QEGDVVTEESTDEESEDEVEIDESVI 363



 Score = 27.6 bits (61), Expect = 8.8
 Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 144 EKEE--EEEEEEEEEEEEEEEEEEEEEITATL 173
           EK+E     EE  +EE E+E E +E  I    
Sbjct: 336 EKQEGDVVTEESTDEESEDEVEIDESVIEEVA 367



 Score = 27.6 bits (61), Expect = 9.9
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           E++ E +E   EE  E E  E + +++   L
Sbjct: 353 EDEVEIDESVIEEVAEMELLEVQVDDLAERL 383


>gnl|CDD|129854 TIGR00771, DcuC, c4-dicarboxylate anaerobic carrier family protein.
            These proteins are members of the C4-dicarboxylate
           Uptake C (DcuC) Family (TC 2.A.61). The only
           functionally characterized member of this family is the
           anaerobic C4-dicarboxylate transporter (DcuC) of
           Escherichia coli. DcuC has 12 GES predicted
           transmembrane regions, is induced only under anaerobic
           conditions, and is not repressed by glucose. It may
           therefore function as a succinate efflux system during
           anaerobic glucose fermentation. However, when
           overexpressed, it can replace either DcuA or DcuB in
           catalyzing fumarate-succinate exchange and fumarate
           uptake [Transport and binding proteins, Carbohydrates,
           organic alcohols, and acids].
          Length = 388

 Score = 31.4 bits (71), Expect = 0.56
 Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 3/63 (4%)

Query: 152 EEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVHIVVGLTSTLSIKLMQDDLHIVKDL 211
           E++++     E+E E ++   +    L  ILP++  I++ L       L Q  LH    +
Sbjct: 165 EDKKQAYIAHEKESEIKVIEGVP--VLYAILPLIPLIIL-LLFIGGTGLQQVPLHEWTKM 221

Query: 212 VLP 214
            +P
Sbjct: 222 GVP 224


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 30.3 bits (69), Expect = 0.56
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           EEK E E E+ E+E EE E E    E+ A 
Sbjct: 44  EEKAEYELEKLEKELEELEAELARRELKAE 73



 Score = 26.8 bits (60), Expect = 7.5
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEIT 170
            EE+ E E E+ E+E EE E E+ 
Sbjct: 43  AEEKAEYELEKLEKELEELEAELA 66


>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain.  Cdc37 is a protein
           required for the activity of numerous eukaryotic protein
           kinases. This domains corresponds to the C terminal
           domain whose function is unclear. It is found C terminal
           to the Hsp90 chaperone (Heat shocked protein 90) binding
           domain pfam08565 and the N terminal kinase binding
           domain of Cdc37 pfam03234.
          Length = 89

 Score = 29.6 bits (67), Expect = 0.58
 Identities = 16/27 (59%), Positives = 21/27 (77%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           GE++ +E EEE  EEE E+EE EEE+E
Sbjct: 63  GEKEFKELEEEYNEEEAEKEEIEEEDE 89



 Score = 28.5 bits (64), Expect = 1.2
 Identities = 17/27 (62%), Positives = 20/27 (74%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           G+ E KE EEE  EEE E+EE EEE+E
Sbjct: 63  GEKEFKELEEEYNEEEAEKEEIEEEDE 89



 Score = 28.5 bits (64), Expect = 1.5
 Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
               EKE +E EEE  EEE E+EE EEE+
Sbjct: 60  KIEGEKEFKELEEEYNEEEAEKEEIEEED 88



 Score = 27.7 bits (62), Expect = 2.2
 Identities = 15/30 (50%), Positives = 18/30 (60%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
             E E+E +E EEE  EEE E+EE EE   
Sbjct: 60  KIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89



 Score = 27.7 bits (62), Expect = 2.4
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            K E ++E +E EEE  EEE E+EE EEE
Sbjct: 59  AKIEGEKEFKELEEEYNEEEAEKEEIEEE 87



 Score = 27.3 bits (61), Expect = 2.9
 Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             K E E+E +E EEE  EEE E+EEI
Sbjct: 58  NAKIEGEKEFKELEEEYNEEEAEKEEI 84


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 31.6 bits (71), Expect = 0.58
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           + E+K  EE E E+E+E+E E E E E
Sbjct: 591 EAEQKAREEREREKEKEKERERERERE 617



 Score = 30.4 bits (68), Expect = 1.5
 Identities = 15/40 (37%), Positives = 23/40 (57%)

Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           +L  KR   +   K E +++  EE E E+E+E+E E E E
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERE 615



 Score = 30.4 bits (68), Expect = 1.6
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEE 164
           EE+E E+E+E+E E E E E E
Sbjct: 598 EEREREKEKEKEREREREREAE 619



 Score = 29.7 bits (66), Expect = 2.7
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E E++  EE E E+E+E+E E E E  A
Sbjct: 591 EAEQKAREEREREKEKEKEREREREREA 618



 Score = 29.3 bits (65), Expect = 3.0
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEE 167
           +EE E E+E+E+E E E E E E
Sbjct: 597 REEREREKEKEKEREREREREAE 619



 Score = 29.3 bits (65), Expect = 3.0
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           ++  EE E E+E+E+E E E E E E  A
Sbjct: 594 QKAREEREREKEKEKEREREREREAERAA 622


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 30.9 bits (70), Expect = 0.60
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           + E + E  E++EEEE  EEEEE+ EEE   
Sbjct: 66  REEARLELLEQQEEEESAEEEEEDPEEENEE 96



 Score = 30.1 bits (68), Expect = 0.85
 Identities = 19/26 (73%), Positives = 22/26 (84%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E++EEEE  EEEEE+ EEE EEEEEE
Sbjct: 75  EQQEEEESAEEEEEDPEEENEEEEEE 100



 Score = 30.1 bits (68), Expect = 0.89
 Identities = 17/26 (65%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +++EEE  EEEEE+ EEE EEEEEE 
Sbjct: 76  QQEEEESAEEEEEDPEEENEEEEEEY 101



 Score = 29.7 bits (67), Expect = 1.4
 Identities = 18/26 (69%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E++EEEE  EEEEE+ EEE EEEEE
Sbjct: 74  LEQQEEEESAEEEEEDPEEENEEEEE 99



 Score = 28.9 bits (65), Expect = 2.3
 Identities = 18/34 (52%), Positives = 22/34 (64%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
            +++EEEE  EEEEE+ EEE EEEE      P G
Sbjct: 74  LEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFG 107



 Score = 28.2 bits (63), Expect = 4.4
 Identities = 21/46 (45%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 125 LIERLDGK--RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           L   LD +  R   L   + EE  EEEEE+ EEE EEEEEE +   
Sbjct: 60  LESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGP 105



 Score = 27.4 bits (61), Expect = 7.3
 Identities = 14/31 (45%), Positives = 17/31 (54%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
             E+EEE+ EEE EEEEEE +     E    
Sbjct: 82  SAEEEEEDPEEENEEEEEEYQRGPFGEEEEE 112



 Score = 27.0 bits (60), Expect = 9.1
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           + +E   E  E++EEEE  EEEEE+   
Sbjct: 65  DREEARLELLEQQEEEESAEEEEEDPEE 92



 Score = 27.0 bits (60), Expect = 9.9
 Identities = 16/29 (55%), Positives = 17/29 (58%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           EE  EEE EEEEEE +     EEEEE   
Sbjct: 87  EEDPEEENEEEEEEYQRGPFGEEEEEDGD 115


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
           domain family is found in eukaryotes, and is typically
           between 99 and 114 amino acids in length. The family is
           found in association with pfam08697, pfam01585. There
           are two completely conserved residues (G and F) that may
           be functionally important. TIP is involved in enamel
           assembly by interacting with one of the major proteins
           responsible for biomineralisation of enamel - tuftelin.
          Length = 106

 Score = 29.6 bits (67), Expect = 0.60
 Identities = 10/60 (16%), Positives = 30/60 (50%), Gaps = 11/60 (18%)

Query: 120 WPQKGLIERLDGKRRPRLYTG-----------KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             ++         ++ + Y+              +E EE+++E+E E+++E +E+++E++
Sbjct: 47  EGERRRGRGRRRSKKKKDYSKPVAFVSGGIKQAAKEPEEDDKEDESEDDDESDEDDDEDD 106


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 30.9 bits (70), Expect = 0.61
 Identities = 15/29 (51%), Positives = 20/29 (68%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + EE EEE EEE+EE  +E E+E EE+ 
Sbjct: 86  KEDEESEEENEEEDEESSDENEKETEEKT 114



 Score = 30.9 bits (70), Expect = 0.72
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           + EE++EE  +E E+E EE+ E   E+EIT
Sbjct: 94  ENEEEDEESSDENEKETEEKTESNVEKEIT 123



 Score = 30.5 bits (69), Expect = 0.94
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           KG+ ++E+EE EEE EEE+EE  +E E+ T  
Sbjct: 81  KGDAEKEDEESEEENEEEDEESSDENEKETEE 112



 Score = 30.1 bits (68), Expect = 1.00
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E +EE EEE+EE  +E E+E EE+ E
Sbjct: 88  DEESEEENEEEDEESSDENEKETEEKTE 115



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 15/35 (42%), Positives = 19/35 (54%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
             EE EEE+EE  +E E+E EE+ E        NP
Sbjct: 91  SEEENEEEDEESSDENEKETEEKTESNVEKEITNP 125



 Score = 29.7 bits (67), Expect = 1.4
 Identities = 15/30 (50%), Positives = 22/30 (73%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
               E+KE++ + E+E+EE EEE EEE+EE
Sbjct: 72  AANSEDKEDKGDAEKEDEESEEENEEEDEE 101



 Score = 29.7 bits (67), Expect = 1.6
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                EKE+EE EEE EEE+EE  +E E+E
Sbjct: 80  DKGDAEKEDEESEEENEEEDEESSDENEKE 109



 Score = 28.9 bits (65), Expect = 2.8
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           E+K + E+E+EE EEE EEE+EE  +    
Sbjct: 79  EDKGDAEKEDEESEEENEEEDEESSDENEK 108



 Score = 28.9 bits (65), Expect = 3.0
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             K ++ + E+E+EE EEE EEE+EE  +E
Sbjct: 76  EDKEDKGDAEKEDEESEEENEEEDEESSDE 105



 Score = 27.8 bits (62), Expect = 7.3
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            K EEKE    E++E++ + E+E+EE EE
Sbjct: 65  VKEEEKEAANSEDKEDKGDAEKEDEESEE 93


>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
          Length = 246

 Score = 31.1 bits (71), Expect = 0.62
 Identities = 10/31 (32%), Positives = 10/31 (32%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
            E    E      E  E EE  EEE    T 
Sbjct: 28  PEPPPPEPAAPPPEPPEPEEVPEEEAAPLTE 58



 Score = 30.3 bits (69), Expect = 1.1
 Identities = 8/27 (29%), Positives = 9/27 (33%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             +    E      E  E EE  EEE 
Sbjct: 27  PPEPPPPEPAAPPPEPPEPEEVPEEEA 53



 Score = 29.1 bits (66), Expect = 2.4
 Identities = 11/29 (37%), Positives = 12/29 (41%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
                 E  E EE  EEE     EEE+ A
Sbjct: 35  PAAPPPEPPEPEEVPEEEAAPLTEEELEA 63



 Score = 28.7 bits (65), Expect = 2.9
 Identities = 12/28 (42%), Positives = 12/28 (42%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
               E  E EE  EEE     EEE E I
Sbjct: 37  APPPEPPEPEEVPEEEAAPLTEEELEAI 64



 Score = 28.7 bits (65), Expect = 3.2
 Identities = 8/27 (29%), Positives = 8/27 (29%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
               E    E      E  E EE  EE
Sbjct: 25  DPPPEPPPPEPAAPPPEPPEPEEVPEE 51


>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
           Provisional.
          Length = 606

 Score = 31.3 bits (71), Expect = 0.67
 Identities = 10/33 (30%), Positives = 19/33 (57%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           PR    +  E+ E     E  EE+EE+++E+++
Sbjct: 574 PRANNNEASEETETITVPENNEEDEEDDDEDDD 606



 Score = 29.4 bits (66), Expect = 3.0
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E  EE E     E  EE+EE+++E++
Sbjct: 578 NNEASEETETITVPENNEEDEEDDDEDD 605



 Score = 29.0 bits (65), Expect = 3.9
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              +E E     E  EE+EE+++E+++
Sbjct: 580 EASEETETITVPENNEEDEEDDDEDDD 606


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 29.5 bits (67), Expect = 0.67
 Identities = 7/24 (29%), Positives = 13/24 (54%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEE 163
                    ++EE++EE EEE ++
Sbjct: 78  AAAAAAAAAKKEEKKEESEEESDD 101



 Score = 29.5 bits (67), Expect = 0.81
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 139 TGKGEEKEEEEEEEEEEEEEE 159
                 K+EE++EE EEE ++
Sbjct: 81  AAAAAAKKEEKKEESEEESDD 101



 Score = 28.4 bits (64), Expect = 1.8
 Identities = 7/23 (30%), Positives = 13/23 (56%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEE 161
                   ++EE++EE EEE ++
Sbjct: 79  AAAAAAAAKKEEKKEESEEESDD 101



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 7/18 (38%), Positives = 13/18 (72%)

Query: 147 EEEEEEEEEEEEEEEEEE 164
              ++EE++EE EEE ++
Sbjct: 84  AAAKKEEKKEESEEESDD 101



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 7/18 (38%), Positives = 13/18 (72%)

Query: 148 EEEEEEEEEEEEEEEEEE 165
              ++EE++EE EEE ++
Sbjct: 84  AAAKKEEKKEESEEESDD 101



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 7/18 (38%), Positives = 13/18 (72%)

Query: 149 EEEEEEEEEEEEEEEEEE 166
              ++EE++EE EEE ++
Sbjct: 84  AAAKKEEKKEESEEESDD 101



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 7/18 (38%), Positives = 13/18 (72%)

Query: 150 EEEEEEEEEEEEEEEEEE 167
              ++EE++EE EEE ++
Sbjct: 84  AAAKKEEKKEESEEESDD 101



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 7/18 (38%), Positives = 13/18 (72%)

Query: 151 EEEEEEEEEEEEEEEEEE 168
              ++EE++EE EEE ++
Sbjct: 84  AAAKKEEKKEESEEESDD 101


>gnl|CDD|215973 pfam00527, E7, E7 protein, Early protein. 
          Length = 92

 Score = 29.1 bits (66), Expect = 0.67
 Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 10/68 (14%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITA--TLNPCGLCRILPVMVHIVVGLTSTLSIKLMQ-- 202
            E+ ++ EEEEE+EE+  +         + PCG C      + +VV L++   I+ +Q  
Sbjct: 24  YEQLDDSEEEEEDEEDRGQPPAQQPYRIVTPCGGCDKT---LRLVV-LSTDEDIRTLQQL 79

Query: 203 --DDLHIV 208
               L +V
Sbjct: 80  LLGTLSLV 87


>gnl|CDD|184886 PRK14892, PRK14892, putative transcription elongation factor Elf1;
           Provisional.
          Length = 99

 Score = 29.6 bits (67), Expect = 0.68
 Identities = 14/28 (50%), Positives = 15/28 (53%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEE 163
               GK E KE + EE EEE E E E E
Sbjct: 72  LYLEGKIEIKERKNEETEEENEIEGESE 99



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           E K E +E + EE EEE E E E E
Sbjct: 75  EGKIEIKERKNEETEEENEIEGESE 99



 Score = 26.9 bits (60), Expect = 4.8
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E + E +E + EE EEE E E E E
Sbjct: 74  LEGKIEIKERKNEETEEENEIEGESE 99


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 30.4 bits (69), Expect = 0.69
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 148 EEEEEEEEEEEEEEEEEEEEEITAT 172
           E ++++ EE EEEEE+EE+ E   +
Sbjct: 11  ESDDDDSEEYEEEEEDEEDAESLES 35



 Score = 30.1 bits (68), Expect = 0.93
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
           E ++++ EE EEEEE+EE+ E  E
Sbjct: 11  ESDDDDSEEYEEEEEDEEDAESLE 34



 Score = 30.1 bits (68), Expect = 1.0
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E  +++ EE EEEEE+EE+ E  E    ++L  
Sbjct: 11  ESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQ 43



 Score = 27.8 bits (62), Expect = 4.8
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
            + E ++++ EE EEEEE+EE+  +  
Sbjct: 8   IDYESDDDDSEEYEEEEEDEEDAESLE 34



 Score = 27.4 bits (61), Expect = 6.7
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
            ++  EE EEEEE+EE+ E  E  +
Sbjct: 13  DDDDSEEYEEEEEDEEDAESLESSD 37



 Score = 27.4 bits (61), Expect = 7.3
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           +  +  + E ++++ EE EEEEE+EE   +L
Sbjct: 3   DSDDIIDYESDDDDSEEYEEEEEDEEDAESL 33



 Score = 27.0 bits (60), Expect = 9.2
 Identities = 10/26 (38%), Positives = 19/26 (73%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+ ++  + E ++++ EE EEEEE+E
Sbjct: 2   EDSDDIIDYESDDDDSEEYEEEEEDE 27


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 31.2 bits (71), Expect = 0.69
 Identities = 16/28 (57%), Positives = 18/28 (64%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E E  EEEEEE EE EEE EE+  +  A
Sbjct: 146 ELELPEEEEEEPEEMEEELEEDAADRDA 173



 Score = 28.1 bits (63), Expect = 6.2
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           E  EEEEEE EE EEE EE+  + +
Sbjct: 148 ELPEEEEEEPEEMEEELEEDAADRD 172



 Score = 28.1 bits (63), Expect = 7.1
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            EE+ EE+  + +  +   EE +E+EE
Sbjct: 160 MEEELEEDAADRDARKRAAEEAKEQEE 186



 Score = 27.7 bits (62), Expect = 8.0
 Identities = 15/25 (60%), Positives = 17/25 (68%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
           K + E E E  EEEEEE EE EEE+
Sbjct: 140 KPKNEFELELPEEEEEEPEEMEEEL 164



 Score = 27.7 bits (62), Expect = 8.1
 Identities = 16/26 (61%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + E E E  EEEEEE EE EEE EE
Sbjct: 141 PKNEFELELPEEEEEEPEEMEEELEE 166


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 31.2 bits (71), Expect = 0.71
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           EE +++ EEE ++E   E  E E E I   + P
Sbjct: 243 EESDDDGEEESDDESAWEGFESEYEPINKPVRP 275



 Score = 30.1 bits (68), Expect = 1.4
 Identities = 10/32 (31%), Positives = 15/32 (46%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
            EE +++ EEE ++E   E  E E      P 
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPV 273



 Score = 29.7 bits (67), Expect = 2.3
 Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +ER++ K+  ++        E  E   EE +++ EEE ++E  
Sbjct: 216 LERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESA 258



 Score = 28.9 bits (65), Expect = 3.1
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            +EE E+E + E++ +E E  EE+    + P
Sbjct: 199 LQEEYEKEVKAEKKRQELERVEEKKLEKMAP 229



 Score = 28.9 bits (65), Expect = 3.4
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E+ +++ EEE ++E   E  E E E
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYE 267



 Score = 28.5 bits (64), Expect = 4.5
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E   EE +++ EEE ++E   E  E     +N 
Sbjct: 239 EGLLEESDDDGEEESDDESAWEGFESEYEPINK 271



 Score = 28.5 bits (64), Expect = 5.0
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           EE E+E + E++ +E E  EE++ E++  
Sbjct: 201 EEYEKEVKAEKKRQELERVEEKKLEKMAP 229



 Score = 28.5 bits (64), Expect = 5.1
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +G  +E +++ EEE ++E   E  E E 
Sbjct: 239 EGLLEESDDDGEEESDDESAWEGFESEY 266


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 31.1 bits (70), Expect = 0.76
 Identities = 15/37 (40%), Positives = 18/37 (48%)

Query: 106 TKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKG 142
            K   EV  TG+KPW QKG      G  R  ++ G G
Sbjct: 139 VKTRAEVSGTGKKPWRQKGTGRARAGSTRSPIFVGGG 175


>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
          Length = 193

 Score = 30.3 bits (69), Expect = 0.77
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           +E+  EE E E E++EE +E E E+E+ 
Sbjct: 10  QEQVSEEIEAEVEQQEEADEAELEDELD 37



 Score = 29.9 bits (68), Expect = 1.1
 Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 1/35 (2%)

Query: 143 EEKEEEEE-EEEEEEEEEEEEEEEEEEITATLNPC 176
           E+K  +E+  EE E E E++EE +E E+   L+  
Sbjct: 5   EQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEA 39



 Score = 29.1 bits (66), Expect = 2.0
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           EE++  +E+  EE E E E++EE +E    
Sbjct: 4   EEQKTPQEQVSEEIEAEVEQQEEADEAELE 33



 Score = 27.2 bits (61), Expect = 8.9
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
             +E++  +E+  EE E E E++EE   A L 
Sbjct: 2   SNEEQKTPQEQVSEEIEAEVEQQEEADEAELE 33



 Score = 27.2 bits (61), Expect = 9.7
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            E+ E E E++EE +E E E+E +E  
Sbjct: 14  SEEIEAEVEQQEEADEAELEDELDEAD 40


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 31.1 bits (70), Expect = 0.78
 Identities = 14/33 (42%), Positives = 17/33 (51%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
           EK E E    EE E EEE E+ E+E   +   C
Sbjct: 371 EKMEIENRNPEESEHEEEVEDYEDENDHSKRIC 403



 Score = 30.3 bits (68), Expect = 1.6
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           E+ E E    EE E EEE E+ E+E
Sbjct: 371 EKMEIENRNPEESEHEEEVEDYEDE 395



 Score = 30.0 bits (67), Expect = 2.1
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E +  EE E EEE E+ E+E +  + I  
Sbjct: 376 ENRNPEESEHEEEVEDYEDENDHSKRICD 404



 Score = 29.6 bits (66), Expect = 2.6
 Identities = 22/76 (28%), Positives = 33/76 (43%), Gaps = 5/76 (6%)

Query: 92  IEWQKKYK---WVRFDHTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEE 148
           +++  KY    W   ++ K  NE     RK +        L+ +    LY  K E +   
Sbjct: 322 LQFLSKYIGCLWRNLNNEKWGNEEAE--RKDYILDSSSVPLEKQFDDILYFEKMEIENRN 379

Query: 149 EEEEEEEEEEEEEEEE 164
            EE E EEE E+ E+E
Sbjct: 380 PEESEHEEEVEDYEDE 395



 Score = 29.6 bits (66), Expect = 2.8
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
              E+ E E    EE E EEE E+ E
Sbjct: 368 LYFEKMEIENRNPEESEHEEEVEDYE 393


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 30.4 bits (69), Expect = 0.79
 Identities = 19/27 (70%), Positives = 21/27 (77%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            EK EEEE EEEE EEE E EEE++EI
Sbjct: 19  AEKSEEEEAEEEEPEEENELEEEQQEI 45



 Score = 30.0 bits (68), Expect = 1.0
 Identities = 18/29 (62%), Positives = 19/29 (65%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             EE EE E+ EEEE EEEE EEE E E 
Sbjct: 12  DAEETEEAEKSEEEEAEEEEPEEENELEE 40



 Score = 29.2 bits (66), Expect = 1.8
 Identities = 16/36 (44%), Positives = 19/36 (52%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
             +  + EE EE E+ EEEE EEEE EEE       
Sbjct: 7   KTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQ 42



 Score = 28.8 bits (65), Expect = 2.5
 Identities = 18/34 (52%), Positives = 20/34 (58%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
             K EE + EE EE E+ EEEE EEEE EE    
Sbjct: 5   EQKTEEPDAEETEEAEKSEEEEAEEEEPEEENEL 38



 Score = 28.8 bits (65), Expect = 2.6
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           E ++ EEEE EEEE EEE E EEE++  A L
Sbjct: 18  EAEKSEEEEAEEEEPEEENELEEEQQEIAEL 48



 Score = 28.1 bits (63), Expect = 4.4
 Identities = 17/32 (53%), Positives = 20/32 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           E+K EE + EE EE E+ EEEE EEE     N
Sbjct: 5   EQKTEEPDAEETEEAEKSEEEEAEEEEPEEEN 36



 Score = 27.7 bits (62), Expect = 6.4
 Identities = 17/28 (60%), Positives = 21/28 (75%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E+EE EEEE EEE E EEE++E  E+ A
Sbjct: 23  EEEEAEEEEPEEENELEEEQQEIAELEA 50



 Score = 27.7 bits (62), Expect = 6.5
 Identities = 17/27 (62%), Positives = 20/27 (74%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           K EE+E EEEE EEE E EEE++E  E
Sbjct: 21  KSEEEEAEEEEPEEENELEEEQQEIAE 47



 Score = 27.3 bits (61), Expect = 8.4
 Identities = 17/36 (47%), Positives = 20/36 (55%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
             + EE EEEE EEE E EEE++E  E E     L 
Sbjct: 21  KSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELK 56


>gnl|CDD|129865 TIGR00783, ccs, citrate carrier protein, CCS family.  These
           proteins are members of the Citrate:Cation Symporter
           (CCS) Family (TC 2.A.24). These proteins have 12 GES
           predicted transmembrane regions. Most members of the CCS
           family catalyze citrate uptake with either Na+ or H+ as
           the cotransported cation. However, one member is
           specific for L-malate and probably functions by a proton
           symport mechanism [Unclassified, Role category not yet
           assigned].
          Length = 347

 Score = 30.8 bits (70), Expect = 0.81
 Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 2/40 (5%)

Query: 124 GLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEE 163
           GL+ R+ GK+RP+L  G GE    E+ E+ E+ +E  E +
Sbjct: 138 GLLSRI-GKKRPKL-NGHGELVRSEKREDAEKAKEITEIK 175


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 30.7 bits (69), Expect = 0.90
 Identities = 13/38 (34%), Positives = 17/38 (44%)

Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           D    P        E+EE  EE +  EEEE+EE   + 
Sbjct: 163 DNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDS 200



 Score = 29.6 bits (66), Expect = 2.0
 Identities = 15/36 (41%), Positives = 18/36 (50%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           R+   +GEE+E EEE     E E EEE   EE    
Sbjct: 232 RIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRN 267



 Score = 29.2 bits (65), Expect = 2.9
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            + + + EEEE  EE +  EEEE+EE+
Sbjct: 170 AQPDVDNEEEERLEESDGREEEEDEEV 196



 Score = 28.0 bits (62), Expect = 6.0
 Identities = 16/38 (42%), Positives = 21/38 (55%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           Y G+  +K++ EEEE EEE     E E EEE  +   P
Sbjct: 228 YEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVP 265


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 30.4 bits (68), Expect = 0.90
 Identities = 17/44 (38%), Positives = 29/44 (65%), Gaps = 4/44 (9%)

Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           L ER++ +RR      K +E+E++EE+E E EE+  EE+ ++ E
Sbjct: 138 LEERMEWERREE----KIDEREDQEEQEREREEQTIEEQSDDSE 177



 Score = 29.3 bits (65), Expect = 2.1
 Identities = 18/52 (34%), Positives = 29/52 (55%)

Query: 122 QKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           +K LI R +   + R    + EE+ E E  EE+ +E E++EE+E E    T+
Sbjct: 118 KKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTI 169


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 29.6 bits (67), Expect = 0.90
 Identities = 12/29 (41%), Positives = 23/29 (79%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           +EK ++E E+EE +++ EE++ ++EE TA
Sbjct: 56  DEKWKKETEDEEFQQKREEKKRKDEEKTA 84



 Score = 28.9 bits (65), Expect = 2.0
 Identities = 14/37 (37%), Positives = 21/37 (56%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
             KG +KEE+E  +  EE  +EEEE EE++    +  
Sbjct: 103 AKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEI 139


>gnl|CDD|220379 pfam09750, DRY_EERY, Alternative splicing regulator.  This entry
           represents the conserved N-terminal region of SWAP
           (suppressor-of-white-apricot protein) proteins. This
           region contains two highly conserved motifs, viz: DRY
           and EERY, which appear to be the sites for alternative
           splicing of exons 2 and 3 of the SWAP mRNA. These
           proteins are thus thought to be involved in
           auto-regulation of pre-mRNA splicing. Most family
           members are associated with two Surp domains pfam01805
           and an Arginine- serine-rich binding region towards the
           C-terminus.
          Length = 121

 Score = 29.3 bits (66), Expect = 0.94
 Identities = 23/66 (34%), Positives = 29/66 (43%), Gaps = 16/66 (24%)

Query: 119 PW--PQKGLIERLDGK----RRPRLYTGKGEEKEEEEEEEEEEEEEEE----------EE 162
           PW   Q  LI+R D +      PR    +   K +EE+E EE    EE          EE
Sbjct: 30  PWQGDQDNLIDRFDVRALLDYIPRGSRNRRPYKTDEEDEAEERFLNEERYRDLIKNDFEE 89

Query: 163 EEEEEE 168
            E+EEE
Sbjct: 90  AEKEEE 95


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 29.7 bits (67), Expect = 0.95
 Identities = 13/50 (26%), Positives = 26/50 (52%), Gaps = 7/50 (14%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE-------EEITATLN 174
              P      G + ++++E++E++EE + E+++E        EEI A L 
Sbjct: 41  FNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDENPYKLIRLEEILAPLT 90



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 9/35 (25%), Positives = 22/35 (62%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRI 181
           +E+  + ++++E++E++EE +      NP  L R+
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDDENPYKLIRL 82


>gnl|CDD|147051 pfam04698, MOBP_C-Myrip, Myelin-associated oligodendrocytic basic
           protein (MOBP).  MOBP is abundantly expressed in central
           nervous system myelin, and shares several
           characteristics with myelin basic protein (MBP), in
           terms of regional distribution and function. This family
           is the middle and C-terminal regions of MOBP which has
           been shown to be essential for normal arrangement of the
           radial component in central nervous system myelin. Most
           member-proteins carry a FVHE-PHD type zinc-finger at
           their N-terminus.
          Length = 710

 Score = 30.9 bits (69), Expect = 0.99
 Identities = 23/90 (25%), Positives = 39/90 (43%), Gaps = 21/90 (23%)

Query: 106 TKVTNEVERTGRKPWPQKGLIERLDGKRRPRLY----------TGKGEEKEEEEEEEEEE 155
           T ++NE +         + L +  + K R RLY          T  GE++E E + E E 
Sbjct: 414 TDISNEAQDP-------QSLTDSTEEKLRNRLYELAMKMSEKETSSGEDQESEPKAEPEN 466

Query: 156 EEE----EEEEEEEEEEITATLNPCGLCRI 181
           ++E    E+  +  +EE+    +   LC I
Sbjct: 467 QKESLSSEDNNQGVQEELKKKCSAVSLCNI 496


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 31.0 bits (70), Expect = 0.99
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+ + + ++  E+E+E E E +EE 
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEER 567



 Score = 28.3 bits (63), Expect = 6.0
 Identities = 7/22 (31%), Positives = 15/22 (68%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEE 165
           + ++  E+E+E E E +EE ++
Sbjct: 548 DADDWTEDEDENEMETDEERKK 569


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 28.9 bits (65), Expect = 1.00
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITA 171
           + E E ++EEEEEEEEE++ E+    
Sbjct: 3   DTEAEVDDEEEEEEEEEDDLEDLSDE 28



 Score = 28.9 bits (65), Expect = 1.0
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E E ++EEEEEEEEE++ E+  +E+
Sbjct: 4   TEAEVDDEEEEEEEEEDDLEDLSDED 29



 Score = 28.2 bits (63), Expect = 1.5
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
            E E ++EEEEEEEEE++ E+       
Sbjct: 4   TEAEVDDEEEEEEEEEDDLEDLSDEDEF 31



 Score = 28.2 bits (63), Expect = 1.9
 Identities = 14/26 (53%), Positives = 21/26 (80%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E ++EEEEEEEEE++ E+  +E+E I
Sbjct: 7   EVDDEEEEEEEEEDDLEDLSDEDEFI 32



 Score = 27.4 bits (61), Expect = 3.1
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  +EEEEEEEEE++ E+  +E+E  
Sbjct: 7   EVDDEEEEEEEEEDDLEDLSDEDEFI 32



 Score = 27.4 bits (61), Expect = 3.7
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
            +E+EEEEEEE++ E+  +E+E  +E
Sbjct: 9   DDEEEEEEEEEDDLEDLSDEDEFIDE 34



 Score = 26.6 bits (59), Expect = 6.0
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
           EEEEEEEEE++ E+  +E+E  +     +  
Sbjct: 11  EEEEEEEEEDDLEDLSDEDEFIDEAEAEDDR 41



 Score = 26.6 bits (59), Expect = 6.3
 Identities = 13/29 (44%), Positives = 21/29 (72%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E + ++EEEEEEEEE++ E+  +E+E   
Sbjct: 5   EAEVDDEEEEEEEEEDDLEDLSDEDEFID 33



 Score = 26.3 bits (58), Expect = 8.0
 Identities = 12/25 (48%), Positives = 20/25 (80%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++EEEEEEEE++ E+  +E+E  +E
Sbjct: 10  DEEEEEEEEEDDLEDLSDEDEFIDE 34


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 28.8 bits (65), Expect = 1.0
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            T K +EK+EE EE++EE  E  ++EEE  E 
Sbjct: 48  VTDKVKEKKEEREEDKEELIERIKKEEETFED 79



 Score = 27.7 bits (62), Expect = 2.3
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E+KEE EE++EE  E  ++EEE  E++
Sbjct: 54  EKKEEREEDKEELIERIKKEEETFEDV 80



 Score = 26.9 bits (60), Expect = 4.2
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             K EE+EE++EE  E  ++EEE  E+ + 
Sbjct: 53  KEKKEEREEDKEELIERIKKEEETFEDVDL 82


>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK).  This
           family consists of several plant specific phytosulfokine
           precursor proteins. Phytosulfokines, are active as
           either a pentapeptide or a C-terminally truncated
           tetrapeptide. These compounds were first isolated
           because of their ability to stimulate cell division in
           somatic embryo cultures of Asparagus officinalis.
          Length = 74

 Score = 28.5 bits (64), Expect = 1.1
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           + K E  E  + +E+E   E E E EEE
Sbjct: 26  SSKIEAAESSKVDEDEVVSECEGEGEEE 53



 Score = 27.7 bits (62), Expect = 2.0
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + +  E  + +E+E   E E E EEE
Sbjct: 27  SKIEAAESSKVDEDEVVSECEGEGEEE 53



 Score = 26.9 bits (60), Expect = 3.4
 Identities = 10/29 (34%), Positives = 13/29 (44%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
                E  E  + +E+E   E E E EEE
Sbjct: 25  ESSKIEAAESSKVDEDEVVSECEGEGEEE 53


>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
           domain 1.  The small mechanosensitive channel, MscS, is
           a part of the turgor-driven solute efflux system that
           protects bacteria from lysis in the event of osmotic
           shock. The MscS protein alone is sufficient to form a
           functional mechanosensitive channel gated directly by
           tension in the lipid bilayer. The MscS proteins are
           heptamers of three transmembrane subunits with seven
           converging M3 domains, and this domain is one of the
           inner membrane domains.
          Length = 339

 Score = 30.3 bits (69), Expect = 1.1
 Identities = 11/27 (40%), Positives = 13/27 (48%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            E   +  EEEEE  E   E  EE E+
Sbjct: 268 AEILAQRAEEEEESSEGAAETIEEPEL 294


>gnl|CDD|222366 pfam13764, E3_UbLigase_R4, E3 ubiquitin-protein ligase UBR4.  This
           is a family of E## ubiquitin ligase enzymes.
          Length = 794

 Score = 30.8 bits (70), Expect = 1.1
 Identities = 15/36 (41%), Positives = 20/36 (55%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRIL 182
           EE  EE E++ EEEE+ E E  I + L  CG    +
Sbjct: 87  EEFIEELEDDREEEEDPEVEFAIASVLQECGGLETM 122


>gnl|CDD|237084 PRK12378, PRK12378, hypothetical protein; Provisional.
          Length = 235

 Score = 30.2 bits (69), Expect = 1.2
 Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           +G++++E  E   + + + E+ EEEE  IT
Sbjct: 140 EGDDEDELLEALIDADVDVEDVEEEEGTIT 169


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 29.5 bits (65), Expect = 1.2
 Identities = 18/46 (39%), Positives = 24/46 (52%)

Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           KG+ +   GK       G  +    + ++E EEEEE E EEE EEE
Sbjct: 34  KGITKGGKGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEE 79



 Score = 29.2 bits (64), Expect = 1.5
 Identities = 18/27 (66%), Positives = 19/27 (70%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             E+EEE E EEE EEE E EEE EEE
Sbjct: 63  EPEEEEEPEPEEEGEEEPEPEEEGEEE 89



 Score = 28.8 bits (63), Expect = 2.3
 Identities = 19/37 (51%), Positives = 19/37 (51%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
               EE EEE E EEE EEE E EE  EEE      P
Sbjct: 70  PEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEP 106



 Score = 28.0 bits (61), Expect = 3.7
 Identities = 24/60 (40%), Positives = 26/60 (43%)

Query: 116 GRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           G K   +KG     D            EE+E E EEE EEE E EEE EEE E   T   
Sbjct: 39  GGKGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEE 98


>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family.  The TOR signalling pathway
           activates a cell-growth program in response to
           nutrients. TIP41 (pfam04176) interacts with TAP42 and
           negatively regulates the TOR signaling pathway.
          Length = 335

 Score = 30.3 bits (69), Expect = 1.2
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +G  +     EEEE++E++EE+++EE  
Sbjct: 295 QGGGEAAASAEEEEDDEDDEEDDDEETL 322



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G E     EEEE++E++EE+++EE  +
Sbjct: 297 GGEAAASAEEEEDDEDDEEDDDEETLK 323



 Score = 28.8 bits (65), Expect = 3.6
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           G GE     EEEE++E++EE+++EE  +
Sbjct: 296 GGGEAAASAEEEEDDEDDEEDDDEETLK 323



 Score = 27.6 bits (62), Expect = 8.0
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G  +   E     EEEE++E++EE+++EE
Sbjct: 292 GVPQGGGEAAASAEEEEDDEDDEEDDDEE 320



 Score = 27.6 bits (62), Expect = 8.8
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
              E     EEEE++E++EE+++EE + A
Sbjct: 296 GGGEAAASAEEEEDDEDDEEDDDEETLKA 324


>gnl|CDD|149439 pfam08375, Rpn3_C, Proteasome regulatory subunit C-terminal.  This
           eukaryotic domain is found at the C-terminus of 26S
           proteasome regulatory subunits such as the non-ATPase
           Rpn3 subunit which is essential for proteasomal
           function. It occurs together with the PCI/PINT domain
           (pfam01399).
          Length = 68

 Score = 28.1 bits (63), Expect = 1.3
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEE---EEEEEEI 169
           Y    E+KEEE+ EE  E ++ E E   E  E ++
Sbjct: 32  YPPDKEKKEEEKAEEARERDQLELELAKELSEGDL 66


>gnl|CDD|226501 COG4014, COG4014, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 97

 Score = 28.7 bits (64), Expect = 1.3
 Identities = 15/40 (37%), Positives = 20/40 (50%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
              K E KEE EE+ +E  E   +E+EE E +   L  C 
Sbjct: 55  VVDKSEIKEEREEDIDEVVERIRKEKEEFELVDLGLGVCC 94


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 30.2 bits (68), Expect = 1.4
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+ E+E++ +E+E++++ EEEE    
Sbjct: 222 EDNEDEDDPKEDEDDDQGEEEESGSS 247



 Score = 29.4 bits (66), Expect = 2.0
 Identities = 8/26 (30%), Positives = 20/26 (76%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
           G+E E  + E+ E+E++ +E+E++++
Sbjct: 213 GDEPESADSEDNEDEDDPKEDEDDDQ 238



 Score = 29.4 bits (66), Expect = 2.0
 Identities = 9/26 (34%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            + E+ E+E++ +E+E++++ EEEE 
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEES 244



 Score = 29.0 bits (65), Expect = 2.8
 Identities = 9/26 (34%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E  + E+ E+E++ +E+E++++ EEE
Sbjct: 217 ESADSEDNEDEDDPKEDEDDDQGEEE 242



 Score = 29.0 bits (65), Expect = 3.3
 Identities = 9/32 (28%), Positives = 24/32 (75%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           + ++ E+E++ +E+E++++ EEEE   + +L+
Sbjct: 220 DSEDNEDEDDPKEDEDDDQGEEEESGSSDSLS 251



 Score = 28.6 bits (64), Expect = 4.3
 Identities = 9/29 (31%), Positives = 20/29 (68%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G   E  + E+ E+E++ +E+E++++ EE
Sbjct: 213 GDEPESADSEDNEDEDDPKEDEDDDQGEE 241



 Score = 28.2 bits (63), Expect = 5.3
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + KE+E++++ EEEE    +   E+ 
Sbjct: 229 DPKEDEDDDQGEEEESGSSDSLSEDS 254



 Score = 28.2 bits (63), Expect = 5.9
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           GEE+E    +   E+ +   EE E  E+ A
Sbjct: 239 GEEEESGSSDSLSEDSDASSEEMESGEMEA 268



 Score = 27.9 bits (62), Expect = 6.7
 Identities = 9/29 (31%), Positives = 13/29 (44%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G+ EE    +   E+ +   EE E  E E
Sbjct: 239 GEEEESGSSDSLSEDSDASSEEMESGEME 267



 Score = 27.9 bits (62), Expect = 7.7
 Identities = 8/26 (30%), Positives = 13/26 (50%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++ EEEE    +   E+ +   EE E
Sbjct: 237 DQGEEEESGSSDSLSEDSDASSEEME 262


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 30.4 bits (69), Expect = 1.4
 Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 6/49 (12%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           I  L+             E+E  EE EE + E+ + E++ ++E T  L 
Sbjct: 825 INELEPLPYEE------PEEETAEEPEEVDPEDVKSEDDIDDEDTGQLG 867


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 30.3 bits (69), Expect = 1.4
 Identities = 9/25 (36%), Positives = 21/25 (84%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           E++++++EE +EE+EE  + E++E+
Sbjct: 124 EEDDDDDEESDEEDEESSKSEDDED 148



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 9/29 (31%), Positives = 20/29 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            + EE+++++EE +EE+EE  + E++   
Sbjct: 121 RDLEEDDDDDEESDEEDEESSKSEDDEDD 149



 Score = 29.6 bits (67), Expect = 2.7
 Identities = 8/27 (29%), Positives = 21/27 (77%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            ++ ++EE +EE+EE  + E++E++++
Sbjct: 125 EDDDDDEESDEEDEESSKSEDDEDDDD 151



 Score = 29.6 bits (67), Expect = 2.7
 Identities = 10/31 (32%), Positives = 24/31 (77%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
            EE +EE+EE  + E++E+++++++++  AT
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIAT 160



 Score = 28.4 bits (64), Expect = 5.0
 Identities = 8/26 (30%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++ +EE +EE+EE  + E++E+++++
Sbjct: 127 DDDDEESDEEDEESSKSEDDEDDDDD 152



 Score = 28.4 bits (64), Expect = 5.2
 Identities = 10/47 (21%), Positives = 26/47 (55%)

Query: 122 QKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +     R D   R        +E+ +EE+EE  + E++E++++++++
Sbjct: 109 ELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDD 155


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 30.4 bits (68), Expect = 1.4
 Identities = 17/31 (54%), Positives = 21/31 (67%), Gaps = 5/31 (16%)

Query: 143 EEKEEEEEEEEEEEE-----EEEEEEEEEEE 168
           EE E EE+EE  E+E     +E+ EEEEEEE
Sbjct: 144 EEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174



 Score = 29.6 bits (66), Expect = 2.2
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 9/44 (20%)

Query: 133 RRPRLYTGKGEEKEEEEEEEEEEE---------EEEEEEEEEEE 167
           R+P +     EE+EE E EE+EE          +E+ EEEEEEE
Sbjct: 131 RKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174



 Score = 28.8 bits (64), Expect = 4.6
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEI 169
            EEEEE E EE+EE  E+E  ++
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKV 162



 Score = 28.5 bits (63), Expect = 6.0
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
            +   EEEEE E EE+EE  E+E  
Sbjct: 136 PDLVLEEEEEVEMEEDEEYYEKEPG 160


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.2 bits (68), Expect = 1.4
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            E  E EEE +E ++ + ++ EE ++I
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQI 306



 Score = 28.2 bits (63), Expect = 5.7
 Identities = 7/34 (20%), Positives = 13/34 (38%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
                  E  E EEE +E ++ + ++       P
Sbjct: 274 PPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIP 307



 Score = 28.2 bits (63), Expect = 6.3
 Identities = 8/29 (27%), Positives = 14/29 (48%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLN 174
            E  E EEE +E ++ + ++ EE      
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQIPE 308



 Score = 27.8 bits (62), Expect = 7.5
 Identities = 7/25 (28%), Positives = 16/25 (64%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
            EE+ +E ++ + ++ EE ++  EE
Sbjct: 285 PEEEPDEPDQTDPDDGEETDQIPEE 309


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 29.8 bits (67), Expect = 1.4
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
             K +EKEE  E   EE  + EE EE ++
Sbjct: 68  KKKKKEKEEVPELAAEELSDSEENEENDK 96



 Score = 29.4 bits (66), Expect = 1.6
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           T K ++++EE  E   EE  + EE EE ++
Sbjct: 67  TKKKKKEKEEVPELAAEELSDSEENEENDK 96



 Score = 29.0 bits (65), Expect = 2.5
 Identities = 12/48 (25%), Positives = 20/48 (41%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLC 179
           K+  +    K E  E   EE  + EE EE +++ + E+    N     
Sbjct: 65  KKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEV 112



 Score = 28.6 bits (64), Expect = 3.1
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            R+ +      ++K+E+EE  E   EE  + EE EE 
Sbjct: 58  PRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEN 94



 Score = 28.2 bits (63), Expect = 4.7
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           T K ++K++E+EE  E   EE  + EE EE 
Sbjct: 64  TKKTKKKKKEKEEVPELAAEELSDSEENEEN 94



 Score = 27.5 bits (61), Expect = 8.0
 Identities = 10/40 (25%), Positives = 16/40 (40%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCR 180
           K E++E  E   EE  + EE EE +++             
Sbjct: 71  KKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAA 110


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 29.9 bits (68), Expect = 1.4
 Identities = 22/30 (73%), Positives = 23/30 (76%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             K EEK EEEEEEE  E EEEEEEE+EEE
Sbjct: 13  VEKVEEKIEEEEEEEAPEAEEEEEEEDEEE 42



 Score = 29.2 bits (66), Expect = 2.8
 Identities = 18/25 (72%), Positives = 22/25 (88%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           EE+EEEE  E EEEEEEE+EEE++E
Sbjct: 21  EEEEEEEAPEAEEEEEEEDEEEKKE 45



 Score = 28.4 bits (64), Expect = 4.4
 Identities = 18/25 (72%), Positives = 22/25 (88%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+EEEEE  E EEEEEEE+EEE++E
Sbjct: 21  EEEEEEEAPEAEEEEEEEDEEEKKE 45



 Score = 28.0 bits (63), Expect = 7.5
 Identities = 17/28 (60%), Positives = 23/28 (82%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           + + +EEEEEE  E EEEEEEE+EEE++
Sbjct: 17  EEKIEEEEEEEAPEAEEEEEEEDEEEKK 44


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 29.5 bits (67), Expect = 1.4
 Identities = 21/27 (77%), Positives = 22/27 (81%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EEKEEEEE E+EEE  EEE EE EEEI
Sbjct: 1   EEKEEEEELEDEEEALEEELEELEEEI 27


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 29.9 bits (67), Expect = 1.4
 Identities = 20/48 (41%), Positives = 25/48 (52%), Gaps = 16/48 (33%)

Query: 142 GEEKEEEEE----------------EEEEEEEEEEEEEEEEEEITATL 173
           GEEKE ++E                EE +E++EEEE EEEEE  T  L
Sbjct: 308 GEEKEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEENDTVKL 355


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 30.2 bits (68), Expect = 1.5
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           G E+ E  +E EE+E E  + E EE E+ A
Sbjct: 250 GREESEGSDESEEDEAEATDGEGEEGEMDA 279



 Score = 29.8 bits (67), Expect = 2.2
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G+ E +  +E EE+E E  + E EE E +
Sbjct: 250 GREESEGSDESEEDEAEATDGEGEEGEMD 278



 Score = 29.0 bits (65), Expect = 3.7
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
              + +  E+ E   EE E  +E EE+E E
Sbjct: 237 DDDQPDNNEDSEAGREESEGSDESEEDEAE 266



 Score = 28.7 bits (64), Expect = 4.4
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E   EE E  +E EE+E E  + E E
Sbjct: 248 EAGREESEGSDESEEDEAEATDGEGE 273



 Score = 28.3 bits (63), Expect = 6.5
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITA 171
           E+ E   EE E  +E EE+E E T 
Sbjct: 245 EDSEAGREESEGSDESEEDEAEATD 269



 Score = 28.3 bits (63), Expect = 7.1
 Identities = 10/30 (33%), Positives = 14/30 (46%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             +  E   EE E  +E EE+E E  + E 
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEG 272



 Score = 27.9 bits (62), Expect = 7.4
 Identities = 11/27 (40%), Positives = 15/27 (55%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
           +  E+ E   EE E  +E EE+E  AT
Sbjct: 242 DNNEDSEAGREESEGSDESEEDEAEAT 268



 Score = 27.9 bits (62), Expect = 8.0
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
             +EE E  +E EE+E E  + E EE
Sbjct: 249 AGREESEGSDESEEDEAEATDGEGEE 274



 Score = 27.5 bits (61), Expect = 10.0
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           G+GEE E +  E  E+ E +E +E+ E 
Sbjct: 270 GEGEEGEMDAAEASEDSESDESDEDTET 297


>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
           trafficking and secretion].
          Length = 340

 Score = 29.9 bits (68), Expect = 1.5
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           E+ E E+E+ EEE+EEEE+E   E +   L+
Sbjct: 12  EELEAEKEKIEEEDEEEEKEGWFERLKQGLS 42



 Score = 29.1 bits (66), Expect = 2.6
 Identities = 16/28 (57%), Positives = 21/28 (75%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           E+ EEE EE E E+E+ EEE+EEEE+  
Sbjct: 5   EKLEEELEELEAEKEKIEEEDEEEEKEG 32



 Score = 29.1 bits (66), Expect = 2.6
 Identities = 17/26 (65%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            EK EEE EE E E+E+ EEE+EEEE
Sbjct: 4   FEKLEEELEELEAEKEKIEEEDEEEE 29



 Score = 28.8 bits (65), Expect = 4.2
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G  ++ EEE EE E E+E+ EEE+EEE
Sbjct: 2   GFFEKLEEELEELEAEKEKIEEEDEEE 28



 Score = 27.6 bits (62), Expect = 7.7
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
            E+E EE E E+E+ EEE+EEEE+E     L 
Sbjct: 7   LEEELEELEAEKEKIEEEDEEEEKEGWFERLK 38


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.3 bits (69), Expect = 1.5
 Identities = 7/28 (25%), Positives = 24/28 (85%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           ++ E++++++ ++E+EE++E +E E+++
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLS 182



 Score = 29.6 bits (67), Expect = 2.5
 Identities = 6/30 (20%), Positives = 25/30 (83%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            ++++++E++++++ ++E+EE++E +E+  
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEK 180



 Score = 29.2 bits (66), Expect = 3.5
 Identities = 8/33 (24%), Positives = 20/33 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           +E EE++E +E E+  ++++   +E+ +  L  
Sbjct: 167 DEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQ 199



 Score = 28.8 bits (65), Expect = 3.6
 Identities = 6/30 (20%), Positives = 24/30 (80%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              ++ ++E+++E++++++ ++E+EE++E 
Sbjct: 146 DIDDDDDDEDDDEDDDDDDVDDEDEEKKEA 175



 Score = 28.8 bits (65), Expect = 3.7
 Identities = 7/29 (24%), Positives = 22/29 (75%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              +E ++E++++++ ++E+EE++E +E 
Sbjct: 150 DDDDEDDDEDDDDDDVDDEDEEKKEAKEL 178



 Score = 28.4 bits (64), Expect = 5.3
 Identities = 5/29 (17%), Positives = 23/29 (79%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              + ++++E+++E++++++ ++E+EE+ 
Sbjct: 145 DDIDDDDDDEDDDEDDDDDDVDDEDEEKK 173



 Score = 28.0 bits (63), Expect = 7.1
 Identities = 6/30 (20%), Positives = 20/30 (66%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
               ++ +++ ++E+EE++E +E E+  ++
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDD 184


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 29.1 bits (65), Expect = 1.5
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            +G+E + EEE+EE +E   E EEE + +
Sbjct: 25  DEGDESDAEEEDEEIQEALREAEEERKAK 53



 Score = 28.3 bits (63), Expect = 2.7
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G+E E +E + EEE+EE +E   E EE
Sbjct: 22  GKEDEGDESDAEEEDEEIQEALREAEE 48



 Score = 28.3 bits (63), Expect = 3.0
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           GK +E +E + EEE+EE +E   E EEE
Sbjct: 22  GKEDEGDESDAEEEDEEIQEALREAEEE 49



 Score = 27.9 bits (62), Expect = 3.4
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G  E++ +E + EEE+EE +E   E EEE
Sbjct: 21  GGKEDEGDESDAEEEDEEIQEALREAEEE 49


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 29.8 bits (67), Expect = 1.6
 Identities = 7/31 (22%), Positives = 21/31 (67%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              GEE+ E+++ E  ++  ++++E++++E 
Sbjct: 80  EVVGEEEPEKDQAEAGKQLSQKQQEQQQQEE 110


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 29.6 bits (67), Expect = 1.6
 Identities = 14/46 (30%), Positives = 26/46 (56%), Gaps = 3/46 (6%)

Query: 132 KRRPRLYTGK---GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           K++ +    K   G++ EEE+E+E E+E++  +E  E +E     N
Sbjct: 1   KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPN 46



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 16/38 (42%), Positives = 20/38 (52%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
             L     EEKE E  EE  EE  +++E  +EEEI  T
Sbjct: 59  SFLPDKAREEKEAELREELREEFLKKQEAVKEEEIEIT 96



 Score = 27.7 bits (62), Expect = 6.6
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            +++EEE+E+E E+E++  +E       N 
Sbjct: 14  GDDDEEEDEDEGEDEKKVPKESSEPDEANV 43



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 12/30 (40%), Positives = 21/30 (70%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           ++EEE+E+E E+E++  +E  E   A +NP
Sbjct: 16  DDEEEDEDEGEDEKKVPKESSEPDEANVNP 45


>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
          Length = 702

 Score = 30.3 bits (68), Expect = 1.6
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           G  EE +EEE + + +EE++ +    E+ I  T
Sbjct: 10  GTSEEDDEEEMDMDVKEEDDGDRRNREKHIPVT 42


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score = 30.0 bits (67), Expect = 1.7
 Identities = 10/38 (26%), Positives = 16/38 (42%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            +  P    G+  + E EE   +EE+      E E+ E
Sbjct: 346 FEVGPETAEGELLQFEAEEIYNDEEQYTRPVYENEDFE 383



 Score = 27.6 bits (61), Expect = 9.5
 Identities = 11/37 (29%), Positives = 16/37 (43%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           G + P  +    E   E  E E  + E EE   +EE+
Sbjct: 335 GYQPPGWHEPLFEVGPETAEGELLQFEAEEIYNDEEQ 371


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 29.9 bits (68), Expect = 1.7
 Identities = 15/32 (46%), Positives = 23/32 (71%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              K +E++ EE+E+  EE ++EEEEEEEE +
Sbjct: 322 LLSKRKEQQAEEKEKPAEEAKKEEEEEEEESV 353



 Score = 28.8 bits (65), Expect = 4.6
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           Y     ++++ EE+E+  EE ++EEEEEEEE
Sbjct: 321 YLLSKRKEQQAEEKEKPAEEAKKEEEEEEEE 351



 Score = 28.0 bits (63), Expect = 8.9
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
                +    + EEKE+  EE ++EEEEEEEE  ++
Sbjct: 320 AYLLSKRKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 30.2 bits (69), Expect = 1.7
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
              +E+ E    +E EE+E++ E+ EIT
Sbjct: 517 AYIKEQMEGSAPKEPEEKEKKPEKPEIT 544


>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
           factors. 
          Length = 116

 Score = 28.6 bits (65), Expect = 1.7
 Identities = 16/31 (51%), Positives = 20/31 (64%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           EE+E++E  E  EEE EE E E EEE+   L
Sbjct: 4   EEEEDDEMRELAEEELEELEAELEEELEELL 34



 Score = 27.0 bits (61), Expect = 6.2
 Identities = 15/30 (50%), Positives = 18/30 (60%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           E  EEEE++E  E  EEE EE E E+   L
Sbjct: 1   ELLEEEEDDEMRELAEEELEELEAELEEEL 30



 Score = 27.0 bits (61), Expect = 6.2
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 148 EEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E  EEEE++E  E  EEE EE+ A L  
Sbjct: 1   ELLEEEEDDEMRELAEEELEELEAELEE 28


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 28.8 bits (65), Expect = 1.7
 Identities = 9/26 (34%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++   +++ +EE+ +EE++E+EE+EE
Sbjct: 104 DDILPDDDFDEEDLDEEDDEDEEDEE 129



 Score = 28.1 bits (63), Expect = 3.1
 Identities = 8/26 (30%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++    +++ +EE+ +EE++E+EE+E
Sbjct: 103 DDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 27.7 bits (62), Expect = 4.4
 Identities = 9/43 (20%), Positives = 26/43 (60%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E    K++ +   G  +  +++   +++ +EE+ +EE++E+E
Sbjct: 83  LEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDE 125



 Score = 27.7 bits (62), Expect = 4.8
 Identities = 11/44 (25%), Positives = 26/44 (59%)

Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           L E+ D K++  +        ++   +++ +EE+ +EE++E+EE
Sbjct: 83  LEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEE 126


>gnl|CDD|215009 smart01069, CDC37_C, Cdc37 C terminal domain.  Cdc37 is a protein
           required for the activity of numerous eukaryotic protein
           kinases. This domains corresponds to the C terminal
           domain whose function is unclear. It is found C terminal
           to the Hsp90 chaperone (Heat shocked protein 90) binding
           domain pfam08565 and the N terminal kinase binding
           domain of Cdc37.
          Length = 93

 Score = 28.1 bits (63), Expect = 1.8
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
              +E E +E +E+ E EEE E+E+EEEE 
Sbjct: 63  AIEDETEFKELQEQYEVEEEAEKEDEEEED 92



 Score = 27.8 bits (62), Expect = 2.3
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           P     + E KE +E+ E EEE E+E+EEEE++
Sbjct: 61  PNAIEDETEFKELQEQYEVEEEAEKEDEEEEDD 93



 Score = 27.0 bits (60), Expect = 4.3
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E +E +E+ E EEE E+E+EEEE++
Sbjct: 68  TEFKELQEQYEVEEEAEKEDEEEEDD 93



 Score = 26.2 bits (58), Expect = 8.6
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
              E+E E +E +E+ E EEE E+E+    
Sbjct: 62  NAIEDETEFKELQEQYEVEEEAEKEDEEEE 91


>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin). 
          Length = 585

 Score = 30.0 bits (67), Expect = 1.8
 Identities = 16/46 (34%), Positives = 21/46 (45%), Gaps = 3/46 (6%)

Query: 133 RRPRLYTGKG-EEKEEEEEEEEEEEEEEEEEE--EEEEEITATLNP 175
            R     G+G  +   E E  EE EE+E +     +E EITA L  
Sbjct: 291 LRWVNARGRGRSQNRAERERSEESEEKELDRASPYQEIEITANLQI 336



 Score = 28.0 bits (62), Expect = 7.8
 Identities = 11/40 (27%), Positives = 18/40 (45%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           P      GE+    EE EE++ +EE +  +E      +L 
Sbjct: 215 PYEDVVGGEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLP 254


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 29.6 bits (66), Expect = 1.9
 Identities = 10/42 (23%), Positives = 25/42 (59%)

Query: 134 RPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           +P +   K + +++E +E++E+ EE E + E++E+ +     
Sbjct: 142 KPHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQS 183



 Score = 28.8 bits (64), Expect = 3.2
 Identities = 12/48 (25%), Positives = 27/48 (56%)

Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           +L    R + +  K + K +++E +E++E+ EE E + E++   +  P
Sbjct: 134 QLSDLNRLKPHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQP 181


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 29.7 bits (67), Expect = 2.0
 Identities = 9/34 (26%), Positives = 11/34 (32%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                 + E   E EE E     E     E EE+
Sbjct: 139 YPKTDEEKELLREVEELESRLATEPSPAPELEEQ 172



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 8/31 (25%), Positives = 11/31 (35%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           Y     ++E+E   E EE E     E     
Sbjct: 137 YEYPKTDEEKELLREVEELESRLATEPSPAP 167



 Score = 29.3 bits (66), Expect = 2.9
 Identities = 9/31 (29%), Positives = 13/31 (41%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             K +E++E   E EE E     E     E+
Sbjct: 139 YPKTDEEKELLREVEELESRLATEPSPAPEL 169



 Score = 28.9 bits (65), Expect = 3.9
 Identities = 10/27 (37%), Positives = 10/27 (37%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EEKE   E EE E     E     E  
Sbjct: 144 EEKELLREVEELESRLATEPSPAPELE 170


>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking.  Sec14-like
           Golgi-trafficking domain The GOLD domain is always found
           combined with lipid- or membrane-association domains.
          Length = 136

 Score = 28.9 bits (65), Expect = 2.0
 Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 148 EEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPV 184
           +EEEEEE EEEE E  + E    +   P  +  I+PV
Sbjct: 56  DEEEEEEAEEEEAETGDVEAGSKSQSRPL-VDEIIPV 91



 Score = 27.0 bits (60), Expect = 7.7
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEE 164
            E  +EEEEEE EEEE E  + E
Sbjct: 52  SESSDEEEEEEAEEEEAETGDVE 74



 Score = 27.0 bits (60), Expect = 8.7
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEE 168
           E  +EEEEEE EEEE E  + E
Sbjct: 53  ESSDEEEEEEAEEEEAETGDVE 74



 Score = 27.0 bits (60), Expect = 8.9
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEE 165
           E  +EEEEEE EEEE E  + E
Sbjct: 53  ESSDEEEEEEAEEEEAETGDVE 74


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 29.8 bits (68), Expect = 2.0
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+ E  EE   E  EEEEE++EE++
Sbjct: 64  EEQRERLEELAPELLEEEEEKKEEKK 89


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 29.6 bits (67), Expect = 2.1
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            EE +EEE++EE++ ++ EE ++  EE 
Sbjct: 68  SEESDEEEDDEEDDNDDSEENKDTVEEF 95



 Score = 28.5 bits (64), Expect = 5.7
 Identities = 9/35 (25%), Positives = 22/35 (62%)

Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           K   ++   +  ++EE++EE++ ++ EE ++  EE
Sbjct: 60  KEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94



 Score = 28.1 bits (63), Expect = 6.2
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           K   K +  EE +EEE++EE++ ++ EE   T+
Sbjct: 60  KEAAKIDFSEESDEEEDDEEDDNDDSEENKDTV 92


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 28.6 bits (64), Expect = 2.1
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEE 162
              K   K+EEE+EEE  E  +EEE
Sbjct: 89  VFNKLTSKKEEEKEEEIPEPTKEEE 113



 Score = 28.2 bits (63), Expect = 2.9
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEE 161
           +++     +KEEE+EEE  E  +EEE
Sbjct: 88  KVFNKLTSKKEEEKEEEIPEPTKEEE 113



 Score = 27.5 bits (61), Expect = 4.5
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
             ++EEE+EEE  E  +EEE   EI
Sbjct: 94  TSKKEEEKEEEIPEPTKEEELLGEI 118


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           K +E+E   E+E  EEEE  E    E+ I  
Sbjct: 396 KVKEEERPREKEGTEEEERREITVYEKRIKK 426



 Score = 29.3 bits (66), Expect = 2.7
 Identities = 11/34 (32%), Positives = 14/34 (41%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
            K +EEE   E+E  EEEE  E       +    
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLE 428


>gnl|CDD|227277 COG4941, COG4941, Predicted RNA polymerase sigma factor containing
           a TPR repeat domain [Transcription].
          Length = 415

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 16/65 (24%), Positives = 25/65 (38%), Gaps = 13/65 (20%)

Query: 118 KPWPQKG--------LIE-----RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
           + WP+ G        LI       +D  RR         E    +E+EE EE E  ++E 
Sbjct: 45  ERWPRAGPPRNPAAWLIAVGRNRAIDRVRRRARRDAAPPELLLSDEDEEMEEAEALDDEH 104

Query: 165 EEEEI 169
             ++ 
Sbjct: 105 IRDDR 109


>gnl|CDD|148810 pfam07412, Geminin, Geminin.  This family contains the eukaryotic
           protein geminin (approximately 200 residues long).
           Geminin inhibits DNA replication by preventing the
           incorporation of MCM complex into prereplication
           complex, and is degraded during the mitotic phase of the
           cell cycle. It has been proposed that geminin inhibits
           DNA replication during S, G2, and M phases and that
           geminin destruction at the metaphase-anaphase transition
           permits replication in the succeeding cell cycle.
          Length = 200

 Score = 29.1 bits (65), Expect = 2.3
 Identities = 17/37 (45%), Positives = 23/37 (62%)

Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEE 161
           +IERL G+    L + K +E + EEEE E+  EEE E
Sbjct: 164 MIERLTGEPPDNLESLKSQEFDSEEEEAEDSLEEESE 200


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 29.4 bits (66), Expect = 2.3
 Identities = 11/72 (15%), Positives = 31/72 (43%), Gaps = 10/72 (13%)

Query: 106 TKVTNEVERTGRKPWPQKGLIER----------LDGKRRPRLYTGKGEEKEEEEEEEEEE 155
            ++  + +   +K   ++  +E+           +  R+  L      EK  ++ E+  +
Sbjct: 53  NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112

Query: 156 EEEEEEEEEEEE 167
           + EE++++ EE 
Sbjct: 113 QAEEKQKQAEEA 124


>gnl|CDD|237798 PRK14714, PRK14714, DNA polymerase II large subunit; Provisional.
          Length = 1337

 Score = 29.7 bits (67), Expect = 2.4
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           T +G+    EE EEEE   + +E E+    I
Sbjct: 280 TIEGKSDAGEEGEEEELSTDADEYEKGPPRI 310


>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
           co-repressor CIR.  This is a 45 residue conserved region
           at the N-terminal end of a family of proteins referred
           to as CIRs (CBF1-interacting co-repressors). CBF1
           (centromere-binding factor 1) acts as a transcription
           factor that causes repression by binding specifically to
           GTGGGAA motifs in responsive promoters, and it requires
           CIR as a co-repressor. CIR binds to histone deacetylase
           and to SAP30 and serves as a linker between CBF1 and the
           histone deacetylase complex.
          Length = 37

 Score = 26.3 bits (59), Expect = 2.4
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
            K E++ EEE+++ EE  +E E+E
Sbjct: 14  WKAEQKAEEEKKKIEERRKEIEKE 37



 Score = 26.0 bits (58), Expect = 3.3
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEE 163
           K E+K EEE+++ EE  +E E+E
Sbjct: 15  KAEQKAEEEKKKIEERRKEIEKE 37



 Score = 25.6 bits (57), Expect = 4.2
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
           + E++ EEE+++ EE  +E E+E
Sbjct: 15  KAEQKAEEEKKKIEERRKEIEKE 37



 Score = 25.6 bits (57), Expect = 4.2
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
            + E++ EEE+++ EE  +E E+E
Sbjct: 14  WKAEQKAEEEKKKIEERRKEIEKE 37


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 29.6 bits (67), Expect = 2.4
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           T + EE+EEE + E E   E +  ++E+E  T
Sbjct: 229 TSETEEREEETDVEIETTSETKGTKQEQEGST 260



 Score = 28.8 bits (65), Expect = 4.9
 Identities = 12/43 (27%), Positives = 25/43 (58%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E +  K+  R +  K +E +  +  EE+ ++EE++E+ + EE 
Sbjct: 635 ELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEK 677



 Score = 28.4 bits (64), Expect = 5.5
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E+ EEE + E E   E +  ++E+E +   +P
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDP 264


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            E +   EEE  E  EE +E   + EE+   L+P
Sbjct: 141 AEAEARLEEEVAEIREEGQELSSKREELKEKLDP 174


>gnl|CDD|216641 pfam01673, Herpes_env, Herpesvirus putative major envelope
           glycoprotein.  This family consists of probable major
           envelope glycoproteins from members of the herpesviridae
           including herpes simplex virus, human cytomegalovirus
           and varicella-zoster virus. Members of the herpesviridae
           have a dsDNA genome and do not have a RNA stage during
           there replication.
          Length = 483

 Score = 29.6 bits (67), Expect = 2.6
 Identities = 11/39 (28%), Positives = 14/39 (35%), Gaps = 3/39 (7%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLN---PCGLCRIL 182
              E+ E     E         + A L    PC +CRIL
Sbjct: 25  GTPEDAELAPFVETALNNVLTHLAAALALDSPCNVCRIL 63


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 29.4 bits (66), Expect = 2.7
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           EEKE+E+ +  ++  E+E+E+EE E + 
Sbjct: 288 EEKEKEKRKRLKKMMEDEDEDEEMEIVP 315



 Score = 29.4 bits (66), Expect = 2.7
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
            EEEE EE E     ++EEE+EE+T +
Sbjct: 319 VEEEESEEPEPPPLPKKEEEKEEVTVS 345



 Score = 27.9 bits (62), Expect = 7.5
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE EE+E+E+ +  ++  E+E+E+EE
Sbjct: 285 EETEEKEKEKRKRLKKMMEDEDEDEE 310


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 29.3 bits (66), Expect = 2.7
 Identities = 10/62 (16%), Positives = 25/62 (40%), Gaps = 9/62 (14%)

Query: 122 QKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEE---------EEEEEEITAT 172
           Q     +   +R      G   + E +  +++++E+  E +         +E+ E++  T
Sbjct: 228 QPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVSDEDGEDLFDT 287

Query: 173 LN 174
            N
Sbjct: 288 DN 289


>gnl|CDD|145910 pfam03012, PP_M1, Phosphoprotein.  This family includes the M1
           phosphoprotein non-structural RNA polymerase alpha
           subunit, which is thought to be a component of the
           active polymerase, and may be involved in template
           binding.
          Length = 298

 Score = 28.9 bits (65), Expect = 2.8
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           +   R ++   K  +  EEE+E+E   EE+    E E+
Sbjct: 51  EDMSRLQISDAKPSQFTEEEDEDEGSSEEDFYMTEGED 88


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.9 bits (65), Expect = 2.8
 Identities = 7/25 (28%), Positives = 20/25 (80%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           +K ++++++ EEE  ++ ++++EEE
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEE 214



 Score = 28.9 bits (65), Expect = 2.8
 Identities = 7/26 (26%), Positives = 21/26 (80%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           ++ ++++++ EEE  ++ ++++EEEE
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEE 215



 Score = 28.5 bits (64), Expect = 4.2
 Identities = 13/60 (21%), Positives = 28/60 (46%), Gaps = 5/60 (8%)

Query: 110 NEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            ++ER  ++ +        L  K+  R    +   +E++++E E+E EE E  +   +E 
Sbjct: 79  VKLERQAQEAYE-----NWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133



 Score = 27.8 bits (62), Expect = 6.5
 Identities = 8/29 (27%), Positives = 22/29 (75%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           K ++++++ EEE  ++ ++++EEEE ++ 
Sbjct: 191 KLKQQQQKREEERRKQRKKQQEEEERKQK 219


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G  +  E ++ +EE+EE  E+E EEE 
Sbjct: 103 GATRNYEADKLDEEQEERVEKEREEEL 129



 Score = 28.6 bits (64), Expect = 4.2
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
            E ++ +EE+EE  E+E EEE  
Sbjct: 108 YEADKLDEEQEERVEKEREEELA 130



 Score = 28.6 bits (64), Expect = 4.3
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
            E ++ +EE+EE  E+E EEE  
Sbjct: 108 YEADKLDEEQEERVEKEREEELA 130



 Score = 28.2 bits (63), Expect = 5.4
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 149 EEEEEEEEEEEEEEEEEEEEI 169
             E++EEEEEEEE+E   + +
Sbjct: 178 RREKKEEEEEEEEDEALIKSL 198



 Score = 28.2 bits (63), Expect = 5.5
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 152 EEEEEEEEEEEEEEEEEITA 171
             E++EEEEEEEE+E  I +
Sbjct: 178 RREKKEEEEEEEEDEALIKS 197


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 28.6 bits (64), Expect = 2.9
 Identities = 9/31 (29%), Positives = 22/31 (70%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
           +E  +E EE+E  + ++E++E++E+++   L
Sbjct: 100 KEVSKETEEKEAIKAKKEKKEKKEKKVAEKL 130



 Score = 28.2 bits (63), Expect = 3.3
 Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            EE E+E++E  +E EE+E  + ++E+   
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEK 121



 Score = 27.8 bits (62), Expect = 4.5
 Identities = 8/29 (27%), Positives = 20/29 (68%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           E++++E  +E EE+E  + ++E++E+   
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEK 124



 Score = 27.4 bits (61), Expect = 7.3
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              +  EKE++E  +E EE+E  + ++E++E
Sbjct: 90  LFPEESEKEQKEVSKETEEKEAIKAKKEKKE 120


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 28.1 bits (63), Expect = 2.9
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEIT 170
           K+ EE  + EEEE+E EE EEE ++T
Sbjct: 97  KQLEENRKLEEEEKEREELEEENDVT 122



 Score = 27.4 bits (61), Expect = 4.9
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEEEE 168
           K++ EE  + EEEE+E EE EEE 
Sbjct: 96  KKQLEENRKLEEEEKEREELEEEN 119



 Score = 27.4 bits (61), Expect = 6.1
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEE 163
           Y  + EE  + EEEE+E EE EEE +
Sbjct: 95  YKKQLEENRKLEEEEKEREELEEEND 120



 Score = 27.0 bits (60), Expect = 8.1
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 150 EEEEEEEEEEEEEEEEEEEITATLN 174
           EE  + EEEE+E EE EEE   T  
Sbjct: 100 EENRKLEEEEKEREELEEENDVTKG 124


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 29.1 bits (66), Expect = 2.9
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           E +  E EEE EE E   EEE E +  
Sbjct: 302 ERRAAEREEELEEGERLREEEAERQAR 328



 Score = 28.7 bits (65), Expect = 3.8
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EEKEEE   +E  EEE  +   EEEE 
Sbjct: 35  EEKEEERRIDEMMEEERLKALAEEEER 61



 Score = 28.3 bits (64), Expect = 5.0
 Identities = 15/26 (57%), Positives = 15/26 (57%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EEKEE    E EEE EE E   EEE 
Sbjct: 298 EEKEERRAAEREEELEEGERLREEEA 323



 Score = 28.0 bits (63), Expect = 7.2
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           EE+  +E  EEE  +   EEEE E +
Sbjct: 39  EERRIDEMMEEERLKALAEEEERERK 64


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 28.1 bits (63), Expect = 2.9
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +  +  EEE ++EE+ E E+ E+ E++E
Sbjct: 74  EALKLLEEENDDEEDAETEDTEDVEDDE 101



 Score = 28.1 bits (63), Expect = 3.8
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           +  EEE ++EE+ E E+ E+ E++E  
Sbjct: 77  KLLEEENDDEEDAETEDTEDVEDDEWE 103



 Score = 27.7 bits (62), Expect = 4.7
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
              +EE ++EE+ E E+ E+ E++E E
Sbjct: 77  KLLEEENDDEEDAETEDTEDVEDDEWE 103



 Score = 27.3 bits (61), Expect = 6.0
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E  +  EEE ++EE+ E E+ E+ E
Sbjct: 73  KEALKLLEEENDDEEDAETEDTEDVE 98



 Score = 26.9 bits (60), Expect = 8.9
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           K  E+E ++EE+ E E+ E+ E++E E
Sbjct: 77  KLLEEENDDEEDAETEDTEDVEDDEWE 103


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 29.4 bits (66), Expect = 3.0
 Identities = 8/28 (28%), Positives = 21/28 (75%)

Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
            +    ++  +E+E+++E+E+E+E+E+E
Sbjct: 883 QFQENEDDDADEDEDQDEDEDEDEDEDE 910



 Score = 28.6 bits (64), Expect = 5.4
 Identities = 10/31 (32%), Positives = 25/31 (80%)

Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           L +   E ++++ +E+E+++E+E+E+E+E+E
Sbjct: 880 LISQFQENEDDDADEDEDQDEDEDEDEDEDE 910


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 29.1 bits (66), Expect = 3.1
 Identities = 10/29 (34%), Positives = 22/29 (75%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
           EE+ +EE +E++EE+++EE E +  +++ 
Sbjct: 277 EEERQEEAQEKKEEKKKEEREAKLAKLSP 305



 Score = 28.4 bits (64), Expect = 5.3
 Identities = 14/30 (46%), Positives = 22/30 (73%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            EEE +EE +E++EE+++EE E   A L+P
Sbjct: 276 AEEERQEEAQEKKEEKKKEEREAKLAKLSP 305


>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
          Length = 382

 Score = 29.1 bits (66), Expect = 3.2
 Identities = 11/37 (29%), Positives = 15/37 (40%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
            G+ EE+EEE EEEE +     E    +         
Sbjct: 224 LGEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQ 260


>gnl|CDD|222762 pfam14456, alpha-hel2, Alpha-helical domain 2.  An alpha-helical
           domain found in gene neighborhoods encoding genes
           containing bacterial homologs of components of the
           ubiquitin modification pathway such as the E1, E2, Ub
           and JAB peptidase proteins.
          Length = 322

 Score = 29.0 bits (65), Expect = 3.3
 Identities = 14/48 (29%), Positives = 19/48 (39%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVHIV 189
           GE+ EEE  E  E E E++EE   EE  +            P +    
Sbjct: 139 GEDDEEEALEVLEGEGEDDEEAAAEEYFSRVRPALAPDDGHPHLFKWG 186


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 29.1 bits (65), Expect = 3.4
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G   E E+EE+ ++++  E E+EE+ E E
Sbjct: 204 GDDTESEDEEDGDDDQPTENEQEEQGEGE 232


>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein.  S-antigens are heat
           stable proteins that are found in the blood of
           individuals infected with malaria.
          Length = 94

 Score = 27.6 bits (61), Expect = 3.4
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           GKG++ E+ +EE E E ++EE+   EE   
Sbjct: 52  GKGQKYEDLQEEGEGENDDEEDSNSEESNN 81


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score = 28.9 bits (66), Expect = 3.5
 Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 2/36 (5%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEE--EEITATLN 174
           +G+    +E+EEE+EE+EE EEE +   E +   L 
Sbjct: 471 RGDLDLGKEDEEEKEEKEEAEEEFKPLLERLKEALG 506


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 29.2 bits (66), Expect = 3.5
 Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 10/59 (16%)

Query: 110 NEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E  RT  +         R + +RR      + +EK + ++E +    EE+  +E E+E
Sbjct: 662 TEKARTQDEQQQA----PRRERQRR------RNDEKRQAQQEAKALNVEEQSVQETEQE 710


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.0 bits (65), Expect = 3.6
 Identities = 9/26 (34%), Positives = 18/26 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +E +  EE E + +EE+ ++EE E++
Sbjct: 390 DETDASEEAEAKAKEEKLKQEENEKK 415



 Score = 28.2 bits (63), Expect = 5.4
 Identities = 8/28 (28%), Positives = 19/28 (67%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
               +E E + +EE+ ++EE E++++E+
Sbjct: 392 TDASEEAEAKAKEEKLKQEENEKKQKEQ 419


>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
          Length = 177

 Score = 28.3 bits (63), Expect = 3.6
 Identities = 15/26 (57%), Positives = 15/26 (57%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E K EE EEE E  E EE  EE  EE
Sbjct: 4   EIKNEEVEEEVEATETEETVEEVVEE 29



 Score = 28.3 bits (63), Expect = 3.7
 Identities = 15/27 (55%), Positives = 15/27 (55%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           K EE EEE E  E EE  EE  EE  E
Sbjct: 6   KNEEVEEEVEATETEETVEEVVEETPE 32



 Score = 28.3 bits (63), Expect = 4.4
 Identities = 15/32 (46%), Positives = 17/32 (53%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           EE + EE EEE E  E EE  EE  E T   +
Sbjct: 3   EEIKNEEVEEEVEATETEETVEEVVEETPEKS 34


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 28.5 bits (64), Expect = 3.6
 Identities = 15/24 (62%), Positives = 19/24 (79%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEI 169
           EEEE++EE E EEEEEEEE + + 
Sbjct: 35  EEEEDDEEFEIEEEEEEEEVDSDF 58



 Score = 28.5 bits (64), Expect = 4.0
 Identities = 18/35 (51%), Positives = 22/35 (62%), Gaps = 9/35 (25%)

Query: 144 EKEEEEEE---------EEEEEEEEEEEEEEEEEI 169
           E+E EE+E         EEE++EE E EEEEEEE 
Sbjct: 19  EEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEE 53



 Score = 28.1 bits (63), Expect = 5.1
 Identities = 15/34 (44%), Positives = 23/34 (67%), Gaps = 6/34 (17%)

Query: 142 GEEKEEEEEEE------EEEEEEEEEEEEEEEEI 169
            EE+EEEEE +      E++E E ++EEE E+E+
Sbjct: 45  IEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKEL 78



 Score = 28.1 bits (63), Expect = 6.0
 Identities = 14/22 (63%), Positives = 18/22 (81%)

Query: 145 KEEEEEEEEEEEEEEEEEEEEE 166
           +EEE++EE E EEEEEEEE + 
Sbjct: 35  EEEEDDEEFEIEEEEEEEEVDS 56



 Score = 27.3 bits (61), Expect = 8.7
 Identities = 10/23 (43%), Positives = 17/23 (73%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
           + E++E E ++EEE E+E + EE
Sbjct: 60  DSEDDEPESDDEEEGEKELQREE 82


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 28.7 bits (64), Expect = 3.6
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           E   EEE  EE E+E E E E+EE+ +   
Sbjct: 215 EASPEEENYEEYEDETELEVEDEEKALDEQ 244


>gnl|CDD|232958 TIGR00399, metG_C_term, methionyl-tRNA synthetase C-terminal
           region/beta chain.  The methionyl-tRNA synthetase (metG)
           is a class I amino acyl-tRNA ligase. This model
           describes a region of the methionyl-tRNA synthetase that
           is present at the C-terminus of MetG in some species (E.
           coli, B. subtilis, Thermotoga maritima, Methanobacterium
           thermoautotrophicum), and as a separate beta chain in
           Aquifex aeolicus. It is absent in a number of other
           species (e.g. Mycoplasma genitalium, Mycobacterium
           tuberculosis), while Pyrococcus horikoshii has both a
           full length MetG and a second protein homologous to the
           beta chain only. Proteins hit by This model should
           called methionyl-tRNA synthetase beta chain if and only
           if the model metG hits a separate protein not also hit
           by This model [Protein synthesis, tRNA aminoacylation].
          Length = 137

 Score = 27.8 bits (62), Expect = 3.7
 Identities = 9/28 (32%), Positives = 19/28 (67%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           E K +  +++E+++E E+  E ++E IT
Sbjct: 6   ELKLKGAKKKEKKDEGEKALEPQKETIT 33


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 28.3 bits (63), Expect = 3.8
 Identities = 19/75 (25%), Positives = 36/75 (48%)

Query: 95  QKKYKWVRFDHTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEE 154
           QK +   + D  +   E      K   ++  IE  + ++       + E+  E++E+EE 
Sbjct: 28  QKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEI 87

Query: 155 EEEEEEEEEEEEEEI 169
            +EEE+EEE E+ + 
Sbjct: 88  AQEEEKEEEAEDVKQ 102



 Score = 27.5 bits (61), Expect = 7.4
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           + +  E++E+EE  +EEE+EEE E+ ++       
Sbjct: 75  REDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFK 109


>gnl|CDD|235378 PRK05256, PRK05256, condesin subunit E; Provisional.
          Length = 238

 Score = 28.4 bits (64), Expect = 3.8
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEE 165
             +   ++E    +E EEE+ +EEEEE
Sbjct: 212 PIENHLQQESSLNDETEEEQPDEEEEE 238



 Score = 28.0 bits (63), Expect = 5.2
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           E   ++E    +E EEE+ +EEEEE
Sbjct: 214 ENHLQQESSLNDETEEEQPDEEEEE 238


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 28.9 bits (64), Expect = 3.8
 Identities = 17/28 (60%), Positives = 21/28 (75%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           +G +KEEEE+E+ EE EE EE E  EEE
Sbjct: 103 RGTQKEEEEDEDVEEIEEVEEVEVVEEE 130


>gnl|CDD|218114 pfam04501, Baculo_VP39, Baculovirus major capsid protein VP39.
           This family constitutes the 39 kDa major capsid protein
           of the Baculoviridae.
          Length = 300

 Score = 28.5 bits (64), Expect = 3.9
 Identities = 9/29 (31%), Positives = 15/29 (51%), Gaps = 1/29 (3%)

Query: 218 PKYLENLVEERCWGPSVLFVDNTDYMERN 246
           P +++NL+  +   P  L +DN     RN
Sbjct: 197 PPFIKNLIN-KLVAPETLTIDNETLRLRN 224


>gnl|CDD|221746 pfam12739, TRAPPC-Trs85, ER-Golgi trafficking TRAPP I complex 85
           kDa subunit.  This family is one of the subunits of the
           TRAPP Golgi trafficking complex. TRAPP subunits are
           found in two different sized complexes, TRAPP I and
           TRAPP II, and this Trs85 is in the smaller complex.
           TRAPP I, but Not TRAPP II, functions in ER-Golgi
           transport. Trs85p was reported to function in the
           cytosol-to-vacuole targeting pathway, suggesting a role
           for this subunit in autophagy as well as in secretion.
           The overall architecture of TRAPP I shows the other
           components to be Bet3p (TRAPPC3), Bet5p (TRAPPC1),
           Trs20p (TRAPPC2), Trs23p (TRAPPC4), Trs31p (TRAPPC5),
           Trs33p (TRAPPC6a and b) and Trs85p.
          Length = 390

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 22/115 (19%), Positives = 40/115 (34%), Gaps = 21/115 (18%)

Query: 152 EEEEEEEEEEEEEEEEEITATLNPCGLCRILPV--------MVHIVVGLTSTLSIKLMQD 203
           +E + ++EE  +   E++  T   C  C +L +         +          S      
Sbjct: 40  DESDGDDEERSQALLEQLKRTFGLCN-CHLLRLNSSSSESAALTDDDPPDEWSSALEDLQ 98

Query: 204 DLHIVKDLV--LPVDDPKYLENLVEERCWGPSVLFVDNT--DYMERNISIVTDQI 254
           ++         L V D + +   V E        FV  +   +MER +  + DQI
Sbjct: 99  EISSAGPEGVYLSVSDVEAIRAFVRE--------FVTQSLIPFMERKVRFLNDQI 145


>gnl|CDD|216831 pfam01988, VIT1, VIT family.  This family includes the vacuolar
           Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
           transporter VIT1.
          Length = 209

 Score = 28.4 bits (64), Expect = 4.0
 Identities = 15/38 (39%), Positives = 18/38 (47%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
           K E   EE E E E+ E E + E E EE+       GL
Sbjct: 57  KSERDLEEAELEREKRELENDPEGEREELAEIYRARGL 94


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 28.8 bits (65), Expect = 4.0
 Identities = 8/33 (24%), Positives = 21/33 (63%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E  +E++   + +++EE++++E E+   A + P
Sbjct: 561 EALKEQKRLRKLKKQEEKKKKELEKLEKAKIPP 593


>gnl|CDD|241523 cd13372, PH_CAPRI, Ca2+ promoted Ras inactivator pleckstrin
           homology (PH) domain.  CAPRI (also called RASA4/RAS p21
           protein activator (GTPase activating protein)
           4/GAPL/FLJ59070/KIAA0538/MGC131890) is a member of the
           GAP1 family of GTPase-activating proteins. CAPRI
           contains two fully conserved C2 domains, a PH domain, a
           RasGAP domain, and a BTK domain. Its catalytic GAP
           domain has dual RasGAP and RapGAP activities, while its
           C2 domains bind phospholipids in the presence of Ca2+.
           Both CAPRI and RASAL are calcium-activated RasGAPs that
           inactivate Ras at the plasma membrane. Thereby enhancing
           the weak intrinsic GTPase activity of RAS proteins
           resulting in the inactive GDP-bound form of RAS and
           allowing control of cellular proliferation and
           differentiation. CAPRI and RASAL differ in that CAPRI is
           an amplitude sensor while RASAL senses calcium
           oscillations. This difference between them resides not
           in their C2 domains, but in their PH domains leading to
           speculation that this might reflect an association with
           either phosphoinositides and/or proteins. PH domains
           have diverse functions, but in general are involved in
           targeting proteins to the appropriate cellular location
           or in the interaction with a binding partner. They share
           little sequence conservation, but all have a common
           fold, which is electrostatically polarized. Less than
           10% of PH domains bind phosphoinositide phosphates
           (PIPs) with high affinity and specificity. PH domains
           are distinguished from other PIP-binding domains by
           their specific high-affinity binding to PIPs with two
           vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
           or PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 140

 Score = 27.9 bits (62), Expect = 4.1
 Identities = 23/101 (22%), Positives = 42/101 (41%), Gaps = 19/101 (18%)

Query: 162 EEEEEEEITATLNPCG-LCRILPVMVHIVVG----LTST---LSIKLMQDDLHIVKDL-- 211
           EEE+E ++T  L     + +   + +H   G    + S+   L   L +D L   K    
Sbjct: 9   EEEDELDLTRMLLLQAPMVKEGFLFIHRTKGKGPLMASSFKKLYFTLTKDALSFAKTPHS 68

Query: 212 ----VLPVDDPKYLENLVEERCWGPS----VLFVDNTDYME 244
                + +   +  E  VEE+C+G S    +++ D+    E
Sbjct: 69  KKSSSISLAKIRAAEK-VEEKCFGSSNVMQIIYTDDAGQQE 108


>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
            Members of this family function as part of the Mediator
           (Med) complex, which links DNA-bound transcriptional
           regulators and the general transcription machinery,
           particularly the RNA polymerase II enzyme. They play a
           role in basal transcription by mediating activation or
           repression according to the specific complement of
           transcriptional regulators bound to the promoter.
          Length = 176

 Score = 28.1 bits (63), Expect = 4.2
 Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 6/64 (9%)

Query: 114 RTGRKPWPQ-----KGLIERL-DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
               +PWP      +GL+ +     +      G   E+EEE +  E E + EEE+E E +
Sbjct: 113 GDLFRPWPAEDEMRRGLLAQSSLLSKEGIEPPGFDPEEEEERKAREAEAKTEEEDEREAQ 172

Query: 168 EITA 171
           E   
Sbjct: 173 ERNQ 176


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 28.9 bits (64), Expect = 4.4
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 148 EEEEEEEEEEEEEEEEEEEEEITAT 172
            E  EE+E  EEE+EEEE +E++A 
Sbjct: 768 AESSEEDESSEEEKEEEENKEVSAK 792


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 27.3 bits (61), Expect = 4.4
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 145 KEEEEEEEEEEEEEEEEEEE 164
           + E+ EEE  E+EEE E+  
Sbjct: 83  RLEDREEERLEKEEEREKRA 102



 Score = 27.0 bits (60), Expect = 5.6
 Identities = 11/31 (35%), Positives = 16/31 (51%), Gaps = 6/31 (19%)

Query: 144 EKEEE------EEEEEEEEEEEEEEEEEEEE 168
           ++EE         E+ EEE  E+EEE E+  
Sbjct: 72  QEEERRSARLARLEDREEERLEKEEEREKRA 102



 Score = 26.2 bits (58), Expect = 10.0
 Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 6/33 (18%)

Query: 142 GEEKEEEEE------EEEEEEEEEEEEEEEEEE 168
           G + +EEE         E+ EEE  E+EEE E+
Sbjct: 68  GADIQEEERRSARLARLEDREEERLEKEEEREK 100


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 28.8 bits (64), Expect = 4.4
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            E E   E+  ++E E++EE++ EEE 
Sbjct: 142 NESESSLEDLSDDETEDDEEKKMEEEE 168



 Score = 28.4 bits (63), Expect = 5.2
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           + + K EEE+EE++E+E++++EEE  E  
Sbjct: 288 EADSKPEEEDEEDDEQEDDQDEEEPPEAA 316



 Score = 28.4 bits (63), Expect = 5.9
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
            E   ++E E++EE++ EEEE  EE+ + 
Sbjct: 148 LEDLSDDETEDDEEKKMEEEEAGEEKESV 176



 Score = 28.4 bits (63), Expect = 6.3
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            E   E+  ++E E++EE++ EEEE   
Sbjct: 144 SESSLEDLSDDETEDDEEKKMEEEEAGE 171



 Score = 28.0 bits (62), Expect = 6.6
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            +   ++  ++E E++EE++ EEEE  EE
Sbjct: 144 SESSLEDLSDDETEDDEEKKMEEEEAGEE 172



 Score = 27.6 bits (61), Expect = 9.2
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           G +  + + E+  ++EEEEEE+E  E+
Sbjct: 103 GSDGSDMDSEDSADDEEEEEEDESLED 129


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 28.5 bits (64), Expect = 4.5
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           +Y G  E++  E ++ EE +E +E E EE  +    
Sbjct: 145 IYDGIFEDRTRESQDMEENDEVDEVEVEEVPDDYGA 180


>gnl|CDD|217827 pfam03979, Sigma70_r1_1, Sigma-70 factor, region 1.1.  Region 1.1
           modulates DNA binding by region 2 and 4 when sigma is
           unbound by the core RNA polymerase. Region 1.1 is also
           involved in promoter binding.
          Length = 79

 Score = 26.9 bits (60), Expect = 4.5
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
           EE + EE +EE   +E+ EEE E
Sbjct: 57  EEADSEELDEETSSDEDAEEEAE 79



 Score = 26.1 bits (58), Expect = 6.9
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
           EE + EE +EE   +E+ EEE E
Sbjct: 57  EEADSEELDEETSSDEDAEEEAE 79



 Score = 26.1 bits (58), Expect = 8.0
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEE 166
           E+ + EE +EE   +E+ EEE E
Sbjct: 57  EEADSEELDEETSSDEDAEEEAE 79


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.9 bits (65), Expect = 4.6
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
            G   E+ E+E EE E+ +EE EEE  +ITA + 
Sbjct: 382 TGLTPEKLEKELEELEKAKEEIEEEISKITARIG 415


>gnl|CDD|234064 TIGR02936, fdxN_nitrog, ferredoxin III, nif-specific.  Members of
           this family are homodimeric ferredoxins from nitrogen
           fixation regions of many nitrogen-fixing bacteria. As
           characterized in Rhodobacter capsulatus, these proteins
           are homodimeric, with two 4Fe-4S clusters bound per
           monomer. Although nif-specific, this protein family is
           not usiveral, as other nitrogenase systems may
           substitute flavodoxins, or different types of ferredoxin
           [Central intermediary metabolism, Nitrogen fixation].
          Length = 91

 Score = 27.0 bits (60), Expect = 4.8
 Identities = 12/43 (27%), Positives = 21/43 (48%), Gaps = 6/43 (13%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNP-----CGLC-RILP 183
           +   EE E    +++++E E ++    NP     CG C R+ P
Sbjct: 41  KGINEEGELVASDDDDDEIERKVMVVANPGNCIGCGACARVCP 83


>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
          Length = 1634

 Score = 28.8 bits (64), Expect = 4.8
 Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 6/67 (8%)

Query: 114  RTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEE----EEEEEEEEEI 169
            R+  K W  K  I   + + R +L   KGE  +EEEEE   EE+E+      +E +E  +
Sbjct: 1448 RSMFKGWFYKNHIP--EARVRRQLRIWKGENPDEEEEEAFREEKEKVFTYLNKELDEAGL 1505

Query: 170  TATLNPC 176
               L  C
Sbjct: 1506 KRVLRRC 1512


>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
          Length = 130

 Score = 27.5 bits (61), Expect = 4.9
 Identities = 11/28 (39%), Positives = 13/28 (46%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           G E E  E E EE  EE+ E    +E  
Sbjct: 8   GYEDEPSEPEIEEGAEEDSESNNNDEVK 35



 Score = 27.1 bits (60), Expect = 5.8
 Identities = 9/32 (28%), Positives = 16/32 (50%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
             ++E  E E EE  EE+ E    +E+  ++ 
Sbjct: 8   GYEDEPSEPEIEEGAEEDSESNNNDEVKGSIG 39


>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
          Length = 341

 Score = 28.3 bits (64), Expect = 5.0
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E+E       +EEEEEE E E  + A L P
Sbjct: 32  EQEAFAPAAPDEEEEEEGELEAAVEAALRP 61


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 26.7 bits (59), Expect = 5.2
 Identities = 5/27 (18%), Positives = 26/27 (96%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           GEE E+++++E++++++++++++++++
Sbjct: 47  GEEPEDDDDDEDDDDDDDKDDKDDDDD 73


>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
          Length = 325

 Score = 28.4 bits (64), Expect = 5.3
 Identities = 8/32 (25%), Positives = 13/32 (40%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           E  +E+   EEE  + +  EEE   +      
Sbjct: 20  EVFDEKALVEEEPSDNDLAEEELLSQGATQRV 51


>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
           selection and in elongation by RNA polymerase II
           [Transcription].
          Length = 521

 Score = 28.5 bits (63), Expect = 5.3
 Identities = 11/32 (34%), Positives = 19/32 (59%)

Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
           R  RLY+ +  E +  ++ +E EE E+  +EE
Sbjct: 157 REERLYSERHIELQRFKDYKELEESEQGLQEE 188


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 27.8 bits (61), Expect = 5.4
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            E+ E++ EE  E+EEE EE++ +EEI
Sbjct: 43  VEELEDQPEEPPEQEEENEEQKPKEEI 69



 Score = 27.4 bits (60), Expect = 8.4
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+K E+  EE E++ EE  E+EEE E
Sbjct: 36  EQKPEDIVEELEDQPEEPPEQEEENE 61


>gnl|CDD|233117 TIGR00759, aceE, pyruvate dehydrogenase E1 component, homodimeric
           type.  Most members of this family are pyruvate
           dehydrogenase complex, E1 component. Note: this family
           was classified as subfamily rather than equivalog
           because it includes a counterexample from Pseudomonas
           putida, MdeB, that is active as an E1 component of an
           alpha-ketoglutarate dehydrogenase complex rather than a
           pyruvate dehydrogase complex. The second pyruvate
           dehydrogenase complex E1 protein from Alcaligenes
           eutrophus, PdhE, complements an aceE mutant of E. coli
           but is not part of a pyruvate dehydrogenase complex
           operon, is more similar to the Pseudomonas putida MdeB
           than to E. coli AceE, and may have also have a different
           primary specificity.
          Length = 885

 Score = 28.6 bits (64), Expect = 5.4
 Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 1/51 (1%)

Query: 249 IVTDQIKHYNIMPAYGENNMLKHDTLVITVDALKQIEDKLLLALNRSDNER 299
           I+   IK Y  M    E+    H    + VDALK   D+  L L+ +  E 
Sbjct: 383 ILAKTIKGYG-MGDAAESRNTAHQVKKLEVDALKNFRDRFELPLSDAQVEE 432


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 28.1 bits (63), Expect = 5.4
 Identities = 11/60 (18%), Positives = 25/60 (41%), Gaps = 3/60 (5%)

Query: 120 WPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLC 179
           W    ++ +L+ KR+ R        K++ +  +  +E  ++   +  + I A L   G  
Sbjct: 146 WKYADVVAKLEEKRKER---AAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKKFGYA 202


>gnl|CDD|224559 COG1645, COG1645, Uncharacterized Zn-finger containing protein
           [General function prediction only].
          Length = 131

 Score = 27.4 bits (61), Expect = 5.4
 Identities = 12/30 (40%), Positives = 14/30 (46%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           G  E   EEEEEE E E +E+      E  
Sbjct: 51  GYREVVVEEEEEEVEAEVQEQLRRSRPELP 80


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 13/45 (28%), Positives = 27/45 (60%), Gaps = 6/45 (13%)

Query: 131 GKRRPRLYTGKGEEK------EEEEEEEEEEEEEEEEEEEEEEEI 169
            K+  R + GK  E       EE+ ++++++E+++EEE + EE+ 
Sbjct: 721 MKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765



 Score = 27.8 bits (62), Expect = 9.8
 Identities = 9/30 (30%), Positives = 21/30 (70%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           E+ ++++++E+++EEE + EE+   EI   
Sbjct: 743 EKTKKKKKKEKKKEEEYKREEKARIEIAEA 772


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.5 bits (64), Expect = 5.6
 Identities = 19/48 (39%), Positives = 22/48 (45%), Gaps = 6/48 (12%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           ER +  R       +  E E E EE+ E   E EEE EE  E  A LN
Sbjct: 537 ERAEELRE------RAAELEAEAEEKREAAAEAEEEAEEAREEVAELN 578


>gnl|CDD|215369 PLN02685, PLN02685, iron superoxide dismutase.
          Length = 299

 Score = 28.0 bits (62), Expect = 5.6
 Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           RL + K    + E+EE   E+EEE + E  E
Sbjct: 256 RLESAKARAAQREQEETRTEDEEEPDSEAVE 286


>gnl|CDD|217148 pfam02620, DUF177, Uncharacterized ACR, COG1399. 
          Length = 118

 Score = 27.3 bits (61), Expect = 5.7
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEEI 169
           EEEE E+EE E+++EE   E +EI
Sbjct: 35  EEEEAEDEELEDDDEEILVEGDEI 58


>gnl|CDD|222662 pfam14300, DUF4375, Domain of unknown function (DUF4375).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 156 and 204 amino
           acids in length. There is a single completely conserved
           residue G that may be functionally important.
          Length = 123

 Score = 27.2 bits (61), Expect = 5.8
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           LY   GE  E EE +EE     E+ EE +E
Sbjct: 67  LYGKHGEPIEREERDEEFMALFEQFEEFDE 96


>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3).  This family
           consists of several optic atrophy 3 (OPA3) proteins.
           OPA3 deficiency causes type III 3-methylglutaconic
           aciduria (MGA) in humans. This disease manifests with
           early bilateral optic atrophy, spasticity,
           extrapyramidal dysfunction, ataxia, and cognitive
           deficits, but normal longevity.
          Length = 134

 Score = 27.2 bits (61), Expect = 5.8
 Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 1/32 (3%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEE-EEEEEEITA 171
           K  +KEEE ++E EE E    E E E E   A
Sbjct: 102 KEAKKEEELQQELEELEARVGELELEIERQKA 133


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 28.3 bits (64), Expect = 5.8
 Identities = 23/90 (25%), Positives = 36/90 (40%), Gaps = 15/90 (16%)

Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVHIVVGLTSTLSI 198
            G  EE  EE  E  +E  E E   +EE+     L+              + GL   L+ 
Sbjct: 395 EGFDEETVEELRERAKEALETEALAQEEKLADDLLS--------------LEGLDRELAF 440

Query: 199 KLMQDDLHIVKDLV-LPVDDPKYLENLVEE 227
           KL +  +  ++DL    VD+   +E   E+
Sbjct: 441 KLAEKGIKTLEDLAEQAVDELIDIEGDEEK 470


>gnl|CDD|239583 cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase
           (Delta6-FADS)-like CD includes the integral-membrane
           enzymes: delta-4, delta-5, delta-6, delta-8,
           delta-8-sphingolipid, and delta-11 desaturases found in
           vertebrates, higher plants, fungi, and bacteria. These
           desaturases are required for the synthesis of highly
           unsaturated fatty acids (HUFAs), which are mainly
           esterified into phospholipids and contribute to
           maintaining membrane fluidity. While HUFAs may be
           required for cold tolerance in bacteria, plants and
           fish, the primary role of HUFAs in mammals is cell
           signaling. These enzymes are described as front-end
           desaturases because they introduce a double bond between
           the pre-exiting double bond and the carboxyl (front) end
           of the fatty acid. Various substrates are involved, with
           both acyl-coenzyme A (CoA) and acyl-lipid desaturases
           present in this CD. Acyl-lipid desaturases are localized
           in the membranes of cyanobacterial thylakoid, plant
           endoplasmic reticulum (ER), and plastid; and acyl-CoA
           desaturases are present in ER membrane. ER-bound plant
           acyl-lipid desaturases and acyl-CoA desaturases require
           cytochrome b5 as an electron donor. Most of the
           eukaryotic desaturase domains have an adjacent
           N-terminal cytochrome b5-like domain. This domain family
           has extensive hydrophobic regions that would be capable
           of spanning the membrane bilayer at least twice.
           Comparison of sequences also reveals the existence of
           three regions of conserved histidine cluster motifs that
           contain the residues: HXXXH, HXX(X)HH, and Q/HXXHH.
           These histidine residues are reported to be
           catalytically essential and proposed to be the ligands
           for the iron atoms contained within the homolog,
           stearoyl CoA desaturase.
          Length = 204

 Score = 28.0 bits (63), Expect = 5.9
 Identities = 11/42 (26%), Positives = 15/42 (35%), Gaps = 3/42 (7%)

Query: 67  KHFGFVNLH---PDIFGLVPRLDVITMNIEWQKKYKWVRFDH 105
            H  + N+    PDI  L           + QKK    R+ H
Sbjct: 57  VHHAYTNILGHDPDIDTLPLLARSEPAFGKDQKKRFLHRYQH 98


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 28.3 bits (63), Expect = 6.0
 Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            L+ +RR      + EE  E + ++E+E  E+ E+ E + E
Sbjct: 234 NLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPE 274


>gnl|CDD|224304 COG1386, scpB, Chromosome segregation and condensation protein B
           [DNA replication, recombination and repair].
          Length = 184

 Score = 27.7 bits (62), Expect = 6.0
 Identities = 16/55 (29%), Positives = 23/55 (41%), Gaps = 9/55 (16%)

Query: 123 KGLIE---RLDGKRRPRLY--TGKGEE----KEEEEEEEEEEEEEEEEEEEEEEE 168
           +GLI    R D   RP LY  T K  +       +E  + EE ++     EE+  
Sbjct: 130 RGLIREVGRRDTPGRPYLYGTTEKFLDYFGLDSLDELPDLEELKDAGLLSEEDLL 184


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 27.7 bits (62), Expect = 6.1
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEE 165
           E  EEEEEE  EEEEE +E E+
Sbjct: 160 EPPEEEEEEIAEEEEEVKEPED 181



 Score = 27.7 bits (62), Expect = 7.1
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
           + +  EEEEEE  EEEEE +E E+
Sbjct: 158 DLEPPEEEEEEIAEEEEEVKEPED 181



 Score = 27.3 bits (61), Expect = 8.1
 Identities = 14/23 (60%), Positives = 16/23 (69%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
           + E  EEEEEE  EEEEE +E E
Sbjct: 158 DLEPPEEEEEEIAEEEEEVKEPE 180



 Score = 27.3 bits (61), Expect = 8.9
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 146 EEEEEEEEEEEEEEEEEEEEEE 167
           E  EEEEEE  EEEEE +E E+
Sbjct: 160 EPPEEEEEEIAEEEEEVKEPED 181



 Score = 27.3 bits (61), Expect = 8.9
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 147 EEEEEEEEEEEEEEEEEEEEEE 168
           E  EEEEEE  EEEEE +E E+
Sbjct: 160 EPPEEEEEEIAEEEEEVKEPED 181


>gnl|CDD|224798 COG1886, FliN, Flagellar motor switch/type III secretory pathway
           protein [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 136

 Score = 27.4 bits (61), Expect = 6.2
 Identities = 12/82 (14%), Positives = 22/82 (26%), Gaps = 7/82 (8%)

Query: 119 PWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
                 LI     +  P           +  +E    E E  E    E    +       
Sbjct: 12  EVGFDLLIGLGWAQAVPGSEATADSLLYKSVKEVAFAEVELTESTVLESLNESIDL---- 67

Query: 179 CRILPVMVHIVVGL-TSTLSIK 199
             +L + V + V L  + + + 
Sbjct: 68  --LLDIPVRLSVELGRTKMPLG 87


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 28.6 bits (63), Expect = 6.2
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 141  KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            K EE ++ EE+E++  E  ++E EE ++ 
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAKKA 1704



 Score = 28.6 bits (63), Expect = 6.5
 Identities = 12/29 (41%), Positives = 24/29 (82%)

Query: 141  KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
            K EE ++E EE++++ EE +++EEE+++I
Sbjct: 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759



 Score = 28.2 bits (62), Expect = 8.2
 Identities = 15/62 (24%), Positives = 30/62 (48%)

Query: 107  KVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
            +   +VE+  +K   +K   E L             E K+ EE++++ EE ++ EE+E++
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689

Query: 167  EE 168
              
Sbjct: 1690 AA 1691



 Score = 28.2 bits (62), Expect = 8.5
 Identities = 9/25 (36%), Positives = 19/25 (76%)

Query: 144  EKEEEEEEEEEEEEEEEEEEEEEEE 168
            +K EE++++ EE ++ EE+E++  E
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAE 1692


>gnl|CDD|225029 COG2118, COG2118, DNA-binding protein [General function prediction
           only].
          Length = 116

 Score = 26.9 bits (60), Expect = 6.2
 Identities = 16/49 (32%), Positives = 28/49 (57%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           E L+  RR +L   + + K EE+ E +EE+  +EEE +++  +   L P
Sbjct: 4   EELEEIRRRKLAELQRQAKLEEQREAQEEQARQEEEAQKQAILRQILTP 52


>gnl|CDD|240228 PTZ00010, PTZ00010, tubulin beta chain; Provisional.
          Length = 445

 Score = 28.2 bits (63), Expect = 6.3
 Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 14/48 (29%)

Query: 138 YTGKG-EEKEEEEEE-------------EEEEEEEEEEEEEEEEEITA 171
           YTG+G +E E  E E             ++   EEE E +EEEE    
Sbjct: 398 YTGEGMDEMEFTEAESNMNDLVSEYQQYQDATVEEEGEFDEEEEAYEI 445


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 28.2 bits (63), Expect = 6.4
 Identities = 7/27 (25%), Positives = 13/27 (48%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
             +  EEE+EEE E        + +++
Sbjct: 6   NTQANEEEDEEELEAVARSAGSDSDDD 32



 Score = 28.2 bits (63), Expect = 7.2
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 149 EEEEEEEEEEEEEEEEEEEEITATLNP 175
           EEE+   +  EEE+EEE E +  +   
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGS 27


>gnl|CDD|220063 pfam08895, DUF1840, Domain of unknown function (DUF1840).  This
           family of proteins are functionally uncharacterized.
          Length = 105

 Score = 26.8 bits (60), Expect = 6.4
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
             +E       E+E+++E+EE+EE  +
Sbjct: 54  AAEEARRAPPAEDEDDDEDEEDEEPAV 80



 Score = 26.4 bits (59), Expect = 9.7
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           RL      E+       E+E+++E+EE+EE  
Sbjct: 48  RLEAAIAAEEARRAPPAEDEDDDEDEEDEEPA 79


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 27.8 bits (62), Expect = 6.4
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EE  E E+EEEE+     ++EEEE+++
Sbjct: 148 EEALEFEKEEEEQRRLLLQKEEEEQQM 174


>gnl|CDD|235224 PRK04140, PRK04140, hypothetical protein; Provisional.
          Length = 317

 Score = 27.9 bits (63), Expect = 6.4
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 148 EEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
           E+  ++E   E+ EE+  E+EI   L   G 
Sbjct: 190 EKVRDDEVTPEDPEEDPFEKEILNVLTRLGF 220


>gnl|CDD|221805 pfam12848, ABC_tran_2, ABC transporter.  This domain is related to
           pfam00005.
          Length = 85

 Score = 26.4 bits (59), Expect = 6.8
 Identities = 10/26 (38%), Positives = 20/26 (76%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E+KEE  E++E+  E++++E ++ EE
Sbjct: 20  EQKEERLEQQEKAYEKQQKEIKKLEE 45


>gnl|CDD|218590 pfam05438, TRH, Thyrotropin-releasing hormone (TRH).  This family
           consists of several thyrotropin-releasing hormone (TRH)
           proteins. Thyrotropin-Releasing Hormone (TRH;
           pyroGlu-His-Pro-NH2), originally isolated as a
           hypothalamic neuropeptide hormone, most likely acts also
           as a neuromodulator and/or neurotransmitter in the
           central nervous system (CNS). This interpretation is
           supported by the identification of a peptidase localised
           on the surface of neuronal cells which has been termed
           TRH-degrading ectoenzyme (TRH-DE) since it selectively
           inactivates TRH. TRH has been used clinically for the
           treatment of spinocerebellar degeneration and
           disturbance of consciousness in humans.
          Length = 210

 Score = 27.6 bits (61), Expect = 6.8
 Identities = 19/85 (22%), Positives = 31/85 (36%)

Query: 93  EWQKKYKWVRFDHTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEE 152
           EW       +  H      +E     PW    L +R    +R      K +     E EE
Sbjct: 114 EWDSDAGQQKRQHPGKRLSLEHMLEDPWLLSELSKRQHPGKRYLDPLHKRQHPGRRELEE 173

Query: 153 EEEEEEEEEEEEEEEEITATLNPCG 177
           E++ + +E E+ +     A  +PC 
Sbjct: 174 EDDGDSKELEKRQHPGKRALGSPCE 198


>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110).  This is
           a family that is found predominantly at the C-terminus
           of Kelch-containing proteins. However, the exact
           function of this region is not known.
          Length = 99

 Score = 26.5 bits (59), Expect = 6.9
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           EE+  E  EE    E+EEEE+   EE+
Sbjct: 56  EERYWELREELRALEDEEEEQAGIEEV 82



 Score = 26.2 bits (58), Expect = 9.3
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
           E +EE    E+EEEE+   EE  
Sbjct: 61  ELREELRALEDEEEEQAGIEEVV 83



 Score = 26.2 bits (58), Expect = 9.6
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
           EE    E+EEEE+   EE     +E
Sbjct: 64  EELRALEDEEEEQAGIEEVVSLAKE 88


>gnl|CDD|222508 pfam14033, DUF4246, Protein of unknown function (DUF4246).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and fungi.
           Proteins in this family are typically between 392 and
           644 amino acids in length.
          Length = 496

 Score = 28.0 bits (63), Expect = 6.9
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
           G    R+     E   +E+E EE + E  E+E+ E+
Sbjct: 250 GHEPLRIEYYGVEYHPKEKEPEEPKPEPGEDEDYED 285


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 27.7 bits (62), Expect = 6.9
 Identities = 20/108 (18%), Positives = 43/108 (39%), Gaps = 13/108 (12%)

Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVM 185
              L+ +RR R    + + +++++ +E +++E+ ++ E EE +     +      I    
Sbjct: 5   EALLEQRRRKR---EQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAE 61

Query: 186 VHIVVGLTSTLSIKLMQD-----DLHIVKDLVLPVDDPKYLENLVEER 228
            +IV        ++         DL + K       D K L   +E R
Sbjct: 62  GNIVFS-----KVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEAR 104


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 27.9 bits (62), Expect = 7.0
 Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 3/60 (5%)

Query: 117 RKPWPQKGLIERLDGKRR---PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
            + +P    +E L+ + R   P L     E +E+++E+E   +  E E  +   ++   +
Sbjct: 78  EELFPSAEKLEELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMI 137


>gnl|CDD|182633 PRK10669, PRK10669, putative cation:proton antiport protein;
           Provisional.
          Length = 558

 Score = 28.1 bits (63), Expect = 7.2
 Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 15/64 (23%)

Query: 149 EEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVH-IVVG---LTSTLSIKLMQDD 204
            + E  EE+  EE  EEE++I     P  +C       H ++VG   + S L  KL+   
Sbjct: 392 AKTETLEEQTLEEAIEEEKQI-----PVDICN------HALLVGYGRVGSLLGEKLLAAG 440

Query: 205 LHIV 208
           + +V
Sbjct: 441 IPLV 444


>gnl|CDD|234338 TIGR03742, PRTRC_F, PRTRC system protein F.  A novel genetic system
           characterized by seven (usually) major proteins,
           including a ParB homolog and a ThiF homolog, is commonly
           found on plasmids or in bacterial chromosomal regions
           near phage, plasmid, or transposon markers. It is most
           common among the beta Proteobacteria. We designate the
           system PRTRC, or ParB-Related,ThiF-Related Cassette.
           This protein family is designated protein F. It is the
           most divergent of the families.
          Length = 342

 Score = 27.8 bits (62), Expect = 7.2
 Identities = 11/34 (32%), Positives = 16/34 (47%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           GE  EEE  EE  +E++E+ E      +   L  
Sbjct: 165 GETDEEEALEELCDEDDEDREAYLPSVVEQALLE 198


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 27.3 bits (61), Expect = 7.4
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
           Y  K  E   +  +EE E ++E    EE+++IT  
Sbjct: 31  YEKKDIEINTDYLQEETELDKELFTPEEQKKITFQ 65


>gnl|CDD|218049 pfam04373, DUF511, Protein of unknown function (DUF511).  Bacterial
           protein of unknown function.
          Length = 310

 Score = 27.7 bits (62), Expect = 7.5
 Identities = 13/36 (36%), Positives = 20/36 (55%)

Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
            ++ PR Y  K  E E E +  +E + EE+EE+ E 
Sbjct: 64  QEKPPRRYYLKSREDELELKALDEIKSEEDEEQSEA 99


>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family.  This is a
           family of transcription elongation factors which
           includes those referred to as Bex proteins as well as
           those named TCEAL7. Bex1 was shown to be a novel link
           between neurotrophin signalling, the cell cycle, and
           neuronal differentiation, suggesting it might function
           by coordinating internal cellular states with the
           ability of cells to respond to external signals. TCEAL7
           has been shown negatively to regulate the NF-kappaB
           pathway, hence being important in ovarian cancer as it
           one of the genes frequently downregulated in this
           cancer. A closely related protein, TFIIS/TCEA, found in
           pfam07500 is involved in transcription elongation and
           transcript fidelity. TFIIS/TCEA promotes 3'
           endoribonuclease activity of RNA polymerase II (pol II)
           and allows pol II to bypass transcript pause or 'arrest'
           during elongation process. It is thus possible that BEX
           is also acting in this way.
          Length = 97

 Score = 26.5 bits (59), Expect = 7.6
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
           E E + E E +EEEE+   EE E +
Sbjct: 7   ENEGKPESEPKEEEEKRPLEEGEGK 31



 Score = 26.2 bits (58), Expect = 8.5
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
           E K E E +EEEE+   EE E ++ E
Sbjct: 9   EGKPESEPKEEEEKRPLEEGEGKKPE 34


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 28.0 bits (62), Expect = 7.6
 Identities = 13/52 (25%), Positives = 21/52 (40%)

Query: 117 RKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
                +K    R   + R RL    G E  + EEE  E+E+   + ++   E
Sbjct: 6   DLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIE 57


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 28.0 bits (62), Expect = 7.6
 Identities = 9/34 (26%), Positives = 20/34 (58%), Gaps = 5/34 (14%)

Query: 143  EEKEEEEEEEEEEEE-----EEEEEEEEEEEITA 171
            E+K+ +E+E+EEE       ++E + + +E  + 
Sbjct: 3946 EDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQ 3979


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 27.3 bits (60), Expect = 7.8
 Identities = 17/63 (26%), Positives = 32/63 (50%)

Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMV 186
           ++ D K++ ++   KG+ K++EE+   +  E E  +E  E E+     P     + PV V
Sbjct: 23  QKSDKKKKEKVSEKKGKSKKKEEKPNGKIPEHEPNQEVTEVEVIIEKEPVPAVAVAPVPV 82

Query: 187 HIV 189
            +V
Sbjct: 83  AVV 85


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 28.1 bits (62), Expect = 7.9
 Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
               T + EE +E  E+  E+E E +  +EE+ E
Sbjct: 948 ESDETDEDEESDESSEDLSEDESENDSSDEEDGE 981


>gnl|CDD|217952 pfam04189, Gcd10p, Gcd10p family.  eIF-3 is a multi-subunit complex
           that stimulates translation initiation in vitro at
           several different steps. This family corresponds to the
           gamma subunit if eIF3. The Yeast protein Gcd10p has also
           been shown to be part of a complex with the
           methyltransferase Gcd14p that is involved in modifying
           tRNA.
          Length = 299

 Score = 27.6 bits (62), Expect = 7.9
 Identities = 11/27 (40%), Positives = 12/27 (44%)

Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            E   EE EEEE E   EEE    +  
Sbjct: 273 FEPPAEEIEEEEPEVLPEEELNTMKPS 299


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
           MRVI1 proteins which are related to the
           lymphoid-restricted membrane protein (JAW1) and the IP3
           receptor associated cGMP kinase substrates A and B
           (IRAGA and IRAGB). The function of MRVI1 is unknown
           although mutations in the Mrvi1 gene induces myeloid
           leukaemia by altering the expression of a gene important
           for myeloid cell growth and/or differentiation so it has
           been speculated that Mrvi1 is a tumour suppressor gene.
           IRAG is very similar in sequence to MRVI1 and is an
           essential NO/cGKI-dependent regulator of IP3-induced
           calcium release. Activation of cGKI decreases
           IP3-stimulated elevations in intracellular calcium,
           induces smooth muscle relaxation and contributes to the
           antiproliferative and pro-apoptotic effects of NO/cGMP.
           Jaw1 is a member of a class of proteins with
           COOH-terminal hydrophobic membrane anchors and is
           structurally similar to proteins involved in vesicle
           targeting and fusion. This suggests that the function
           and/or the structure of the ER in lymphocytes may be
           modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 28.1 bits (62), Expect = 7.9
 Identities = 15/67 (22%), Positives = 28/67 (41%), Gaps = 9/67 (13%)

Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG---------LCRILPVMVHIVVG 191
           +GEE  E   +    EE  E EE ++E+ ++ L               + L +M   ++ 
Sbjct: 430 EGEEAVERTRKPSLSEEVAETEEWDKEQRSSKLASWVTFLQVSYRKANQALWLMAAAILL 489

Query: 192 LTSTLSI 198
           L + +S 
Sbjct: 490 LAALMSF 496


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 28.0 bits (62), Expect = 8.0
 Identities = 13/44 (29%), Positives = 23/44 (52%)

Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
            I+ L    R      +  ++E E+EEE   +  +E +EEE+E+
Sbjct: 239 RIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEK 282


>gnl|CDD|236876 PRK11191, PRK11191, RNase E inhibitor protein; Provisional.
          Length = 138

 Score = 26.8 bits (60), Expect = 8.1
 Identities = 7/30 (23%), Positives = 16/30 (53%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
           Y G G   E+   E+ ++E+  +E+++   
Sbjct: 108 YDGWGTYFEDPNAEDGDDEDFVDEDDDGVR 137


>gnl|CDD|225752 COG3211, PhoX, Predicted phosphatase [General function prediction
           only].
          Length = 616

 Score = 27.9 bits (62), Expect = 8.1
 Identities = 19/90 (21%), Positives = 28/90 (31%), Gaps = 7/90 (7%)

Query: 29  FNDFTSNKSEILHETKEFQNAFLQPRQAWVENLDTTEEKHFGF-VNLHPDIFGLVPRLDV 87
           F+ +   + E L             R        T  +    F     P+ FG V  +D 
Sbjct: 241 FDGYFGGEDEELEPE------DDLKRYGIPGGDYTWGKFDARFDYVKEPNRFGWVVEIDP 294

Query: 88  ITMNIEWQKKYKWVRFDHTKVTNEVERTGR 117
              N    K+    RF H     +V + GR
Sbjct: 295 NDPNSTPVKRTALGRFKHEAAAVDVNKDGR 324


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 28.0 bits (63), Expect = 8.3
 Identities = 16/38 (42%), Positives = 20/38 (52%), Gaps = 5/38 (13%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
             G+  EKEE E+      E E+ EE+EEE     LNP
Sbjct: 849 LEGEFSEKEEAEKFYNNLIEVEKVEEKEEE-----LNP 881


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 27.3 bits (61), Expect = 8.3
 Identities = 11/31 (35%), Positives = 13/31 (41%)

Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
           Y    + K  EE   EE E     +EEE  E
Sbjct: 156 YAEWKDAKLLEEFAAEEAEAAAAAKEEESAE 186


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 27.4 bits (61), Expect = 8.4
 Identities = 20/61 (32%), Positives = 23/61 (37%), Gaps = 10/61 (16%)

Query: 114 RTGRKPWPQ-KGLIERLDGKRRP-----RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
              R P PQ KGL  R      P               E E  E+E   + E+E E EEE
Sbjct: 97  VVPRPPVPQPKGLKMRF----FPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEE 152

Query: 168 E 168
           E
Sbjct: 153 E 153


>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
           and repair].
          Length = 1187

 Score = 28.1 bits (63), Expect = 8.5
 Identities = 12/43 (27%), Positives = 17/43 (39%), Gaps = 8/43 (18%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI--------TATLNPCG 177
           +   E+  E   EE  E E E E++          +AT  P G
Sbjct: 248 KLYGEKRAERVREELREVEREREKKRRKLGILVVSSATGKPRG 290


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 27.8 bits (62), Expect = 8.5
 Identities = 16/67 (23%), Positives = 19/67 (28%), Gaps = 3/67 (4%)

Query: 106 TKVTNEVERTGRKPWPQKGLIERLDGKRR---PRLYTGKGEEKEEEEEEEEEEEEEEEEE 162
            K   E  R G    P+K                    K  E       + +    E EE
Sbjct: 50  PKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEE 109

Query: 163 EEEEEEI 169
           EEE EE 
Sbjct: 110 EEEPEET 116


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 27.6 bits (61), Expect = 8.7
 Identities = 14/34 (41%), Positives = 17/34 (50%)

Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
           R Y     +K    E E E+E E+E EEE  EE 
Sbjct: 245 REYREWKGDKANTNETEIEDESEDEYEEEAGEEQ 278


>gnl|CDD|227225 COG4888, COG4888, Uncharacterized Zn ribbon-containing protein
           [General function prediction only].
          Length = 104

 Score = 26.3 bits (58), Expect = 8.7
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 140 GKGEEKEEEEEEEEEEEEEEEEE 162
           G+G  +   +EE ++E E   EE
Sbjct: 80  GRGCGESGTDEENDQEIESPGEE 102


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 27.7 bits (61), Expect = 9.1
 Identities = 9/41 (21%), Positives = 25/41 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILP 183
           E+ ++ +  + + EE++++ EEE+ + T++ +  G  +  P
Sbjct: 104 EQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQP 144


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 26.5 bits (59), Expect = 9.3
 Identities = 13/23 (56%), Positives = 20/23 (86%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
           EE ++EEEEE+EE+E+E E +E+
Sbjct: 28  EEYDKEEEEEKEEKEKEAEPDED 50


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 27.5 bits (61), Expect = 9.7
 Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 8/45 (17%)

Query: 134 RPRLYTGKG--------EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
           RP+   GKG        E  ++E +   E E EE      EEE+ 
Sbjct: 227 RPQRPVGKGLSELVQEEESIDDELDVLREIEAEEAGIGPIEEEVV 271


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 27.7 bits (62), Expect = 9.8
 Identities = 7/31 (22%), Positives = 19/31 (61%)

Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
           E ++ ++EEEE    +E++++E++      +
Sbjct: 221 ELDDIDDEEEERVLADEDDDDEDDMFDMDDD 251


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
           protein.  This domain family is found in eukaryotes, and
           is typically between 212 and 238 amino acids in length.
           The family is found in association with pfam01805. There
           are two completely conserved residues (W and H) that may
           be functionally important. PRP21 is required for
           assembly of the prespliceosome and it interacts with U2
           snRNP and/or pre-mRNA in the prespliceosome. This family
           also contains proteins similar to PRP21, such as the
           mammalian SF3a. SF3a also interacts with U2 snRNP from
           the prespliceosome, converting it to its active form.
          Length = 230

 Score = 27.4 bits (61), Expect = 9.9
 Identities = 15/33 (45%), Positives = 20/33 (60%)

Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
           +     EEEEE EEEEE+ +EE++EE      P
Sbjct: 100 DAAAAPEEEEEMEEEEEDIDEEKKEESAKKSTP 132


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.137    0.406 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,641,512
Number of extensions: 1677397
Number of successful extensions: 20188
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13607
Number of HSP's successfully gapped: 1935
Length of query: 309
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 212
Effective length of database: 6,635,264
Effective search space: 1406675968
Effective search space used: 1406675968
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.3 bits)