RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6875
         (201 letters)



>gnl|CDD|215754 pfam00153, Mito_carr, Mitochondrial carrier protein. 
          Length = 96

 Score = 75.4 bits (186), Expect = 4e-18
 Identities = 23/67 (34%), Positives = 33/67 (49%), Gaps = 4/67 (5%)

Query: 136 WWRHLVAGGVAGAVSRTCTAPLDRLKVYLQ----VHGNKTTGIKKCLLHLLHEGGFLSLW 191
           +   L+AGG+AGA++ T T PLD +K  LQ        K  GI  C   +  E G   L+
Sbjct: 5   FLASLLAGGIAGAIAATVTYPLDVVKTRLQSSAAGGSRKYKGILDCFKKIYKEEGIRGLY 64

Query: 192 RGNGSKI 198
           +G    +
Sbjct: 65  KGLLPNL 71



 Score = 67.7 bits (166), Expect = 4e-15
 Identities = 18/59 (30%), Positives = 28/59 (47%)

Query: 41 GVHGNKTTGIKKCLLHLLHEGGFLSLWRGNGINVLKIAPESALKFMAYEQAKRLIRGNY 99
               K  GI  C   +  E G   L++G   N+L++AP +A+ F  YE  K+L+    
Sbjct: 38 AGGSRKYKGILDCFKKIYKEEGIRGLYKGLLPNLLRVAPAAAIYFGTYETLKKLLLKKL 96


>gnl|CDD|240302 PTZ00169, PTZ00169, ADP/ATP transporter on adenylate translocase;
           Provisional.
          Length = 300

 Score = 54.0 bits (130), Expect = 6e-09
 Identities = 42/151 (27%), Positives = 68/151 (45%), Gaps = 33/151 (21%)

Query: 48  TGIKKCLLHLLHEGGFLSLWRGNGINVLKIAPESALKFMAYEQAKRLIRGNYTRELSIYE 107
           +GI  C   +  E G LSLWRGN  NV++  P  A  F A++   + +   Y ++   ++
Sbjct: 53  SGIVNCFRRVSKEQGVLSLWRGNTANVIRYFPTQAFNF-AFKDYFKNMFPKYNQKTDFWK 111

Query: 108 RFVAGSLAGGVSLNVPDDFTAKEMLTGMWWRHLVAGGVAGAVSRTCTAPLD----RLKVY 163
            F                             ++++GG+AGA S     PLD    RL   
Sbjct: 112 FFGV---------------------------NILSGGLAGASSLLIVYPLDFARTRLASD 144

Query: 164 LQVHGNKT-TGIKKCLLHLLHEGGFLSLWRG 193
           +   G++  TG+  CL+ +  + GFLSL++G
Sbjct: 145 IGKGGDREFTGLFDCLMKISKQTGFLSLYQG 175



 Score = 50.9 bits (122), Expect = 7e-08
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 10/64 (15%)

Query: 141 VAGGVAGAVSRTCTAPLDRLKVYLQVH--------GN--KTTGIKKCLLHLLHEGGFLSL 190
           + GG++ A+S+T  AP++R+K+ +Q          G   + +GI  C   +  E G LSL
Sbjct: 12  LMGGISAAISKTAVAPIERVKMLIQTQDSIPEIKSGKVPRYSGIVNCFRRVSKEQGVLSL 71

Query: 191 WRGN 194
           WRGN
Sbjct: 72  WRGN 75



 Score = 37.4 bits (87), Expect = 0.002
 Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 5/88 (5%)

Query: 48  TGIKKCLLHLLHEGGFLSLWRGNGINVLKIAPESALKFMAYEQAKRLIRGNYTRELSIYE 107
           TG+  CL+ +  + GFLSL++G G++V  I       F  Y+ AK L+ GN      +Y+
Sbjct: 154 TGLFDCLMKISKQTGFLSLYQGFGVSVQGIIVYRGAYFGLYDSAKALLFGNDKNTNILYK 213

Query: 108 RFVAGS---LAGGVSLNVPDDFTAKEML 132
             VA +   LAG +S   P D   + M+
Sbjct: 214 WAVAQTVTILAGLISY--PFDTVRRRMM 239


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 35.4 bits (81), Expect = 0.016
 Identities = 16/28 (57%), Positives = 24/28 (85%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           EEEEE++EEE+E+EEE++++E   E EE
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 35.4 bits (81), Expect = 0.017
 Identities = 16/29 (55%), Positives = 24/29 (82%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           SEEEEE++EEE+E+EEE++++E   E  E
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 34.2 bits (78), Expect = 0.038
 Identities = 19/47 (40%), Positives = 30/47 (63%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKCLLHLL 58
           EEEEE++EEE+E+EEE++++E     E L +   +T   +   L LL
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIYLFLL 912



 Score = 33.0 bits (75), Expect = 0.10
 Identities = 14/26 (53%), Positives = 23/26 (88%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKE 32
           E +  EEEEE++EEE+E+EEE++++E
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 32.7 bits (74), Expect = 0.12
 Identities = 14/26 (53%), Positives = 22/26 (84%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKE 32
           E +  EEEEE++EEE+E+EEE++ +E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 32.3 bits (73), Expect = 0.15
 Identities = 14/26 (53%), Positives = 22/26 (84%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKE 32
           E +  EEEEE++EEE+E+EEE+++ E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 31.9 bits (72), Expect = 0.21
 Identities = 14/28 (50%), Positives = 23/28 (82%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           + EEE++EEE+E+EEE++++E   E EE
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEE 889


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 35.1 bits (81), Expect = 0.018
 Identities = 14/34 (41%), Positives = 25/34 (73%)

Query: 6  IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
          +E+++ +EEEE+++EEK++EEEK   +     EE
Sbjct: 31 VEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEE 64



 Score = 32.0 bits (73), Expect = 0.16
 Identities = 17/39 (43%), Positives = 25/39 (64%), Gaps = 4/39 (10%)

Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGI 50
          EE +E++E+E++K++ KK KE   E E L    NKT  I
Sbjct: 59 EEVDEEEEKEEKKKKTKKVKETTTEWELL----NKTKPI 93



 Score = 31.3 bits (71), Expect = 0.30
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
          E    EEEEEK+E+++E+E+   K+E   E EE      KT  +K+
Sbjct: 34 EVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 32.6 bits (75), Expect = 0.031
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREG 37
              EEKKEEE+E+EE+++ +E    G
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAAG 100



 Score = 30.7 bits (70), Expect = 0.15
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 12 EEEEEKKEEEKEKEEEKKKKE 32
              E+K+EE+E+EEEK++ E
Sbjct: 74 AAAAEEKKEEEEEEEEKEESE 94



 Score = 29.9 bits (68), Expect = 0.34
 Identities = 8/30 (26%), Positives = 17/30 (56%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
                ++++E+E+EEE+K++        LG
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEEAAAGLG 102



 Score = 28.0 bits (63), Expect = 1.6
 Identities = 8/28 (28%), Positives = 16/28 (57%)

Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
                +E+++E+EEE++K+E   E   
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 27.2 bits (61), Expect = 2.8
 Identities = 8/29 (27%), Positives = 15/29 (51%)

Query: 13 EEEEKKEEEKEKEEEKKKKEGGREGEELG 41
                 EEK++EEE+++++   E E   
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 26.8 bits (60), Expect = 3.2
 Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 3/33 (9%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHG 44
                  EE+KE+EEE+++KE   E EE    G
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKE---ESEEEAAAG 100



 Score = 25.7 bits (57), Expect = 8.7
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 5  AIERKISEEEEEKKEEEKEKE 25
          A E K  EEEEE+++EE E+E
Sbjct: 76 AAEEKKEEEEEEEEKEESEEE 96


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 32.3 bits (74), Expect = 0.041
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
               E++EEE+E+EEE++++        LG
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEEEAMAGLG 101



 Score = 29.2 bits (66), Expect = 0.51
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREG 37
          +     ++EEE+E+EEE++++E   E 
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEESEEEA 96



 Score = 26.9 bits (60), Expect = 3.2
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 18 KEEEKEKEEEKKKKEGGREGEELGVHG 44
             E+E+EEE++++E   E EE  + G
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMAG 99



 Score = 26.5 bits (59), Expect = 5.0
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 15 EEKKEEEKEKEEEKKKKEGGREGEE 39
               EE+E+EEE++++E     EE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
          includes archaeal L12p, the protein that is
          functionally equivalent to L7/L12 in bacteria and the
          P1 and P2 proteins in eukaryotes. L12p is homologous to
          P1 and P2 but is not homologous to bacterial L7/L12. It
          is located in the L12 stalk, with proteins L10, L11,
          and 23S rRNA. L12p is the only protein in the ribosome
          to occur as multimers, always appearing as sets of
          dimers. Recent data indicate that most archaeal species
          contain six copies of L12p (three homodimers), while
          eukaryotes have four copies (two heterodimers), and
          bacteria may have four or six copies (two or three
          homodimers), depending on the species. The organization
          of proteins within the stalk has been characterized
          primarily in bacteria, where L7/L12 forms either two or
          three homodimers and each homodimer binds to the
          extended C-terminal helix of L10. L7/L12 is attached to
          the ribosome through L10 and is the only ribosomal
          protein that does not directly interact with rRNA.
          Archaeal L12p is believed to function in a similar
          fashion. However, hybrid ribosomes containing the large
          subunit from E. coli with an archaeal stalk are able to
          bind archaeal and eukaryotic elongation factors but not
          bacterial elongation factors. In several mesophilic and
          thermophilic archaeal species, the binding of 23S rRNA
          to protein L11 and to the L10/L12p pentameric complex
          was found to be temperature-dependent and cooperative.
          Length = 106

 Score = 31.7 bits (72), Expect = 0.071
 Identities = 15/23 (65%), Positives = 19/23 (82%)

Query: 5  AIERKISEEEEEKKEEEKEKEEE 27
          A E+   +EEE+KKEEEKE+EEE
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEE 96



 Score = 31.7 bits (72), Expect = 0.077
 Identities = 15/31 (48%), Positives = 22/31 (70%)

Query: 3   PIAIERKISEEEEEKKEEEKEKEEEKKKKEG 33
             A E K  E+EEEKK+EE+++EEE++   G
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEEALAG 101



 Score = 31.7 bits (72), Expect = 0.080
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 3  PIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
          P A     +EE+ E+KEEEK+KEEEK+++E
Sbjct: 66 PAAAAAAAAEEKAEEKEEEKKKEEEKEEEE 95



 Score = 29.4 bits (66), Expect = 0.44
 Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREG 37
            E+ EEK+EE+K++EE+++++E    G
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEEALAG 101



 Score = 28.6 bits (64), Expect = 0.79
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
           + EE+ +++EE++K+EE+K++E       LG
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEEALAGLG 103



 Score = 28.6 bits (64), Expect = 0.89
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 14  EEEKKEEEKEKEEEKKKKEGGREGEELGVHG 44
               +E+ +EKEEEKKK+E   E EE  + G
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEEALAG 101


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 33.2 bits (76), Expect = 0.082
 Identities = 22/79 (27%), Positives = 34/79 (43%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKCLLHLLHEGGFLSL 66
                EE+EE++EEE E E   K+     E EE  V            L + ++     +
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSSLAVGYKNDRSYV 447

Query: 67  WRGNGINVLKIAPESALKF 85
            RG+ I V K   + +L+F
Sbjct: 448 VRGDKIGVFKHTDDDSLEF 466


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.1 bits (75), Expect = 0.084
 Identities = 12/25 (48%), Positives = 20/25 (80%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKK 31
           E+K  EE E +KE+EKE+E E++++
Sbjct: 593 EQKAREEREREKEKEKERERERERE 617



 Score = 31.6 bits (71), Expect = 0.26
 Identities = 17/35 (48%), Positives = 24/35 (68%), Gaps = 2/35 (5%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEK-KKKEGGREGE 38
           A+E K   E E+K  EE+E+E+EK K++E  RE E
Sbjct: 584 AVE-KAKREAEQKAREEREREKEKEKERERERERE 617


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 33.2 bits (75), Expect = 0.091
 Identities = 13/36 (36%), Positives = 20/36 (55%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
            A E K   +E +K  E K+K +E KK E  ++ +E 
Sbjct: 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527



 Score = 32.8 bits (74), Expect = 0.12
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 8    RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            +K +EE ++K +E K+  E KKK +  ++ EE
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520



 Score = 32.0 bits (72), Expect = 0.24
 Identities = 13/36 (36%), Positives = 24/36 (66%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
            A  +K ++E ++K EE+K+ +E KKK E  ++ +E 
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEA 1449



 Score = 32.0 bits (72), Expect = 0.25
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
            A E K +EE ++K EE K+ +E KKK E  ++ +E 
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA 1488



 Score = 31.6 bits (71), Expect = 0.25
 Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
            A E+K ++E ++K EE K+ +E KKK E  ++ EE 
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462



 Score = 31.6 bits (71), Expect = 0.32
 Identities = 15/35 (42%), Positives = 23/35 (65%), Gaps = 2/35 (5%)

Query: 7    ERKISEEEEEKKEEEKEKEEEKKKKE--GGREGEE 39
            E K +EE+++K EE K+ EE++KK      +E EE
Sbjct: 1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700



 Score = 31.3 bits (70), Expect = 0.36
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
            A E K ++E ++K EE K+ EE KKK E  ++ +E 
Sbjct: 1440 AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475



 Score = 31.3 bits (70), Expect = 0.44
 Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 5    AIERKISEEEEEKKEEE-KEKEEEKKKKEGGREGEE 39
            A E K +EE+E+K  E  K++ EE KK E  ++ E 
Sbjct: 1677 AEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA 1712



 Score = 30.9 bits (69), Expect = 0.59
 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 8    RKISEEEEEKKEEEKEKEEEKKKK-EGGREGEE 39
            +K  EE + K  EE +K EE KKK E  ++ EE
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE 1685



 Score = 30.5 bits (68), Expect = 0.66
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 13   EEEEKKEEEKEKEEEKKKKEGGREGEE 39
             E +KK +E +K EE KK +  ++ EE
Sbjct: 1506 AEAKKKADEAKKAEEAKKADEAKKAEE 1532



 Score = 30.1 bits (67), Expect = 0.85
 Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 5    AIERKISEE----EEEKKEEEKEKEEEKKKKEGGREGEEL 40
            A E K ++E    EE KK +E +K EEKKK +  ++ EEL
Sbjct: 1518 AEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557



 Score = 30.1 bits (67), Expect = 0.95
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 5    AIERKISEEEEEKKEEEKEKEEE-KKKKEGGREGEEL 40
            A E K   +E +K    K+K +E KKK E  ++ +E 
Sbjct: 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEA 1436



 Score = 29.7 bits (66), Expect = 1.1
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 7    ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            E K   EE++KK EE +K+EE+KKK    + EE
Sbjct: 1734 EAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEE 1766



 Score = 29.7 bits (66), Expect = 1.3
 Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 9/42 (21%)

Query: 7    ERKISEEEEEKKEEE---------KEKEEEKKKKEGGREGEE 39
            E KI   EE KK EE         K +E+EKK  E  ++  E
Sbjct: 1658 ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699



 Score = 29.7 bits (66), Expect = 1.4
 Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 5    AIERKISEEEEEKKEEE--KEKEEEKKKKEGGREGEE 39
            A E    E EE KK EE  K++ EEKKK E  ++ EE
Sbjct: 1690 AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEE 1726



 Score = 29.3 bits (65), Expect = 1.4
 Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 4/37 (10%)

Query: 8    RKISEE----EEEKKEEEKEKEEEKKKKEGGREGEEL 40
            +K +E     +E KK EE +K +E KK E  ++ +E 
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539



 Score = 29.3 bits (65), Expect = 1.5
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query: 7    ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
            E K   +E +K EE+K+ +E KKK E  ++ +E 
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEA 1320



 Score = 29.3 bits (65), Expect = 1.8
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            A E K +EE + K EE K+ EEEKKK E  ++ E 
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEA 1643



 Score = 29.0 bits (64), Expect = 2.1
 Identities = 14/39 (35%), Positives = 22/39 (56%)

Query: 1    MTPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            M   A  +   + EE +K +E +K EEKKK +  ++ EE
Sbjct: 1262 MAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEE 1300



 Score = 29.0 bits (64), Expect = 2.1
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 8    RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            +K +EE ++K +  K+K EE KK     + E 
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352



 Score = 29.0 bits (64), Expect = 2.2
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            A E+K ++E ++K EE+K+K +E KK    ++  +
Sbjct: 1387 AEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD 1421



 Score = 29.0 bits (64), Expect = 2.3
 Identities = 15/37 (40%), Positives = 20/37 (54%), Gaps = 4/37 (10%)

Query: 7    ERKISE----EEEEKKEEEKEKEEEKKKKEGGREGEE 39
            ERK  E    E+ +K E  K+ EE KK  E  ++ EE
Sbjct: 1211 ERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE 1247



 Score = 29.0 bits (64), Expect = 2.4
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 8    RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            +K +EE+++K +E K+    KKK +  ++  E
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428



 Score = 28.6 bits (63), Expect = 2.6
 Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 1/37 (2%)

Query: 5    AIERKISEEEEEKKEEEKEKEEE-KKKKEGGREGEEL 40
            A E K ++E ++K EE K+K +E KK  E  ++ +E 
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515



 Score = 28.2 bits (62), Expect = 3.4
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
            A E K    E++K+E +K+ +  KKK E  ++ +E 
Sbjct: 1361 AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEA 1396



 Score = 28.2 bits (62), Expect = 3.5
 Identities = 13/48 (27%), Positives = 16/48 (33%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
            A   K   EE +K  E  + E E    E     E+      K    KK
Sbjct: 1331 ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378



 Score = 28.2 bits (62), Expect = 3.5
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 13   EEEEKKEEEKEKEEEKKKKEGGREGEE 39
            EE  K EE K+K E+ +K E  R+ E+
Sbjct: 1112 EEARKAEEAKKKAEDARKAEEARKAED 1138



 Score = 28.2 bits (62), Expect = 3.7
 Identities = 19/39 (48%), Positives = 24/39 (61%), Gaps = 4/39 (10%)

Query: 5    AIERKISEE----EEEKKEEEKEKEEEKKKKEGGREGEE 39
            A E K +EE    +E KK EE +K EEKKK E  ++ EE
Sbjct: 1536 ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574



 Score = 28.2 bits (62), Expect = 4.0
 Identities = 15/35 (42%), Positives = 20/35 (57%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            A E K   EE +K +E K+K EE KK E  ++  E
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467



 Score = 28.2 bits (62), Expect = 4.0
 Identities = 14/48 (29%), Positives = 19/48 (39%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
            A E K   +  +KK EE +K  E  K E     +E      K    +K
Sbjct: 1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371



 Score = 27.8 bits (61), Expect = 4.4
 Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 8    RKISEEEEEKKEEEKEKEEEKKKK--EGGREGEE 39
            +K   EE++K EE K+ EEE K K  E  +E EE
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741



 Score = 27.8 bits (61), Expect = 4.6
 Identities = 16/39 (41%), Positives = 21/39 (53%)

Query: 1    MTPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            M     E+K+  EE +K EE K K EE KK E  ++  E
Sbjct: 1598 MKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVE 1636



 Score = 27.8 bits (61), Expect = 4.9
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 14   EEEKKEEEKEKEEEKKKKEGGREGEEL 40
            +E KK EEK+K +E KK E  ++ +E 
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEA 1307



 Score = 27.4 bits (60), Expect = 6.0
 Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 7    ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
            E KI  EE +K+ EE +K+ E+ KK+   E E+  +   K    KK
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKD---EEEKKKIAHLKKEEEKK 1769



 Score = 27.4 bits (60), Expect = 6.1
 Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 8/41 (19%)

Query: 7    ERKISEEEEEKKEEEKEK--------EEEKKKKEGGREGEE 39
            E KI  EE +K EEEK+K         EEKKK E  ++ EE
Sbjct: 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE 1657



 Score = 27.4 bits (60), Expect = 6.5
 Identities = 14/37 (37%), Positives = 24/37 (64%), Gaps = 1/37 (2%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKK-KEGGREGEEL 40
            A E K ++E ++K EE K+ +E KKK +E  ++ +E 
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502



 Score = 27.4 bits (60), Expect = 6.7
 Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 2/36 (5%)

Query: 7    ERKISEEEEEKKEEEKEKEEEKKKK--EGGREGEEL 40
            E K   +E +KK EE +K +E KKK  E  ++ +  
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA 1334



 Score = 27.4 bits (60), Expect = 6.9
 Identities = 14/35 (40%), Positives = 21/35 (60%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            A E K   EE++K +E K+K EE KK +  ++  E
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454



 Score = 27.4 bits (60), Expect = 7.1
 Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 1/36 (2%)

Query: 5    AIERKISEEEEEKKEEEKEK-EEEKKKKEGGREGEE 39
            A E K   EE +K +E K+K EE KKK +  ++  E
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE 1507



 Score = 27.0 bits (59), Expect = 8.1
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 8    RKISEEEEEKKEEEKEKEEEKKKK 31
            +K  EE + K EE K++ EE KKK
Sbjct: 1722 KKAEEENKIKAEEAKKEAEEDKKK 1745



 Score = 27.0 bits (59), Expect = 8.6
 Identities = 15/37 (40%), Positives = 19/37 (51%), Gaps = 4/37 (10%)

Query: 7    ERKISEE----EEEKKEEEKEKEEEKKKKEGGREGEE 39
                +EE    E  K EE ++ EE KKK E  R+ EE
Sbjct: 1096 AFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEE 1132



 Score = 27.0 bits (59), Expect = 8.7
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKK 31
            A E K ++E ++K EE K+K +  KKK
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKADAAKKK 1337



 Score = 27.0 bits (59), Expect = 8.7
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            A   +    EE+ +  EK+KEE KKK +  ++  E
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAE 1388



 Score = 27.0 bits (59), Expect = 9.3
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 7    ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
              K  EE ++K +  K+K EEKKK +  ++  E
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE 1401


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 32.9 bits (76), Expect = 0.10
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 1  MTPIAIERKISEEEEEKKEEEKEKEEEKKK 30
          ++      ++ E   E  EEE+EK+EEKK 
Sbjct: 61 LSLEEQRERLEELAPELLEEEEEKKEEKKG 90


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 32.6 bits (75), Expect = 0.11
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKE 32
             E+ +   EEE++EE +EK+EEKKK+E
Sbjct: 268 EEEKILKAAEEERQEEAQEKKEEKKKEE 295



 Score = 28.0 bits (63), Expect = 3.0
 Identities = 11/26 (42%), Positives = 18/26 (69%), Gaps = 1/26 (3%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKK 30
           A   K+S EE+ K  EEKE++++ +K
Sbjct: 298 AKLAKLSPEEQRKL-EEKERKKQARK 322



 Score = 28.0 bits (63), Expect = 3.7
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 12  EEEEEKKEEEKEKEEEKKKKE 32
           EE +EKKEE+K++E E K  +
Sbjct: 282 EEAQEKKEEKKKEEREAKLAK 302



 Score = 27.2 bits (61), Expect = 6.1
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKE 32
            R+  EE+  K  EE+ +EE ++KKE
Sbjct: 264 TREEEEEKILKAAEEERQEEAQEKKE 289



 Score = 26.8 bits (60), Expect = 8.7
 Identities = 10/30 (33%), Positives = 18/30 (60%), Gaps = 4/30 (13%)

Query: 7   ERKISEEEEEKKEEEKEKE----EEKKKKE 32
           E +  +EE++K+E E +      EE++K E
Sbjct: 283 EAQEKKEEKKKEEREAKLAKLSPEEQRKLE 312



 Score = 26.8 bits (60), Expect = 9.3
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKK 31
           E +  E +E+K+E++KE+ E K  K
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAK 302


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 32.5 bits (75), Expect = 0.11
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 13  EEEEKKEEEKEKEEEKKKKEGGREG 37
             EE++EEE+E+EEE+  +E    G
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEEAAAG 324



 Score = 30.6 bits (70), Expect = 0.45
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
           ++    ++EEE+E+EEE+++         LG
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSEEEAAAGLG 326



 Score = 30.2 bits (69), Expect = 0.56
 Identities = 10/40 (25%), Positives = 21/40 (52%)

Query: 2   TPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
               ++  +S + +    EE+E+EEE++++E   E E   
Sbjct: 284 LDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAA 323


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 32.0 bits (73), Expect = 0.13
 Identities = 19/48 (39%), Positives = 32/48 (66%), Gaps = 1/48 (2%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEG-GREGEELGVHGNKTTGIKK 52
           +E++   EEEEKKE++K+KE +K+KKE   ++ + +   G+K    KK
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 32.6 bits (75), Expect = 0.14
 Identities = 14/34 (41%), Positives = 21/34 (61%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           I  K  ++ EE+K+E+K+K    KKKE   E E+
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448



 Score = 31.8 bits (73), Expect = 0.23
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 9   KISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           K   E+ EKK EE++KE++KK   G ++ EE
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443



 Score = 31.4 bits (72), Expect = 0.30
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
             +KI E+ E+K+EEEK+++++K      +E EE
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444



 Score = 30.7 bits (70), Expect = 0.51
 Identities = 14/33 (42%), Positives = 26/33 (78%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           +++  EEEE++K+EE+++EEE++ +E   E EE
Sbjct: 439 KKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 30.7 bits (70), Expect = 0.52
 Identities = 11/36 (30%), Positives = 22/36 (61%)

Query: 4   IAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           I    + +E++ E++++EK+K+    KK+   E EE
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447



 Score = 30.3 bits (69), Expect = 0.73
 Identities = 19/43 (44%), Positives = 29/43 (67%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKT 47
           A ++K  EEEEEK+++E+EKEEE+++ E  +E EE       T
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478



 Score = 29.9 bits (68), Expect = 0.84
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           +E+   + EEEKKE++K+    KKK+E   E +E
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449



 Score = 29.9 bits (68), Expect = 0.89
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           E+K      +KKEEE+E+E+EKK++E   E EE
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461



 Score = 29.9 bits (68), Expect = 0.99
 Identities = 13/33 (39%), Positives = 24/33 (72%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           +++  ++    K++E+E+EEEK+KKE  +E EE
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459



 Score = 29.9 bits (68), Expect = 1.0
 Identities = 9/46 (19%), Positives = 24/46 (52%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
             +  +E+++K    K+KEEE+++++  +E E+           ++
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 29.5 bits (67), Expect = 1.3
 Identities = 7/33 (21%), Positives = 21/33 (63%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
             K  EEE+++K+++    ++K+++E   + ++
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451



 Score = 28.7 bits (65), Expect = 2.1
 Identities = 16/35 (45%), Positives = 26/35 (74%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           A   K  EEEEE+++E+KE+E+E++++E   E EE
Sbjct: 434 AFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 28.7 bits (65), Expect = 2.1
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 3   PIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
             A ++     E+ +K+ E+EK+E+KKK   G++ EE
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442



 Score = 28.3 bits (64), Expect = 3.0
 Identities = 14/35 (40%), Positives = 24/35 (68%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
             +    +++EE++EEEKEK+EE+K++E     EE
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 27.6 bits (62), Expect = 5.4
 Identities = 10/33 (30%), Positives = 24/33 (72%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           +R+  ++E++KK    +K+EE++++E  ++ EE
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 27.2 bits (61), Expect = 6.7
 Identities = 9/32 (28%), Positives = 19/32 (59%)

Query: 8   RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            K + ++ +K  E+ EK+ E++KKE  ++   
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFA 436


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 31.1 bits (71), Expect = 0.16
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 4  IAIERKISEEEEEKKEEEKEKEEEKKKKEG 33
          I IE K  E+E EK+E+E+++E+EK+  +G
Sbjct: 12 IDIEEKREEKEREKEEKERKEEKEKEWGKG 41



 Score = 30.7 bits (70), Expect = 0.25
 Identities = 8/25 (32%), Positives = 20/25 (80%)

Query: 8  RKISEEEEEKKEEEKEKEEEKKKKE 32
          R I  EE+ +++E +++E+E+K+++
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEK 34


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 31.6 bits (72), Expect = 0.18
 Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 12  EEEEEKKEEEKEKEEEKKKKE 32
            EE+E++EEEK  EEEK ++E
Sbjct: 159 AEEKEREEEEKAAEEEKAREE 179



 Score = 27.0 bits (60), Expect = 6.7
 Identities = 13/21 (61%), Positives = 15/21 (71%)

Query: 7   ERKISEEEEEKKEEEKEKEEE 27
           E K  EEEE+  EEEK +EEE
Sbjct: 160 EEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly
          related to archaeal Holliday junction resolvase
          [Nucleotide transport and metabolism].
          Length = 175

 Score = 31.3 bits (71), Expect = 0.19
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 5  AIERKISEEEEEKKEEEKEKEEEKKKKEGGRE 36
            ER ++E +  K+EE K KE  +KK E  RE
Sbjct: 46 ERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 31.9 bits (72), Expect = 0.21
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
           E K+S E   K E E++++ E+ ++E   E   L   G K   +K 
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKG 346



 Score = 28.4 bits (63), Expect = 3.2
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 3   PIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
              IE+    EE E+++ E+E    KK K+
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 31.5 bits (72), Expect = 0.25
 Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 4   IAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
             ++ +I E E+E+ EEE E+  E+ K EG  EG E G
Sbjct: 95  EKLQMQI-EREQEEWEEELERLIEEAKAEGYEEGYEKG 131


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 31.7 bits (72), Expect = 0.26
 Identities = 15/36 (41%), Positives = 21/36 (58%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
           +ER   EEEEE++EEE+E  E +   E    G E+ 
Sbjct: 398 VERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVP 433


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 31.2 bits (71), Expect = 0.27
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 8   RKISEEEEEKKEEEKEKEEEKKKKEGGR 35
            KI +EEE KK+ + E+      K G +
Sbjct: 268 GKIDKEEEAKKQAKAERRAANAPKPGAK 295


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 90 amino acids in
          length. There are two completely conserved L residues
          that may be functionally important.
          Length = 70

 Score = 28.8 bits (65), Expect = 0.36
 Identities = 14/29 (48%), Positives = 21/29 (72%)

Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
          EEE+ ++EEE  K EE+K++E  +  EEL
Sbjct: 32 EEEKREEEEEARKREERKEREKNKSFEEL 60


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 30.3 bits (69), Expect = 0.36
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGRE 36
           EE E +K  EKE  E+ KK+E  RE
Sbjct: 84  EERELRKRAEKEALEQAKKEEELRE 108


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 31.1 bits (70), Expect = 0.36
 Identities = 12/26 (46%), Positives = 21/26 (80%)

Query: 3   PIAIERKISEEEEEKKEEEKEKEEEK 28
           P AI+ ++ EE +E++EEE+ +EEE+
Sbjct: 324 PFAIDEEMLEETQEQQEEEENEEEEE 349


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 31.0 bits (70), Expect = 0.37
 Identities = 16/34 (47%), Positives = 23/34 (67%), Gaps = 1/34 (2%)

Query: 7   ERKISEE-EEEKKEEEKEKEEEKKKKEGGREGEE 39
           +RK  EE +E++  +EKEKE+EKK +E     EE
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163



 Score = 30.6 bits (69), Expect = 0.57
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 8   RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
                + E  KEEEKEKE+ K++K+  +E  +
Sbjct: 94  PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125



 Score = 29.1 bits (65), Expect = 1.5
 Identities = 18/39 (46%), Positives = 25/39 (64%)

Query: 2   TPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           T  A E K    +EE+KE+E+ KEE+KKKKE  +E  + 
Sbjct: 92  TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKD 130



 Score = 27.5 bits (61), Expect = 5.0
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREG 37
           + ++ EE+++KKE+ KE+ +++K KE  +E 
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 30.6 bits (69), Expect = 0.38
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGRE 36
           I+  + EE+E+KK   KE++E KKK+   RE
Sbjct: 89  IKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 30.9 bits (70), Expect = 0.38
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
             +  +EEEE++ +E E+EE +  +E G +  E
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSRE 402



 Score = 28.5 bits (64), Expect = 2.5
 Identities = 8/35 (22%), Positives = 18/35 (51%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNK 46
            +E+E +EEE+  +E ++++    E E      + 
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404



 Score = 27.8 bits (62), Expect = 4.8
 Identities = 10/29 (34%), Positives = 20/29 (68%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            E +E++ EEE+++ +E +++EG    EE
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEE 396



 Score = 26.6 bits (59), Expect = 8.9
 Identities = 7/29 (24%), Positives = 13/29 (44%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            EE E+ +EE  +  E+   +     G +
Sbjct: 387 EEEGEDSEEEGSQSREDGSSESSSDVGSD 415



 Score = 26.6 bits (59), Expect = 9.2
 Identities = 7/34 (20%), Positives = 18/34 (52%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            E +  E+  ++ EEE+ ++ E++  +   +G  
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSS 406



 Score = 26.6 bits (59), Expect = 9.2
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query: 10  ISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           I  EE ++ E+E+E++   + +E   E  E
Sbjct: 365 IDFEEVDEDEDEEEEQRSDEHEEEEGEDSE 394


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
          family includes archaebacterial L12, eukaryotic P0, P1
          and P2.
          Length = 88

 Score = 29.1 bits (66), Expect = 0.47
 Identities = 11/16 (68%), Positives = 14/16 (87%)

Query: 12 EEEEEKKEEEKEKEEE 27
            EEEKKEEE+E+EE+
Sbjct: 65 AAEEEKKEEEEEEEED 80



 Score = 29.1 bits (66), Expect = 0.49
 Identities = 9/16 (56%), Positives = 13/16 (81%)

Query: 12 EEEEEKKEEEKEKEEE 27
             EE+K+EE+E+EEE
Sbjct: 64 AAAEEEKKEEEEEEEE 79



 Score = 27.6 bits (62), Expect = 1.6
 Identities = 9/16 (56%), Positives = 15/16 (93%)

Query: 12 EEEEEKKEEEKEKEEE 27
           EEE+K+EEE+E+E++
Sbjct: 66 AEEEKKEEEEEEEEDD 81



 Score = 26.4 bits (59), Expect = 3.8
 Identities = 6/17 (35%), Positives = 13/17 (76%)

Query: 11 SEEEEEKKEEEKEKEEE 27
          +    E++++E+E+EEE
Sbjct: 62 AAAAAEEEKKEEEEEEE 78



 Score = 26.0 bits (58), Expect = 5.8
 Identities = 7/21 (33%), Positives = 14/21 (66%)

Query: 13 EEEEKKEEEKEKEEEKKKKEG 33
                EEEK++EEE+++++ 
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81



 Score = 25.7 bits (57), Expect = 7.9
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query: 11 SEEEEEKKEEEKEKEEE 27
          +      +EE+KE+EEE
Sbjct: 60 AAAAAAAEEEKKEEEEE 76


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 30.8 bits (69), Expect = 0.48
 Identities = 15/38 (39%), Positives = 22/38 (57%)

Query: 9   KISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNK 46
           K++EEEE + +EE E+ EE KK+   + G E      K
Sbjct: 239 KVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFK 276



 Score = 29.3 bits (65), Expect = 1.8
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGRE 36
           E    +EE E+ EE+K++  EK   EG  E
Sbjct: 244 EEFELDEEHEEAEEDKKEALEKIGAEGDEE 273


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 30.1 bits (68), Expect = 0.66
 Identities = 9/26 (34%), Positives = 21/26 (80%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKE 32
           E K  +++++K+EEE+ K+ +K+++E
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEE 213



 Score = 27.8 bits (62), Expect = 4.3
 Identities = 9/38 (23%), Positives = 26/38 (68%)

Query: 2   TPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           +    ++++ E E +K +++++K EE+++K+  ++ EE
Sbjct: 176 SQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEE 213



 Score = 26.6 bits (59), Expect = 8.3
 Identities = 9/27 (33%), Positives = 20/27 (74%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEG 33
           ++K  EE  +++++++E+EE K+K E 
Sbjct: 196 QQKREEERRKQRKKQQEEEERKQKAEE 222


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 30.3 bits (69), Expect = 0.70
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
             K +E+EE +K   K+K E   KK  G +GE 
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453



 Score = 27.2 bits (61), Expect = 7.5
 Identities = 7/35 (20%), Positives = 17/35 (48%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNK 46
            E++ +KEE ++   +KK +   ++ +       K
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455



 Score = 26.8 bits (60), Expect = 9.1
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEG 33
            + +K  + E++ ++EE EK   KKK E 
Sbjct: 413 KLRKKQRKAEKKAEKEEAEKAAAKKKAEA 441


>gnl|CDD|235425 PRK05349, PRK05349, Na(+)-translocating NADH-quinone reductase
           subunit B; Provisional.
          Length = 405

 Score = 30.2 bits (69), Expect = 0.73
 Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 10/49 (20%)

Query: 108 RFVAGSLAGGVS----LN-VPDDFTAKEMLTGM-WWRHLVAGGVA-GAV 149
           R VAG + G ++     N +  D      +  M W+ HLV GG A G V
Sbjct: 287 RIVAGVMIGMIATSLLFNLIGSDTNP---MFAMPWYWHLVLGGFAFGMV 332


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 30.0 bits (68), Expect = 0.81
 Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 7   ERKISEEEEEKKEEE--KEKEEEKKKKEGGREGEELGVHGNK 46
           +R  +EEE  K+E E  K+ E E+ ++  G E ++     +K
Sbjct: 255 DRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSK 296



 Score = 28.0 bits (63), Expect = 4.1
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
             ++E+ +EEE+E++E+KKKK       EL
Sbjct: 362 DSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
          represents the eukaryotic large ribosomal protein P1.
          Eukaryotic P1 and P2 are functionally equivalent to the
          bacterial protein L7/L12, but are not homologous to
          L7/L12. P1 is located in the L12 stalk, with proteins
          P2, P0, L11, and 28S rRNA. P1 and P2 are the only
          proteins in the ribosome to occur as multimers, always
          appearing as sets of heterodimers. Recent data indicate
          that eukaryotes have four copies (two heterodimers),
          while most archaeal species contain six copies of L12p
          (three homodimers) and bacteria may have four or six
          copies (two or three homodimers), depending on the
          species. Experiments using S. cerevisiae P1 and P2
          indicate that P1 proteins are positioned more
          internally with limited reactivity in the C-terminal
          domains, while P2 proteins seem to be more externally
          located and are more likely to interact with other
          cellular components. In lower eukaryotes, P1 and P2 are
          further subdivided into P1A, P1B, P2A, and P2B, which
          form P1A/P2B and P1B/P2A heterodimers. Some plant
          species have a third P-protein, called P3, which is not
          homologous to P1 and P2. In humans, P1 and P2 are
          strongly autoimmunogenic. They play a significant role
          in the etiology and pathogenesis of systemic lupus
          erythema (SLE). In addition, the ribosome-inactivating
          protein trichosanthin (TCS) interacts with human P0,
          P1, and P2, with its primary binding site located in
          the C-terminal region of P2. TCS inactivates the
          ribosome by depurinating a specific adenine in the
          sarcin-ricin loop of 28S rRNA.
          Length = 103

 Score = 28.4 bits (64), Expect = 0.84
 Identities = 8/16 (50%), Positives = 14/16 (87%)

Query: 12 EEEEEKKEEEKEKEEE 27
           ++EEKKEEE+E+ ++
Sbjct: 81 AKKEEKKEEEEEESDD 96


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 30.0 bits (68), Expect = 0.92
 Identities = 12/33 (36%), Positives = 21/33 (63%), Gaps = 3/33 (9%)

Query: 3   PIAIERKISEEEEEKKEEE---KEKEEEKKKKE 32
              ++R+  E+E  K+++     +K+EEKKKKE
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKE 582



 Score = 28.5 bits (64), Expect = 2.7
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 13  EEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           E+EEK+  +++K   K KK+  ++ +EL
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKEL 583



 Score = 26.9 bits (60), Expect = 8.9
 Identities = 9/27 (33%), Positives = 19/27 (70%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKE 32
            E K + +E+++  + K++EE+KKK+ 
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKEL 583


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 28.2 bits (63), Expect = 0.92
 Identities = 7/39 (17%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 4  IAIERKI---SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
          + I R +    EE E+  ++E + +++ K  +   + ++
Sbjct: 37 VEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDD 75


>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex.  Med21 has been
           known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
           Drosophila. The heterodimer of the two subunits Med7 and
           Med21 appears to act as a hinge between the middle and
           the tail regions of Mediator.
          Length = 132

 Score = 28.8 bits (65), Expect = 0.93
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKE 32
            R+I E EEE +E E E+EE  K+KE
Sbjct: 91  LRRIKELEEELREVEAEREEAVKEKE 116



 Score = 27.2 bits (61), Expect = 3.1
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKK 30
           E ++ E E E++E  KEKE+  KK
Sbjct: 98  EEELREVEAEREEAVKEKEKLLKK 121



 Score = 26.5 bits (59), Expect = 5.8
 Identities = 13/33 (39%), Positives = 22/33 (66%), Gaps = 3/33 (9%)

Query: 11  SEEEEE---KKEEEKEKEEEKKKKEGGREGEEL 40
           SEEE+    K+ EE+ +E E +++E  +E E+L
Sbjct: 86  SEEEQLRRIKELEEELREVEAEREEAVKEKEKL 118


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 29.3 bits (66), Expect = 0.93
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 1   MTPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
                I+ K+S  E++ KE E E  +E+ +K+   E EE
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182



 Score = 27.8 bits (62), Expect = 3.5
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            E++ K+ E ++ +EE +K E   E EE
Sbjct: 156 LEKKLKELEAEDVDEEDEKDEEEEEEEE 183



 Score = 26.3 bits (58), Expect = 9.8
 Identities = 10/16 (62%), Positives = 15/16 (93%)

Query: 12  EEEEEKKEEEKEKEEE 27
           EE+E+ +EEE+E+EEE
Sbjct: 170 EEDEKDEEEEEEEEEE 185


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 29.9 bits (67), Expect = 0.94
 Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 3/33 (9%)

Query: 1  MTPIAIERKISEEEE---EKKEEEKEKEEEKKK 30
           T    E+KI  EEE   +KK+EEK KE+E KK
Sbjct: 3  RTESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 29.5 bits (66), Expect = 1.3
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 8  RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           +   E E+K   E+E E +KKK+E  +E E 
Sbjct: 2  SRTESEAEKKILTEEELERKKKKEEKAKEKEL 33



 Score = 27.6 bits (61), Expect = 5.5
 Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 1  MTPIAIERKISEEEEEK-KEEEKEKEEEKKKK 31
          +T   +ERK  +EE+ K KE +K K  +K+ K
Sbjct: 13 LTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.8 bits (67), Expect = 0.97
 Identities = 14/29 (48%), Positives = 20/29 (68%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           SEE E K +EEK K+EE +KK+  +  E+
Sbjct: 395 SEEAEAKAKEEKLKQEENEKKQKEQADED 423



 Score = 29.4 bits (66), Expect = 1.2
 Identities = 12/33 (36%), Positives = 24/33 (72%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEGGREG 37
           A E K+ +EE EKK++E+  E+++K+++  R+ 
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 29.6 bits (66), Expect = 0.98
 Identities = 13/33 (39%), Positives = 24/33 (72%), Gaps = 2/33 (6%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           E K  E++ ++++EE+E EEE K++EG  +G +
Sbjct: 212 EHKTEEKQPQEEQEEEEVEEEAKQEEG--QGTD 242


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 29.9 bits (68), Expect = 0.98
 Identities = 10/24 (41%), Positives = 17/24 (70%), Gaps = 2/24 (8%)

Query: 11  SEEEEEK--KEEEKEKEEEKKKKE 32
           S++E E+  K+ E+   E+KK+KE
Sbjct: 502 SDDEIERMVKDAEEYAAEDKKRKE 525


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 10/21 (47%), Positives = 17/21 (80%)

Query: 12  EEEEEKKEEEKEKEEEKKKKE 32
           +EEE K++ E+ KE E++KK+
Sbjct: 106 DEEEAKRQHEEAKEREREKKK 126



 Score = 28.0 bits (63), Expect = 1.7
 Identities = 8/35 (22%), Positives = 19/35 (54%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           I+ +   +    +E++ ++EE K++ E  +E E  
Sbjct: 89  IDERAEAQRARDEEKKLDEEEAKRQHEEAKERERE 123



 Score = 27.2 bits (61), Expect = 3.5
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKK 30
             ++   EE + + EE KE+E EKKK
Sbjct: 101 EEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|185494 PTZ00168, PTZ00168, mitochondrial carrier protein; Provisional.
          Length = 259

 Score = 29.5 bits (66), Expect = 1.1
 Identities = 13/54 (24%), Positives = 28/54 (51%), Gaps = 6/54 (11%)

Query: 141 VAGGVAGAVSRTCTAPLDRLKVYLQVHGNK-TTGIKKCLLHLLHEGGFLSLWRG 193
           + GG+AG ++   T P+D +K    ++G      + +     + E G+L+ ++G
Sbjct: 184 ICGGLAGGIAGFLTTPVDVIKSRQIIYGKSYIETVTE-----IAEEGYLTFYKG 232


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
          approximately 300 residues, found in plants and
          vertebrates. They contain a highly conserved DDRGK
          motif.
          Length = 189

 Score = 29.3 bits (66), Expect = 1.1
 Identities = 16/33 (48%), Positives = 26/33 (78%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
          ERK  EE EE++E++KE+EE K+++E  R+ +E
Sbjct: 41 ERKEEEELEEEREKKKEEEERKEREEQARKEQE 73



 Score = 27.4 bits (61), Expect = 3.8
 Identities = 15/35 (42%), Positives = 23/35 (65%), Gaps = 3/35 (8%)

Query: 6  IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           E ++ EE E+KKEEE+ KE E++ +   +E EE 
Sbjct: 44 EEEELEEEREKKKEEEERKEREEQAR---KEQEEY 75



 Score = 27.4 bits (61), Expect = 4.5
 Identities = 15/28 (53%), Positives = 19/28 (67%)

Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           E E K+EEE E+E EKKK+E  R+  E
Sbjct: 38 REGERKEEEELEEEREKKKEEEERKERE 65


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 29.6 bits (67), Expect = 1.2
 Identities = 14/38 (36%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 4   IAIERKISE-EEEEKKEEEKEKEEEKKKKEGGREGEEL 40
             ++RK+ E E E+K+EE +EK  +K  ++ G + EE+
Sbjct: 87  QGLQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEV 124


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 28.6 bits (65), Expect = 1.2
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKE 32
          E K  E+ E+K E+ K K E+KK K+
Sbjct: 67 ELKAWEKAEKKAEKAKAKAEKKKAKK 92


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 29.7 bits (67), Expect = 1.2
 Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 4/33 (12%)

Query: 7   ERKISEEEEEKKEE----EKEKEEEKKKKEGGR 35
           E KIS+  EEK ++    EK+KEEE K++E  R
Sbjct: 734 EFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 28.5 bits (64), Expect = 3.0
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 1   MTPIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
           ++    E+   ++++EKK+EE+ K EEK + E
Sbjct: 737 ISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 29.2 bits (65), Expect = 1.3
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 10  ISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTG 49
           +  EEEE+ EE   +EEE+ ++ G     E     N+   
Sbjct: 174 VDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEE 213



 Score = 27.3 bits (60), Expect = 6.3
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           E +  EE + ++EEE E+       EG RE  E
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNE 210


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 29.3 bits (65), Expect = 1.3
 Identities = 12/34 (35%), Positives = 24/34 (70%)

Query: 3   PIAIERKISEEEEEKKEEEKEKEEEKKKKEGGRE 36
           P   ++    +++EKKE+EKE++++KKK+  G +
Sbjct: 193 PAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 27.7 bits (61), Expect = 4.6
 Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 1/36 (2%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           A+E+K S++ ++K+++EKEKE +K KK+     + L
Sbjct: 194 AVEKK-SKKPKKKEKKEKEKERDKDKKKEVEGFKSL 228


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is
           a family of proteins conserved in yeasts. The function
           is not known. The Schizosaccharomyces pombe member is
           SPBC18E5.07 and the Saccharomyces cerevisiae member is
           AIM21.
          Length = 671

 Score = 29.5 bits (66), Expect = 1.4
 Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 2/46 (4%)

Query: 3   PIAIERKISEEEEEKKEEEKEKEEEKKKKEGG--REGEELGVHGNK 46
           P A ++ +   +E  KE+++E E+ K+K      R+G   G    K
Sbjct: 598 PQAPKKVLESPKEPSKEKKEEDEDTKEKAPLSDARKGRARGPARRK 643


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 29.5 bits (66), Expect = 1.4
 Identities = 17/33 (51%), Positives = 26/33 (78%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           E +  EEEEE++EEE+E+E+E +++EG  E EE
Sbjct: 441 EEESVEEEEEEEEEEEEEEQESEEEEGEDEEEE 473



 Score = 29.1 bits (65), Expect = 2.0
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
              EEE++EEE+E+EEE++ +E   E EE 
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472



 Score = 28.3 bits (63), Expect = 2.9
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            E +  EEE+E +EEE E EEE+++ E     EE
Sbjct: 451 EEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEE 484


>gnl|CDD|226040 COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary
           metabolites biosynthesis, transport, and catabolism].
          Length = 312

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 21/90 (23%), Positives = 31/90 (34%), Gaps = 25/90 (27%)

Query: 109 FVAGSLAGG-----VSLNVPDDFTAKEMLTGMWWRHLVAGGVAGAVSRTCTAPLDRLKVY 163
           +V G   GG     ++   PD F A            VAG +A  V+ T   P+  +   
Sbjct: 147 YVTGLSNGGRMANRLACEYPDIFAAIA---------PVAGLLALGVACTPPRPVSVMA-- 195

Query: 164 LQVHGNKTTGIKKCLLHLLHEGGFLSLWRG 193
              HG              + GG + + RG
Sbjct: 196 --FHGTADPLNP-------YHGGGVPIGRG 216


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 29.3 bits (67), Expect = 1.5
 Identities = 12/24 (50%), Positives = 17/24 (70%), Gaps = 2/24 (8%)

Query: 11  SEEEEEK--KEEEKEKEEEKKKKE 32
           S+EE E+  K+ E   EE+KK+KE
Sbjct: 501 SDEEIERMVKDAEANAEEDKKRKE 524


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 29.0 bits (65), Expect = 1.5
 Identities = 17/37 (45%), Positives = 26/37 (70%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGV 42
           IE +ISE+ EE++ EE E++EE++ +E   E E LG 
Sbjct: 71  IEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGD 107


>gnl|CDD|187630 cd05372, ENR_SDR, Enoyl acyl carrier protein (ACP) reductase (ENR),
           divergent SDR.  This bacterial subgroup of ENRs includes
           Escherichia coli ENR. ENR catalyzes the
           NAD(P)H-dependent reduction of enoyl-ACP in the last
           step of fatty acid biosynthesis. De novo fatty acid
           biosynthesis is catalyzed by the fatty acid synthetase
           complex, through the serial addition of 2-carbon
           subunits. In bacteria and plants,ENR catalyzes one of
           six synthetic steps in this process. Oilseed rape ENR,
           and also apparently the NADH-specific form of
           Escherichia coli ENR, is tetrameric.  Although similar
           to the classical SDRs, this group does not have the
           canonical catalytic tetrad, nor does it have the typical
           Gly-rich NAD-binding pattern. Such so-called divergent
           SDRs have a GXXXXXSXA NAD-binding motif and a YXXMXXXK
           (or YXXXMXXXK) active site motif. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold (alpha/beta folding pattern with a central
           beta-sheet), an NAD(P)(H)-binding region, and a
           structurally diverse C-terminal region. Classical SDRs
           are typically about 250 residues long, while extended
           SDRs are approximately 350 residues. Sequence identity
           between different SDR enzymes are typically in the
           15-30% range, but the enzymes share the Rossmann fold
           NAD-binding motif and characteristic NAD-binding and
           catalytic sequence patterns. These enzymes catalyze a
           wide range of activities including the metabolism of
           steroids, cofactors, carbohydrates, lipids, aromatic
           compounds, and amino acids, and act in redox sensing.
           Classical SDRs have an TGXXX[AG]XG cofactor binding
           motif and a YXXXK active site motif, with the Tyr
           residue of the active site motif serving as a critical
           catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase (15-PGDH)
           numbering). In addition to the Tyr and Lys, there is
           often an upstream Ser (Ser-138, 15-PGDH numbering)
           and/or an Asn (Asn-107, 15-PGDH numbering) contributing
           to the active site; while substrate binding is in the
           C-terminal region, which determines specificity. The
           standard reaction mechanism is a 4-pro-S hydride
           transfer and proton relay involving the conserved Tyr
           and Lys, a water molecule stabilized by Asn, and
           nicotinamide. Extended SDRs have additional elements in
           the C-terminal region, and typically have a TGXXGXXG
           cofactor binding motif. Complex (multidomain) SDRs such
           as ketoreductase domains of fatty acid synthase have a
           GGXGXXG NAD(P)-binding motif and an altered active site
           motif (YXXXN). Fungal type ketoacyl reductases have a
           TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
           have lost catalytic activity and/or have an unusual
           NAD(P)-binding motif and missing or unusual active site
           residues. Reactions catalyzed within the SDR family
           include isomerization, decarboxylation, epimerization,
           C=N bond reduction, dehydratase activity,
           dehalogenation, Enoyl-CoA reduction, and
           carbonyl-alcohol oxidoreduction.
          Length = 250

 Score = 28.7 bits (65), Expect = 1.7
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 70  NGINVLKIAPESALKFMAYEQAKRLIRGN 98
           N + V K A ES+++++AYE  ++ IR N
Sbjct: 152 NVMGVAKAALESSVRYLAYELGRKGIRVN 180


>gnl|CDD|167649 PRK03963, PRK03963, V-type ATP synthase subunit E; Provisional.
          Length = 198

 Score = 28.6 bits (64), Expect = 1.7
 Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 3/37 (8%)

Query: 6  IERKISEEEEEKKE---EEKEKEEEKKKKEGGREGEE 39
          I ++I+ E E+K E   EE +KE EK K+E  +  E 
Sbjct: 7  IIQEINREAEQKIEYILEEAQKEAEKIKEEARKRAES 43


>gnl|CDD|181163 PRK07906, PRK07906, hypothetical protein; Provisional.
          Length = 426

 Score = 29.0 bits (66), Expect = 1.7
 Identities = 19/55 (34%), Positives = 23/55 (41%), Gaps = 18/55 (32%)

Query: 116 GGVSLNVPDDF------TAKEMLTGMWWRHLVAGGVAG---------AVSRTCTA 155
           GG SL VP         TA++   G+ W  L A G AG         AV+R   A
Sbjct: 169 GGFSLTVPGRDRLYLIETAEK---GLAWMRLTARGRAGHGSMVNDDNAVTRLAEA 220


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
          subunit.  This is a family of proteins which are
          subunits of the eukaryotic translation initiation
          factor 3 (eIF3). In yeast it is called Hcr1. The
          Saccharomyces cerevisiae protein eIF3j (HCR1) has been
          shown to be required for processing of 20S pre-rRNA and
          binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 12 EEEEEKKEEEKEKEEEKKKK 31
          EEE+E+KEEEK K   K K 
Sbjct: 39 EEEDEEKEEEKAKVAAKAKA 58



 Score = 27.3 bits (61), Expect = 4.9
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 11 SEEEEEKKEEEKEKEEEKKKK 31
           EE+EEK+EE+ +   + K K
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAK 59



 Score = 26.9 bits (60), Expect = 6.8
 Identities = 12/25 (48%), Positives = 18/25 (72%)

Query: 16 EKKEEEKEKEEEKKKKEGGREGEEL 40
          + K EEKEK + +K+++G RE EE 
Sbjct: 63 KAKIEEKEKAKREKEEKGLRELEED 87


>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal.  This is the N terminal of
           the DNA repair protein Rad54.
          Length = 191

 Score = 28.5 bits (64), Expect = 1.7
 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 11/45 (24%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKT----TGIKK 52
           + EEEK E+++E EE + K          G   +K+     GIKK
Sbjct: 146 KIEEEKAEKDQEPEESETK-------LSNGPKVHKSLAEILGIKK 183


>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH.  This
           bacillus clade of FliH proteins is not found by the Pfam
           FliH model pfam02108, but is closely related to the
           sequences identified by that model. Sequences identified
           by this model are observed in flagellar operons in an
           analogous position relative to other flagellar operon
           genes.
          Length = 255

 Score = 28.6 bits (64), Expect = 1.8
 Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 1/37 (2%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
           AI  +I E+E  + EEE+E+  ++ K+EG   G + G
Sbjct: 66  AIREQI-EQERAQWEEERERLIQEAKQEGYEAGFQAG 101


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family
          consists of several ATP synthase E chain sequences
          which are components of the CF(0) subunit.
          Length = 83

 Score = 27.4 bits (61), Expect = 1.8
 Identities = 14/35 (40%), Positives = 19/35 (54%)

Query: 6  IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           E KI E E ++K  EK K    KKK+  +E + L
Sbjct: 35 KEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKSL 69


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 28.5 bits (64), Expect = 1.9
 Identities = 7/33 (21%), Positives = 23/33 (69%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           ++   ++++ KK+++K+K+++  KK+   E ++
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114



 Score = 27.3 bits (61), Expect = 4.2
 Identities = 7/29 (24%), Positives = 20/29 (68%)

Query: 4   IAIERKISEEEEEKKEEEKEKEEEKKKKE 32
               +K  +++++KK+++K+ + EKK ++
Sbjct: 88  KKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116



 Score = 26.6 bits (59), Expect = 8.5
 Identities = 7/28 (25%), Positives = 22/28 (78%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           ++ ++KK+++K+K+++KK  +  ++ E+
Sbjct: 89  KKSKKKKDKDKDKKDDKKDDKSEKKDEK 116


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 28.5 bits (63), Expect = 2.0
 Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 4/49 (8%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGV----HGNKTTG 49
           A +   S+E+EE++EEE+E+E E ++ E G  G         HGN ++G
Sbjct: 129 ATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSG 177



 Score = 27.7 bits (61), Expect = 4.0
 Identities = 15/43 (34%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKT 47
              +K ++E+E  ++EE+E+EEE+++ E   E  E G +G  T
Sbjct: 124 NAGKKATKEDESDEDEEEEEEEEEEEAE--VEENEQGTNGTST 164


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 27.3 bits (61), Expect = 2.2
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 12  EEEEEKKEEEKEKEEEKKK 30
           EEE  +KEEE+EK   K+K
Sbjct: 88  EEERLEKEEEREKRARKRK 106



 Score = 27.0 bits (60), Expect = 3.3
 Identities = 7/20 (35%), Positives = 16/20 (80%)

Query: 12  EEEEEKKEEEKEKEEEKKKK 31
           E+ EE++ E++E+ E++ +K
Sbjct: 85  EDREEERLEKEEEREKRARK 104



 Score = 25.8 bits (57), Expect = 8.5
 Identities = 8/19 (42%), Positives = 15/19 (78%)

Query: 13  EEEEKKEEEKEKEEEKKKK 31
            EEE+ E+E+E+E+  +K+
Sbjct: 87  REEERLEKEEEREKRARKR 105



 Score = 25.8 bits (57), Expect = 8.7
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 13  EEEEKKEEEKEKEEEKKKKE 32
             E+++EE  EKEEE++K+ 
Sbjct: 83  RLEDREEERLEKEEEREKRA 102


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 28.0 bits (63), Expect = 2.2
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 9   KISEEEEEKKEEEKEKEEEKKKKE 32
           ++ E+E  K + EK +E E+K+KE
Sbjct: 122 ELLEKELAKLKREKRRENERKQKE 145



 Score = 27.3 bits (61), Expect = 4.6
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGE 38
            +EEE+ +E  EKE  K K+E  RE E
Sbjct: 114 LDEEEQIDELLEKELAKLKREKRRENE 140



 Score = 26.1 bits (58), Expect = 8.9
 Identities = 7/27 (25%), Positives = 16/27 (59%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKE 32
           I+  + +E  + K E++ + E K+K+ 
Sbjct: 120 IDELLEKELAKLKREKRRENERKQKEI 146


>gnl|CDD|215645 PLN03246, PLN03246, 26S proteasome regulatory subunit; Provisional.
          Length = 303

 Score = 28.6 bits (64), Expect = 2.3
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGG 34
           I  KI  +E EK E+ K   +  KK  G 
Sbjct: 275 INNKILNKEHEKAEDAKPAAKPAKKGSGK 303


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 27.4 bits (61), Expect = 2.3
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGG 34
            E +E +EEEKE+E E++  +  
Sbjct: 81  AEADEAEEEEKEEEAEEESDDDM 103



 Score = 27.4 bits (61), Expect = 2.7
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 3   PIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
             A     +    E   E  E EEE+K++E   E ++
Sbjct: 65  AAAGAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDD 101



 Score = 25.8 bits (57), Expect = 8.3
 Identities = 8/33 (24%), Positives = 17/33 (51%)

Query: 2   TPIAIERKISEEEEEKKEEEKEKEEEKKKKEGG 34
              A     +E +E ++EE++E+ EE+   +  
Sbjct: 72  AAAAGAEAAAEADEAEEEEKEEEAEEESDDDML 104



 Score = 25.8 bits (57), Expect = 8.5
 Identities = 6/29 (20%), Positives = 15/29 (51%)

Query: 13  EEEEKKEEEKEKEEEKKKKEGGREGEELG 41
             E   E ++ +EEEK+++      +++ 
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEESDDDML 104


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
          domain.  The centromere protein B (CENP-B) dimerisation
          domain is composed of two alpha-helices, which are
          folded into an antiparallel configuration. Dimerisation
          of CENP-B is mediated by this domain, in which monomers
          dimerise to form a symmetrical, antiparallel,
          four-helix bundle structure with a large hydrophobic
          patch in which 23 residues of one monomer form van der
          Waals contacts with the other monomer. This CENP-B
          dimer configuration may be suitable for capturing two
          distant CENP-B boxes during centromeric heterochromatin
          formation.
          Length = 101

 Score = 27.4 bits (60), Expect = 2.4
 Identities = 7/23 (30%), Positives = 16/23 (69%)

Query: 11 SEEEEEKKEEEKEKEEEKKKKEG 33
          S+EEE+  +E++E ++E   ++ 
Sbjct: 16 SDEEEDDDDEDEEDDDEDDDEDD 38



 Score = 26.3 bits (57), Expect = 5.6
 Identities = 6/32 (18%), Positives = 21/32 (65%)

Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEELGV 42
          S+ + +++E++ +++EE   ++   + +E+ V
Sbjct: 12 SDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPV 43


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 27.8 bits (62), Expect = 2.4
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKE 32
           E+K   +E E+KE  K K+E+K+KKE
Sbjct: 98  EQKEVSKETEEKEAIKAKKEKKEKKE 123



 Score = 27.8 bits (62), Expect = 2.9
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 10  ISEEEEEKKEEEKEKEEEKKKKE 32
           +  EE EK+++E  KE E+K+  
Sbjct: 90  LFPEESEKEQKEVSKETEEKEAI 112



 Score = 27.4 bits (61), Expect = 3.9
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKK 31
           E     EE+E  + +KEK+E+K+KK
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 26.3 bits (58), Expect = 8.5
 Identities = 9/31 (29%), Positives = 19/31 (61%)

Query: 2   TPIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
            P   E++  E  +E +E+E  K +++KK++
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKKEK 121


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap
          is involved in recombination. Rap (recombination adept
          with plasmid) increases lambda-by-plasmid recombination
          catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 28.1 bits (63), Expect = 2.5
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 4  IAIERKISEEEEEKKEEEKEKEEEKKKKE 32
          +A++R+ ++E++ K E + E+ E K +KE
Sbjct: 32 LALKREKAQEKKRKAEAQAERRELKARKE 60


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 28.1 bits (63), Expect = 2.5
 Identities = 11/25 (44%), Positives = 13/25 (52%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKK 31
           ERK +     KK    +K E KKKK
Sbjct: 172 ERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 28.5 bits (64), Expect = 2.8
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 9   KISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           + + EEEE++EEE+E+EEE   +       E+
Sbjct: 401 RWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEM 432


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 27.1 bits (60), Expect = 2.8
 Identities = 19/39 (48%), Positives = 25/39 (64%)

Query: 8  RKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNK 46
          +KI EE  +K EEE +K  EKKKKEG  E +++   G K
Sbjct: 38 KKIEEEIIKKAEEEAQKLIEKKKKEGEEEAKKILEEGEK 76


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 28.6 bits (64), Expect = 2.8
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 8   RKISEEEEEKKEEEKEKEEEKKKKE 32
           ++++EE    ++E +E EEE K K+
Sbjct: 301 KQVNEELTTVRQENEELEEEYKIKK 325


>gnl|CDD|187737 cd09616, Peptidase_C12_UCH_L1_L3, Cysteine peptidase C12 containing
           ubiquitin carboxyl-terminal hydrolase (UCH) families L1
           and L3.  This ubiquitin C-terminal hydrolase (UCH)
           family includes UCH-L1 and UCH-L3, the two members
           sharing around 53% sequence identity as well as
           conserved catalytic residues. Both enzymes hydrolyze
           carboxyl terminal esters and amides of ubiquitin (Ub).
           UCH-L1, in dimeric form, has additional enzymatic
           activity as a ubiquitin ligase. It is highly abundant in
           the brain, constituting up to 2% of total protein, and
           is expressed exclusively in neurons and testes. Abnormal
           expression of UCH-L1 has been shown to correlate with
           several forms of cancer, including several primary lung
           tumors, lung tumor cell lines, and colorectal cancers.
           Mutations in the UCH-L1 gene have been linked to
           susceptibility to and protection from Parkinson's
           disease (PD); dysfunction of the hydrolase activity can
           lead to an accumulation of alpha-synuclein, which is
           linked to Parkinson's disease (PD), while accumulation
           of neurofibrillary tangles is linked to Alzheimer's
           disease (AD).  UCH-L3 hydrolyzes isopeptide bonds at the
           C-terminal glycine of either Ub or Nedd8, a
           ubiquitin-like protein. It can also interact with
           Lys48-linked Ub dimers to protect them from degradation
           while inhibiting its hydrolase activity at the same
           time.  Unlike UCH-L1, neither dimerization nor ligase
           activity have been observed for UCH-L3. It has been
           shown that levels of Nedd8 and the apoptotic protein p53
           and Bax are elevated in UCH-L3 knockout mice upon
           cryptorchid injury, possibly contributing to profound
           germ cell loss via apoptosis.
          Length = 222

 Score = 28.0 bits (63), Expect = 3.0
 Identities = 18/70 (25%), Positives = 33/70 (47%), Gaps = 10/70 (14%)

Query: 16  EKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKC-LLHLLHEGGFLSLWRGNGINV 74
           +  EE +++EEE+ K++G    E +     +T G   C  + L+H     ++   N  + 
Sbjct: 55  KAYEEFRKEEEEEIKEKGQEVSESV-YFMKQTIG-NACGTIALIH-----AV--ANNEDR 105

Query: 75  LKIAPESALK 84
           + I   S LK
Sbjct: 106 INILEGSFLK 115


>gnl|CDD|223696 COG0623, FabI, Enoyl-[acyl-carrier-protein].
          Length = 259

 Score = 27.9 bits (63), Expect = 3.1
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 70  NGINVLKIAPESALKFMAYEQAKRLIRGN 98
           N + V K A E++++++A +  K  IR N
Sbjct: 156 NVMGVAKAALEASVRYLAADLGKEGIRVN 184


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 27.3 bits (59), Expect = 3.1
 Identities = 14/33 (42%), Positives = 20/33 (60%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
          E    E E+EKK E+ E E+EKK ++   E E+
Sbjct: 57 EHIKKENEDEKKPEKPENEDEKKPEKPENEDEK 89



 Score = 27.3 bits (59), Expect = 3.3
 Identities = 13/28 (46%), Positives = 19/28 (67%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           E E+EKK E+ E E+EKK ++   E E+
Sbjct: 73  ENEDEKKPEKPENEDEKKPEKPENEDEK 100


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 28.2 bits (62), Expect = 3.3
 Identities = 16/34 (47%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 4    IAIER-KISEEEEEKKEEEKEKEEEKKKKEGGRE 36
            I  ER K  EE+E  +  ++ KEEE  KKE  RE
Sbjct: 1021 IVKERIKDQEEKERMESLQRAKEEEIGKKEKERE 1054



 Score = 28.2 bits (62), Expect = 4.0
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 5    AIERKISEEEEEKKEEEKEKEEEKKKKE 32
               ++  EEE  KKE+E+E+   K   +
Sbjct: 1036 ESLQRAKEEEIGKKEKEREQRIRKTIHD 1063


>gnl|CDD|212050 cd11480, SLC5sbd_u4, Uncharacterized bacterial solute carrier 5
           subfamily; putative solute-binding domain.  SLC5 (also
           called the sodium/glucose cotransporter family or solute
           sodium symporter family) is a family of proteins that
           co-transports Na+ with sugars, amino acids, inorganic
           ions or vitamins. Prokaryotic members of this family
           include Vibrio parahaemolyticus glucose/galactose
           (vSGLT), and Escherichia coli proline (PutP) and
           pantothenate (PutF) cotransporters. One member of the
           SLC5 family, human SGLT3, has been characterized as a
           glucose sensor and not a transporter. This subfamily
           belongs to the solute carrier 5 (SLC5) transporter
           family.
          Length = 484

 Score = 27.9 bits (63), Expect = 3.6
 Identities = 14/63 (22%), Positives = 23/63 (36%), Gaps = 22/63 (34%)

Query: 109 FVAGSLAGGVSLNVPDDFTAKEM----------------LTGMWWRHL-----VAGGVAG 147
            V G +A  ++L  P    A  +                L G++WR       +AG + G
Sbjct: 377 VVIGVVAILLALLAPPQNVA-FLVALAFAIAASAFFPVLLLGIFWRRFTTAGAIAGMLVG 435

Query: 148 AVS 150
            +S
Sbjct: 436 LLS 438


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 27.0 bits (60), Expect = 3.6
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEG 33
           E +  EEEE+++EE +E+ +  K K+ 
Sbjct: 101 ENRKLEEEEKEREELEEENDVTKGKDL 127


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 27.2 bits (60), Expect = 3.7
 Identities = 13/36 (36%), Positives = 21/36 (58%), Gaps = 9/36 (25%)

Query: 5   AIERKISEEE---------EEKKEEEKEKEEEKKKK 31
           A   +I +EE         E+++E  +E+EEEK+KK
Sbjct: 76  ATFMEIQKEERRSARMARLEDERELAREEEEEKRKK 111



 Score = 26.4 bits (58), Expect = 5.8
 Identities = 10/23 (43%), Positives = 17/23 (73%)

Query: 9   KISEEEEEKKEEEKEKEEEKKKK 31
           ++ +E E  +EEE+EK ++KKK 
Sbjct: 93  RLEDERELAREEEEEKRKKKKKN 115


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 27.8 bits (62), Expect = 3.8
 Identities = 13/33 (39%), Positives = 23/33 (69%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           ++  +E+E+E+ EEE E+E+E+   E  +E EE
Sbjct: 80  DKGDAEKEDEESEEENEEEDEESSDENEKETEE 112



 Score = 27.0 bits (60), Expect = 5.8
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 13 EEEEKKEEEKEKEEEKKKKEGGREGEE 39
          E EE KEEEKE    + K++ G   +E
Sbjct: 61 EIEEVKEEEKEAANSEDKEDKGDAEKE 87



 Score = 27.0 bits (60), Expect = 6.8
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           SE++E+K + EKE EE +++ E   E   
Sbjct: 75  SEDKEDKGDAEKEDEESEEENEEEDEESS 103


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 27.2 bits (61), Expect = 4.4
 Identities = 11/37 (29%), Positives = 18/37 (48%), Gaps = 1/37 (2%)

Query: 5   AIERKISEEEEEKKEEEKEKEE-EKKKKEGGREGEEL 40
            ++ KI+  ++EK     E +E E K KE   E +  
Sbjct: 71  ELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHA 107


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 27.5 bits (61), Expect = 4.4
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 5   AIERKISEEEEEKKEEEKEKEEEKKKKE 32
           A+E +  EEE+ +   +KE+EE++  K 
Sbjct: 150 ALEFEKEEEEQRRLLLQKEEEEQQMNKR 177


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
          secreted from the skin of amphibians, including the
          opiate-like dermorphins and deltorphins, and the
          antimicrobial dermoseptins and temporins. The alignment
          for this family includes the signal peptide.
          Length = 46

 Score = 25.4 bits (56), Expect = 4.6
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKK 29
          E++  EEE E +EE +E+ E K+
Sbjct: 24 EKREDEEENEDEEEGEEQSEVKR 46



 Score = 24.6 bits (54), Expect = 8.4
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 4  IAIERKISEEEEEKKEEEKEKEEEKKK 30
          +  E K  +EEE + EEE E++ E K+
Sbjct: 20 LCEEEKREDEEENEDEEEGEEQSEVKR 46



 Score = 24.2 bits (53), Expect = 9.6
 Identities = 10/20 (50%), Positives = 17/20 (85%)

Query: 14 EEEKKEEEKEKEEEKKKKEG 33
          EEEK+E+E+E E+E++ +E 
Sbjct: 22 EEEKREDEEENEDEEEGEEQ 41


>gnl|CDD|181605 PRK08993, PRK08993, 2-deoxy-D-gluconate 3-dehydrogenase; Validated.
          Length = 253

 Score = 27.5 bits (61), Expect = 4.7
 Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 5/46 (10%)

Query: 67  WRGNGINVLKIAPESALKFMAYEQAKRLIRGNYTRELSIYERFVAG 112
           W  + INV  IAP     +MA    ++L R +  R   I +R  AG
Sbjct: 176 WAKHNINVNAIAP----GYMATNNTQQL-RADEQRSAEILDRIPAG 216


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 26.5 bits (59), Expect = 4.7
 Identities = 7/17 (41%), Positives = 14/17 (82%)

Query: 11  SEEEEEKKEEEKEKEEE 27
           + ++EEKKEE +E+ ++
Sbjct: 85  AAKKEEKKEESEEESDD 101


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 26.8 bits (59), Expect = 4.8
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 15  EEKKEEEKEKEEEKKKKEG 33
           E KKEE+KE+EEE++   G
Sbjct: 89  EAKKEEKKEEEEEEEDDLG 107


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 27.6 bits (61), Expect = 4.9
 Identities = 13/35 (37%), Positives = 19/35 (54%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           +   I + EE K +E K KE+ KK  E  +  E+L
Sbjct: 186 LAELIIDLEELKLQELKLKEQAKKALEYYQLKEKL 220


>gnl|CDD|204918 pfam12436, USP7, Ubiquitin-specific protease 7.  This domain
          family is found in eukaryotes, and is approximately 40
          amino acids in length. The family is found in
          association with pfam00443, pfam00917. USP7 regulates
          the turnover of p53.
          Length = 35

 Score = 24.8 bits (55), Expect = 5.0
 Identities = 8/22 (36%), Positives = 14/22 (63%)

Query: 6  IERKISEEEEEKKEEEKEKEEE 27
          +  ++ EE EE++   KE+EE 
Sbjct: 9  LPFRLEEEREERERRRKEREEA 30


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 27.3 bits (61), Expect = 5.1
 Identities = 6/29 (20%), Positives = 16/29 (55%)

Query: 4   IAIERKISEEEEEKKEEEKEKEEEKKKKE 32
             +E   ++ +  K+  E+ KE+E+ ++ 
Sbjct: 162 EELEEDAADRDARKRAAEEAKEQEELRRR 190


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
          production and conversion].
          Length = 108

 Score = 26.6 bits (59), Expect = 5.2
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           R+I EE EE+ E+  ++  E+ ++E   E EE
Sbjct: 41 AREIIEEAEEEAEKLAQEILEEAREEAEEEAEE 73


>gnl|CDD|220379 pfam09750, DRY_EERY, Alternative splicing regulator.  This entry
           represents the conserved N-terminal region of SWAP
           (suppressor-of-white-apricot protein) proteins. This
           region contains two highly conserved motifs, viz: DRY
           and EERY, which appear to be the sites for alternative
           splicing of exons 2 and 3 of the SWAP mRNA. These
           proteins are thus thought to be involved in
           auto-regulation of pre-mRNA splicing. Most family
           members are associated with two Surp domains pfam01805
           and an Arginine- serine-rich binding region towards the
           C-terminus.
          Length = 121

 Score = 26.6 bits (59), Expect = 5.5
 Identities = 14/27 (51%), Positives = 18/27 (66%)

Query: 9   KISEEEEEKKEEEKEKEEEKKKKEGGR 35
           K   EE EK+EEEK +  +KKK +GG 
Sbjct: 84  KNDFEEAEKEEEEKRELAQKKKSKGGY 110


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 27.5 bits (61), Expect = 5.5
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKE 32
             + S+ EEE +E+EKEK +  KK  
Sbjct: 277 SGERSDSEEETEEKEKEKRKRLKKMM 302


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 27.4 bits (61), Expect = 5.6
 Identities = 14/33 (42%), Positives = 23/33 (69%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
          ++K  +  E KK+ EK+K+++K+KKE   EGE 
Sbjct: 57 DKKEDKNNESKKKSEKKKKKKKEKKEPKSEGET 89


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 27.7 bits (62), Expect = 5.7
 Identities = 14/26 (53%), Positives = 18/26 (69%), Gaps = 2/26 (7%)

Query: 9   KISEEEEEK--KEEEKEKEEEKKKKE 32
            +SEEE E+  KE E   EE+KK+KE
Sbjct: 497 GLSEEEIERMVKEAEANAEEDKKRKE 522


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
          function of this eukaryotic protein family is unknown.
          The yeast orthologues have been implicated in cell
          cycle progression and biogenesis of 60S ribosomal
          subunits. The Schistosoma mansoni Mak16 has been shown
          to target protein transport to the nucleolus.
          Length = 97

 Score = 26.0 bits (57), Expect = 5.7
 Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 5  AIERKISEEE-EEKKEEEKEKEEEKKKKEGGREGEELGVH 43
          A+E + SEE  EE++EEE+E++E + +     E  E  + 
Sbjct: 54 ALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIE 93


>gnl|CDD|217206 pfam02731, SKIP_SNW, SKIP/SNW domain.  This domain is found in
           chromatin proteins.
          Length = 158

 Score = 26.9 bits (60), Expect = 5.8
 Identities = 11/27 (40%), Positives = 19/27 (70%), Gaps = 1/27 (3%)

Query: 7   ERKISEEEEEKKEEEKE-KEEEKKKKE 32
           ERK  EE  ++ E +++  E+EK++KE
Sbjct: 118 ERKAREEVRQRAELQRQLAEKEKQEKE 144


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
          are designated YL1. These proteins have been shown to
          be DNA-binding and may be a transcription factor.
          Length = 238

 Score = 27.0 bits (60), Expect = 6.0
 Identities = 9/21 (42%), Positives = 16/21 (76%)

Query: 11 SEEEEEKKEEEKEKEEEKKKK 31
           +EEE +KE ++E+  +KKK+
Sbjct: 69 DDEEEGEKELQREERLKKKKR 89


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 27.6 bits (61), Expect = 6.0
 Identities = 8/46 (17%), Positives = 19/46 (41%)

Query: 8   RKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKC 53
           RK SE + +    +  K ++   K   R+  +     + T+  ++ 
Sbjct: 127 RKGSELDSDSDSSDSSKGKKLVNKPKKRQKYKKATIQSPTSLTEEE 172


>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
          co-repressor CIR.  This is a 45 residue conserved
          region at the N-terminal end of a family of proteins
          referred to as CIRs (CBF1-interacting co-repressors).
          CBF1 (centromere-binding factor 1) acts as a
          transcription factor that causes repression by binding
          specifically to GTGGGAA motifs in responsive promoters,
          and it requires CIR as a co-repressor. CIR binds to
          histone deacetylase and to SAP30 and serves as a linker
          between CBF1 and the histone deacetylase complex.
          Length = 37

 Score = 24.5 bits (54), Expect = 6.2
 Identities = 9/19 (47%), Positives = 14/19 (73%)

Query: 13 EEEEKKEEEKEKEEEKKKK 31
          E E+K  EE++K EE +K+
Sbjct: 15 EAEQKALEEQKKIEELRKE 33


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 27.1 bits (61), Expect = 6.4
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKK 31
          E K   + ++KKE+++ K  + K K
Sbjct: 65 ELKAWHKAQKKKEKQEAKAAKAKSK 89


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 27.3 bits (61), Expect = 6.4
 Identities = 13/29 (44%), Positives = 22/29 (75%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           +EEEE+++EE+E+EEE+ ++    EG  L
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376


>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family.  This is a
          family of transcription elongation factors which
          includes those referred to as Bex proteins as well as
          those named TCEAL7. Bex1 was shown to be a novel link
          between neurotrophin signalling, the cell cycle, and
          neuronal differentiation, suggesting it might function
          by coordinating internal cellular states with the
          ability of cells to respond to external signals. TCEAL7
          has been shown negatively to regulate the NF-kappaB
          pathway, hence being important in ovarian cancer as it
          one of the genes frequently downregulated in this
          cancer. A closely related protein, TFIIS/TCEA, found in
          pfam07500 is involved in transcription elongation and
          transcript fidelity. TFIIS/TCEA promotes 3'
          endoribonuclease activity of RNA polymerase II (pol II)
          and allows pol II to bypass transcript pause or
          'arrest' during elongation process. It is thus possible
          that BEX is also acting in this way.
          Length = 97

 Score = 26.2 bits (58), Expect = 6.5
 Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
            +E E K E + KEEE+K+     EG++  
Sbjct: 4  PCKENEGKPESEPKEEEEKRPLEEGEGKKPE 34


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 27.3 bits (61), Expect = 6.5
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 8   RKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
            K  E +     EEK K+EEKK+K    E E +
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680



 Score = 26.9 bits (60), Expect = 8.3
 Identities = 10/29 (34%), Positives = 15/29 (51%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           +EE EE+ + E E   E K  +  +EG  
Sbjct: 232 TEEREEETDVEIETTSETKGTKQEQEGST 260


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 26.1 bits (58), Expect = 6.7
 Identities = 11/26 (42%), Positives = 19/26 (73%), Gaps = 2/26 (7%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKE 32
          ERK  EEE+ ++EEE ++ +  K++E
Sbjct: 2  ERK--EEEKAQREEELKRLKNLKREE 25



 Score = 26.1 bits (58), Expect = 6.9
 Identities = 8/18 (44%), Positives = 14/18 (77%)

Query: 14 EEEKKEEEKEKEEEKKKK 31
          +E K+EE+ ++EEE K+ 
Sbjct: 1  KERKEEEKAQREEELKRL 18


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 27.3 bits (60), Expect = 6.7
 Identities = 11/35 (31%), Positives = 19/35 (54%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
           ++    + EE ++EEKE E E+++K      EE  
Sbjct: 137 QKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENN 171


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 27.3 bits (60), Expect = 6.7
 Identities = 20/85 (23%), Positives = 38/85 (44%), Gaps = 6/85 (7%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKCLLHLL----HEGG 62
           + +I     E+ E E+E E+ + + +  +   +     N      +   HL+    +E  
Sbjct: 372 KMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADEKNSHLVVGFRNERS 431

Query: 63  FLSLWRGNGINVLKIAPESALKFMA 87
           +++  RGN I V K   E +L+F A
Sbjct: 432 YVT--RGNSIGVFKNTDEGSLEFKA 454


>gnl|CDD|235963 PRK07203, PRK07203, putative chlorohydrolase/aminohydrolase;
           Validated.
          Length = 442

 Score = 27.2 bits (61), Expect = 7.0
 Identities = 11/28 (39%), Positives = 15/28 (53%), Gaps = 4/28 (14%)

Query: 113 SLAGGVSLNV--PDDFTAKEMLTGMWWR 138
            LA G+  N+  P DF    +L  +WWR
Sbjct: 71  GLARGMMANIPPPPDFI--SILKNLWWR 96


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 26.9 bits (59), Expect = 7.2
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 3   PIAIERKISEEEEEKKEEEKEKEEEKKKKEG 33
           P   +RK++EE+EE  E E   E   + +EG
Sbjct: 213 PRGHKRKLNEEDEEDAESESSFESSNEDEEG 243


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 27.3 bits (60), Expect = 7.2
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 8   RKISEEEEEKKEEEKEKEEEKKKK 31
           +   + EE K EE KE +E KKKK
Sbjct: 225 KADEDGEEPKVEEVKEGDEGKKKK 248


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 27.1 bits (60), Expect = 7.3
 Identities = 9/28 (32%), Positives = 19/28 (67%)

Query: 8   RKISEEEEEKKEEEKEKEEEKKKKEGGR 35
            +   +E+E KE++++KE E++KK+   
Sbjct: 278 LRTERKEKEAKEQQEKKELEQRKKKKKE 305


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
          family of proteins is found in bacteria and viruses.
          Proteins in this family are typically between 180 and
          214 amino acids in length.
          Length = 125

 Score = 26.1 bits (58), Expect = 7.3
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKE 32
          E+K  E++ E ++  K   EEK + E
Sbjct: 25 EKKQEEKKSEAEKLAKMSAEEKAEYE 50


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 26.6 bits (59), Expect = 7.4
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKE 32
            E E+E   E K + +EKKK+E
Sbjct: 107 DETEQEDPPETKTESKEKKKRE 128



 Score = 26.3 bits (58), Expect = 8.5
 Identities = 8/22 (36%), Positives = 15/22 (68%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKE 32
           S +E E+++  + K E K+KK+
Sbjct: 105 STDETEQEDPPETKTESKEKKK 126


>gnl|CDD|162945 TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase
           domain.  All proteins that score above the trusted
           cutoff score of 45 to this model are large proteins of
           either Pirellula sp. 1 or Verrucomicrobium spinosum.
           These proteins all contain, in addition to this domain,
           several hundred residues of highly variable sequence,
           and then a well-conserved C-terminal domain (TIGR02603)
           that features a putative cytochrome c-type heme binding
           motif CXXCH. The membrane-bound L-sorbosone
           dehydrogenase from Acetobacter liquefaciens
           (Gluconacetobacter liquefaciens) (SP|Q44091) is
           homologous to this domain but lacks additional sequence
           regions shared by members of this family and belongs to
           a different clade of the larger family of homologs. It
           and its closely related homologs are excluded from the
           this model by scoring between the trusted (45) and noise
           (18) cutoffs.
          Length = 367

 Score = 27.0 bits (60), Expect = 7.5
 Identities = 12/59 (20%), Positives = 17/59 (28%), Gaps = 1/59 (1%)

Query: 105 IYERFVAGSLAGGVSLNVPDDFTAKEMLTGMWWRHLVAGGVAGAVSRTCTAPLDRLKVY 163
                       G   +  D     E+ TG W +        G V+   TAP   +  Y
Sbjct: 214 AEGGRNGYQSFNGRRYDHADRGADHEVPTGEWRQDDRGVETVGDVAGGGTAPCG-IAFY 271


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 26.9 bits (60), Expect = 7.5
 Identities = 7/29 (24%), Positives = 17/29 (58%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
            ++E+E+K+E KE E+     +   + ++
Sbjct: 165 VDDEDEEKKEAKELEKLSDDDDFVWDEDD 193


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 26.8 bits (60), Expect = 7.6
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEG 33
           +E++I + EEEK+E EK   E + K E 
Sbjct: 125 LEQEIKKLEEEKEELEKRVAELEAKLEA 152


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 27.0 bits (59), Expect = 7.7
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGRE 36
           +E  E+KEE+ E EE ++  E   +
Sbjct: 317 DEIGEEKEEDDENEENERHTELLAD 341


>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
          protein [Cell motility and secretion / Intracellular
          trafficking and secretion].
          Length = 234

 Score = 26.6 bits (59), Expect = 7.8
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 5  AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGV 42
          +  +++ E  EE   E  E+  +   +EG  EG+E G 
Sbjct: 55 SAAQELQEGIEEGAREGYEEGFQLGYEEGFEEGQEEGR 92


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 26.1 bits (58), Expect = 8.2
 Identities = 9/35 (25%), Positives = 17/35 (48%), Gaps = 2/35 (5%)

Query: 4   IAIERKISEEEEEKKEEEKEKEEEKK--KKEGGRE 36
               R+   +E  +KE+   K  EKK  +++  +E
Sbjct: 95  ALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 26.5 bits (59), Expect = 8.3
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 7   ERKISEEEEEKKEEEKEKEEEKKKK 31
           E + +   +E++  E EK+E KKKK
Sbjct: 172 EAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 26.9 bits (60), Expect = 8.5
 Identities = 10/34 (29%), Positives = 21/34 (61%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           I+ K  +  ++K+EE+K ++ +KKKK   +  + 
Sbjct: 375 IKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 26.5 bits (59), Expect = 8.9
 Identities = 12/25 (48%), Positives = 19/25 (76%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKK 30
            + ++SEE+EE+  EE+E E EK+K
Sbjct: 149 SDEELSEEDEEEAAEEEEAEAEKEK 173



 Score = 26.5 bits (59), Expect = 8.9
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 11  SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           SE+EEEK E  K+ +E+  ++    + EE
Sbjct: 132 SEDEEEKDEAAKKAKEDSDEELSEEDEEE 160


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).
          This family consists of several Plasmodium falciparum
          SPAM (secreted polymorphic antigen associated with
          merozoites) proteins. Variation among SPAM alleles is
          the result of deletions and amino acid substitutions in
          non-repetitive sequences within and flanking the
          alanine heptad-repeat domain. Heptad repeats in which
          the a and d position contain hydrophobic residues
          generate amphipathic alpha-helices which give rise to
          helical bundles or coiled-coil structures in proteins.
          SPAM is an example of a P. falciparum antigen in which
          a repetitive sequence has features characteristic of a
          well-defined structural element.
          Length = 164

 Score = 26.4 bits (58), Expect = 8.9
 Identities = 8/19 (42%), Positives = 14/19 (73%)

Query: 12 EEEEEKKEEEKEKEEEKKK 30
          EEEEE +E+  + ++ +KK
Sbjct: 79 EEEEEDEEDNVDLKDIEKK 97



 Score = 26.4 bits (58), Expect = 9.8
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 7  ERKISEEEEEKKEEEKEKEEEKKKKE 32
          + +  EE  E +EEE+E EE+    +
Sbjct: 67 DIEDEEEIVEDEEEEEEDEEDNVDLK 92


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 26.9 bits (59), Expect = 8.9
 Identities = 10/28 (35%), Positives = 22/28 (78%)

Query: 12  EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
           E++++++++E+E+EEE++ K    E EE
Sbjct: 162 EDDDDEEDDEEEEEEEEEIKGFDDEDEE 189


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45;
          Provisional.
          Length = 193

 Score = 26.6 bits (58), Expect = 8.9
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 3  PIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
          P  I  ++ ++ EE  E+E+E EE+K K+E
Sbjct: 39 PEDIVEELEDQPEEPPEQEEENEEQKPKEE 68


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 27.0 bits (60), Expect = 9.0
 Identities = 10/28 (35%), Positives = 16/28 (57%), Gaps = 1/28 (3%)

Query: 6   IERKIS-EEEEEKKEEEKEKEEEKKKKE 32
            E K + EE  E+  EE   + +KKK++
Sbjct: 406 AEGKKAIEEIREELIEEGLLKSKKKKRK 433



 Score = 26.6 bits (59), Expect = 9.7
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKK 30
           +E K+ ++E+E +E EK  EE ++K
Sbjct: 298 LENKLEKQEDELEELEKAAEELRQK 322


>gnl|CDD|222222 pfam13561, adh_short_C2, Enoyl-(Acyl carrier protein) reductase. 
          Length = 239

 Score = 26.4 bits (59), Expect = 9.1
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 70  NGINVLKIAPESALKFMAYEQAKRLIRGN 98
            G+ V K A ES  +++AYE  ++ IR N
Sbjct: 144 GGMGVAKAALESLARYLAYELGRKGIRVN 172


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 26.7 bits (59), Expect = 9.4
 Identities = 9/35 (25%), Positives = 21/35 (60%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
           +++K + E+E  K+ EKE+   +++K+   E  + 
Sbjct: 92  LQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ 126


>gnl|CDD|241121 cd12677, RRM4_Nop4p, RNA recognition motif 4 in yeast nucleolar
          protein 4 (Nop4p) and similar proteins.  This subgroup
          corresponds to the RRM4 of Nop4p (also known as
          Nop77p), encoded by YPL043W from Saccharomyces
          cerevisiae. It is an essential nucleolar protein
          involved in processing and maturation of 27S pre-rRNA
          and biogenesis of 60S ribosomal subunits. Nop4p has
          four RNA recognition motifs (RRMs), also termed RBDs
          (RNA binding domains) or RNPs (ribonucleoprotein
          domains). .
          Length = 156

 Score = 26.1 bits (57), Expect = 9.5
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 10 ISEEEEEKKEEEKEKEEEKKKKEGG 34
          +S+EEE + +  + KE + KK + G
Sbjct: 44 LSKEEENRDKGHRYKEAQLKKGKSG 68


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 26.8 bits (59), Expect = 9.6
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 6   IERKISEEEEEKKEEEKEKEEEKKKK 31
           I+    +  E++ E E E +EE+KK 
Sbjct: 545 IDLDADDWTEDEDENEMETDEERKKP 570


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
          structure and biogenesis].
          Length = 107

 Score = 25.6 bits (56), Expect = 9.7
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 9  KISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
          K    +E+K E+ K    + KKK   ++ +E
Sbjct: 4  KSQLSKEKKAEKAKAGTAKDKKKWSKKKKKE 34


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 26.8 bits (59), Expect = 10.0
 Identities = 8/43 (18%), Positives = 19/43 (44%)

Query: 1   MTPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVH 43
           M    +E +  +  +E++ EE  + +   + E G + E+    
Sbjct: 230 MINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETK 272


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.135    0.399 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,545,201
Number of extensions: 1004688
Number of successful extensions: 6336
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5054
Number of HSP's successfully gapped: 726
Length of query: 201
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 109
Effective length of database: 6,857,034
Effective search space: 747416706
Effective search space used: 747416706
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 56 (25.2 bits)