RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9274
         (318 letters)



>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
           This family contains, amongst other G-protein-coupled
           receptors (GCPRs), members of the opsin family, which
           have been considered to be typical members of the
           rhodopsin superfamily. They share several motifs, mainly
           the seven transmembrane helices, GCPRs of the rhodopsin
           superfamily. All opsins bind a chromophore, such as
           11-cis-retinal. The function of most opsins other than
           the photoisomerases is split into two steps: light
           absorption and G-protein activation. Photoisomerases, on
           the other hand, are not coupled to G-proteins - they are
           thought to generate and supply the chromophore that is
           used by visual opsins.
          Length = 251

 Score =  131 bits (332), Expect = 9e-37
 Identities = 48/134 (35%), Positives = 76/134 (56%), Gaps = 4/134 (2%)

Query: 85  VVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCK--MYLIVV 142
           +V+   +++R+ TN+ ++NLA+ADLLF++   P+     V   WPFGD  CK   +L VV
Sbjct: 1   LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWPFGDALCKLVGFLFVV 60

Query: 143 TAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEV 202
             YAS+  L  +S+DR+LA+VHP+    +RT R A   IL+ W++ +L ++P        
Sbjct: 61  NGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLPPL--LFSW 118

Query: 203 SYTYASELHTACVF 216
             T      T C+ 
Sbjct: 119 LRTVEEGNVTTCLI 132


>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
           protein; Provisional.
          Length = 335

 Score = 94.5 bits (235), Expect = 3e-22
 Identities = 50/210 (23%), Positives = 93/210 (44%), Gaps = 24/210 (11%)

Query: 13  IVDIQHHYATCNISYNEYNRSNIPAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGII 72
            + I          Y +       A C         D            S I+ + +  I
Sbjct: 4   TLTINTTIENTTDYYYDTYYDEDYADC---DLNIGYDT----------NSTILIVVYSTI 50

Query: 73  IILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGD 132
              G  GN ++V+ V    ++++  ++ ++NLA++DLLF++  +PF    Y+   W FG+
Sbjct: 51  FFFGLVGN-IIVIYVLTKTKIKTPMDIYLLNLAVSDLLFVMT-LPFQIYYYILFQWSFGE 108

Query: 133 VWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVL 190
             CK+   L  +  Y S+  + +MS+DR++A+VHP+ S  + T +      L+ WI+ ++
Sbjct: 109 FACKIVSGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWIISII 168

Query: 191 TAVPVFNMHGEVSYTYASELHT--ACVFLD 218
              P+        YT   +  T   C+F +
Sbjct: 169 ETTPIL-----FVYTTKKDHETLICCMFYN 193


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 93.5 bits (232), Expect = 2e-21
 Identities = 51/173 (29%), Positives = 101/173 (58%), Gaps = 12/173 (6%)

Query: 41  NFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLL 100
           N ++E IN   +  + M   +S  + IF+ II ILG FGNA +++++ F +++++ T++ 
Sbjct: 80  NITYELIN---IKNKCMYPSISEYIKIFYIIIFILGLFGNAAIIMIL-FCKKIKTITDIY 135

Query: 101 IINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAY----ASVYTLVLMSL 156
           I NLAI+DL+F++   PF   +     W FGD  CK  +I  + Y    ++++ + LMS+
Sbjct: 136 IFNLAISDLIFVID-FPFIIYNEFDQ-WIFGDFMCK--VISASYYIGFFSNMFLITLMSI 191

Query: 157 DRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYTYASE 209
           DR+ A+++PI+    RT    +   +I+WI+ ++   P + +    +  ++++
Sbjct: 192 DRYFAILYPISFQKYRTFNIGIILCIISWILSLIITSPAYFIFEASNIIFSAQ 244


>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
           Srx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srx is part of the Srg
           superfamily of chemoreceptors. Chemoperception is one of
           the central senses of soil nematodes like C. elegans
           which are otherwise 'blind' and 'deaf'.
          Length = 275

 Score = 59.1 bits (144), Expect = 4e-10
 Identities = 32/150 (21%), Positives = 61/150 (40%), Gaps = 14/150 (9%)

Query: 71  IIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIAD----LLFIVCCVPFTATDYVTT 126
           +I + G   N L+  +      ++++  +L  N AI++     +F+   VP T  D    
Sbjct: 2   LISLFGIILNWLIFYIFLKLPSLKNSFGILCANKAISNAIICTIFLFYVVPMTLLDL--- 58

Query: 127 CWPFGDVWCKMYLIVVTAY----ASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAIL 182
              F       ++  +        S  T +L+SL+RF AV  P     + + +N    I 
Sbjct: 59  --SFLPELLNSHIGGLILLGLYEISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTKIIIT 116

Query: 183 ITWIVIVLTAVPVFNMHGEVSYTYASELHT 212
             WI+ ++ +   +   G   + Y+ E  T
Sbjct: 117 FIWIIAIIISTLFYFPLG-CHFYYSPESLT 145


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 57.3 bits (138), Expect = 4e-09
 Identities = 33/63 (52%), Positives = 39/63 (61%), Gaps = 8/63 (12%)

Query: 244 EREVHEEEEEEEEEEEETKAK---ETKLKEKKKLGGGGG-----EEEEEEEEEEEEEEEE 295
           E +  + E ++E  E+E  A+   E K  EK   GGGG      EEEEEEEEEEEEEEEE
Sbjct: 820 ETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEE 879

Query: 296 EEE 298
           EEE
Sbjct: 880 EEE 882



 Score = 54.6 bits (131), Expect = 3e-08
 Identities = 30/57 (52%), Positives = 32/57 (56%), Gaps = 19/57 (33%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           GG  +  + EEEEEEEEEE                   EEEEEEEEEEEEEEEE EE
Sbjct: 855 GGGSDGGDSEEEEEEEEEE-------------------EEEEEEEEEEEEEEEENEE 892



 Score = 47.3 bits (112), Expect = 7e-06
 Identities = 28/62 (45%), Positives = 30/62 (48%), Gaps = 22/62 (35%)

Query: 241 KGGEREVHEEEE----EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           K  E+ V         + EEEEEE                   EEEEEEEEEEEEEEEEE
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEE------------------EEEEEEEEEEEEEEEEE 887

Query: 297 EE 298
           EE
Sbjct: 888 EE 889



 Score = 43.8 bits (103), Expect = 9e-05
 Identities = 21/56 (37%), Positives = 31/56 (55%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +    E ++E  E+E     + + K+ +K   GGG  +  + EEEEEEEEEEEE
Sbjct: 820 ETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEE 875



 Score = 41.1 bits (96), Expect = 7e-04
 Identities = 27/92 (29%), Positives = 32/92 (34%), Gaps = 29/92 (31%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEE------------ 283
           E+   KG E    + E E E E  E    E + + +        E  E            
Sbjct: 787 EDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAK 846

Query: 284 -----------------EEEEEEEEEEEEEEE 298
                            EEEEEEEEEEEEEEE
Sbjct: 847 QDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEE 878



 Score = 38.4 bits (89), Expect = 0.004
 Identities = 20/65 (30%), Positives = 31/65 (47%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
             E D  +G      ++ E ++E  E+    E + + K+   G  G    +  + EEEEE
Sbjct: 809 AGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEE 868

Query: 294 EEEEE 298
           EEEEE
Sbjct: 869 EEEEE 873



 Score = 37.7 bits (87), Expect = 0.008
 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 225 PDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG--GEEE 282
            +G  +T  ++E++     ER+  +E E E E +E     ET+ +E +  G     G E+
Sbjct: 669 QEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTED 728

Query: 283 EEEEEEEEEEEEEEEEIGG 301
           E E E  EE EE E+E  G
Sbjct: 729 EGEIETGEEGEEVEDEGEG 747



 Score = 37.7 bits (87), Expect = 0.008
 Identities = 26/74 (35%), Positives = 33/74 (44%), Gaps = 9/74 (12%)

Query: 234 QSENDYCKGGEREVHEEEEE--EEEEEEETKAK-------ETKLKEKKKLGGGGGEEEEE 284
           + EN    GGE E   E E   E E E E  A+       E +++ K+    G  E EE 
Sbjct: 656 EGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEV 715

Query: 285 EEEEEEEEEEEEEE 298
           E E E E E  E+E
Sbjct: 716 EHEGETEAEGTEDE 729



 Score = 35.7 bits (82), Expect = 0.029
 Identities = 21/60 (35%), Positives = 26/60 (43%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           +  E +   E E EE E E     E    E +   G  GEE E+E E E E + E E  G
Sbjct: 700 EAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759



 Score = 34.6 bits (79), Expect = 0.083
 Identities = 22/64 (34%), Positives = 26/64 (40%), Gaps = 5/64 (7%)

Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           D  KG   E     E   EE E     E +  E+      GGE E+E E E + E E E 
Sbjct: 629 DLSKGDVAEAEHTGERTGEEGERPTEAEGENGEE-----SGGEAEQEGETETKGENESEG 683

Query: 298 EIGG 301
           EI  
Sbjct: 684 EIPA 687



 Score = 34.6 bits (79), Expect = 0.085
 Identities = 19/61 (31%), Positives = 26/61 (42%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
              E E  EE   E E+E ET+ K     E +      GE+E E E E +E + + E   
Sbjct: 653 TEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEA 712

Query: 301 G 301
            
Sbjct: 713 E 713



 Score = 34.2 bits (78), Expect = 0.095
 Identities = 16/25 (64%), Positives = 17/25 (68%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKL 268
           E E  EEEEEEEEEEEE +  E  L
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPL 894



 Score = 34.2 bits (78), Expect = 0.11
 Identities = 20/59 (33%), Positives = 28/59 (47%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           G E E  E+E E E E +     E   KE +  G    E +E+E+E E +  E+ E  G
Sbjct: 735 GEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKG 793



 Score = 33.8 bits (77), Expect = 0.12
 Identities = 18/58 (31%), Positives = 23/58 (39%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           GE       E E+E E ETK +     E      G  E E E E +E + + E E   
Sbjct: 657 GENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEE 714



 Score = 33.8 bits (77), Expect = 0.12
 Identities = 21/60 (35%), Positives = 28/60 (46%), Gaps = 1/60 (1%)

Query: 243 GEREVHEE-EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           GE E  EE EE E+E E E + K     E  +       E E E +E+E+E E +    G
Sbjct: 730 GEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG 789



 Score = 33.8 bits (77), Expect = 0.13
 Identities = 18/59 (30%), Positives = 29/59 (49%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
            +   H+ E E EE E E + +    +++ ++  G   EE E+E E E E + E E  G
Sbjct: 701 AKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759



 Score = 33.4 bits (76), Expect = 0.19
 Identities = 13/50 (26%), Positives = 19/50 (38%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E   EE E   E + +  +    +    G  E + E E E E   E + E
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGE 692



 Score = 33.0 bits (75), Expect = 0.25
 Identities = 16/26 (61%), Positives = 18/26 (69%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEE 260
           SE +  +  E E  EEEEEEEEEEEE
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEE 888



 Score = 32.3 bits (73), Expect = 0.42
 Identities = 21/57 (36%), Positives = 29/57 (50%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +G E E   E  EE EE E+    E + K + +  G   E E E E E E +E+E+E
Sbjct: 724 EGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780



 Score = 32.3 bits (73), Expect = 0.43
 Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 1/59 (1%)

Query: 244 EREVHEEEEEEEEEEE-ETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           E E+   EE EE E+E E +A+     E +        E E E E +E+E+E E + G 
Sbjct: 729 EGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGE 787



 Score = 31.9 bits (72), Expect = 0.49
 Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 2/75 (2%)

Query: 241 KGGEREVHEEEEEEEEEEEETKA--KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +G E E   E E E + E ET+   KET+ + + +  G   E+E E +  E+ E + +E 
Sbjct: 737 EGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEG 796

Query: 299 IGGVVVSAVDFGSGG 313
             G V    +  +G 
Sbjct: 797 AEGKVEHEGETEAGE 811



 Score = 31.5 bits (71), Expect = 0.64
 Identities = 19/62 (30%), Positives = 25/62 (40%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
            G E E   E E E  EE   +A++    E K      GE   E + E+E E E E +  
Sbjct: 645 TGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEA 704

Query: 301 GV 302
             
Sbjct: 705 DH 706



 Score = 31.5 bits (71), Expect = 0.70
 Identities = 21/62 (33%), Positives = 30/62 (48%), Gaps = 1/62 (1%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLK-EKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + GE    E E+E E E +     E ++  E+K    G GE E +E + + E E EE E 
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEH 717

Query: 300 GG 301
            G
Sbjct: 718 EG 719



 Score = 29.6 bits (66), Expect = 2.7
 Identities = 18/47 (38%), Positives = 24/47 (51%), Gaps = 7/47 (14%)

Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEE-------ETKAKE 265
           DG +    + E +  +  E E  EEEEEEEE EE       ET+ K+
Sbjct: 859 DGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 28.0 bits (62), Expect = 8.3
 Identities = 19/67 (28%), Positives = 27/67 (40%), Gaps = 1/67 (1%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E +  +       E E  E+E E ET   E +  E +  G   G+ E E E + +E E E
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGE-EGEEVEDEGEGEAEGKHEVETEGDRKETEHE 767

Query: 296 EEEIGGV 302
            E     
Sbjct: 768 GETEAEG 774


>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
          Length = 323

 Score = 54.1 bits (130), Expect = 2e-08
 Identities = 40/161 (24%), Positives = 88/161 (54%), Gaps = 10/161 (6%)

Query: 43  SHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLII 102
           ++ Y  + + D E +   V+  V +F+ ++ I G  GN LV+ V+   + M    ++ + 
Sbjct: 9   NYSYYFEEECDFEMVNSDVNYFVIVFYILLFIFGLIGNVLVIAVLIVKRFM-FVVDVYLF 67

Query: 103 NLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIV--VTAYASVYTLVLMSLDRFL 160
           N+A++DL+ +V   PF   + +   W FG+  CK+ L V  V  +++++ + L+S+DR++
Sbjct: 68  NIAMSDLM-LVFSFPFIIHNDLNE-WIFGEFMCKLVLGVYFVGFFSNMFFVTLISIDRYI 125

Query: 161 AVVHPIASMSVRTERNALKAIL--ITWIVIVLTAVPVFNMH 199
            VV+   +  ++ +  +L  +L    W+  V+ ++P   ++
Sbjct: 126 LVVN---ATKIKNKSISLSVLLSVAAWVCSVILSMPAMVLY 163


>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
           Srsx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srsx is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 257

 Score = 50.7 bits (122), Expect = 3e-07
 Identities = 27/110 (24%), Positives = 51/110 (46%), Gaps = 14/110 (12%)

Query: 73  IILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTA---TDYVTT--- 126
            ++G FGN +++++    +++RS  + LI    +ADLL +   + F     T    T   
Sbjct: 1   SVIGIFGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLLFTGTQLTRNE 60

Query: 127 CWPFGDVWCKMYLIVVTAYASVYTLVL-MSLDRFLAVVHPIASMSVRTER 175
           C+         Y+   TA +    L+L + +DR +AV  PI    + + +
Sbjct: 61  CFLIIIP----YIFGQTAQS---PLMLMIGIDRLIAVKFPIFYRLLSSSK 103


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 49.3 bits (118), Expect = 7e-07
 Identities = 21/50 (42%), Positives = 29/50 (58%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +EEE+EE+EEE      K K KK L     E+E+ + E+EE+   E EE 
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEED 87



 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 21/60 (35%), Positives = 34/60 (56%), Gaps = 14/60 (23%)

Query: 249 EEEEEEEEEEEETK---------AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +EEE+EE+EEE+ K         A + K++EK+K      + E+EE+   E EE+  E+ 
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKA-----KREKEEKGLRELEEDTPEDE 92



 Score = 31.2 bits (71), Expect = 0.58
 Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 19/70 (27%)

Query: 246 EVHEEEEEEEEE-----------------EEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
           +  E+EE+EEE+                 EE+ KAK  + KE+K L     +  E+E  E
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAK--REKEEKGLRELEEDTPEDELAE 95

Query: 289 EEEEEEEEEE 298
           +    + +EE
Sbjct: 96  KLRLRKLQEE 105



 Score = 30.8 bits (70), Expect = 0.84
 Identities = 15/44 (34%), Positives = 25/44 (56%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ++EE E  A   K   K K      +++ ++  +EEE+EE+EEE
Sbjct: 5   DDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEE 48


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 47.3 bits (113), Expect = 4e-06
 Identities = 20/57 (35%), Positives = 31/57 (54%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           G   +V  ++E E  + E+ + K+   K+ K+       EE+EEE  EEEE E E+E
Sbjct: 116 GEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172



 Score = 43.9 bits (104), Expect = 5e-05
 Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 7/58 (12%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             E E  + E+EEE++E   KAKE   +E         EE+EEE  EEEE E E+E+ 
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELS-------EEDEEEAAEEEEAEAEKEKA 174



 Score = 40.4 bits (95), Expect = 7e-04
 Identities = 18/56 (32%), Positives = 30/56 (53%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E +   E  + E+EEE+ +A +   ++  +      EEE  EEEE E E+E+  E+
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177



 Score = 36.2 bits (84), Expect = 0.015
 Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E    K  E+ +  +++++EEEE E +  E    E       G   + E ++E E  + E
Sbjct: 81  EERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDE-------GEWIDVESDKEIESSDSE 133

Query: 296 EEEIGG 301
           +EE   
Sbjct: 134 DEEEKD 139



 Score = 36.2 bits (84), Expect = 0.017
 Identities = 20/68 (29%), Positives = 33/68 (48%), Gaps = 11/68 (16%)

Query: 243 GEREVHEEEE-----EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           GE EV +        E+ +EEE  K +  +  E         +++EEEE E EE+E+ ++
Sbjct: 61  GEEEVVDGIPGLELLEKWKEEERKKKEAEQGLE------SDDDDDEEEEWEVEEDEDSDD 114

Query: 298 EIGGVVVS 305
           E   + V 
Sbjct: 115 EGEWIDVE 122



 Score = 31.5 bits (72), Expect = 0.49
 Identities = 14/65 (21%), Positives = 26/65 (40%), Gaps = 13/65 (20%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           K  ER+  E E+  E ++++                   E EE+E+ ++E E  + E   
Sbjct: 79  KEEERKKKEAEQGLESDDDDD-------------EEEEWEVEEDEDSDDEGEWIDVESDK 125

Query: 301 GVVVS 305
            +  S
Sbjct: 126 EIESS 130



 Score = 28.5 bits (64), Expect = 4.9
 Identities = 10/38 (26%), Positives = 22/38 (57%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
           E +  +  ++   + +EE  EE+EE  A+E + + +K+
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172



 Score = 28.1 bits (63), Expect = 6.5
 Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 19/87 (21%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKE------------TKLKE-------KKKL 274
             E+   +  E +  E  EEEE E E+ KA E             K++E        K L
Sbjct: 145 AKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPADFAKIQELRLEKGVDKAL 204

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           GG     +++  E   +E  + ++I G
Sbjct: 205 GGKLKRRDKDAPERHSDELVDADDIEG 231



 Score = 27.3 bits (61), Expect = 9.9
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKK 272
           + E D      +E  +EE  EE+EEE  + +E + +++K
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 46.3 bits (110), Expect = 6e-06
 Identities = 22/49 (44%), Positives = 32/49 (65%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             EEEE+ +E+    E KLKE +       +E++EEEEEEEEEE+E+ +
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFD 190



 Score = 42.0 bits (99), Expect = 2e-04
 Identities = 12/54 (22%), Positives = 31/54 (57%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E E   +E+    E++  + +   + E+ +      EEEEEE+E+ ++++++++
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197



 Score = 39.0 bits (91), Expect = 0.001
 Identities = 12/63 (19%), Positives = 36/63 (57%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           F+ +       E ++ E+    E++ +E +A++   +++K       EEEE+E+ +++++
Sbjct: 135 FKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194

Query: 293 EEE 295
           +++
Sbjct: 195 DDD 197


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 46.6 bits (111), Expect = 9e-06
 Identities = 22/86 (25%), Positives = 36/86 (41%), Gaps = 6/86 (6%)

Query: 222 VNRPDG---YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEK---KKLG 275
           V  P     YN +    +    +  E+EV  E++ +E E  E K  E    E     ++ 
Sbjct: 179 VEVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMS 238

Query: 276 GGGGEEEEEEEEEEEEEEEEEEEIGG 301
            G  EE +++ EEE ++E   E    
Sbjct: 239 EGLLEESDDDGEEESDDESAWEGFES 264



 Score = 36.2 bits (84), Expect = 0.020
 Identities = 17/72 (23%), Positives = 27/72 (37%), Gaps = 13/72 (18%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE--------EEEEEEE 287
           E     G E    E   E  E E E   K  + K K K      +        E E E +
Sbjct: 243 EESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTK-----AQRNKEKRRKELEREAK 297

Query: 288 EEEEEEEEEEEI 299
           EE++ +++  ++
Sbjct: 298 EEKQLKKKLAQL 309



 Score = 33.1 bits (76), Expect = 0.16
 Identities = 12/55 (21%), Positives = 29/55 (52%), Gaps = 6/55 (10%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ERE  EE++ +++  +  + KE   +  +K      E+    ++E+ +E  E+++
Sbjct: 293 EREAKEEKQLKKKLAQLARLKEIAKEVAQK------EKARARKKEQRKERGEKKK 341



 Score = 31.2 bits (71), Expect = 0.69
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query: 279 GEEEEEEEEEEEEEEEEEEEIGGVV 303
            E+E + E++ +E E  EE+    +
Sbjct: 203 YEKEVKAEKKRQELERVEEKKLEKM 227


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 46.4 bits (110), Expect = 1e-05
 Identities = 28/66 (42%), Positives = 35/66 (53%), Gaps = 6/66 (9%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E+      E E  E  EEEEEEEEE + +E + +E        GE+EEEEEE E +   E
Sbjct: 430 ESPSMASQESEEEESVEEEEEEEEEEEEEEQESEE------EEGEDEEEEEEVEADNGSE 483

Query: 296 EEEIGG 301
           EE  G 
Sbjct: 484 EEMEGS 489



 Score = 41.8 bits (98), Expect = 4e-04
 Identities = 22/72 (30%), Positives = 35/72 (48%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + E +  +  E E  E EEEE E+EEE +  E     ++++ G    + + EE EE+ E 
Sbjct: 446 EEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAER 505

Query: 294 EEEEEIGGVVVS 305
              E  G   +S
Sbjct: 506 RNSEMAGISRMS 517



 Score = 38.4 bits (89), Expect = 0.005
 Identities = 21/58 (36%), Positives = 25/58 (43%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           E E  E E+EEEEEE E      +  E    G G GEE EE+ E    E      +  
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSE 518



 Score = 37.6 bits (87), Expect = 0.007
 Identities = 25/69 (36%), Positives = 31/69 (44%), Gaps = 16/69 (23%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
            Q   +     E E  EEEEEEEE+E E +                  E+EEEEEE E +
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEE----------------GEDEEEEEEVEAD 479

Query: 293 EEEEEEIGG 301
              EEE+ G
Sbjct: 480 NGSEEEMEG 488



 Score = 37.2 bits (86), Expect = 0.011
 Identities = 19/66 (28%), Positives = 27/66 (40%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E +  +    E   E+EEEEEE E     E +++   +  G G E EE+ E    E    
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGI 513

Query: 296 EEEIGG 301
                G
Sbjct: 514 SRMSEG 519



 Score = 35.7 bits (82), Expect = 0.028
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 5/55 (9%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGE-----EEEEEEEEEEEEEEEEEE 298
           EEEE  + +E E +   ++  +  K     GE      +E EEEE  EEEEEEEE
Sbjct: 399 EEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEE 453


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 46.1 bits (110), Expect = 2e-05
 Identities = 25/50 (50%), Positives = 37/50 (74%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E++ EEE++E++ KA   K KE+++      +EEE+EEEEEE EEE+EEE
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469



 Score = 45.7 bits (109), Expect = 2e-05
 Identities = 24/53 (45%), Positives = 36/53 (67%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            V + E++ EEE++E K K    K+K++      E++EEE+EEEEEE EEE+E
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467



 Score = 45.7 bits (109), Expect = 2e-05
 Identities = 24/58 (41%), Positives = 39/58 (67%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           E+   + EEE++E++++  A + K +E+++      EE+EEEEEE EEE+EEEEE   
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474



 Score = 44.5 bits (106), Expect = 4e-05
 Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 214 CVFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
             FL+L      +   + +  S+    K  +     E++ EEE++E+ K      K++++
Sbjct: 389 AAFLELT-----EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443

Query: 274 LGGGGGEEEEEEEEEEEEEEEEEEE 298
                 ++EEE+EEEEEE EEE+EE
Sbjct: 444 EEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 44.5 bits (106), Expect = 5e-05
 Identities = 22/55 (40%), Positives = 36/55 (65%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +  ++ EEE++E+++      K +E+++      EEE+EEEEEE EEE+EEEE
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470



 Score = 40.3 bits (95), Expect = 0.001
 Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 6/53 (11%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + +  E+ E++ EE K ++ K     K      ++EEEEEEE+E++EEE+EE 
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGK------KKEEEEEEEKEKKEEEKEEE 458



 Score = 39.5 bits (93), Expect = 0.002
 Identities = 19/57 (33%), Positives = 38/57 (66%), Gaps = 4/57 (7%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E++  +++    +++EE + +E + KE++K      EEEE EEE+EEEEE+++++  
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEK----EEEEEEAEEEKEEEEEKKKKQAT 478



 Score = 38.0 bits (89), Expect = 0.006
 Identities = 17/60 (28%), Positives = 34/60 (56%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
              ++  ++ ++   KA++ + +EKK+        +++EEEEEEE+E++EEE       A
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 33.0 bits (76), Expect = 0.23
 Identities = 24/57 (42%), Positives = 35/57 (61%), Gaps = 12/57 (21%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +  ++    +++EEEEEEE+ K +E K            EEEEEE EEE+EEEEE++
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEK------------EEEEEEAEEEKEEEEEKK 473



 Score = 29.9 bits (68), Expect = 2.0
 Identities = 13/40 (32%), Positives = 25/40 (62%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE 280
            G ++E  EEEE+E++EEE+ + +E   +EK++      +
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 29.1 bits (66), Expect = 3.1
 Identities = 14/47 (29%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            EEE E  T +K+   K KK +     E++ EEE++E++++    + 
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKA--EKKREEEKKEKKKKAFAGKK 439


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 44.2 bits (104), Expect = 6e-05
 Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETK--LKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
            E E  E+ EE EEE+ E +   +K   K KK  G   G ++++ +  ++ ++ + +   
Sbjct: 312 PEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGED 371

Query: 301 GVV 303
            V 
Sbjct: 372 SVS 374



 Score = 42.3 bits (99), Expect = 2e-04
 Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 11/64 (17%)

Query: 243 GEREVHEEE--------EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           G+ E  EE+          + EE E+  + E      K       + EE EEE+ EEE  
Sbjct: 277 GDDEGREEDYISDSSASGNDPEEREDKLSPE---IPAKPEIEQDEDSEESEEEKNEEEGG 333

Query: 295 EEEE 298
             ++
Sbjct: 334 LSKK 337



 Score = 37.2 bits (86), Expect = 0.009
 Identities = 17/66 (25%), Positives = 34/66 (51%), Gaps = 3/66 (4%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           +  + E  EEE+ EEE     K K+ K K K K    G ++++ +  ++ ++ + + E  
Sbjct: 316 QDEDSEESEEEKNEEEGGLSKKGKKLK-KLKGK--KNGLDKDDSDSGDDSDDSDIDGEDS 372

Query: 301 GVVVSA 306
             +V+A
Sbjct: 373 VSLVTA 378



 Score = 35.7 bits (82), Expect = 0.026
 Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 5/56 (8%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKL-----GGGGGEEEEEEEEEEEEEEEEEEEI 299
           ++  E+ +EE+  K K+   K KKKL     G  GG+++ +E + ++ ++E  EE 
Sbjct: 230 DKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREED 285



 Score = 34.6 bits (79), Expect = 0.059
 Identities = 13/45 (28%), Positives = 25/45 (55%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +E+EE E  K    + K+ K     G +E++ +E ++  E+ +EE
Sbjct: 195 DEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEE 239



 Score = 33.0 bits (75), Expect = 0.20
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           +E+EE E E+    +   LK K   G    + +E ++  E+ +EE
Sbjct: 195 DEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEE 239



 Score = 29.9 bits (67), Expect = 2.1
 Identities = 13/65 (20%), Positives = 28/65 (43%), Gaps = 3/65 (4%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           + D  K G+R   ++ +E + ++ + + +E             G + EE E++   E   
Sbjct: 254 KLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISD---SSASGNDPEEREDKLSPEIPA 310

Query: 296 EEEIG 300
           + EI 
Sbjct: 311 KPEIE 315



 Score = 29.9 bits (67), Expect = 2.2
 Identities = 11/58 (18%), Positives = 25/58 (43%), Gaps = 7/58 (12%)

Query: 236 ENDYCKGGEREVHEEEE-------EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEE 286
           E DY        ++ EE       E   + E  + ++++  E++K    GG  ++ ++
Sbjct: 283 EEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 44.2 bits (104), Expect = 6e-05
 Identities = 14/55 (25%), Positives = 22/55 (40%), Gaps = 3/55 (5%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           + H + E + E  EE++  E   KE++  GG  G   E             EE+ 
Sbjct: 64  QGHRQTESDTETAEESRHGE---KEERGQGGPSGSGSESVGSPTPSPSGSAEELA 115



 Score = 40.0 bits (93), Expect = 0.001
 Identities = 13/83 (15%), Positives = 18/83 (21%), Gaps = 9/83 (10%)

Query: 219 LDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG 278
           L P N      ++                HE    E       +                
Sbjct: 118 LSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPS---------SFLQ 168

Query: 279 GEEEEEEEEEEEEEEEEEEEIGG 301
              E+  EE E    E E +  G
Sbjct: 169 PSHEDSPEEPEPPTSEPEPDSPG 191



 Score = 40.0 bits (93), Expect = 0.001
 Identities = 9/64 (14%), Positives = 20/64 (31%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           S N       +  HE+  EE E        ++    + +        +   +E  E +  
Sbjct: 158 SPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSP 217

Query: 295 EEEE 298
             ++
Sbjct: 218 TPQQ 221



 Score = 39.2 bits (91), Expect = 0.002
 Identities = 6/57 (10%), Positives = 9/57 (15%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            G  E        E     +                 G  E    E       ++  
Sbjct: 108 SGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPS 164



 Score = 39.2 bits (91), Expect = 0.002
 Identities = 13/103 (12%), Positives = 30/103 (29%), Gaps = 8/103 (7%)

Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKA--------KETKLKEK 271
            P  +P  + +   +   +  +    E   +     + E  T +        +  + +  
Sbjct: 158 SPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSP 217

Query: 272 KKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGW 314
                     ++  E E+E  E E E        +  +   GW
Sbjct: 218 TPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRSHSYTVVGW 260



 Score = 38.8 bits (90), Expect = 0.002
 Identities = 7/62 (11%), Positives = 13/62 (20%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
             +  + GE+E   +        E   +         +    G   E       E     
Sbjct: 75  TAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASH 134

Query: 296 EE 297
             
Sbjct: 135 SP 136



 Score = 38.8 bits (90), Expect = 0.003
 Identities = 5/59 (8%), Positives = 8/59 (13%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
                  EE       E  + +                     E    E       +  
Sbjct: 105 PSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQP 163



 Score = 37.3 bits (86), Expect = 0.009
 Identities = 7/64 (10%), Positives = 12/64 (18%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
            + +  +GG      E            A+E       +   G   E             
Sbjct: 83  EKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPS 142

Query: 295 EEEE 298
               
Sbjct: 143 HPGP 146



 Score = 36.5 bits (84), Expect = 0.013
 Identities = 5/64 (7%), Positives = 12/64 (18%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           S                         +     +    +       ++     +   E+  
Sbjct: 116 SGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSP 175

Query: 295 EEEE 298
           EE E
Sbjct: 176 EEPE 179



 Score = 36.5 bits (84), Expect = 0.015
 Identities = 6/62 (9%), Positives = 13/62 (20%), Gaps = 1/62 (1%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG-GEEEEEEEEEEEEEEEEEEEI 299
            G   E+      E       ++  +           G  E    E       ++    +
Sbjct: 108 SGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFL 167

Query: 300 GG 301
             
Sbjct: 168 QP 169



 Score = 36.5 bits (84), Expect = 0.016
 Identities = 10/59 (16%), Positives = 13/59 (22%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
             E   H E+EE  +        E+         G   E       E       E    
Sbjct: 75  TAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPAS 133



 Score = 36.5 bits (84), Expect = 0.017
 Identities = 16/70 (22%), Positives = 26/70 (37%), Gaps = 5/70 (7%)

Query: 237 NDYCKGGEREVHEEEEEEEEEEEETK-AKETKLKEKKKLGGGG----GEEEEEEEEEEEE 291
                 G +     E+   + E +T+ A+E++  EK++ G GG    G E          
Sbjct: 49  PAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPS 108

Query: 292 EEEEEEEIGG 301
              EE   G 
Sbjct: 109 GSAEELASGL 118



 Score = 34.9 bits (80), Expect = 0.042
 Identities = 6/56 (10%), Positives = 13/56 (23%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
                       +   E+  +  E    E +    G  + E        +   +E 
Sbjct: 156 NPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEP 211



 Score = 33.8 bits (77), Expect = 0.099
 Identities = 8/65 (12%), Positives = 15/65 (23%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
              +     ++     +   E+  E     T   E    G    E        +   +E 
Sbjct: 152 PESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEP 211

Query: 296 EEEIG 300
            E   
Sbjct: 212 GEPQS 216


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 42.2 bits (99), Expect = 8e-05
 Identities = 24/63 (38%), Positives = 40/63 (63%), Gaps = 7/63 (11%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           EN+  K  ++E  ++EEEEEE+EEE +  E    E++ +     E+EEEEEE+EE+  + 
Sbjct: 39  ENEDVKDEKQE--DDEEEEEEDEEEIEEPEDIEDEEEIV-----EDEEEEEEDEEDNVDL 91

Query: 296 EEE 298
           ++ 
Sbjct: 92  KDI 94



 Score = 34.8 bits (80), Expect = 0.026
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 9/55 (16%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           + E  EEE+EEE EE E    E ++ E         +EEEEEE+EE+  + ++ E
Sbjct: 50  DDEEEEEEDEEEIEEPEDIEDEEEIVE---------DEEEEEEDEEDNVDLKDIE 95



 Score = 31.0 bits (70), Expect = 0.52
 Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 13/79 (16%)

Query: 221 PVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE 280
           P  +P+   K     E+      ++E   +E E+ ++E++   +E              E
Sbjct: 9   PEKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEE-------------E 55

Query: 281 EEEEEEEEEEEEEEEEEEI 299
           EE+EEE EE E+ E+EEEI
Sbjct: 56  EEDEEEIEEPEDIEDEEEI 74



 Score = 30.6 bits (69), Expect = 0.57
 Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 5/63 (7%)

Query: 242 GGEREVHEEEEEEEEEEEE-----TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           GG  E   EEE+++E   E     +  KE  +KE + +     E++EEEEEE+EEE EE 
Sbjct: 6   GGAPEKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEP 65

Query: 297 EEI 299
           E+I
Sbjct: 66  EDI 68



 Score = 29.8 bits (67), Expect = 1.1
 Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 14/55 (25%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E +  E EE E+ E+EE   ++              EEEEEE+EE+  + ++ E+
Sbjct: 56  EEDEEEIEEPEDIEDEEEIVED--------------EEEEEEDEEDNVDLKDIEK 96



 Score = 29.8 bits (67), Expect = 1.1
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE 270
           E E+ E+EEEEEE+EE+    +   K+
Sbjct: 71  EEEIVEDEEEEEEDEEDNVDLKDIEKK 97


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 43.1 bits (102), Expect = 1e-04
 Identities = 29/86 (33%), Positives = 34/86 (39%), Gaps = 35/86 (40%)

Query: 249 EEEEEEEEEEEETKAKE-----------------------------------TKLKEKKK 273
           EEEEEEEEEEE  +  E                                   T +  ++K
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQK 347

Query: 274 LGGGGGEEEEEEEEEEEEEEEEEEEI 299
                 E+EEEEEEEEE EE E EE 
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 42.4 bits (100), Expect = 2e-04
 Identities = 22/73 (30%), Positives = 28/73 (38%), Gaps = 18/73 (24%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKE------------------KKKLGGGGGEEEEEEEE 287
           E  E EE  + E  E +     L                     ++      +E+EEEEE
Sbjct: 301 ERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQKDEEEEQEDEEEEE 360

Query: 288 EEEEEEEEEEEIG 300
           EEEE EE E E G
Sbjct: 361 EEEEPEEPEPEEG 373



 Score = 35.4 bits (82), Expect = 0.033
 Identities = 19/49 (38%), Positives = 24/49 (48%), Gaps = 19/49 (38%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +++EEEE+E+EE                   EEEEEEE EE E EE   
Sbjct: 346 QKDEEEEQEDEE-------------------EEEEEEEPEEPEPEEGPP 375



 Score = 34.3 bits (79), Expect = 0.081
 Identities = 30/72 (41%), Positives = 36/72 (50%), Gaps = 15/72 (20%)

Query: 231 TIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEE 290
            I  ++ DY    E E+ E EEEEEEEE     +E              +E EEEEEEE 
Sbjct: 152 KILGTQKDYYVA-EVELREGEEEEEEEEVGEADEE--------------DEGEEEEEEEP 196

Query: 291 EEEEEEEEIGGV 302
           EE  +EEE  GV
Sbjct: 197 EEVPKEEEGTGV 208



 Score = 33.9 bits (78), Expect = 0.096
 Identities = 21/56 (37%), Positives = 24/56 (42%), Gaps = 19/56 (33%)

Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           C     E  +EEEE+E+EEEE                   EEEEE EE E EE   
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEE-------------------EEEEEPEEPEPEEGPP 375



 Score = 31.6 bits (72), Expect = 0.65
 Identities = 15/30 (50%), Positives = 16/30 (53%)

Query: 237 NDYCKGGEREVHEEEEEEEEEEEETKAKET 266
           N   K  E E  +EEEEEEEEE E    E 
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372


>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin). 
          Length = 585

 Score = 43.4 bits (102), Expect = 1e-04
 Identities = 20/57 (35%), Positives = 27/57 (47%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           G E E  EE+EE+ +E     A +T L       G G  +   E E  EE EE+E +
Sbjct: 264 GLEDEAEEEKEEKGQESRGLSAVQTYLLRWVNARGRGRSQNRAERERSEESEEKELD 320



 Score = 40.7 bits (95), Expect = 8e-04
 Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 6/61 (9%)

Query: 242 GGEREVHEEEEEEEEEEEETKA------KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           GGE     EE EE++ +EE K       K   L ++ K  G  G E+E EEE+EE+ +E 
Sbjct: 221 GGEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLPDESKRSGQLGLEDEAEEEKEEKGQES 280

Query: 296 E 296
            
Sbjct: 281 R 281



 Score = 36.1 bits (83), Expect = 0.023
 Identities = 14/58 (24%), Positives = 26/58 (44%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K       E E+  +++E++T  +     E    G      EE EE++ +EE +  +E
Sbjct: 188 KQKRERSDEREKSYQDDEDDTYRQNNIPYEDVVGGEDWNPIEEGEEDQTQEEVKRSKE 245



 Score = 31.9 bits (72), Expect = 0.44
 Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 4/62 (6%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEK----KKLGGGGGEEEEEEEEEEEEEEEEEEE 298
              E H +   E  E       +  LK       KL G   ++ E  +E E+  +++E++
Sbjct: 148 NSEEKHSKRTNEIVEVFYNPQSQATLKSVFEEVGKLDGQSKQKRERSDEREKSYQDDEDD 207

Query: 299 IG 300
             
Sbjct: 208 TY 209



 Score = 27.7 bits (61), Expect = 9.0
 Identities = 15/77 (19%), Positives = 24/77 (31%), Gaps = 19/77 (24%)

Query: 241 KGGEREVHEEEEEEEEEE-------------------EETKAKETKLKEKKKLGGGGGEE 281
           +   R   E  EE EE+E                        +  +  + +K  G   EE
Sbjct: 301 RSQNRAERERSEESEEKELDRASPYQEIEITANLQIPPSDLIRMLRKGDGRKPRGRVEEE 360

Query: 282 EEEEEEEEEEEEEEEEE 298
              E  E  +E+ E + 
Sbjct: 361 GALEAPEALDEKRELDL 377


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 40.8 bits (96), Expect = 1e-04
 Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 19/84 (22%)

Query: 244 EREVHEEEEEEE----------EEEEET---KAKETKLKEKK------KLGGGGGEEEEE 284
           + +  +E E+EE          ++EE+T   +AK  K K+KK      K G    E+E  
Sbjct: 56  DEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGS 115

Query: 285 EEEEEEEEEEEEEEIGGVVVSAVD 308
           +  EE  +EEEE E          
Sbjct: 116 KSSEESSDEEEEGEEDKQEEPVEI 139



 Score = 33.1 bits (76), Expect = 0.078
 Identities = 19/86 (22%), Positives = 31/86 (36%), Gaps = 26/86 (30%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKL--------------------------GGGGGEEE 282
           E  +E+ ++E E +  + K +EKK+                            G   EE+
Sbjct: 53  ELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEK 112

Query: 283 EEEEEEEEEEEEEEEEIGGVVVSAVD 308
           E  +  EE  +EEEE         V+
Sbjct: 113 EGSKSSEESSDEEEEGEEDKQEEPVE 138


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 42.4 bits (99), Expect = 2e-04
 Identities = 24/65 (36%), Positives = 29/65 (44%), Gaps = 4/65 (6%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLK----EKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           E  EEEEEEEEEE E +  E           ++  G G    +  EE EEE   E E  G
Sbjct: 137 EDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVTEAEAEG 196

Query: 302 VVVSA 306
             V+ 
Sbjct: 197 TTVAG 201



 Score = 41.2 bits (96), Expect = 4e-04
 Identities = 26/65 (40%), Positives = 28/65 (43%), Gaps = 8/65 (12%)

Query: 241 KGGEREVHEEEEEEEEEEEE--------TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           K  E +  EEEEEEEEEEE         T    T   E     G  G +  EE EEE   
Sbjct: 131 KEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVT 190

Query: 293 EEEEE 297
           E E E
Sbjct: 191 EAEAE 195



 Score = 37.0 bits (85), Expect = 0.009
 Identities = 17/29 (58%), Positives = 19/29 (65%)

Query: 270 EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            KK       +E+EEEEEEEEEEE E EE
Sbjct: 126 GKKATKEDESDEDEEEEEEEEEEEAEVEE 154



 Score = 35.8 bits (82), Expect = 0.019
 Identities = 22/73 (30%), Positives = 30/73 (41%), Gaps = 20/73 (27%)

Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            K G       +E+E +E+EE                    EEEEEEEEE E EE E+  
Sbjct: 120 KKAGNAGKKATKEDESDEDEE--------------------EEEEEEEEEAEVEENEQGT 159

Query: 300 GGVVVSAVDFGSG 312
            G   ++ +   G
Sbjct: 160 NGTSTNSTEVDHG 172



 Score = 30.4 bits (68), Expect = 1.1
 Identities = 20/63 (31%), Positives = 25/63 (39%), Gaps = 18/63 (28%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFG 310
           +  EEE EEET                    EEE  E+ +  E+EE E     +S V  G
Sbjct: 58  DSSEEEGEEET------------------SNEEENNEDSDGNEDEEAEAENTTLSTVTLG 99

Query: 311 SGG 313
            GG
Sbjct: 100 YGG 102


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 39.5 bits (93), Expect = 2e-04
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
                   EEE++EEEEEEEE++++G 
Sbjct: 59  AAAAAAAAEEEKKEEEEEEEEDDDMGF 85



 Score = 37.6 bits (88), Expect = 9e-04
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                    EEE++EEEEEEEE++
Sbjct: 58  AAAAAAAAAEEEKKEEEEEEEEDD 81



 Score = 27.2 bits (61), Expect = 3.7
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 241 KGGEREVHEEEEEEEEEEEE 260
                   EE++EEEEEEEE
Sbjct: 60  AAAAAAAEEEKKEEEEEEEE 79



 Score = 26.4 bits (59), Expect = 6.4
 Identities = 9/12 (75%), Positives = 12/12 (100%)

Query: 249 EEEEEEEEEEEE 260
           E++EEEEEEEE+
Sbjct: 69  EKKEEEEEEEED 80



 Score = 26.0 bits (58), Expect = 8.5
 Identities = 8/12 (66%), Positives = 12/12 (100%)

Query: 249 EEEEEEEEEEEE 260
           ++EEEEEEEE++
Sbjct: 70  KKEEEEEEEEDD 81


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 42.3 bits (100), Expect = 2e-04
 Identities = 20/53 (37%), Positives = 33/53 (62%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E  +EEEE+ +E  +    E +  +    G G GEE+EEEEE+  ++E+EE++
Sbjct: 286 EEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDD 338



 Score = 41.5 bits (98), Expect = 4e-04
 Identities = 20/57 (35%), Positives = 36/57 (63%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           G   E  EEEE+  ++E+E    +   +E++ +     EE+EE+E+ ++E++EEEEE
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEE 373



 Score = 41.5 bits (98), Expect = 5e-04
 Identities = 17/56 (30%), Positives = 34/56 (60%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + E  E+  ++E+EE++    E + ++         EE+E+ ++E++EEEEEEE+ 
Sbjct: 322 DEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKE 377



 Score = 40.4 bits (95), Expect = 0.001
 Identities = 21/78 (26%), Positives = 44/78 (56%)

Query: 218 DLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGG 277
           DLD    PD  +        +  +  E  V +E+EE+++++ E + ++  L ++++    
Sbjct: 301 DLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEED 360

Query: 278 GGEEEEEEEEEEEEEEEE 295
              ++E++EEEEEEE+E+
Sbjct: 361 EDSDDEDDEEEEEEEKEK 378



 Score = 39.6 bits (93), Expect = 0.002
 Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 8/64 (12%)

Query: 251 EEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           EEE  +EE E      +LK  E ++L    GEEE++EEEE+ +E  ++ +         +
Sbjct: 260 EEELAKEEAE------RLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDN 313

Query: 309 FGSG 312
           FG G
Sbjct: 314 FGLG 317



 Score = 39.2 bits (92), Expect = 0.002
 Identities = 20/56 (35%), Positives = 32/56 (57%), Gaps = 3/56 (5%)

Query: 248 HEEEEEEEEEEEETKAKET---KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
            EEE++EEEE+ +  A +       +     G G  EE+EEEEE+  ++E+EE+  
Sbjct: 284 GEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDD 339



 Score = 39.2 bits (92), Expect = 0.003
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           R   E++EEEE+ +E     + + +       G G+ EE+EEEEE+  ++E+EE   
Sbjct: 283 RGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDD 339



 Score = 36.1 bits (84), Expect = 0.021
 Identities = 15/64 (23%), Positives = 30/64 (46%), Gaps = 13/64 (20%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
            ++++    +      + EE+EEEEE              G    +EE+++++ EEEEE+
Sbjct: 302 LDDEFEPDDDDNFGLGQGEEDEEEEE-------------DGVDDEDEEDDDDDLEEEEED 348

Query: 295 EEEE 298
            +  
Sbjct: 349 VDLS 352



 Score = 36.1 bits (84), Expect = 0.022
 Identities = 19/69 (27%), Positives = 34/69 (49%), Gaps = 12/69 (17%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           F+ ++D   G  +   +EEEEE+  ++E +  +              + EEEEE+ +  +
Sbjct: 306 FEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDD------------DDDLEEEEEDVDLSD 353

Query: 293 EEEEEEIGG 301
           EEE+EE   
Sbjct: 354 EEEDEEDED 362



 Score = 30.4 bits (69), Expect = 1.5
 Identities = 11/53 (20%), Positives = 29/53 (54%), Gaps = 8/53 (15%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE--EEI 299
           ++ ++++  +  +  +          GGG  +EE+EEE+ E ++ ++E  +E+
Sbjct: 136 DDFDDDDLGDLASDDRAAHF------GGGEDDEEDEEEQPERKKSKKEVMKEV 182



 Score = 29.2 bits (66), Expect = 4.0
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKL 274
           E++EEEEEEE+E K K++    + +L
Sbjct: 366 EDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 28.8 bits (65), Expect = 4.5
 Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKK 273
           + +  +E++EEEEEEE+ K K+   +  + 
Sbjct: 360 DEDSDDEDDEEEEEEEKEKKKKKSAESTRS 389



 Score = 28.4 bits (64), Expect = 5.4
 Identities = 11/48 (22%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE--EEEI 299
           ++ ++++     +  +     GG   EE+EEE+ E ++ ++E  +E I
Sbjct: 136 DDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVI 183


>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen.  The
           hepatitis delta virus (HDV) encodes a single protein,
           the hepatitis delta antigen (HDAg). The central region
           of this protein has been shown to bind RNA. Several
           interactions are also mediated by a coiled-coil region
           at the N terminus of the protein.
          Length = 194

 Score = 41.0 bits (96), Expect = 3e-04
 Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 7/66 (10%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE----EEEEEEEEEEIGGVVVSAV 307
           ++E  +    KA E K   KK+L  GG     EEEEE     EE+EE E  + G  V  V
Sbjct: 95  DQERRDHRRRKALENK---KKQLSSGGKHLSREEEEELRRLTEEDEERERRVAGPRVGGV 151

Query: 308 DFGSGG 313
           +   GG
Sbjct: 152 NPLDGG 157


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 39.3 bits (92), Expect = 3e-04
 Identities = 17/50 (34%), Positives = 27/50 (54%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            ++E  E  EE   +E K + ++K        +E   EEEE EEEE++E+
Sbjct: 30  VKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEM 79



 Score = 33.2 bits (76), Expect = 0.036
 Identities = 18/51 (35%), Positives = 27/51 (52%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           G  +    E  EE+ EEE K +  + +  ++     G  EEEE EEEE++E
Sbjct: 28  GSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKE 78



 Score = 33.2 bits (76), Expect = 0.039
 Identities = 20/51 (39%), Positives = 28/51 (54%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           G  ++   E  EE+ EEEE    E K   +++    G  EEEE EEEE++E
Sbjct: 28  GSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKE 78



 Score = 32.0 bits (73), Expect = 0.12
 Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 9/51 (17%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
             E   EEEE+ E EE+   +E   KE            EEEE EEEE++E
Sbjct: 37  LTEEQGEEEEKNEVEEKQAVEEEANKEG---------VVEEEEVEEEEDKE 78



 Score = 31.2 bits (71), Expect = 0.19
 Identities = 22/55 (40%), Positives = 25/55 (45%), Gaps = 17/55 (30%)

Query: 246 EVHEE--EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E+ EE  EEEE+ E EE                   EEE  +E   EEEE EEEE
Sbjct: 36  ELTEEQGEEEEKNEVEEK---------------QAVEEEANKEGVVEEEEVEEEE 75



 Score = 31.2 bits (71), Expect = 0.20
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
           ++E +  +  E E ++E   EEEE EE + KE  L+E +K 
Sbjct: 47  KNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKA 87


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 41.6 bits (98), Expect = 3e-04
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
             +  + EE +E+E+EE + +  + +E++    G   EEE  +  E+   E   ++G   
Sbjct: 361 RLDPIDFEEVDEDEDEEEEQRSDEHEEEE----GEDSEEEGSQSREDGSSESSSDVGSDS 416

Query: 304 VSAVDFGS 311
            S  D  S
Sbjct: 417 ESKADKES 424



 Score = 35.9 bits (83), Expect = 0.021
 Identities = 12/56 (21%), Positives = 25/56 (44%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E +  E+EEEE+  +E  + +    +E+       G  E   +   + E + ++E
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423



 Score = 35.5 bits (82), Expect = 0.034
 Identities = 15/52 (28%), Positives = 26/52 (50%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
              + HEEEE E+ EEE ++++E    E     G   E + ++E   + + E
Sbjct: 380 QRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431



 Score = 35.1 bits (81), Expect = 0.041
 Identities = 12/55 (21%), Positives = 26/55 (47%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            E +  +E EEEE E+ E +  +++     +     G + E + ++E   + + E
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431



 Score = 34.7 bits (80), Expect = 0.050
 Identities = 13/56 (23%), Positives = 26/56 (46%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           +  E+   E EEEE E+ EE  ++  +    +     G + E + ++E   + + E
Sbjct: 376 EEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431



 Score = 33.6 bits (77), Expect = 0.13
 Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 9/55 (16%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
              +   + EE +E+E+         E+++      EEEE E+ EEE  +  E+ 
Sbjct: 359 RARLDPIDFEEVDEDED---------EEEEQRSDEHEEEEGEDSEEEGSQSREDG 404



 Score = 32.0 bits (73), Expect = 0.35
 Identities = 11/55 (20%), Positives = 22/55 (40%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E   +E EEEE E+  +      ++           + E + ++E   + + E
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431



 Score = 32.0 bits (73), Expect = 0.40
 Identities = 13/60 (21%), Positives = 23/60 (38%), Gaps = 5/60 (8%)

Query: 244 EREVHEEEEE-----EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             E  +EEEE      EEEE E   +E     +        +   + E + ++E   + +
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSD 429



 Score = 31.2 bits (71), Expect = 0.65
 Identities = 11/55 (20%), Positives = 25/55 (45%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E    +E EEEE E+++ + ++ +E            + E + ++E   + + 
Sbjct: 376 EEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDS 430


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 38.8 bits (91), Expect = 4e-04
 Identities = 15/24 (62%), Positives = 18/24 (75%)

Query: 278 GGEEEEEEEEEEEEEEEEEEEIGG 301
             EE++EEEEEEEE+EE EEE   
Sbjct: 76  AAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 37.2 bits (87), Expect = 0.002
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 277 GGGEEEEEEEEEEEEEEEEEEEIGG 301
              E++EEEEEEEE+EE EEE   G
Sbjct: 76  AAEEKKEEEEEEEEKEESEEEAAAG 100



 Score = 33.0 bits (76), Expect = 0.045
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 276 GGGGEEEEEEEEEEEEEEEEEEE 298
                  EE++EEEEEEEE+EE 
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEES 93



 Score = 31.5 bits (72), Expect = 0.15
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
           EEEEEEEE+EE EEE    +G 
Sbjct: 82  EEEEEEEEKEESEEEAAAGLGA 103



 Score = 28.8 bits (65), Expect = 1.4
 Identities = 15/46 (32%), Positives = 18/46 (39%), Gaps = 19/46 (41%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
                EE++EEE                   EEEEE+EE EEE   
Sbjct: 73  AAAAAEEKKEEE-------------------EEEEEKEESEEEAAA 99



 Score = 26.8 bits (60), Expect = 7.2
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKE 265
              E +  EEEEEEE+EE E +A  
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAA 99


>gnl|CDD|223017 PHA03235, PHA03235, DNA packaging protein UL33; Provisional.
          Length = 409

 Score = 41.3 bits (97), Expect = 4e-04
 Identities = 44/182 (24%), Positives = 74/182 (40%), Gaps = 28/182 (15%)

Query: 23  CNISYNEYNRSNIPAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIIL-GFFGNA 81
             + +NE N  N      +  H  +ND  V  E +    +     F  ++II  G   N 
Sbjct: 2   VLLVHNETNDEN------DLLH--VNDTCVPTEGLS--AARTTETFINLLIISVGGPLNL 51

Query: 82  LVVVVVAFNQQMR--STTNLLIINLAIADLLFIVCCVPFTATD---YVTTCWPFGDVWCK 136
           +V+V      ++   ST  L + NL +A+LL  V  +PF        ++     G   CK
Sbjct: 52  IVLVTQLLANRVHGFSTPTLYMTNLYLANLL-TVFVLPFIMLSNQGLLSGS-VAG---CK 106

Query: 137 MYLIVVTAYAS----VYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTA 192
                +  YAS      T+ L++ DR+  V+H          R+  K + +TW   ++ +
Sbjct: 107 --FASLLYYASCTVGFATVALIAADRY-RVIHQRTRARSSAYRSTYKILGLTWFASLICS 163

Query: 193 VP 194
            P
Sbjct: 164 GP 165


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 41.1 bits (96), Expect = 5e-04
 Identities = 19/27 (70%), Positives = 21/27 (77%)

Query: 278 GGEEEEEEEEEEEEEEEEEEEIGGVVV 304
           G + E EE+EEEEEEEEEEEEI  V V
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTV 249



 Score = 35.3 bits (81), Expect = 0.032
 Identities = 16/21 (76%), Positives = 17/21 (80%)

Query: 277 GGGEEEEEEEEEEEEEEEEEE 297
           G   E EE+EEEEEEEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEEEE 243



 Score = 30.3 bits (68), Expect = 1.2
 Identities = 14/17 (82%), Positives = 14/17 (82%)

Query: 244 EREVHEEEEEEEEEEEE 260
           E E  EEEEEEEEEEEE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243



 Score = 28.4 bits (63), Expect = 4.8
 Identities = 13/19 (68%), Positives = 15/19 (78%)

Query: 242 GGEREVHEEEEEEEEEEEE 260
           G + E  E+EEEEEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEE 241


>gnl|CDD|220691 pfam10323, 7TM_GPCR_Srv, Serpentine type 7TM GPCR chemoreceptor
           Srv.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srv is a member of the
           Srg superfamily of chemoreceptors. Chemoperception is
           one of the central senses of soil nematodes like C.
           elegans which are otherwise 'blind' and 'deaf'.
          Length = 283

 Score = 40.7 bits (96), Expect = 6e-04
 Identities = 32/154 (20%), Positives = 62/154 (40%), Gaps = 35/154 (22%)

Query: 60  MVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFT 119
           +  + +P++F I+I L    N        F    ++T   L++   IAD++        T
Sbjct: 4   LSIVTLPLYFLILICLLKLRN----RSKTF----KTTFYTLLLQHCIADII--------T 47

Query: 120 ATDYVTTC----WPFGDV-------WC-KMYLIVVTAYASVY----TLVLMSLDRFLAVV 163
             +Y          +          +    Y     AY ++Y     +  +S +R+LA+ 
Sbjct: 48  LLNYYFGAMFPKIGWFRDFYFNYQHYYAAAYY--NIAYYTLYIRCTGISFLSTNRYLAIC 105

Query: 164 HPIASMSVRTERNAL-KAILITWIVIVLTAVPVF 196
            PI+ ++   +   L K ILI W+  +L ++ V 
Sbjct: 106 FPISRLTKIWQSARLWKIILIYWLPGLLISLVVL 139


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 39.8 bits (93), Expect = 6e-04
 Identities = 19/76 (25%), Positives = 38/76 (50%)

Query: 223 NRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEE 282
                  +  F  +    K  E+E  +E  + +EEE++ + +E + +++          E
Sbjct: 21  EEAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAE 80

Query: 283 EEEEEEEEEEEEEEEE 298
           ++E+EE  +EEE+EEE
Sbjct: 81  QQEKEEIAQEEEKEEE 96



 Score = 36.0 bits (83), Expect = 0.011
 Identities = 20/55 (36%), Positives = 32/55 (58%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E    +EEE++ E EE  K KE    E++ +     +EE  +EEE+EEE E+ ++
Sbjct: 48  EHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQ 102



 Score = 28.6 bits (64), Expect = 2.8
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 8/60 (13%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
            +E+EEE+E  +   K  + ++K         E EE E+E+E  E E E+I         
Sbjct: 36  ADEKEEEKENSDEHVKSKEEEQK--------IEYEEAEKEKEAGEPEREDIAEQQEKEEI 87


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 40.6 bits (96), Expect = 7e-04
 Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           +++  +EE K   +   +         EEEEEEEEEEEEEE  EEE   
Sbjct: 280 DKDALDEELKEVLSAQAQAAA-----AEEEEEEEEEEEEEEPSEEEAAA 323



 Score = 31.0 bits (71), Expect = 0.71
 Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 8/50 (16%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
             +  + EE +E    + +  A E + +E+        EEEEEEE  EEE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEE--------EEEEEEEPSEEE 320


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 40.5 bits (95), Expect = 7e-04
 Identities = 20/46 (43%), Positives = 30/46 (65%), Gaps = 6/46 (13%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           E E+E  +EEEE + +E K +E+K        ++EEE +EEEE+EE
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKT------TDKEEEVDEEEEKEE 69



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 17/55 (30%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E+EV +EEEEEE+EE++ +                 EE+  ++EEE +EEEE+EE
Sbjct: 32  EKEVPDEEEEEEKEEKKEE-----------------EEKTTDKEEEVDEEEEKEE 69



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 14/54 (25%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E E    +EEEEEE+EE K +E              E+  ++EEE +EEEE+EE
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEE--------------EKTTDKEEEVDEEEEKEE 69



 Score = 32.8 bits (75), Expect = 0.22
 Identities = 16/43 (37%), Positives = 21/43 (48%), Gaps = 6/43 (13%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEE 283
           +  +    EEE +EEEE+EE K K  K+KE         E E 
Sbjct: 50  EEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT------EWEL 86



 Score = 32.0 bits (73), Expect = 0.36
 Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 5/53 (9%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
           E +  K  ++E  E+  ++EEE +E + KE K K+ KK+     +E   E E 
Sbjct: 39  EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKV-----KETTTEWEL 86



 Score = 29.4 bits (66), Expect = 3.0
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVV 303
           EEEEEE+EE++EEEE+  +    V
Sbjct: 38  EEEEEEKEEKKEEEEKTTDKEEEV 61



 Score = 29.0 bits (65), Expect = 4.3
 Identities = 17/39 (43%), Positives = 23/39 (58%)

Query: 259 EETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E+ KA  +++    KLG    E+EE EEE+EE E E  E
Sbjct: 476 EDPKAFASRIYRMIKLGLSIDEDEEVEEEDEEAEVETTE 514



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           E  +EEEEEE+EE++EEEE        VD
Sbjct: 34  EVPDEEEEEEKEEKKEEEEKTTDKEEEVD 62


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 40.8 bits (96), Expect = 8e-04
 Identities = 26/60 (43%), Positives = 28/60 (46%), Gaps = 19/60 (31%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
            E EEEEEEEEE                     EEEEEEEEEEE   EEE+   V  A+ 
Sbjct: 303 PEPEEEEEEEEE-------------------VPEEEEEEEEEEERTFEEEVRATVAEAIR 343



 Score = 39.2 bits (92), Expect = 0.002
 Identities = 24/63 (38%), Positives = 25/63 (39%), Gaps = 19/63 (30%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
           R        E EEEEE                   EEEE  EEEEEEEEEEE      V 
Sbjct: 295 RRRPPPSPPEPEEEEE-------------------EEEEVPEEEEEEEEEEERTFEEEVR 335

Query: 305 SAV 307
           + V
Sbjct: 336 ATV 338



 Score = 28.5 bits (64), Expect = 5.1
 Identities = 16/23 (69%), Positives = 17/23 (73%)

Query: 243 GEREVHEEEEEEEEEEEETKAKE 265
            E EV EEEEEEEEEEE T  +E
Sbjct: 311 EEEEVPEEEEEEEEEEERTFEEE 333


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 40.2 bits (94), Expect = 0.001
 Identities = 13/59 (22%), Positives = 23/59 (38%), Gaps = 1/59 (1%)

Query: 241 KGGEREVHEEEEEE-EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +  +    EE  E+  ++EEE  +      +   +     +EEE EE       E+  E
Sbjct: 176 ESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSE 234



 Score = 37.5 bits (87), Expect = 0.008
 Identities = 13/56 (23%), Positives = 20/56 (35%), Gaps = 6/56 (10%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
              +  ++EEEE+ + E+             L     EEE EE       E+  E 
Sbjct: 186 SAEDDSDDEEEEDSDSEDYSQ------YDGMLVDSSDEEEGEEAPSINYNEDTSES 235



 Score = 36.0 bits (83), Expect = 0.021
 Identities = 10/50 (20%), Positives = 28/50 (56%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ++ ++ +++E +  + +   +E +       EE  E++ ++EEEE+ + E
Sbjct: 153 KKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202



 Score = 34.4 bits (79), Expect = 0.078
 Identities = 9/57 (15%), Positives = 31/57 (54%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            ++   +++ ++ +++E  ++ +   +E+ +       EE  E++ ++EEEE+ +  
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202



 Score = 34.0 bits (78), Expect = 0.082
 Identities = 16/79 (20%), Positives = 34/79 (43%), Gaps = 9/79 (11%)

Query: 230 KTIFQS-ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK--------LGGGGGE 280
           K + +S      K    E   ++ + +++ +++K KE K    K               E
Sbjct: 126 KNVLESIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEE 185

Query: 281 EEEEEEEEEEEEEEEEEEI 299
             E++ ++EEEE+ + E+ 
Sbjct: 186 SAEDDSDDEEEEDSDSEDY 204



 Score = 32.9 bits (75), Expect = 0.23
 Identities = 11/51 (21%), Positives = 25/51 (49%), Gaps = 6/51 (11%)

Query: 253 EEEEEEEETKAKETKLKEKKKL----GG--GGGEEEEEEEEEEEEEEEEEE 297
           + EE    +K  + K      L    GG   G E+E++++E+ + ++  ++
Sbjct: 252 DSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQVVKK 302



 Score = 32.1 bits (73), Expect = 0.32
 Identities = 14/69 (20%), Positives = 36/69 (52%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
           +++  + +++E +E  +   +E    E +       E++ ++EEEE+ + E+  +  G++
Sbjct: 152 KKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGML 211

Query: 304 VSAVDFGSG 312
           V + D   G
Sbjct: 212 VDSSDEEEG 220



 Score = 29.0 bits (65), Expect = 3.1
 Identities = 13/47 (27%), Positives = 21/47 (44%)

Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           K K  K  +K K        ++++EEE E E+E + E      S  +
Sbjct: 148 KGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDE 194


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 39.3 bits (92), Expect = 0.001
 Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 4/59 (6%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           K   R+  E EEEE EE ++ + K    +++++      EEE E+++EEEE +E EE+ 
Sbjct: 14  KQARRQQREAEEEEREERKKLEEKREGERKEEEEL----EEEREKKKEEEERKEREEQA 68



 Score = 34.7 bits (80), Expect = 0.030
 Identities = 19/49 (38%), Positives = 32/49 (65%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            EE++   ++ +A+E + +E+KKL      E +EEEE EEE E+++EE 
Sbjct: 11  LEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59



 Score = 33.5 bits (77), Expect = 0.087
 Identities = 19/56 (33%), Positives = 27/56 (48%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E    ++EEEE +E EE   KE +  EK K      EE  ++   +EE  E  E+ 
Sbjct: 50  EEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSADEESNELLEDF 105



 Score = 32.7 bits (75), Expect = 0.14
 Identities = 22/68 (32%), Positives = 32/68 (47%), Gaps = 18/68 (26%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE----------E 290
              +RE   +EEEE EEE E K +E + KE+        EE+  +E+EE          E
Sbjct: 34  LEEKREGERKEEEELEEEREKKKEEEERKER--------EEQARKEQEEYEKLKSSFVVE 85

Query: 291 EEEEEEEE 298
           EE  ++  
Sbjct: 86  EEGTDKLS 93



 Score = 31.6 bits (72), Expect = 0.30
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 14/57 (24%)

Query: 244 EREVHEEEEEE--EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ER+  EE+ E   +EEEE  + +E K            +EEEE +E EE+  +E+EE
Sbjct: 30  ERKKLEEKREGERKEEEELEEEREKK------------KEEEERKEREEQARKEQEE 74


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 39.7 bits (93), Expect = 0.001
 Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 8/67 (11%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE--EEEE 292
             +        +   ++++ E+E      +  K+K+++       EE+E EEEE   EEE
Sbjct: 121 DSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERA------EEKEREEEEKAAEEE 174

Query: 293 EEEEEEI 299
           +  EEEI
Sbjct: 175 KAREEEI 181



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 7/54 (12%), Positives = 27/54 (50%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E  E+++EE+E + +   ++ +  +          +  +++ ++++ E+E   +
Sbjct: 94  EGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAAL 147



 Score = 28.9 bits (65), Expect = 3.3
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGG 276
           E  EE+E EEEE  A+E K +E++ L G
Sbjct: 157 ERAEEKEREEEEKAAEEEKAREEEILTG 184


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 40.0 bits (94), Expect = 0.001
 Identities = 14/58 (24%), Positives = 31/58 (53%), Gaps = 6/58 (10%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           G  R++ EE   EE  +E  K   ++  ++++      ++EE++ ++ EE ++  EE 
Sbjct: 44  GSFRDLTEESLREEIAKEAAKIDFSEESDEEE------DDEEDDNDDSEENKDTVEEF 95


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 39.4 bits (92), Expect = 0.001
 Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 14/64 (21%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E D CK  E +  ++ +EE+ E EE + +E                E+ EE  E + EE 
Sbjct: 14  EEDCCKENENKEEDKGKEEDLEFEEIEKEEII--------------EDSEESNEVKIEEL 59

Query: 296 EEEI 299
           ++E 
Sbjct: 60  KDEN 63



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 5/59 (8%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           +   HE  EE+  +E E K ++   +E  +      EE E+EE  E+ EE  E +I  +
Sbjct: 6   KDAKHENMEEDCCKENENKEEDKGKEEDLEF-----EEIEKEEIIEDSEESNEVKIEEL 59



 Score = 30.9 bits (70), Expect = 0.57
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 9/51 (17%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E ++ + E  EE   KE + KE         E++ +EE+ E EE E+EE I
Sbjct: 4   ECKDAKHENMEEDCCKENENKE---------EDKGKEEDLEFEEIEKEEII 45


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 39.6 bits (92), Expect = 0.001
 Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 1/71 (1%)

Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
           +NK   + E    +  ER V EEEE + E E  T   E +  +KK L    G ++  E  
Sbjct: 186 WNKAATEREKRQDEK-ERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETS 244

Query: 288 EEEEEEEEEEE 298
           E EEEE  E E
Sbjct: 245 ESEEEESSESE 255



 Score = 31.6 bits (71), Expect = 0.49
 Identities = 18/57 (31%), Positives = 28/57 (49%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
           EEEE  E E +E + ++ K K +K+      +  +     E E+E E E+I  V  S
Sbjct: 247 EEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKIPAVQHS 303



 Score = 29.2 bits (65), Expect = 2.8
 Identities = 16/65 (24%), Positives = 27/65 (41%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           QS        E     E +E+E+E+ + K ++ K  + KK        E E+E E E+  
Sbjct: 239 QSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKIP 298

Query: 294 EEEEE 298
             +  
Sbjct: 299 AVQHS 303


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 37.9 bits (88), Expect = 0.001
 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 1/63 (1%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           K G +   +E E+E  E EE + K+ K+     L GG      EE+ ++EE+EE+     
Sbjct: 36  KLGAKAQEKEHEKERAEREEAREKKAKI-SPNALRGGATAGHGEEDTDDEEDEEDFATPS 94

Query: 301 GVV 303
            V 
Sbjct: 95  AVP 97



 Score = 31.3 bits (71), Expect = 0.30
 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 18/71 (25%)

Query: 241 KGGEREVHEEEEEEEEEEEET------------KAKETKLKEKKKLGGGGGEEEEEEEEE 288
           +GG    H EE+ ++EE+EE             KA++ + K  +KL       E EE+  
Sbjct: 69  RGGATAGHGEEDTDDEEDEEDFATPSAVPQWGKKARKRQRKVIRKL------LEAEEQLR 122

Query: 289 EEEEEEEEEEI 299
           E++ ++E+E+I
Sbjct: 123 EDQYDDEDEDI 133


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 38.1 bits (89), Expect = 0.002
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           G     E E+E++E   E+K  E + K K+KL     + ++EE E+E+E+EE   + 
Sbjct: 41  GSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKA 97



 Score = 37.7 bits (88), Expect = 0.002
 Identities = 21/95 (22%), Positives = 43/95 (45%), Gaps = 12/95 (12%)

Query: 205 TYASELHTACVFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAK 264
           + ++E +            RP+G  K       +  +  + +  +EE E+E+E+EE   K
Sbjct: 42  STSNEENEDEDDESTAESKRPEGRKK-----AKEKLRRDKLKAKKEEAEKEKEKEERFMK 96

Query: 265 ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
                EK++        E E+++ E +  +EE++I
Sbjct: 97  ALAEAEKERA-------ELEKKKAEAKLMKEEKKI 124


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 36.9 bits (86), Expect = 0.002
 Identities = 18/22 (81%), Positives = 18/22 (81%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
           EEEEEEEEEEEEEEE EEE   
Sbjct: 77  EEEEEEEEEEEEEEESEEEAMA 98



 Score = 36.2 bits (84), Expect = 0.004
 Identities = 18/22 (81%), Positives = 19/22 (86%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
           EEEEEEEEEEEEEE EEE + G
Sbjct: 78  EEEEEEEEEEEEEESEEEAMAG 99



 Score = 36.2 bits (84), Expect = 0.004
 Identities = 21/36 (58%), Positives = 23/36 (63%), Gaps = 1/36 (2%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFG 310
                EEEEEEEEEEEEEEEE EE     + A+ FG
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEEAMAGLGAL-FG 105



 Score = 34.6 bits (80), Expect = 0.012
 Identities = 17/24 (70%), Positives = 17/24 (70%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                 EEEEEEEEEEEEEEEE E
Sbjct: 70  AAAAAAEEEEEEEEEEEEEEEESE 93



 Score = 32.7 bits (75), Expect = 0.055
 Identities = 15/24 (62%), Positives = 15/24 (62%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                    EEEEEEEEEEEEEEE
Sbjct: 67  AAAAAAAAAEEEEEEEEEEEEEEE 90



 Score = 31.6 bits (72), Expect = 0.16
 Identities = 14/24 (58%), Positives = 14/24 (58%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                     EEEEEEEEEEEEEE
Sbjct: 66  AAAAAAAAAAEEEEEEEEEEEEEE 89



 Score = 27.3 bits (61), Expect = 4.5
 Identities = 19/41 (46%), Positives = 19/41 (46%), Gaps = 19/41 (46%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
             EEEEEEEEEE                   EEEEEE EEE
Sbjct: 74  AAEEEEEEEEEE-------------------EEEEEESEEE 95



 Score = 26.9 bits (60), Expect = 6.6
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKE 265
                  EEEEEEEEEEEE +  E
Sbjct: 70  AAAAAAEEEEEEEEEEEEEEEESE 93



 Score = 26.5 bits (59), Expect = 9.0
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKE 265
                  EEEEEEEEEEEE +++E
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEE 94


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 38.6 bits (90), Expect = 0.002
 Identities = 21/61 (34%), Positives = 33/61 (54%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           K  ++E  E EE +EEE+E   +++ + K   +      EEE EEE+EE  +E E+E   
Sbjct: 53  KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEE 112

Query: 301 G 301
            
Sbjct: 113 K 113



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 14/56 (25%), Positives = 26/56 (46%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            + E  E    E++E++    KE +  E++         +E E+E EE+ E   E+
Sbjct: 65  VKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEK 120



 Score = 37.4 bits (87), Expect = 0.005
 Identities = 21/55 (38%), Positives = 32/55 (58%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E +  +++E  E EE + + KE    E K+  G   +E+EE EEE EEE+EE  +
Sbjct: 50  EAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104



 Score = 35.9 bits (83), Expect = 0.015
 Identities = 22/59 (37%), Positives = 29/59 (49%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           E EE +EEE+E    E K  +          EEE EEE+EE  +E E+E      S V+
Sbjct: 61  EIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVE 119



 Score = 34.3 bits (79), Expect = 0.049
 Identities = 14/55 (25%), Positives = 31/55 (56%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E+E    E++E++ + E + +E++ + +++      E E+E EE+ E   E+E  
Sbjct: 69  EKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEIT 123



 Score = 33.6 bits (77), Expect = 0.097
 Identities = 13/49 (26%), Positives = 27/49 (55%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E++E++ + E+E +  E + +E+ +      E+E EE+ E   E+E   
Sbjct: 76  EDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITN 124



 Score = 30.5 bits (69), Expect = 0.85
 Identities = 17/54 (31%), Positives = 24/54 (44%), Gaps = 6/54 (11%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E +   E+E+EE EEE  +  E    E +K      E EE+ E   E+E     
Sbjct: 79  EDKGDAEKEDEESEEENEEEDEESSDENEK------ETEEKTESNVEKEITNPS 126


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 39.1 bits (92), Expect = 0.002
 Identities = 24/56 (42%), Positives = 29/56 (51%), Gaps = 8/56 (14%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           ER   +EEE+E E EEE K  E   +EK        E EEE E E  E +EE+E  
Sbjct: 136 ERIERKEEEKEREREEELKILE-YQREK-------AEREEEREAERRERKEEKERE 183



 Score = 35.7 bits (83), Expect = 0.028
 Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 6/50 (12%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           RE  E EEE E E  E K +  K +E  +L      ++EE E+E EE +E
Sbjct: 160 REKAEREEEREAERRERKEE--KEREVARL----RAQQEEAEDEREELDE 203



 Score = 33.0 bits (76), Expect = 0.17
 Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 7/56 (12%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E++  + EE+EEE   +   +E +LK          EEEE E + +EE  E    +
Sbjct: 28  EKKRIKAEEKEEERRIDEMMEEERLKALA-------EEEERERKRKEERREGRAVL 76



 Score = 31.8 bits (73), Expect = 0.48
 Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 8/63 (12%)

Query: 245 REVHEEEEEEEEEEEETKAKETK--------LKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
            +  E+EE E+E  E+ + K  +        ++EK++      EEE EE E   EEE E 
Sbjct: 266 EKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEEAER 325

Query: 297 EEI 299
           +  
Sbjct: 326 QAR 328



 Score = 31.4 bits (72), Expect = 0.55
 Identities = 18/51 (35%), Positives = 31/51 (60%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +EE E +E ++E +  E + ++K++L     E+ EE+EE  +EE  EEE  
Sbjct: 210 QEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAE 260



 Score = 31.0 bits (71), Expect = 0.72
 Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 7/57 (12%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E  +   EEEE E + K +  + +   ++++     EE E+  +EE EE  +E E
Sbjct: 48  EEERLKALAEEEERERKRKEERREGRAVLQEQI-----EEREKRRQEEYEERLQERE 99



 Score = 31.0 bits (71), Expect = 0.72
 Identities = 25/64 (39%), Positives = 35/64 (54%), Gaps = 13/64 (20%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEE--------EEEEEEEEEEEEEE 295
           ERE  +E  E  +EE+E +A+E + K+KK       EE         E +EEE+E E EE
Sbjct: 97  EREQMDEIIERIQEEDEAEAQEKREKQKKLR-----EEIDEFNEERIERKEEEKEREREE 151

Query: 296 EEEI 299
           E +I
Sbjct: 152 ELKI 155



 Score = 30.6 bits (70), Expect = 1.1
 Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 3/58 (5%)

Query: 244 EREVHEE-EEEEEEEEEETKAKETKLKEKKKLGGGGGEE--EEEEEEEEEEEEEEEEE 298
           ER + E  EEE  +   E + +E K KE+++ G    +E  EE E+  +EE EE  +E
Sbjct: 40  ERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQE 97



 Score = 28.7 bits (65), Expect = 3.6
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 14/64 (21%)

Query: 244 EREVHEEEEEEE--------EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           ER   +EE+E E        EE E+ + +  +L+          +EE E +E ++E+EE 
Sbjct: 172 ERRERKEEKEREVARLRAQQEEAEDEREELDELRADL------YQEEYERKERQKEKEEA 225

Query: 296 EEEI 299
           E+  
Sbjct: 226 EKRR 229



 Score = 28.7 bits (65), Expect = 3.7
 Identities = 18/54 (33%), Positives = 30/54 (55%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E +E +E ++E+EE  K +  K + ++       E+EE  +EE  EEE E E +
Sbjct: 211 EEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERM 264



 Score = 28.0 bits (63), Expect = 6.7
 Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 13/63 (20%)

Query: 244 EREVHEEEEEEEEE-EEETKAKETKLKEKKKLGGGGGEEEEEEE------EEEEEEEEEE 296
           E E   E  E +EE E E      + +E +       E EE +E      +EE E +E +
Sbjct: 166 EEEREAERRERKEEKEREVARLRAQQEEAED------EREELDELRADLYQEEYERKERQ 219

Query: 297 EEI 299
           +E 
Sbjct: 220 KEK 222



 Score = 28.0 bits (63), Expect = 7.9
 Identities = 20/68 (29%), Positives = 28/68 (41%), Gaps = 19/68 (27%)

Query: 245 REVHEEEEEEEEEEEETKAKETKL--------------KEKKKLGGGGGEEEEEEEEEEE 290
             + E++ E+EE E+E   K                  KE+++       E EEE EE E
Sbjct: 262 ERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRA-----AEREEELEEGE 316

Query: 291 EEEEEEEE 298
              EEE E
Sbjct: 317 RLREEEAE 324


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 39.7 bits (93), Expect = 0.002
 Identities = 27/62 (43%), Positives = 34/62 (54%), Gaps = 5/62 (8%)

Query: 257 EEEETKAKETK--LKEKKK--LGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDF-GS 311
           EE+    +E K  LKEK+   +     EEEEEEEEEEEEEEE   E+  +    +    S
Sbjct: 378 EEDVKDIEELKKFLKEKEHPVVERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLPAS 437

Query: 312 GG 313
           GG
Sbjct: 438 GG 439



 Score = 28.5 bits (64), Expect = 5.7
 Identities = 14/23 (60%), Positives = 15/23 (65%)

Query: 244 EREVHEEEEEEEEEEEETKAKET 266
           ER   EEEEEEEEEEEE +    
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVA 422



 Score = 27.7 bits (62), Expect = 9.6
 Identities = 15/37 (40%), Positives = 17/37 (45%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
              E  EEEEEEEEEEE           + +L   GG
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGG 439


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 39.7 bits (92), Expect = 0.002
 Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 244  EREVHEEEEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            E ++   EE ++ EE++ KA+E K  E  +KK      +E EE ++ EE +++E EE 
Sbjct: 1658 ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK 1715



 Score = 37.4 bits (86), Expect = 0.011
 Identities = 21/56 (37%), Positives = 34/56 (60%), Gaps = 2/56 (3%)

Query: 244  EREVHEEEEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            E +  + EE ++ EE + KA+E K  E  KKK+     +E EE+++ EE ++ EEE
Sbjct: 1603 EEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658



 Score = 37.0 bits (85), Expect = 0.012
 Identities = 21/60 (35%), Positives = 34/60 (56%), Gaps = 3/60 (5%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKET---KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            K  E +  + EE ++ EE+E KA E    + +E KK      +E EE+++ EE ++ EEE
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727



 Score = 37.0 bits (85), Expect = 0.014
 Identities = 23/57 (40%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 244  EREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E ++  EE ++E EE++ KA+E K    EKKK+     EEE++ EE  +E+E   EE
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEE 1783



 Score = 35.9 bits (82), Expect = 0.028
 Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 4/62 (6%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE----EE 296
            K  E E  ++ EE ++ EEE K K  +  +K +      EE ++ EE+E++  E    E 
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA 1698

Query: 297  EE 298
            EE
Sbjct: 1699 EE 1700



 Score = 35.9 bits (82), Expect = 0.035
 Identities = 22/66 (33%), Positives = 38/66 (57%), Gaps = 4/66 (6%)

Query: 234  QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
            ++E D  K  E +  EE+E++  E  + +A+E K  E+ K      E EE+++ EE ++ 
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELK----KKEAEEKKKAEELKKA 1724

Query: 294  EEEEEI 299
            EEE +I
Sbjct: 1725 EEENKI 1730



 Score = 35.5 bits (81), Expect = 0.039
 Identities = 25/75 (33%), Positives = 40/75 (53%), Gaps = 10/75 (13%)

Query: 234  QSENDYCKGGEREVHEEEE-------EEEEEEEETKAKETKLKE---KKKLGGGGGEEEE 283
            ++E D  K  E    E EE       +++E EE+ KA+E K  E   K K      E EE
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741

Query: 284  EEEEEEEEEEEEEEE 298
            ++++ EE +++EEE+
Sbjct: 1742 DKKKAEEAKKDEEEK 1756



 Score = 34.3 bits (78), Expect = 0.092
 Identities = 18/54 (33%), Positives = 36/54 (66%)

Query: 246  EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            E+ ++E EE+++ EE K  E + K K +      EE++++ EE +++EEE+++I
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759



 Score = 34.3 bits (78), Expect = 0.10
 Identities = 25/68 (36%), Positives = 34/68 (50%), Gaps = 9/68 (13%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLK---EKKKLGGGGGEEE------EEEEEEEEE 291
            K  E E    E  ++E EE  KA+E K K   EKKK       EE      EE ++E EE
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741

Query: 292  EEEEEEEI 299
            ++++ EE 
Sbjct: 1742 DKKKAEEA 1749



 Score = 34.0 bits (77), Expect = 0.11
 Identities = 16/57 (28%), Positives = 33/57 (57%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            K  E+   +E EE+++ EE  KA+E    +  +      E++++ EE ++ EE+E++
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689



 Score = 33.2 bits (75), Expect = 0.20
 Identities = 16/59 (27%), Positives = 32/59 (54%), Gaps = 4/59 (6%)

Query: 244  EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE----EEEEEEEEE 298
              E  ++ E+ +++E E K K  +LK+ ++       EE ++ EE+    EE ++ EE+
Sbjct: 1628 AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686



 Score = 33.2 bits (75), Expect = 0.21
 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 3/61 (4%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLK---EKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            K  E    + EE ++ EEE+ K ++ K K   EKKK       EEE + +  EE ++ EE
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672

Query: 298  E 298
            +
Sbjct: 1673 D 1673



 Score = 32.8 bits (74), Expect = 0.27
 Identities = 20/64 (31%), Positives = 35/64 (54%), Gaps = 11/64 (17%)

Query: 246  EVHEEEEE--------EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE---E 294
            E+ + EEE        ++E EE+ KA+E K  E++       E ++ EE++++ EE    
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683

Query: 295  EEEE 298
            EE+E
Sbjct: 1684 EEDE 1687



 Score = 32.0 bits (72), Expect = 0.46
 Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 4/57 (7%)

Query: 246  EVHEEEEEEEEEEEET----KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E  + EE ++ EE++     KA+E K  E+ ++       EEE++ + EE ++ EE 
Sbjct: 1562 EKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618



 Score = 31.6 bits (71), Expect = 0.70
 Identities = 20/55 (36%), Positives = 32/55 (58%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
            K  E +  + EE +++EEE+ K    K +E+KK      E+E   EEE +EE+E+
Sbjct: 1737 KEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEK 1791



 Score = 31.3 bits (70), Expect = 0.76
 Identities = 19/61 (31%), Positives = 31/61 (50%), Gaps = 2/61 (3%)

Query: 238  DYCKGGEREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEE 295
            D  K    E  + +E +++ EE  KA E K K  E KK      + EE ++ +E +++ E
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480

Query: 296  E 296
            E
Sbjct: 1481 E 1481



 Score = 31.3 bits (70), Expect = 0.90
 Identities = 19/58 (32%), Positives = 34/58 (58%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            K  E    EE  +  EEE++ KA+E K  E+ K+     ++ EEE+++ E+ +++E E
Sbjct: 1587 KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644



 Score = 30.5 bits (68), Expect = 1.4
 Identities = 15/58 (25%), Positives = 32/58 (55%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            K  E +   +E +++ EE++ KA E K     K      +++ EE+++ +E +++ EE
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEE 1442



 Score = 30.5 bits (68), Expect = 1.6
 Identities = 15/55 (27%), Positives = 27/55 (49%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
            K  E    + +E ++  E + KA E K  E+ K      + EE ++ +E ++ EE
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEE 1544



 Score = 30.1 bits (67), Expect = 1.7
 Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 238  DYCKGGEREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEE 295
            D  K    E  + EE +++ EE  KA E K K  E KK      + EE +++ +E ++  
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAA 1506

Query: 296  EEE 298
            E +
Sbjct: 1507 EAK 1509



 Score = 30.1 bits (67), Expect = 2.1
 Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 8/67 (11%)

Query: 241  KGGEREVHEEEEEEEEE--------EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
            K  E++  EE ++ EE+        EE  KA+E +++E  KL     + + EE ++ EE 
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618

Query: 293  EEEEEEI 299
            + + EE+
Sbjct: 1619 KIKAEEL 1625



 Score = 29.7 bits (66), Expect = 2.4
 Identities = 14/58 (24%), Positives = 31/58 (53%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            K  E +  + +E ++    + KA E K K ++K      +++ EE ++ +E +++ EE
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455



 Score = 29.7 bits (66), Expect = 2.4
 Identities = 12/51 (23%), Positives = 25/51 (49%)

Query: 246  EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
            E  ++  E  + E E  A E +  E+K       +EE +++ +  +++ EE
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389



 Score = 29.7 bits (66), Expect = 2.8
 Identities = 16/61 (26%), Positives = 32/61 (52%)

Query: 238  DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            D  K       + +E +++ EE+ KA E K K ++       +++ EE ++ EE +++ E
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467

Query: 298  E 298
            E
Sbjct: 1468 E 1468



 Score = 29.7 bits (66), Expect = 2.8
 Identities = 15/56 (26%), Positives = 28/56 (50%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
            K  E     +E +++ EE + KA E K   + K      ++ EE ++ +E ++ EE
Sbjct: 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE 1532



 Score = 29.3 bits (65), Expect = 3.3
 Identities = 14/65 (21%), Positives = 35/65 (53%)

Query: 234  QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
            ++  D  +  E +    E+++EE +++  A + K +EKKK      + EE++++ +E ++
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412

Query: 294  EEEEE 298
                +
Sbjct: 1413 AAAAK 1417



 Score = 29.3 bits (65), Expect = 3.5
 Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 246  EVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            E  + +E +++ EE  KA E K K  E KK      ++ EE ++  E  + E E
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353



 Score = 29.0 bits (64), Expect = 4.2
 Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 2/53 (3%)

Query: 246  EVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEE 296
            E  + +E +++ EE  KA E K K  E KK      +  E +++ +E ++ EE
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520



 Score = 29.0 bits (64), Expect = 4.3
 Identities = 14/58 (24%), Positives = 27/58 (46%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            K  E    + +  +++ EE  KA E    E +         EE+ E  E+++EE +++
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 29.0 bits (64), Expect = 4.4
 Identities = 16/55 (29%), Positives = 31/55 (56%)

Query: 244  EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            + E     +E E  EE+ +A E K +E KK      ++ EE+++ +E +++ EE+
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403



 Score = 29.0 bits (64), Expect = 4.9
 Identities = 13/59 (22%), Positives = 31/59 (52%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            K  E    + EE ++ +E + KA+E K  ++ K      +++ +E ++  E +++ +E 
Sbjct: 1457 KKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515



 Score = 28.6 bits (63), Expect = 6.7
 Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETK--LKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            K  E     +E +++ EE + KA   K   +E KK       E E   +E E  EE+ E
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 28.6 bits (63), Expect = 6.7
 Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 4/66 (6%)

Query: 234  QSENDYCKGGEREVHEEEEEEEEE----EEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
            ++E    K  E +   E +++ +E    EE  KA E K  E+ K      + EE+++ +E
Sbjct: 1491 KAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE 1550

Query: 290  EEEEEE 295
             ++ EE
Sbjct: 1551 LKKAEE 1556



 Score = 28.2 bits (62), Expect = 7.1
 Identities = 12/58 (20%), Positives = 24/58 (41%)

Query: 238  DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
            D  K    E  +  E  + E E    +    +EK +      EE +++ +  +++ EE
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389



 Score = 28.2 bits (62), Expect = 7.2
 Identities = 16/61 (26%), Positives = 33/61 (54%)

Query: 238  DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            D  K    E  + +E +++ EE  KA+E K K ++       +++ EE ++ +E +++ E
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493

Query: 298  E 298
            E
Sbjct: 1494 E 1494



 Score = 28.2 bits (62), Expect = 7.6
 Identities = 10/54 (18%), Positives = 29/54 (53%)

Query: 246  EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            E  +EE +++ +  + KA+E K  ++ K      +++ +E ++    +++ +E 
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423



 Score = 28.2 bits (62), Expect = 7.7
 Identities = 8/56 (14%), Positives = 29/56 (51%)

Query: 244  EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            +++  E ++  E  + E +A   + +  ++      +++EE +++ +  +++ EE 
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390



 Score = 27.8 bits (61), Expect = 9.6
 Identities = 17/61 (27%), Positives = 29/61 (47%), Gaps = 3/61 (4%)

Query: 241  KGGEREVHEEEEEEEEEEEETKAKETKLKE---KKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            K  E +   EE +++ +  + KA+E K      K +      E E  EE+ E  E+++EE
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375

Query: 298  E 298
             
Sbjct: 1376 A 1376


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 38.2 bits (88), Expect = 0.002
 Identities = 21/65 (32%), Positives = 37/65 (56%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           Q E D  +  E +    EE    EEE  + +E + +EK++      EEEE++E+EE+E  
Sbjct: 49  QEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERI 108

Query: 294 EEEEE 298
           ++++E
Sbjct: 109 QKQKE 113



 Score = 35.5 bits (81), Expect = 0.015
 Identities = 18/55 (32%), Positives = 32/55 (58%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E+E  E+EE++  E EE K +  + + +++      EEE   E+EE+ + + EEE
Sbjct: 43  EQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEE 97



 Score = 35.1 bits (80), Expect = 0.021
 Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 4/58 (6%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +  E  +  EEE   +EEE  + KE    EK K      E++E+EE+E  ++++EE E
Sbjct: 63  RAAEERLRREEEARRQEEERAREKE----EKAKRKAEEEEKQEQEEQERIQKQKEEAE 116



 Score = 32.4 bits (73), Expect = 0.16
 Identities = 16/57 (28%), Positives = 30/57 (52%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           +RE  E+E  E+EE++  + +E K +  ++      E   +EEE   E+EE+ +   
Sbjct: 38  QREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKA 94



 Score = 32.4 bits (73), Expect = 0.19
 Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 4/54 (7%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           R+  E+ E+EE+E  E + ++   +E+ K        EE    EEE   +EEE 
Sbjct: 33  RQAREQREQEEQERREQEEQDRLEREELK----RRAAEERLRREEEARRQEEER 82



 Score = 31.2 bits (70), Expect = 0.36
 Identities = 19/58 (32%), Positives = 30/58 (51%), Gaps = 11/58 (18%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE----EEEEEEEEEEEE 297
           E E   +EEE   E+EE   ++ + +EK+       E+EE+E    ++EE E    EE
Sbjct: 72  EEEARRQEEERAREKEEKAKRKAEEEEKQ-------EQEEQERIQKQKEEAEARAREE 122



 Score = 31.2 bits (70), Expect = 0.37
 Identities = 15/55 (27%), Positives = 27/55 (49%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +R    E+ E+EE+E   + ++ +L+ ++       E    EEE   +EEE   E
Sbjct: 31  KRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERARE 85



 Score = 30.1 bits (67), Expect = 1.0
 Identities = 17/54 (31%), Positives = 28/54 (51%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           R   EE   E+EE+ + KA+E + +E+++      ++EE E    EE E    E
Sbjct: 76  RRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLE 129



 Score = 28.9 bits (64), Expect = 2.8
 Identities = 18/68 (26%), Positives = 37/68 (54%), Gaps = 7/68 (10%)

Query: 234 QSENDYCKGGEREVHEEEEEE--EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
           + E    K  E E  E+EE+E  ++++EE +A+  +  E+ +L     E E+  ++ E+E
Sbjct: 86  KEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRL-----EREKHFQQIEQE 140

Query: 292 EEEEEEEI 299
             E ++ +
Sbjct: 141 RLERKKRL 148



 Score = 28.5 bits (63), Expect = 3.2
 Identities = 14/54 (25%), Positives = 26/54 (48%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +E     E+EE+ + + + +E + +E+++      EE E    EE E    E E
Sbjct: 78  QEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLERE 131



 Score = 28.1 bits (62), Expect = 4.7
 Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 9/65 (13%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + E    +  E E  +E+EE+E  +++ +  E + +E         E E    E E+  +
Sbjct: 85  EKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARARE---------EAERMRLEREKHFQ 135

Query: 294 EEEEE 298
           + E+E
Sbjct: 136 QIEQE 140


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 38.7 bits (90), Expect = 0.002
 Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 5/68 (7%)

Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
           K +F  + D     ERE    +E+  EEE + KA+E K +  K +     EEE ++E E 
Sbjct: 38  KPVFTRKKDRITIQEREREAAKEKALEEEAKRKAEERKRETLKIV-----EEEVKKELEL 92

Query: 290 EEEEEEEE 297
           ++     E
Sbjct: 93  KKRNTLLE 100



 Score = 36.0 bits (83), Expect = 0.016
 Identities = 16/34 (47%), Positives = 23/34 (67%)

Query: 265 ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ET++ E ++    G EEEEE EEEEE + E++ E
Sbjct: 1   ETEVLELEEEDESGEEEEEESEEEEETDSEDDME 34



 Score = 36.0 bits (83), Expect = 0.019
 Identities = 23/65 (35%), Positives = 28/65 (43%), Gaps = 19/65 (29%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
           E EV E EEE+E  EEE                   EEE EEEEE + E++ E  +  V 
Sbjct: 1   ETEVLELEEEDESGEEE-------------------EEESEEEEETDSEDDMEPRLKPVF 41

Query: 304 VSAVD 308
               D
Sbjct: 42  TRKKD 46



 Score = 35.3 bits (81), Expect = 0.027
 Identities = 24/65 (36%), Positives = 32/65 (49%), Gaps = 16/65 (24%)

Query: 244 EREVHEEEEEEEEEEEET----------KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           E E  EEEEEE EEEEET          K   T+ K++  +      +E E E  +E+  
Sbjct: 10  EDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITI------QEREREAAKEKAL 63

Query: 294 EEEEE 298
           EEE +
Sbjct: 64  EEEAK 68



 Score = 33.0 bits (75), Expect = 0.17
 Identities = 19/65 (29%), Positives = 32/65 (49%)

Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
           + + E +   G E E   EEEEE + E++ + +   +  +KK      E E E  +E+  
Sbjct: 4   VLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKAL 63

Query: 292 EEEEE 296
           EEE +
Sbjct: 64  EEEAK 68



 Score = 29.5 bits (66), Expect = 1.9
 Identities = 22/102 (21%), Positives = 36/102 (35%), Gaps = 28/102 (27%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEE---------------------ETKAKETKLKEKKK 273
              +  + GE E  E EEEEE + E                     E + +  K K  ++
Sbjct: 6   ELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEE 65

Query: 274 LGGGGGEEEE-------EEEEEEEEEEEEEEEIGGVVVSAVD 308
                 EE +       EEE ++E E ++   +    +  VD
Sbjct: 66  EAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVD 107


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 38.7 bits (90), Expect = 0.002
 Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 7/72 (9%)

Query: 227 GYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEE 286
            Y K    +      G +    + ++ ++EE+++      + KE K       E+EEEEE
Sbjct: 221 YYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELK-------EDEEEEE 273

Query: 287 EEEEEEEEEEEE 298
            EEEEEEE+E+E
Sbjct: 274 TEEEEEEEDEDE 285



 Score = 36.7 bits (85), Expect = 0.010
 Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 6/81 (7%)

Query: 221 PVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGG 278
               PD         E    +  E+E  +    +++  + +  K  K    E     G  
Sbjct: 204 TNGLPDSSQDKNKSLE----EYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDY 259

Query: 279 GEEEEEEEEEEEEEEEEEEEI 299
            EE+E +E+EEEEE EEEEE 
Sbjct: 260 VEEKELKEDEEEEETEEEEEE 280


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 37.7 bits (88), Expect = 0.002
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 20/60 (33%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           G+  V  EE+E +++EE+                    EEEE++EE+++E+E EEE   V
Sbjct: 103 GQHLVASEEDESDDDEED--------------------EEEEDDEEDDDEDESEEEESPV 142



 Score = 33.1 bits (76), Expect = 0.082
 Identities = 13/38 (34%), Positives = 26/38 (68%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKK 272
           SE D     E +  EE++EE+++E+E++ +E+ +K+ K
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 30.8 bits (70), Expect = 0.53
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 19/60 (31%)

Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
           +  S        E E  ++EE+EEEE++E                   E+++E+E EEEE
Sbjct: 99  VHISGQHLVASEEDESDDDEEDEEEEDDE-------------------EDDDEDESEEEE 139


>gnl|CDD|220692 pfam10324, 7TM_GPCR_Srw, Serpentine type 7TM GPCR chemoreceptor
           Srw.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srw is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'. The genes encoding Srw do not appear to be
           under as strong an adaptive evolutionary pressure as
           those of Srz.
          Length = 317

 Score = 38.7 bits (91), Expect = 0.003
 Identities = 32/137 (23%), Positives = 63/137 (45%), Gaps = 21/137 (15%)

Query: 71  IIIILGFFGNALVVVVVAFNQQMR-STTNLLIINLAIADLLFIVCCV-----PFTATDYV 124
           I+ I+GF  N +  +++   + MR S+ N+++I +AI D++ ++  +      F  +   
Sbjct: 5   ILSIIGFIIN-IFHLIILTRKSMRTSSINIIMIGIAICDIITMLLTIYNFIPEFIISYEN 63

Query: 125 TTCWPF---GDVWCKMYLIVVTAYA---SVYTLVLMSLDRFLAVVHPIASMSVRTER--- 175
           + C P      V     L  +   +   S +  V M+L R L V +P   MS + ++   
Sbjct: 64  SECIPPDSYLKVLLDWILESLQDISRRCSTWLGVFMALIRTLVVKNP---MSNKIQKLSK 120

Query: 176 --NALKAILITWIVIVL 190
               L  I+I +I+ + 
Sbjct: 121 PKFGLIIIIIVFILSLP 137


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 38.6 bits (90), Expect = 0.003
 Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 8/54 (14%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
               ++EEEEEEEE+E           K L  G   EE+    ++E+ E++EE+
Sbjct: 177 FRREKKEEEEEEEEDE--------ALIKSLSFGPETEEDRRRADDEDSEDDEED 222



 Score = 34.3 bits (79), Expect = 0.062
 Identities = 22/93 (23%), Positives = 36/93 (38%), Gaps = 24/93 (25%)

Query: 233 FQSENDYCKGGEREVHEEEEEEE--------------EEEEETKAKETKLKEKK------ 272
           ++++    +  ER   E EEE                + E E   +  +LKE +      
Sbjct: 108 YEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADV 167

Query: 273 ----KLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
                L      E++EEEEEEEE+E   + +  
Sbjct: 168 DVNSMLEALFRREKKEEEEEEEEDEALIKSLSF 200



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 7/60 (11%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEK--KKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           G  R    ++ +EE+EE   K +E +L     KKL     E    + + E E  E  EE+
Sbjct: 103 GATRNYEADKLDEEQEERVEKEREEELAGDAMKKL-----ENRTADSKREMEVLERLEEL 157


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 38.5 bits (89), Expect = 0.003
 Identities = 22/54 (40%), Positives = 32/54 (59%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           EE+EE  EE EE +  E   K ++K      EE ++EE+E E EEEE+ + G +
Sbjct: 114 EEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSL 167



 Score = 38.5 bits (89), Expect = 0.003
 Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 9/64 (14%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           +EN+  +  E+E   EE EE EE E     E K             + EE ++EE+E E 
Sbjct: 106 AENETVEEEEKEESREEREEVEETEGVTKSEQKNDW---------RDAEECQKEEKEPEP 156

Query: 295 EEEE 298
           EEEE
Sbjct: 157 EEEE 160



 Score = 38.1 bits (88), Expect = 0.005
 Identities = 20/58 (34%), Positives = 30/58 (51%), Gaps = 9/58 (15%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +  + E  EE EE EE E  TK+++              + EE ++EE+E E EEEE+
Sbjct: 113 EEEKEESREEREEVEETEGVTKSEQKNDWR---------DAEECQKEEKEPEPEEEEK 161



 Score = 34.6 bits (79), Expect = 0.068
 Identities = 16/55 (29%), Positives = 31/55 (56%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E    E E  EEEE+EE++ +  +++E + +     + +  + EE ++EE+E E 
Sbjct: 102 EDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEP 156



 Score = 33.9 bits (77), Expect = 0.093
 Identities = 14/60 (23%), Positives = 29/60 (48%)

Query: 239 YCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +   R   +   E E  EEE K +  + +E+ +   G  + E++ +  + EE ++EE+
Sbjct: 93  LSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEK 152



 Score = 33.9 bits (77), Expect = 0.10
 Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 7/55 (12%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            ++  EE  +  EEEE+ + +E   ++ +       EEEE+   +EE E    E 
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSR-------EEEEKRRLKEEIERRRAEA 266



 Score = 32.7 bits (74), Expect = 0.25
 Identities = 15/50 (30%), Positives = 24/50 (48%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E EE +++ EE  K  E + + +K+        EEEE+   +EE E    
Sbjct: 215 ELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRA 264



 Score = 31.9 bits (72), Expect = 0.42
 Identities = 19/60 (31%), Positives = 27/60 (45%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
              E  E ++E +    +    E   + ++  G      EEEE+EE  EE EE EE  GV
Sbjct: 73  RFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGV 132



 Score = 31.9 bits (72), Expect = 0.49
 Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 4/57 (7%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           ER    EEEE+  ++EE   K  + +EK++L     EE E    E  E+ ++  E G
Sbjct: 225 ERRKVLEEEEQRRKQEEADRKSREEEEKRRL----KEEIERRRAEAAEKRQKVPEDG 277



 Score = 30.4 bits (68), Expect = 1.3
 Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 8/53 (15%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           E      EEEE+  ++E   ++++ +E+K+          +EE E    E  E
Sbjct: 224 EERRKVLEEEEQRRKQEEADRKSREEEEKR--------RLKEEIERRRAEAAE 268



 Score = 30.0 bits (67), Expect = 1.6
 Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 5/58 (8%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
             E  E EE E   + E K      +E +K      E+E E EEEE+ +    EE  G
Sbjct: 120 REEREEVEETEGVTKSEQKNDWRDAEECQK-----EEKEPEPEEEEKPKRGSLEENNG 172



 Score = 28.5 bits (63), Expect = 5.7
 Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 7/68 (10%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAK-ETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
            +EN + +GG      E  +E E+ ++ + +   +L+E KK      + EE  +  EEEE
Sbjct: 181 HTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKK------KREERRKVLEEEE 234

Query: 293 EEEEEEIG 300
           +  ++E  
Sbjct: 235 QRRKQEEA 242



 Score = 27.7 bits (61), Expect = 8.5
 Identities = 16/57 (28%), Positives = 24/57 (42%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +   RE   +E   E  E + + K T   +         +E+   E E  EEEE+EE
Sbjct: 62  RLARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEE 118


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 37.3 bits (87), Expect = 0.003
 Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 8/60 (13%)

Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
               + +  EEEEE E EE + + +  +L EK        E  + + E+  E E +++EI
Sbjct: 95  GLDKKEKEEEEEEEVEVEELDEEEQIDELLEK--------ELAKLKREKRRENERKQKEI 146



 Score = 33.8 bits (78), Expect = 0.041
 Identities = 18/33 (54%), Positives = 22/33 (66%)

Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + K +K LG    E+EEEEEEE E EE +EEE 
Sbjct: 87  RKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQ 119



 Score = 33.4 bits (77), Expect = 0.066
 Identities = 17/38 (44%), Positives = 24/38 (63%), Gaps = 6/38 (15%)

Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           K ++    +KK+      +EEEEEEE E EE +EEE+I
Sbjct: 89  KVRKLLGLDKKE------KEEEEEEEVEVEELDEEEQI 120



 Score = 31.1 bits (71), Expect = 0.35
 Identities = 13/35 (37%), Positives = 19/35 (54%)

Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K   L +K+K      E E EE +EEE+ +E  E+
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEK 126



 Score = 29.6 bits (67), Expect = 1.3
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K      KEK++      E EE +EEE+ +E  E+E 
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKEL 128



 Score = 28.8 bits (65), Expect = 2.0
 Identities = 16/31 (51%), Positives = 22/31 (70%)

Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           LK +KK+    G +++E+EEEEEEE E EE 
Sbjct: 84  LKWRKKVRKLLGLDKKEKEEEEEEEVEVEEL 114



 Score = 27.3 bits (61), Expect = 7.6
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
           E ++ E  E+E  + +  K +E + K+K+ L
Sbjct: 117 EEQIDELLEKELAKLKREKRRENERKQKEIL 147


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 37.0 bits (86), Expect = 0.003
 Identities = 10/49 (20%), Positives = 12/49 (24%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
                E +   T    T  K+K        E      E       EEE 
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155



 Score = 35.4 bits (82), Expect = 0.012
 Identities = 9/53 (16%), Positives = 13/53 (24%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            E+      E +     +A   K K+               E       EEE 
Sbjct: 103 LELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 38.7 bits (90), Expect = 0.003
 Identities = 20/66 (30%), Positives = 42/66 (63%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
            ++E+   +  E+E  +EE+++++E+ + + K+ K KE+ K      E+E+E+E++ EE 
Sbjct: 98  PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157

Query: 293 EEEEEE 298
            + EEE
Sbjct: 158 RDREEE 163



 Score = 38.3 bits (89), Expect = 0.004
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 5/50 (10%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            +E + E  +EE K KE   +EKKK      +E+ +EE ++ + +EE +E
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKK-----KKEKPKEEPKDRKPKEEAKE 139



 Score = 36.8 bits (85), Expect = 0.014
 Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 11/69 (15%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKET-----------KLKEKKKLGGGGGEEEEEEEEEE 289
           +  + E  +++E+ +EE ++ K KE            K KEKK       EEE++ E   
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR 170

Query: 290 EEEEEEEEE 298
            +   ++  
Sbjct: 171 AKSRPKKPP 179



 Score = 34.1 bits (78), Expect = 0.100
 Identities = 12/41 (29%), Positives = 26/41 (63%)

Query: 258 EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +  A +TK  ++ K   G  EE+E+E+ +EE+++++E+ 
Sbjct: 84  GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKP 124



 Score = 32.6 bits (74), Expect = 0.27
 Identities = 12/42 (28%), Positives = 24/42 (57%)

Query: 257 EEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +   AK    KE K   G   E+E+E+ +EE+++++E+ +
Sbjct: 84  GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125



 Score = 32.6 bits (74), Expect = 0.29
 Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
             E +  K  E++V E  + EEE++ E    +++ K+  K      ++E  EEE++ +  
Sbjct: 143 PKEKE--KEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200

Query: 294 EEEE 297
            E  
Sbjct: 201 REAV 204



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 13/59 (22%), Positives = 22/59 (37%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
               R+  EE++ E    +    K  K K   K      EE++ +   E  + + EE  
Sbjct: 154 VEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPD 212



 Score = 29.9 bits (67), Expect = 2.3
 Identities = 14/66 (21%), Positives = 26/66 (39%), Gaps = 10/66 (15%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGG----------GGEEEEEEEEEEEEEE 293
           E E   E    +   ++   K+   K+K+                G+ EE +  EE E+E
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKE 220

Query: 294 EEEEEI 299
           E++ + 
Sbjct: 221 EDDGKD 226



 Score = 28.3 bits (63), Expect = 6.0
 Identities = 11/48 (22%), Positives = 22/48 (45%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E +   +E   + ++   K          E + E  +EEE+E+E+ +E
Sbjct: 68  ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKE 115


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 36.1 bits (83), Expect = 0.004
 Identities = 25/66 (37%), Positives = 31/66 (46%), Gaps = 19/66 (28%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           G +E  +E EEEEE +E+                   +EEEE E EEEE EEEEE  G  
Sbjct: 43  GAQEGDDEMEEEEEVDED-------------------DEEEEGEGEEEEGEEEEETEGAT 83

Query: 303 VVSAVD 308
              A +
Sbjct: 84  GKRAAE 89



 Score = 34.6 bits (79), Expect = 0.012
 Identities = 20/49 (40%), Positives = 31/49 (63%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +E++E  EE+E  K       E ++ G   G++E EEEEE +E++EEEE
Sbjct: 17  KEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEE 65



 Score = 33.8 bits (77), Expect = 0.028
 Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 22/64 (34%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE-------EE 294
            G+ E+ EEEE +E++EEE                G GEEEE EEEEE E        E+
Sbjct: 46  EGDDEMEEEEEVDEDDEEEE---------------GEGEEEEGEEEEETEGATGKRAAED 90

Query: 295 EEEE 298
           EE++
Sbjct: 91  EEDD 94



 Score = 32.6 bits (74), Expect = 0.062
 Identities = 18/51 (35%), Positives = 28/51 (54%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           +E++E  EE+   K       +    G  E ++E EEEEE +E++EEE G 
Sbjct: 17  KEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGE 67



 Score = 31.9 bits (72), Expect = 0.12
 Identities = 19/49 (38%), Positives = 26/49 (53%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E  EE+E  +   A   + +E     G    EEEEE +E++EEEE E E
Sbjct: 21  EVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGE 69


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 38.7 bits (90), Expect = 0.004
 Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E EEE+E++ EET   E + +          E E E + E+ EE+E+E++
Sbjct: 646 EGEEEDEDDLEETDDDEDECEA-----IEDSESESESDGEDGEEDEQEDD 690



 Score = 36.8 bits (85), Expect = 0.016
 Identities = 14/47 (29%), Positives = 27/47 (57%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E EEE+E++ +  +    E + +     E E + E+ EE+E+E++ E
Sbjct: 646 EGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692



 Score = 29.8 bits (67), Expect = 2.4
 Identities = 20/66 (30%), Positives = 34/66 (51%), Gaps = 3/66 (4%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           GE E  ++ EE +++E+E +A E    E +  G    + EE+E+E++ E  E    I   
Sbjct: 647 GEEEDEDDLEETDDDEDECEAIEDSESESESDGE---DGEEDEQEDDAEANEGVVPIDKA 703

Query: 303 VVSAVD 308
           V  A+ 
Sbjct: 704 VRRALP 709



 Score = 28.3 bits (63), Expect = 6.0
 Identities = 16/63 (25%), Positives = 26/63 (41%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           +SE+D   G E E  ++ E  E      KA    L +   L       + E+EE  ++E+
Sbjct: 674 ESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEEGMDDEQ 733

Query: 294 EEE 296
              
Sbjct: 734 MMR 736



 Score = 28.3 bits (63), Expect = 6.1
 Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 2/36 (5%)

Query: 267 KLKEKKK--LGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           K  E K        GEEE+E++ EE +++E+E E  
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAI 668



 Score = 27.9 bits (62), Expect = 9.2
 Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 13/77 (16%)

Query: 236 ENDYCKGGEREVHEEEE-----EEEEEEEETKAKETKL--------KEKKKLGGGGGEEE 282
           + D C+  E    E E      EE+E+E++ +A E  +           K L      + 
Sbjct: 661 DEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDG 720

Query: 283 EEEEEEEEEEEEEEEEI 299
            + E+EE  ++E+   +
Sbjct: 721 GDSEDEEGMDDEQMMRL 737


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 36.2 bits (84), Expect = 0.004
 Identities = 13/27 (48%), Positives = 15/27 (55%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           G     E +E EEEE+EEE EEE    
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 35.5 bits (82), Expect = 0.007
 Identities = 12/33 (36%), Positives = 15/33 (45%)

Query: 269 KEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
                      E +E EEEE+EEE EEE +   
Sbjct: 71  AAAAAGAEAAAEADEAEEEEKEEEAEEESDDDM 103



 Score = 34.3 bits (79), Expect = 0.016
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 276 GGGGEEEEEEEEEEEEEEEEEEEIGG 301
               +E EEEE+EEE EEE ++++  
Sbjct: 80  AAEADEAEEEEKEEEAEEESDDDMLF 105



 Score = 34.3 bits (79), Expect = 0.017
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
                E EEEE+EEE EEE +++ + G
Sbjct: 80  AAEADEAEEEEKEEEAEEESDDDMLFG 106



 Score = 28.1 bits (63), Expect = 2.1
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 263 AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           A       +        EEEE+EEE EEE +++
Sbjct: 70  AAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 37.7 bits (87), Expect = 0.004
 Identities = 23/66 (34%), Positives = 28/66 (42%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
              E      GE      EEEEEE E     ++    E +++    GEEEE EEE     
Sbjct: 192 EDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLF 251

Query: 293 EEEEEE 298
           E E EE
Sbjct: 252 EIEWEE 257



 Score = 36.1 bits (83), Expect = 0.017
 Identities = 21/66 (31%), Positives = 32/66 (48%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           +ND     + +V  EEEE  EE +  + +E +       G G  E  EEEEEE E  ++ 
Sbjct: 163 DNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDG 222

Query: 296 EEEIGG 301
           E+ +  
Sbjct: 223 EDVVDY 228



 Score = 34.6 bits (79), Expect = 0.051
 Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 13/73 (17%)

Query: 243 GEREVHEEEEEEEEEEEETKAKE--------TKLKEKKKLGGGGGEEEEEEEEEEE---- 290
            E  + E +  EEEE+EE  +           + +E++  G   GE+  + E E      
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQ 237

Query: 291 -EEEEEEEEIGGV 302
            EEEE EEE+  +
Sbjct: 238 GEEEEMEEEVINL 250



 Score = 31.5 bits (71), Expect = 0.47
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 276 GGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
               EEEE  EE +  EEEE+EE+G       +
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGN 205



 Score = 28.4 bits (63), Expect = 4.2
 Identities = 12/49 (24%), Positives = 18/49 (36%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E+             ++       +      EEE  EE +  EEEE+EE
Sbjct: 147 EKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEE 195



 Score = 28.4 bits (63), Expect = 5.3
 Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 11/67 (16%)

Query: 243 GEREVHEEEEEEEE-----------EEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
           G RE++EEEEEE E           E E    K+ + +E ++      E E EEE   EE
Sbjct: 204 GNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEE 263

Query: 292 EEEEEEE 298
                EE
Sbjct: 264 VPRNNEE 270


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 38.1 bits (88), Expect = 0.005
 Identities = 18/63 (28%), Positives = 33/63 (52%), Gaps = 12/63 (19%)

Query: 248 HEEEEEEEEEEE------ETKAKET------KLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           HE    E E+++      + K K T      K    +KLG    EE  EEE+++EE++++
Sbjct: 3   HEFASSELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDD 62

Query: 296 EEE 298
           +++
Sbjct: 63  DDD 65



 Score = 33.4 bits (76), Expect = 0.16
 Identities = 14/72 (19%), Positives = 35/72 (48%), Gaps = 14/72 (19%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
            G E E    EEE+++EE++    +              E+E++++++++E++E+E++  
Sbjct: 41  LGKEAEEEAMEEEDDDEEDDDDDDD--------------EDEDDDDDDDDEDDEDEDDDD 86

Query: 301 GVVVSAVDFGSG 312
             +        G
Sbjct: 87  STLHDDSSADDG 98



 Score = 33.4 bits (76), Expect = 0.16
 Identities = 15/74 (20%), Positives = 38/74 (51%), Gaps = 11/74 (14%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
           R++ +E EEE  EEE+   ++              +++E+E++++++++E++E+      
Sbjct: 39  RKLGKEAEEEAMEEEDDDEED-----------DDDDDDEDEDDDDDDDDEDDEDEDDDDS 87

Query: 305 SAVDFGSGGWWFES 318
           +  D  S     E+
Sbjct: 88  TLHDDSSADDGNET 101


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 38.1 bits (89), Expect = 0.005
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E EEE +EEE +    K   ++K G   GE+E E ++ ++E + E +E
Sbjct: 400 ELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKE 449



 Score = 38.1 bits (89), Expect = 0.006
 Identities = 20/72 (27%), Positives = 33/72 (45%), Gaps = 11/72 (15%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE-----------EEE 290
             E E  + EEE+E+++E+    E     +KKLG     E++EE             E  
Sbjct: 326 EDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEAR 385

Query: 291 EEEEEEEEIGGV 302
           ++EE + EI  +
Sbjct: 386 KKEENDAEIEEL 397



 Score = 37.0 bits (86), Expect = 0.012
 Identities = 15/55 (27%), Positives = 28/55 (50%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             E+  E E EEE +EE   + +K    ++  G    E+E E ++ ++E + E +
Sbjct: 394 IEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448



 Score = 37.0 bits (86), Expect = 0.013
 Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 11/78 (14%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEE---------- 284
           S ++  +  + +  EE+E+++E++++ +      K+  KL  G  +EE            
Sbjct: 322 SVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQR 381

Query: 285 -EEEEEEEEEEEEEEIGG 301
            E  ++EE + E EE+  
Sbjct: 382 AEARKKEENDAEIEELRR 399



 Score = 35.8 bits (83), Expect = 0.030
 Identities = 19/47 (40%), Positives = 27/47 (57%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEE 290
           + +   +EEEE E+EEE K ++   K  K+      EEEEEE +EE 
Sbjct: 448 KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEEN 494



 Score = 34.6 bits (80), Expect = 0.076
 Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 8/74 (10%)

Query: 243 GEREVHEEEEEEEEEEEETK--------AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           GE E  EEE EE  ++   +         KE + K+ KK      +E++E +EEEE E+E
Sbjct: 403 GEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDE 462

Query: 295 EEEEIGGVVVSAVD 308
           EE ++  V    + 
Sbjct: 463 EEAKVEKVANKLLK 476



 Score = 32.7 bits (75), Expect = 0.24
 Identities = 16/47 (34%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           EE  +  EE   +  +L+ +K  G    EE+E+E+ + EEE+E+++E
Sbjct: 298 EEARKALEEQLRQGEELR-RKIEGKSVSEEDEDEDSDSEEEDEDDDE 343



 Score = 31.2 bits (71), Expect = 0.72
 Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 4/61 (6%)

Query: 243 GEREVHEEEEEEEEEEEETK----AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +    E++EE       K    A+  K +E          E E EEE +EEE EE  +
Sbjct: 358 LGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSK 417

Query: 299 I 299
            
Sbjct: 418 K 418



 Score = 30.8 bits (70), Expect = 1.1
 Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 8/50 (16%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           +  E  ++EE  A      E ++L      E EEE +EEE EE  ++ +G
Sbjct: 380 QRAEARKKEENDA------EIEELRREL--EGEEESDEEENEEPSKKNVG 421



 Score = 29.3 bits (66), Expect = 3.6
 Identities = 17/49 (34%), Positives = 28/49 (57%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +EE+E++EE     + K   K+   G     E E+E++E  EEE+ +EI
Sbjct: 563 DEEDEDDEELPFLFKQKDLIKEAFAGDDVVAEFEKEKKEVIEEEDPKEI 611


>gnl|CDD|220688 pfam10317, 7TM_GPCR_Srd, Serpentine type 7TM GPCR chemoreceptor
           Srd.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srd is part of the
           larger Str superfamily of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 293

 Score = 37.5 bits (88), Expect = 0.005
 Identities = 29/151 (19%), Positives = 57/151 (37%), Gaps = 32/151 (21%)

Query: 61  VSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNL-----LIINLAIADLLFIVCC 115
            SI  PIFF    ILG   N L++ ++ F    +S  +L     L++N A+  ++  +  
Sbjct: 1   FSIYYPIFF----ILGLILNLLLLYLIIF----KSPKSLRTYRILLLNTALTQIISCILA 52

Query: 116 VPFTATDYVTTCWP-----------FGDVWCK-MYLIVVTAY-ASVYTLVLMSLDRFLAV 162
             FT    +                FG  +C   Y +++     S ++L+L    R+  +
Sbjct: 53  F-FTQQRIIPNGVSLALISYGPCKYFGPWFCYSGYSLLLHFLLHSGWSLLLTFYYRYYIL 111

Query: 163 VHPIASMSVRTERNALKAILITWIVIVLTAV 193
            H          +  +    + +I  +   +
Sbjct: 112 KHV-----DPKRKKLILIFFLHYIPSLSQLL 137


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 37.9 bits (88), Expect = 0.005
 Identities = 23/125 (18%), Positives = 55/125 (44%), Gaps = 15/125 (12%)

Query: 178 LKAILITWIVIVLTAVPVFNMHGEVSYTYASELHTACVFLDLDPVNRPDGYNKTIFQSEN 237
           L ++L+  +++ L  +       +       E+  A +   +DP       N+ I Q + 
Sbjct: 5   LLSLLLHILLLGLLILGSLYHSVKPEPGGGGEIIQAVL---VDPGAVAQQANR-IQQQKK 60

Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKE----TKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
                 E+E  ++ E++ EE E+ +A E     +L+++         E+  ++ E+  ++
Sbjct: 61  P-AAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAA------AEKAAKQAEQAAKQ 113

Query: 294 EEEEE 298
            EE++
Sbjct: 114 AEEKQ 118



 Score = 31.7 bits (72), Expect = 0.42
 Identities = 17/55 (30%), Positives = 27/55 (49%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E   +EE +++ EEE KAK     +KK        E E + + E + + + EE
Sbjct: 139 EAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEE 193



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 6/58 (10%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K  E +  + EE + ++  E KAK     EKK       +EE +++ EEE + +   E
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKA------KEEAKKQAEEEAKAKAAAE 163



 Score = 30.2 bits (68), Expect = 1.4
 Identities = 14/53 (26%), Positives = 28/53 (52%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            +  EE++++ EE +  +A E K K + +      EE +++ EEE + +   E
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAE 163


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 36.9 bits (86), Expect = 0.006
 Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 5/51 (9%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             +EEEEE  E    +  +   +KKL      +E +EE +EEE++   EE 
Sbjct: 19  SPKEEEEEALELAIQEAHENALEKKL-----LDELDEELDEEEDDRFLEEY 64



 Score = 35.7 bits (83), Expect = 0.014
 Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 7/55 (12%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            +    +EEEEE  E   +       EKK         +E +EE +EEE++   E
Sbjct: 15  PKPPSPKEEEEEALELAIQEAHENALEKK-------LLDELDEELDEEEDDRFLE 62


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 38.1 bits (88), Expect = 0.006
 Identities = 23/88 (26%), Positives = 43/88 (48%), Gaps = 24/88 (27%)

Query: 218 DLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGG 277
           DLD +       + +  +E D        + +++E+E+E++++                 
Sbjct: 126 DLDMIIIKRRRARHL--AEEDMSPRDNFVIDDDDEDEDEDDDD----------------- 166

Query: 278 GGEEEEEEEEEEEE----EEEEEEEIGG 301
             E++EEEEEEEEE    ++E+EE+ GG
Sbjct: 167 -EEDDEEEEEEEEEIKGFDDEDEEDEGG 193



 Score = 33.1 bits (75), Expect = 0.22
 Identities = 15/50 (30%), Positives = 30/50 (60%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           E E  ++++EE++EEEE + +E K  + +     GGE+   E+ E ++ +
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207



 Score = 31.2 bits (70), Expect = 0.97
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 12/55 (21%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            + +  +EE++EEEEEEE + K            G  +E+EE+E  E+   E+ E
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIK------------GFDDEDEEDEGGEDFTYEKSE 202


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 36.2 bits (84), Expect = 0.006
 Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 7/61 (11%)

Query: 245 REVHEEEEEEEEEEEETKAKETK-------LKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +E  +E+E EE  EE  + +E +       LKE+K+      EE ++EE+ E E+ E+ E
Sbjct: 39  QEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTEDVE 98

Query: 298 E 298
           +
Sbjct: 99  D 99



 Score = 35.4 bits (82), Expect = 0.009
 Identities = 17/65 (26%), Positives = 29/65 (44%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
           +E  E  +  EEE ++ +  ET+  E  +     G  E    + EEE  +E++    V V
Sbjct: 70  KERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDKYKTTVTV 129

Query: 305 SAVDF 309
             +D 
Sbjct: 130 EELDP 134



 Score = 30.8 bits (70), Expect = 0.39
 Identities = 14/69 (20%), Positives = 32/69 (46%), Gaps = 21/69 (30%)

Query: 253 EEEEEEEETKAKETKLKEKKKLG---------------------GGGGEEEEEEEEEEEE 291
           ++ +EE + K +E +++E+K++                          ++EE+ E E+ E
Sbjct: 36  KKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTE 95

Query: 292 EEEEEEEIG 300
           + E++E  G
Sbjct: 96  DVEDDEWEG 104


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 37.9 bits (88), Expect = 0.006
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 6/60 (10%)

Query: 245 REVHEEEEEEE------EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           R V  +  +E         EE  K  E  L E     GG  E++E+E++EE +EE E+++
Sbjct: 299 RPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQ 358



 Score = 28.2 bits (63), Expect = 5.7
 Identities = 13/37 (35%), Positives = 20/37 (54%)

Query: 237 NDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
             Y  G + +  +E++EE +EE E K K  K   K+K
Sbjct: 332 LGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRK 368


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 37.8 bits (88), Expect = 0.006
 Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 244 EREVHEEEEEEEEEEEETKAKET-KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E  +  E EE  E E+  +A+E  +L+E   L     E+E+E EE + EEEEE  
Sbjct: 395 EEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449



 Score = 37.5 bits (87), Expect = 0.007
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +RE  E  E E+ E  E  A+  +L    +      E E+E+E EE + EEEEE 
Sbjct: 399 QREAEERLEAEQAERAEEDARLRELYPLPE-----DEFEDEDELEEAQPEEEEEA 448



 Score = 35.9 bits (83), Expect = 0.025
 Identities = 16/55 (29%), Positives = 26/55 (47%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            +   EE  + E EE     +  + +E  +L       E+E E+E+E EE + EE
Sbjct: 390 SQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEE 444



 Score = 34.8 bits (80), Expect = 0.058
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E   + E EE  E E   +  +    ++L     +E E+E+E EE + EEEEE
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEE 447



 Score = 34.8 bits (80), Expect = 0.058
 Identities = 17/65 (26%), Positives = 25/65 (38%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           Q   D          EEEE  + E EE    E   + ++           E+E E+E+E 
Sbjct: 378 QEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDEL 437

Query: 294 EEEEE 298
           EE + 
Sbjct: 438 EEAQP 442



 Score = 33.2 bits (76), Expect = 0.17
 Identities = 16/73 (21%), Positives = 30/73 (41%)

Query: 229 NKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
           +   +  E +  K  E     EEEE  + E E + +  + +  ++           E+E 
Sbjct: 372 DSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEF 431

Query: 289 EEEEEEEEEEIGG 301
           E+E+E EE +   
Sbjct: 432 EDEDELEEAQPEE 444


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 36.6 bits (85), Expect = 0.006
 Identities = 15/53 (28%), Positives = 34/53 (64%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           E+   E EE+++ + ++ K+K+ K K+K K      ++ E+++E+E E++ E+
Sbjct: 72  EKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 32.4 bits (74), Expect = 0.17
 Identities = 11/55 (20%), Positives = 36/55 (65%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +++  E  EE E+ ++E + K+    +KKK      +++++++++++++ E+++E
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 27.7 bits (62), Expect = 6.6
 Identities = 8/54 (14%), Positives = 34/54 (62%), Gaps = 4/54 (7%)

Query: 249 EEEEEEEEEEEETKAKETKLK----EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E  E +++++E   +  K+K    EK+K      + +++++++++++++++++
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108



 Score = 27.3 bits (61), Expect = 8.1
 Identities = 9/50 (18%), Positives = 28/50 (56%), Gaps = 6/50 (12%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ++ +++++++++ K  +   K +KK      +E+E E++ E+  +   E 
Sbjct: 89  KKSKKKKDKDKDKKDDKKDDKSEKK------DEKEAEDKLEDLTKSYSET 132


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 37.8 bits (87), Expect = 0.006
 Identities = 13/46 (28%), Positives = 23/46 (50%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E +E     KE + +E +       E + +  +E  EE+EE++E
Sbjct: 283 EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDE 328



 Score = 35.4 bits (81), Expect = 0.029
 Identities = 13/48 (27%), Positives = 24/48 (50%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E +E  E  K ++ +  E  +      + +  +E  EE+EE++E E
Sbjct: 283 EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENE 330



 Score = 33.5 bits (76), Expect = 0.12
 Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 2/51 (3%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            E +E  E ++E + +E +  E          +  +E  EE+EE++E EE 
Sbjct: 284 IENKEVSEGDKEQQQEEVENAEAH--KEEVQSDRPDEIGEEKEEDDENEEN 332



 Score = 32.4 bits (73), Expect = 0.29
 Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 4/61 (6%)

Query: 241 KGGEREVHEEEEEEE----EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           +G E E  E   EE+      E E K      KE+++      E  +EE + +  +E  E
Sbjct: 262 QGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGE 321

Query: 297 E 297
           E
Sbjct: 322 E 322



 Score = 29.3 bits (65), Expect = 3.2
 Identities = 14/55 (25%), Positives = 21/55 (38%), Gaps = 19/55 (34%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E  E  +EE + +                     +E  EE+EE++E EE E  
Sbjct: 300 EVENAEAHKEEVQSDRP-------------------DEIGEEKEEDDENEENERH 335



 Score = 28.9 bits (64), Expect = 3.4
 Identities = 18/63 (28%), Positives = 29/63 (46%), Gaps = 6/63 (9%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE------EEEEEEEEE 297
           ++E    EEE EE   E        + + K    G +E+++EE E      EE + +  +
Sbjct: 258 KKEKQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPD 317

Query: 298 EIG 300
           EIG
Sbjct: 318 EIG 320



 Score = 28.5 bits (63), Expect = 4.8
 Identities = 12/56 (21%), Positives = 25/56 (44%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E    E + +  +E  E K ++ + +E ++      +E  E E+  EE+  + E 
Sbjct: 304 AEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRRQMES 359



 Score = 28.1 bits (62), Expect = 6.3
 Identities = 9/32 (28%), Positives = 16/32 (50%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETK 267
           EN      E +    +E  EE+EE+ + +E +
Sbjct: 302 ENAEAHKEEVQSDRPDEIGEEKEEDDENEENE 333


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 37.5 bits (87), Expect = 0.007
 Identities = 16/51 (31%), Positives = 33/51 (64%), Gaps = 6/51 (11%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           +E +  EE E KAKE KLK+++       E++++E+ +E++E+ +++E   
Sbjct: 390 DETDASEEAEAKAKEEKLKQEEN------EKKQKEQADEDKEKRQKDERKK 434



 Score = 36.7 bits (85), Expect = 0.014
 Identities = 13/53 (24%), Positives = 32/53 (60%), Gaps = 4/53 (7%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +  ++  E ++E + ++  E K KE+K       EE E++++E+ +E++E+ +
Sbjct: 380 DYTKKLGEVKDETDASEEAEAKAKEEK----LKQEENEKKQKEQADEDKEKRQ 428



 Score = 32.5 bits (74), Expect = 0.30
 Identities = 10/50 (20%), Positives = 27/50 (54%), Gaps = 6/50 (12%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ++  ++  E ++      + + K K      EE+ ++EE E++++E+ +E
Sbjct: 379 QDYTKKLGEVKDETDASEEAEAKAK------EEKLKQEENEKKQKEQADE 422



 Score = 32.5 bits (74), Expect = 0.31
 Identities = 12/39 (30%), Positives = 25/39 (64%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
           SE    K  E ++ +EE E++++E+  + KE + K+++K
Sbjct: 395 SEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 30.9 bits (70), Expect = 0.78
 Identities = 12/42 (28%), Positives = 22/42 (52%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           ++ +E  K    KL++  K  G   +E +  EE E + +EE+
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEK 406



 Score = 30.5 bits (69), Expect = 1.1
 Identities = 13/50 (26%), Positives = 26/50 (52%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           + +E  +E  ++ +    KL E K       E E + +EE+ ++EE E++
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKK 415



 Score = 30.1 bits (68), Expect = 1.7
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
           E D  +  E +  EE+ ++EE E++ K +  + KEK++
Sbjct: 391 ETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428



 Score = 29.4 bits (66), Expect = 2.8
 Identities = 13/71 (18%), Positives = 37/71 (52%), Gaps = 1/71 (1%)

Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
             K   + +++    G++     +E  ++ ++ TK K  ++K++         + +EE+ 
Sbjct: 349 LTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTK-KLGEVKDETDASEEAEAKAKEEKL 407

Query: 288 EEEEEEEEEEE 298
           ++EE E++++E
Sbjct: 408 KQEENEKKQKE 418


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 36.9 bits (86), Expect = 0.007
 Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 15/79 (18%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI---- 299
            RE   E+   + E+ + + K+ + + KKK      E+ ++ E EE + EE + +     
Sbjct: 3   SREALLEQRRRKREQRKARKKQKRKEAKKK------EDAQKSEAEEVKNEENKSKKKAAP 56

Query: 300 -----GGVVVSAVDFGSGG 313
                G +V S V+F  G 
Sbjct: 57  IENAEGNIVFSKVEFADGE 75



 Score = 32.3 bits (74), Expect = 0.23
 Identities = 11/47 (23%), Positives = 24/47 (51%), Gaps = 6/47 (12%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
              E   E+ + K  + K +KK      ++ +E +++E+ ++ E EE
Sbjct: 2   SSREALLEQRRRKREQRKARKK------QKRKEAKKKEDAQKSEAEE 42


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 35.8 bits (83), Expect = 0.007
 Identities = 26/60 (43%), Positives = 35/60 (58%), Gaps = 10/60 (16%)

Query: 244 EREVHEE---EEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ERE+ EE   ++E E+E +E   KE  +    KK+L     EE  + EEEE+E EE EEE
Sbjct: 64  EREIAEERKLQKEREKEGDEFADKEKFVTSAYKKQL-----EENRKLEEEEKEREELEEE 118


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 35.4 bits (82), Expect = 0.007
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIG 300
                 E ++EE++EEEEEE ++++G
Sbjct: 74  AAAAAAEAKKEEKKEEEEEESDDDMG 99



 Score = 34.6 bits (80), Expect = 0.011
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGG 301
                  E ++EE++EEEEEE ++  G
Sbjct: 73  AAAAAAAEAKKEEKKEEEEEESDDDMG 99



 Score = 31.1 bits (71), Expect = 0.22
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                   E ++EE++EEEEEE +
Sbjct: 72  AAAAAAAAEAKKEEKKEEEEEESD 95



 Score = 29.6 bits (67), Expect = 0.63
 Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 6/38 (15%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSG 312
                     E ++EE++EEEEEE      S  D G G
Sbjct: 70  AAAAAAAAAAEAKKEEKKEEEEEE------SDDDMGFG 101


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 36.2 bits (84), Expect = 0.008
 Identities = 16/51 (31%), Positives = 27/51 (52%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            EE  EE+E++E +  +E  L + ++      EEE+     +EEE + E E
Sbjct: 13  DEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAE 63



 Score = 30.4 bits (69), Expect = 0.80
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 279 GEEEEEEEEEEEEEEEEEEE 298
           GEE+EE  EE+E++E  +EE
Sbjct: 10  GEEDEELPEEDEDDESSDEE 29



 Score = 30.4 bits (69), Expect = 0.82
 Identities = 13/57 (22%), Positives = 27/57 (47%), Gaps = 11/57 (19%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
              +  E+EE  EE+E++  + E              +  ++E++EE + EEE+  +
Sbjct: 6   ESDDGEEDEELPEEDEDDESSDE-----------EEVDLPDDEQDEESDSEEEQIFV 51



 Score = 30.0 bits (68), Expect = 1.0
 Identities = 17/51 (33%), Positives = 29/51 (56%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           ++ EE+EE  EE +  E+  +E+  L     +EE + EEE+     +EEE+
Sbjct: 8   DDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEV 58



 Score = 29.3 bits (66), Expect = 1.8
 Identities = 13/69 (18%), Positives = 25/69 (36%), Gaps = 18/69 (26%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
             +D  +  E    +E++E  +EEE                      ++E++EE + EEE
Sbjct: 6   ESDDGEEDEELPEEDEDDESSDEEEVD------------------LPDDEQDEESDSEEE 47

Query: 295 EEEEIGGVV 303
           +        
Sbjct: 48  QIFVTRQEE 56


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 37.4 bits (87), Expect = 0.008
 Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 6/54 (11%)

Query: 253 EEEEEEEETKAKETKLK------EKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E   +++ TKAKET+           ++     E ++EEEE+EEEEEEE+E+ G
Sbjct: 354 ETLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEG 407



 Score = 34.0 bits (78), Expect = 0.11
 Identities = 20/57 (35%), Positives = 29/57 (50%), Gaps = 13/57 (22%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             ER+  EEE+EEEEEEE+                G  +E  ++EE EE++ E + E
Sbjct: 385 NTERDDEEEEDEEEEEEEDE-------------DEGPSKEHSDDEEFEEDDVESKYE 428



 Score = 32.0 bits (73), Expect = 0.45
 Identities = 12/20 (60%), Positives = 16/20 (80%)

Query: 280 EEEEEEEEEEEEEEEEEEEI 299
           EE+EEEEEEE+E+E   +E 
Sbjct: 393 EEDEEEEEEEDEDEGPSKEH 412



 Score = 32.0 bits (73), Expect = 0.50
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 279 GEEEEEEEEEEEEEEEEEEEIG 300
            E+EEEEEEE+E+E   +E   
Sbjct: 393 EEDEEEEEEEDEDEGPSKEHSD 414



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 20/73 (27%), Positives = 32/73 (43%), Gaps = 17/73 (23%)

Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE 285
           D ++    +  N      E E  EEEEEE+E+E  +K                 E  ++E
Sbjct: 374 DAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSK-----------------EHSDDE 416

Query: 286 EEEEEEEEEEEEE 298
           E EE++ E + E+
Sbjct: 417 EFEEDDVESKYED 429


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 37.3 bits (86), Expect = 0.009
 Identities = 16/57 (28%), Positives = 29/57 (50%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             E    +EEEEEE+E  E +  + + +           E+  ++E E++EE++ EE
Sbjct: 110 DSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEE 166



 Score = 36.5 bits (84), Expect = 0.019
 Identities = 17/53 (32%), Positives = 28/53 (52%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           EEEEEE+E  E+    +    +         E+  ++E E++EE++ EEE  G
Sbjct: 118 EEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAG 170



 Score = 35.7 bits (82), Expect = 0.033
 Identities = 16/56 (28%), Positives = 30/56 (53%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           G     E+  ++EEEEEE ++ E ++ + +       E E   E+  ++E E++EE
Sbjct: 106 GSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEE 161



 Score = 34.6 bits (79), Expect = 0.065
 Identities = 9/68 (13%), Positives = 25/68 (36%)

Query: 231 TIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEE 290
            +     D     +     + + E+  ++E + +E +  E + +      +   E E   
Sbjct: 89  RVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSL 148

Query: 291 EEEEEEEE 298
           E+  ++E 
Sbjct: 149 EDLSDDET 156



 Score = 34.2 bits (78), Expect = 0.095
 Identities = 12/56 (21%), Positives = 22/56 (39%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           G      + E+  ++EEE +  E+   E            E E   E+  ++E E+
Sbjct: 103 GSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETED 158



 Score = 34.2 bits (78), Expect = 0.100
 Identities = 13/49 (26%), Positives = 23/49 (46%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             E    +E      + +  E  + G  G + + E+  ++EEEEEE+E 
Sbjct: 78  YVEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDES 126



 Score = 33.4 bits (76), Expect = 0.15
 Identities = 15/64 (23%), Positives = 33/64 (51%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           S  D   G + +  +  ++EEEEEE+   ++  + ++ +       E   E+  ++E E+
Sbjct: 99  SHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETED 158

Query: 295 EEEE 298
           +EE+
Sbjct: 159 DEEK 162



 Score = 33.4 bits (76), Expect = 0.17
 Identities = 22/74 (29%), Positives = 32/74 (43%), Gaps = 4/74 (5%)

Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEE----EEEEE 287
             +S  +     E E  EE++ EEEE  E K    +   +KK    G +++    +E  E
Sbjct: 143 ESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNE 202

Query: 288 EEEEEEEEEEEIGG 301
             E  E EEE   G
Sbjct: 203 FLEATEAEEEAALG 216



 Score = 32.7 bits (74), Expect = 0.29
 Identities = 14/56 (25%), Positives = 30/56 (53%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            + E   ++EEEEEE+E  + +    +++  L        E+  ++E E++EE++ 
Sbjct: 109 MDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKM 164



 Score = 30.7 bits (69), Expect = 1.2
 Identities = 12/66 (18%), Positives = 29/66 (43%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
            + E+D       +   + + EEE+EE+ + ++ + +E+         + +E   E  + 
Sbjct: 273 AELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDL 332

Query: 293 EEEEEE 298
           E  +E 
Sbjct: 333 ESPKEL 338



 Score = 29.6 bits (66), Expect = 2.3
 Identities = 18/67 (26%), Positives = 34/67 (50%), Gaps = 7/67 (10%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
             + E E++E ++E   KE   K ++       E+EE++E+E++++EEE  E     V  
Sbjct: 270 GDDAELEDDEPDKEAVKKEADSKPEE-------EDEEDDEQEDDQDEEEPPEAAMDKVKL 322

Query: 307 VDFGSGG 313
            +    G
Sbjct: 323 DEPVLEG 329



 Score = 29.6 bits (66), Expect = 2.4
 Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 14/67 (20%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           +S  D     E E     E E   E+ +  +               E++EE++ EEEE  
Sbjct: 125 ESLEDEMIDDEDEADLFNESESSLEDLSDDET--------------EDDEEKKMEEEEAG 170

Query: 294 EEEEEIG 300
           EE+E + 
Sbjct: 171 EEKESVE 177



 Score = 29.2 bits (65), Expect = 3.1
 Identities = 17/74 (22%), Positives = 36/74 (48%), Gaps = 10/74 (13%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEE----ETKAKETKLKEKKKLGGGGGEEE------EEE 285
           ++D   G   +  ++EE   E E+    + K K+    +  +L     ++E      + +
Sbjct: 233 KDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSK 292

Query: 286 EEEEEEEEEEEEEI 299
            EEE+EE++E+E+ 
Sbjct: 293 PEEEDEEDDEQEDD 306



 Score = 29.2 bits (65), Expect = 3.4
 Identities = 15/55 (27%), Positives = 23/55 (41%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E +   EEE+EE++E+E    E +  E         E   E  + E  +E    E
Sbjct: 288 EADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFE 342



 Score = 28.8 bits (64), Expect = 3.9
 Identities = 13/58 (22%), Positives = 26/58 (44%), Gaps = 1/58 (1%)

Query: 244 EREVHEEEEEEEEEEEETKAK-ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           + E HE+  +  + + E  A  E + +E + L     ++E+E +   E E   E+   
Sbjct: 96  DSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSD 153



 Score = 28.4 bits (63), Expect = 6.8
 Identities = 15/65 (23%), Positives = 35/65 (53%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           +  N   +       ++++++  ++ E +  E   +  KK      EEE+EE++E+E+++
Sbjct: 248 EEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQ 307

Query: 294 EEEEE 298
           +EEE 
Sbjct: 308 DEEEP 312



 Score = 28.0 bits (62), Expect = 7.6
 Identities = 12/57 (21%), Positives = 32/57 (56%), Gaps = 5/57 (8%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             +E  ++++  ++ E E    + +  +K+        + EEE+EE++E+E++++E 
Sbjct: 259 DPKEKDKKKDAGDDAELEDDEPDKEAVKKE-----ADSKPEEEDEEDDEQEDDQDEE 310


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 34.8 bits (80), Expect = 0.010
 Identities = 14/23 (60%), Positives = 20/23 (86%)

Query: 279 GEEEEEEEEEEEEEEEEEEEIGG 301
            E+EEE+++EEE+EEEEEE + G
Sbjct: 79  EEKEEEKKKEEEKEEEEEEALAG 101



 Score = 32.5 bits (74), Expect = 0.069
 Identities = 13/19 (68%), Positives = 18/19 (94%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
            EE+EEE+++EEE+EEEEE
Sbjct: 78  AEEKEEEKKKEEEKEEEEE 96



 Score = 30.9 bits (70), Expect = 0.28
 Identities = 12/19 (63%), Positives = 18/19 (94%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           + EE+EEE+++EEE+EEEE
Sbjct: 77  KAEEKEEEKKKEEEKEEEE 95



 Score = 30.5 bits (69), Expect = 0.31
 Identities = 12/20 (60%), Positives = 18/20 (90%)

Query: 279 GEEEEEEEEEEEEEEEEEEE 298
            EE+ EE+EEE+++EEE+EE
Sbjct: 74  AEEKAEEKEEEKKKEEEKEE 93



 Score = 30.2 bits (68), Expect = 0.43
 Identities = 12/19 (63%), Positives = 18/19 (94%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           E+ EE+EEE+++EEE+EEE
Sbjct: 76  EKAEEKEEEKKKEEEKEEE 94



 Score = 29.4 bits (66), Expect = 0.73
 Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 7/39 (17%)

Query: 263 AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           A E K +EK+       EE+++EEE+EEEEEE    +G 
Sbjct: 73  AAEEKAEEKE-------EEKKKEEEKEEEEEEALAGLGA 104



 Score = 27.8 bits (62), Expect = 3.1
 Identities = 10/19 (52%), Positives = 15/19 (78%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
              EE+ EE+EEE+++EEE
Sbjct: 72  AAAEEKAEEKEEEKKKEEE 90



 Score = 26.3 bits (58), Expect = 9.4
 Identities = 9/19 (47%), Positives = 14/19 (73%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
               EE+ EE+EEE+++EE
Sbjct: 71  AAAAEEKAEEKEEEKKKEE 89



 Score = 26.3 bits (58), Expect = 9.4
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 243 GEREVHEEEEEEEEEEEETKAKE 265
            E +  E+EEE+++EEE+ + +E
Sbjct: 74  AEEKAEEKEEEKKKEEEKEEEEE 96


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 35.3 bits (81), Expect = 0.011
 Identities = 17/54 (31%), Positives = 27/54 (50%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E E  + +E+ +E  E   A E   +E+         +E + E EE +EEE+EE
Sbjct: 75  EAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEE 128



 Score = 34.9 bits (80), Expect = 0.013
 Identities = 13/58 (22%), Positives = 27/58 (46%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
                E  E  + +E+ +E  +A      ++++      +E  +E + E EE +EEE+
Sbjct: 69  AAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEED 126



 Score = 34.9 bits (80), Expect = 0.015
 Identities = 14/53 (26%), Positives = 26/53 (49%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
               +E+ +E  E +   +  + +E  +      +E + E EE +EEE+EE E
Sbjct: 78  AADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130



 Score = 34.9 bits (80), Expect = 0.016
 Identities = 15/56 (26%), Positives = 27/56 (48%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E     + +E+ +E  E  A +   +E++         +E + E EE +EEE+EE
Sbjct: 73  AEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEE 128



 Score = 32.2 bits (73), Expect = 0.13
 Identities = 14/52 (26%), Positives = 25/52 (48%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +  E+ +E  E +   +A E +  ++         + E EE +EEE+EE E 
Sbjct: 80  DADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAEA 131



 Score = 31.5 bits (71), Expect = 0.21
 Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 8/65 (12%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           ++E         E  E +  +E +EEE   +               E + E EE +EEE+
Sbjct: 75  EAEAADADEDADEAAEADAADEADEEEETDEAVDETAD--------EADAEAEEADEEED 126

Query: 294 EEEEE 298
           EE E 
Sbjct: 127 EEAEA 131



 Score = 28.8 bits (64), Expect = 1.9
 Identities = 16/69 (23%), Positives = 21/69 (30%), Gaps = 14/69 (20%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
            G         E  EE E   A E              + +E  E +  +E +EEEE   
Sbjct: 60  KGPAAAATAAAEAAEEAEAADADE--------------DADEAAEADAADEADEEEETDE 105

Query: 302 VVVSAVDFG 310
            V    D  
Sbjct: 106 AVDETADEA 114



 Score = 28.0 bits (62), Expect = 3.6
 Identities = 11/48 (22%), Positives = 18/48 (37%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E+         A+  +  E         E  E +  +E +EEEE +E
Sbjct: 58  AEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDE 105



 Score = 28.0 bits (62), Expect = 3.8
 Identities = 14/66 (21%), Positives = 25/66 (37%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E +Y  G E    E  E    E+    A     +  ++      +E+ +E  E +  +E 
Sbjct: 38  EKNYDLGREARDLEWTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEA 97

Query: 296 EEEIGG 301
           +EE   
Sbjct: 98  DEEEET 103


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 37.2 bits (86), Expect = 0.012
 Identities = 15/59 (25%), Positives = 33/59 (55%), Gaps = 4/59 (6%)

Query: 244 EREVHEEEEEEEEEEEETKAKE----TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ER+  +EE+ +E+E ++ KA +     KL+ ++   G    ++ E++  + + E+E  E
Sbjct: 19  ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENPE 77


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 37.0 bits (85), Expect = 0.013
 Identities = 19/93 (20%), Positives = 34/93 (36%), Gaps = 13/93 (13%)

Query: 226 DGYNKTIFQSENDYCKG-------------GEREVHEEEEEEEEEEEETKAKETKLKEKK 272
           D  NK       +YC G                +        EE +  ++       E  
Sbjct: 424 DSINKGKLACVENYCVGKRLPPHLSPFASVDSYDPRASLMTMEETQRHSEEDLVNRFEDV 483

Query: 273 KLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
           +     GEE+++++EE + ++E E E  G+  S
Sbjct: 484 RYEHVAGEEDDDDDEELQAQKELELEAQGIKYS 516



 Score = 33.9 bits (77), Expect = 0.12
 Identities = 15/65 (23%), Positives = 33/65 (50%)

Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
            F+        GE +  ++EE + ++E E +A+  K  E  +      + + ++ + +EE
Sbjct: 479 RFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEE 538

Query: 292 EEEEE 296
           EEE++
Sbjct: 539 EEEKK 543


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 35.9 bits (83), Expect = 0.013
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E  EEEE  EEEEE+ + +  + +E+ + G  G EEEE+ +  +E    EE E  
Sbjct: 75  EQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREEL 129



 Score = 30.1 bits (68), Expect = 1.1
 Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 11/66 (16%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           ++  +  +  E E   EEEEE+ EEE  + +E                   EEEEE+ + 
Sbjct: 68  EARLELLEQQEEEESAEEEEEDPEEENEEEEEEY-----------QRGPFGEEEEEDGDS 116

Query: 294 EEEEEI 299
            +E   
Sbjct: 117 YDELPT 122



 Score = 28.5 bits (64), Expect = 3.7
 Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 11/66 (16%)

Query: 244 EREVHEEEEEEEEEEEET--------KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           ERE     +   E             +A+   L+++++       EEEEE+ EEE EEEE
Sbjct: 42  EREARGRAKRWSEGLSGVLESSLDREEARLELLEQQEEEES---AEEEEEDPEEENEEEE 98

Query: 296 EEEIGG 301
           EE   G
Sbjct: 99  EEYQRG 104



 Score = 28.2 bits (63), Expect = 4.5
 Identities = 14/46 (30%), Positives = 21/46 (45%)

Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + EE      ++ + +E  +      EEE EEEEEE +     EE 
Sbjct: 65  DREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEE 110



 Score = 27.8 bits (62), Expect = 6.5
 Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 6/63 (9%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E+   +   R    E++EEEE  EE +    +  E++       EE +     EEEEE+ 
Sbjct: 61  ESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEE------EEEYQRGPFGEEEEEDG 114

Query: 296 EEE 298
           +  
Sbjct: 115 DSY 117


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 36.0 bits (84), Expect = 0.014
 Identities = 10/55 (18%), Positives = 23/55 (41%), Gaps = 14/55 (25%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
                      E E  +                 E++++++E+E+E++EEE ++G
Sbjct: 34  AAAAAATAAAIESELDEEDL--------------EDDDDDDEDEDEDDEEEADLG 74



 Score = 34.5 bits (80), Expect = 0.042
 Identities = 10/57 (17%), Positives = 26/57 (45%), Gaps = 9/57 (15%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           G  + +           E++  E  L++         +++++E+E+E++EEE +   
Sbjct: 28  GFIDPNAAAAAATAAAIESELDEEDLED---------DDDDDEDEDEDDEEEADLGP 75



 Score = 29.8 bits (68), Expect = 1.6
 Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 14/55 (25%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
                 E E +EE+ E+    +              E+E+E++EEE +   + EE
Sbjct: 39  ATAAAIESELDEEDLEDDDDDD--------------EDEDEDDEEEADLGPDPEE 79



 Score = 29.5 bits (67), Expect = 1.7
 Identities = 10/51 (19%), Positives = 20/51 (39%), Gaps = 9/51 (17%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
                  E E + +  E    +         +E+E+E++EEE +   + E 
Sbjct: 38  AATAAAIESELDEEDLEDDDDD---------DEDEDEDDEEEADLGPDPEE 79


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 34.9 bits (81), Expect = 0.014
 Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 8/72 (11%)

Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
             KT    E D     E+   E+++EE++ E E  AK +  +EK        E E E+ E
Sbjct: 4   EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSA-EEKA-------EYELEKLE 55

Query: 288 EEEEEEEEEEEI 299
           +E EE E E   
Sbjct: 56  KELEELEAELAR 67



 Score = 31.5 bits (72), Expect = 0.19
 Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 5/55 (9%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +  EE++ E E+  +  A+E    E +KL     E+E EE E E    E + E
Sbjct: 24  WEKKQEEKKSEAEKLAKMSAEEKAEYELEKL-----EKELEELEAELARRELKAE 73


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 35.5 bits (82), Expect = 0.014
 Identities = 14/51 (27%), Positives = 25/51 (49%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            ++EEEE EEE T   E +   + K      ++ E  + + E+E+ + E  
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144



 Score = 30.9 bits (70), Expect = 0.47
 Identities = 14/54 (25%), Positives = 24/54 (44%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
            E       E E  EE  +E E+E+  + K    ++KK+       E+E+ + E
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 29.3 bits (66), Expect = 1.5
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 272 KKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ++L     EEEE EEE  +E E+E+  
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDPP 115


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 33.1 bits (76), Expect = 0.015
 Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 5/45 (11%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +E  E+    + KL+EK+K       E E+EE E  ++EEEEE I
Sbjct: 4   DERAEKRAEFDKKLEEKEKA-----LEAEKEEAEARQKEEEEEAI 43


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 35.3 bits (81), Expect = 0.017
 Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 9/56 (16%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
                +EEE+ + + EE +  E ++++          E+  +EE+EE EE EEE  
Sbjct: 14  PPPTKDEEEDWDSQAEEVEEDEEEMED---------WEDSLDEEDEEAEEVEEETA 60



 Score = 31.0 bits (70), Expect = 0.42
 Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 17/65 (26%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEE-------EEEEEEE 293
           KG  +++       ++EEE+  ++  +++E          +EEE E       EE+EE E
Sbjct: 4   KGNAKKLKVRPPPTKDEEEDWDSQAEEVEE----------DEEEMEDWEDSLDEEDEEAE 53

Query: 294 EEEEE 298
           E EEE
Sbjct: 54  EVEEE 58



 Score = 28.3 bits (63), Expect = 3.7
 Identities = 11/17 (64%), Positives = 12/17 (70%)

Query: 249 EEEEEEEEEEEETKAKE 265
           EE+EE EE EEET A  
Sbjct: 47  EEDEEAEEVEEETAASS 63


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 36.5 bits (84), Expect = 0.019
 Identities = 13/54 (24%), Positives = 26/54 (48%)

Query: 244  EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            E   +EE+   + + +E++A+E +             E E  + EEE ++ +EE
Sbjct: 3838 EELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891



 Score = 31.5 bits (71), Expect = 0.73
 Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 9/63 (14%)

Query: 243  GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE---EEEEEI 299
             E     +E+ ++++  +    + K+ E       G EE  +E EE  E+    +EE E 
Sbjct: 4037 LEENNTLDEDIQQDDFSDLAEDDEKMNED------GFEENVQENEESTEDGVKSDEELEQ 4090

Query: 300  GGV 302
            G V
Sbjct: 4091 GEV 4093



 Score = 30.7 bits (69), Expect = 1.2
 Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 5/60 (8%)

Query: 245  REVHEEEEEEEEEEEETKAKETKLKEKKKL---GGGGGEEEEE--EEEEEEEEEEEEEEI 299
              ++E+++ EE   EE  A ++ L E +         G  ++    E E  + EEE +++
Sbjct: 3829 NTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDL 3888



 Score = 30.7 bits (69), Expect = 1.4
 Identities = 12/48 (25%), Positives = 25/48 (52%)

Query: 249  EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
            E +EE + E++E    E  L+E   L     +++  +  E++E+  E+
Sbjct: 4018 ENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNED 4065



 Score = 29.6 bits (66), Expect = 3.3
 Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 13/86 (15%)

Query: 221  PVNRPDGYNKTIFQSEN-DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
            P +  +  N+ ++   N +     E++ +E+     E +  +K  + K  E         
Sbjct: 3896 PEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALED-------K 3948

Query: 280  EEEEEEEEEEEE-----EEEEEEEIG 300
            + +E+E+EEE       ++E + +I 
Sbjct: 3949 DRQEKEDEEEMSDDVGIDDEIQPDIQ 3974



 Score = 29.2 bits (65), Expect = 4.4
 Identities = 21/101 (20%), Positives = 39/101 (38%), Gaps = 8/101 (7%)

Query: 218  DLDPVNRPDGYNKTIFQSEN-DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGG 276
            D DP+   +  ++ I Q +  D  +  E+   +  EE  +E EE+     K  E+ + G 
Sbjct: 4033 DEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGE 4092

Query: 277  -------GGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFG 310
                       + + +      E +EE    G+V    + G
Sbjct: 4093 VPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGENEELG 4133



 Score = 28.4 bits (63), Expect = 6.3
 Identities = 13/50 (26%), Positives = 23/50 (46%)

Query: 251  EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
            ++E+  EE  T  ++ +  +   L     +  E+  EE  +E EE  E G
Sbjct: 4032 QDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDG 4081



 Score = 28.0 bits (62), Expect = 8.1
 Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 2/60 (3%)

Query: 243  GEREVHEEEEEEEEEEEETKAKETKLKEK--KKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
            G   V E+ EEE+E+++E   +E    +    K      ++     E E EEE+ E E G
Sbjct: 2921 GNINVSEQIEEEDEKDDEYGEQEKVSDKNDYLKDFIDYDDDGSISTESEPEEEKRECESG 2980


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 35.9 bits (83), Expect = 0.020
 Identities = 18/65 (27%), Positives = 24/65 (36%), Gaps = 5/65 (7%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEE-----EEEEEEEEEEEEEEEEIGGVV 303
           E  +E E  + E    E   KE +    G  EE        E+ +   EE E  E+    
Sbjct: 211 ELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAE 270

Query: 304 VSAVD 308
            SA D
Sbjct: 271 ASADD 275



 Score = 35.6 bits (82), Expect = 0.024
 Identities = 12/55 (21%), Positives = 26/55 (47%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           G+     + E+ E+E++  + ++    E+++ G      E+ +   EE E  E E
Sbjct: 213 GDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267



 Score = 29.4 bits (66), Expect = 2.2
 Identities = 11/49 (22%), Positives = 19/49 (38%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +  EE  +E E+   E    E         ++ EEEE    +   E+ +
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSD 255


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 36.3 bits (84), Expect = 0.020
 Identities = 22/47 (46%), Positives = 26/47 (55%), Gaps = 4/47 (8%)

Query: 257 EEEETKAKETK--LKEKKK--LGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EE+ T   E +  LKEK    +     EEEEEEEEEEEE  E E  +
Sbjct: 377 EEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPM 423



 Score = 33.6 bits (77), Expect = 0.12
 Identities = 15/34 (44%), Positives = 18/34 (52%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGG 313
           EEEEEEEE  E E   EE + G  V  +   + G
Sbjct: 408 EEEEEEEEAAEAEAPMEEPVPGFEVPEMPMPAAG 441



 Score = 30.2 bits (68), Expect = 1.4
 Identities = 16/29 (55%), Positives = 18/29 (62%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKK 272
           ER   EEEEEEEEEEEE    E  ++E  
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPV 427



 Score = 29.4 bits (66), Expect = 2.5
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKE 270
           +    E  EEEEEEEEE  E +A   +   
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428



 Score = 28.2 bits (63), Expect = 5.8
 Identities = 21/63 (33%), Positives = 24/63 (38%), Gaps = 19/63 (30%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
            EEEEEEEEEE                   EEE  E E   EE     E+  + + A   
Sbjct: 402 AEEEEEEEEEE-------------------EEEAAEAEAPMEEPVPGFEVPEMPMPAAGG 442

Query: 310 GSG 312
           G G
Sbjct: 443 GGG 445


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 35.4 bits (82), Expect = 0.020
 Identities = 19/56 (33%), Positives = 25/56 (44%), Gaps = 13/56 (23%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + E  EEE EE E EE  + +               EE  EEE E E  +EE+ +I
Sbjct: 4   KNEQVEEEVEETEVEEAVEDEVE-------------EETVEEESEAELLDEEQAKI 46


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 33.1 bits (76), Expect = 0.020
 Identities = 15/30 (50%), Positives = 21/30 (70%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKK 273
           ++E+  EEE+ EEEEE  K +E K +EK K
Sbjct: 26  KKELKAEEEKREEEEEARKREERKEREKNK 55



 Score = 31.5 bits (72), Expect = 0.079
 Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 6/32 (18%)

Query: 267 KLKEKKK-LGGGGGEEEEEEEEEEEEEEEEEE 297
           KLK KKK L     + EEE+ EEEEE  + EE
Sbjct: 21  KLKAKKKEL-----KAEEEKREEEEEARKREE 47



 Score = 28.8 bits (65), Expect = 0.70
 Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 6/36 (16%)

Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           K KE K +E+K+      EEEEE  + EE +E E+ 
Sbjct: 25  KKKELKAEEEKR------EEEEEARKREERKEREKN 54


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 34.5 bits (79), Expect = 0.023
 Identities = 26/72 (36%), Positives = 41/72 (56%), Gaps = 11/72 (15%)

Query: 241 KGGEREVHEEEEEEE----EEEEETKAKETKLKEKKKLGGGG-------GEEEEEEEEEE 289
           +G E +  EE+EE +    E EEE KAK  K++E++++   G        ++EE+EEE +
Sbjct: 26  EGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDKYGIKKKEEDEEEPQ 85

Query: 290 EEEEEEEEEIGG 301
              EEEE  +G 
Sbjct: 86  AAAEEEEGRLGR 97



 Score = 30.6 bits (69), Expect = 0.48
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           L  K+  G     EEE+EE +E   E EEE
Sbjct: 20  LGGKEDEGDESDAEEEDEEIQEALREAEEE 49



 Score = 29.5 bits (66), Expect = 1.1
 Identities = 18/42 (42%), Positives = 25/42 (59%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           EE+EEE +   + +E +L  KKK       E  EE+EEEEE+
Sbjct: 78  EEDEEEPQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEEEK 119



 Score = 29.5 bits (66), Expect = 1.1
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           LK+ K + GG  +E +E + EEE+EE +E
Sbjct: 13  LKDVKGMLGGKEDEGDESDAEEEDEEIQE 41



 Score = 27.9 bits (62), Expect = 3.5
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 270 EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ++ +      EEE+EE +E   E EEE +
Sbjct: 23  KEDEGDESDAEEEDEEIQEALREAEEERK 51



 Score = 27.5 bits (61), Expect = 5.4
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 269 KEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           KE +       EE+EE +E   E EEE + 
Sbjct: 23  KEDEGDESDAEEEDEEIQEALREAEEERKA 52


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 35.9 bits (83), Expect = 0.024
 Identities = 18/77 (23%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           ++ +  ++E  EE+ E ++E  E  +   +  E ++   G G+ E    + +EE+EEE++
Sbjct: 398 EFVRSLQKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDK 457

Query: 298 EIG--GVVVSAVDFGSG 312
           E     ++      G G
Sbjct: 458 EAIPSTILEEEPTVGGG 474



 Score = 33.6 bits (77), Expect = 0.14
 Identities = 16/55 (29%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 253 EEEEEEEETKA--KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
           E+++++++TKA  K++K ++KKK          +E+E+E   E   E++ G+ V 
Sbjct: 83  EDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSEDLAGLKVG 137



 Score = 33.6 bits (77), Expect = 0.15
 Identities = 15/57 (26%), Positives = 27/57 (47%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
            E++++ +    K +    K++KKL       +  EE  E+EEE  E    G+V+  
Sbjct: 339 VEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSEDEEEAAENNDNGIVIDE 395



 Score = 32.4 bits (74), Expect = 0.32
 Identities = 18/66 (27%), Positives = 29/66 (43%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           Q E    K   ++   EE  + EE++E +  E    + +       EE+EEE++E     
Sbjct: 404 QKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPST 463

Query: 294 EEEEEI 299
             EEE 
Sbjct: 464 ILEEEP 469



 Score = 29.3 bits (66), Expect = 2.9
 Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 4/45 (8%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEK----KKLGGGGGEEEEEEEEE 288
            +EE E + + EE + K  K +EK     KLGG     E++++++
Sbjct: 45  RQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDD 89



 Score = 28.6 bits (64), Expect = 5.9
 Identities = 16/54 (29%), Positives = 25/54 (46%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           R++ EE  E+EEE  E       + E  +      +E  EE+ E ++E  EE  
Sbjct: 370 RQIAEERSEDEEEAAENNDNGIVIDETSEFVRSLQKEPLEEKPENKDESVEEIS 423


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 34.2 bits (79), Expect = 0.026
 Identities = 18/47 (38%), Positives = 31/47 (65%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           EE+ EE+E E + KE K +++K+ G G  ++EE E+  EE E+ + +
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNK 61



 Score = 33.8 bits (78), Expect = 0.035
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 4/50 (8%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           EE+ EE+E E+E K +    KE+K+   G G  ++EE E+  EE E+ + 
Sbjct: 15  EEKREEKEREKEEKER----KEEKEKEWGKGLVQKEEREKRLEELEKAKN 60



 Score = 30.0 bits (68), Expect = 0.76
 Identities = 14/25 (56%), Positives = 21/25 (84%), Gaps = 1/25 (4%)

Query: 280 EEEEEEEEEEEEEEEEEEEIG-GVV 303
           EE+E E+EE+E +EE+E+E G G+V
Sbjct: 19  EEKEREKEEKERKEEKEKEWGKGLV 43



 Score = 28.4 bits (64), Expect = 2.9
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 280 EEEEEEEEEEEEEEEEEEEI 299
           EE+ EE+E E+EE+E +EE 
Sbjct: 15  EEKREEKEREKEEKERKEEK 34



 Score = 27.3 bits (61), Expect = 7.3
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 280 EEEEEEEEEEEEEEEEEEEI 299
           E+ EE+E E+EE+E +EE+ 
Sbjct: 16  EKREEKEREKEEKERKEEKE 35


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 35.9 bits (83), Expect = 0.030
 Identities = 17/57 (29%), Positives = 29/57 (50%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E +V+ +  EEE+EEE      +   +         +E+E+EE++EE E   + EI 
Sbjct: 2   EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEIS 58



 Score = 35.5 bits (82), Expect = 0.039
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 7/64 (10%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
           E E    +  EEE+EEE +A            G   +++E   E+E+E+EE++EE     
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARS-------AGSDSDDDEVPAEDEDEDEEDDEEAESPA 53

Query: 304 VSAV 307
            + +
Sbjct: 54  KAEI 57



 Score = 34.8 bits (80), Expect = 0.060
 Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 263 AKETKLKEKKKLGGGGGEEEEEEEEEEEEE--EEEEEEIGG 301
           AK  +    KK  G G    +  EEEE+EE  +EEE+ +GG
Sbjct: 113 AKGDQSASAKKAKGRGRHASKLTEEEEDEEYLKEEEDGLGG 153



 Score = 31.3 bits (71), Expect = 0.82
 Identities = 14/34 (41%), Positives = 18/34 (52%)

Query: 269 KEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
             KK  G G    +  EEEE+EE  +EEE+  G 
Sbjct: 120 SAKKAKGRGRHASKLTEEEEDEEYLKEEEDGLGG 153



 Score = 29.4 bits (66), Expect = 3.0
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           EE+   +  EEE+EEE E +     S  D
Sbjct: 2   EEQVNTQANEEEDEEELEAVARSAGSDSD 30


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 35.0 bits (81), Expect = 0.031
 Identities = 17/71 (23%), Positives = 26/71 (36%), Gaps = 10/71 (14%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG---VVVS 305
           EEE+               + E           E E E E EEE E++  +G    V+  
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECT-------GPEPEPEPEPEEEPEKKSGMGPLLLVLAV 167

Query: 306 AVDFGSGGWWF 316
           A+  G   ++F
Sbjct: 168 ALIGGGAYYYF 178



 Score = 32.3 bits (74), Expect = 0.19
 Identities = 16/83 (19%), Positives = 28/83 (33%), Gaps = 30/83 (36%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKK-------------LGGGGG------------- 279
            V      E    E     E + + +KK             L GGG              
Sbjct: 128 PVCSVNMTECTGPEPEPEPEPEEEPEKKSGMGPLLLVLAVALIGGGAYYYFKFYKPKQQE 187

Query: 280 ----EEEEEEEEEEEEEEEEEEE 298
               +++ +E +  +E+EEE++E
Sbjct: 188 KGAPDDDLDEYDYGDEDEEEDDE 210


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 35.8 bits (82), Expect = 0.033
 Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 6/44 (13%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           EE  E+ + E E KA+E + +EK+K      E+E E E E E E
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEK------EKEREREREREAE 619


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 32.6 bits (75), Expect = 0.034
 Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 20/53 (37%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +   + E EEEEE                    GE+ E E+EE+EEE+++++ 
Sbjct: 41  DYDSDAEWEEEEE--------------------GEDLESEDEEDEEEDDDDDM 73



 Score = 28.0 bits (63), Expect = 1.8
 Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 16/46 (34%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           + E EEEEE E+ +                 E+EE+EEE+++++ +
Sbjct: 45  DAEWEEEEEGEDLE----------------SEDEEDEEEDDDDDMD 74


>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit.  This family
           represents the eukaryotic vacuolar (H+)-ATPase
           (V-ATPase) G subunit. V-ATPases generate an acidic
           environment in several intracellular compartments.
           Correspondingly, they are found as membrane-attached
           proteins in several organelles. They are also found in
           the plasma membranes of some specialised cells.
           V-ATPases consist of peripheral (V1) and membrane
           integral (V0) heteromultimeric complexes. The G subunit
           is part of the V1 subunit, but is also thought to be
           strongly attached to the V0 complex. It may be involved
           in the coupling of ATP degradation to H+ translocation.
          Length = 105

 Score = 33.3 bits (77), Expect = 0.034
 Identities = 17/50 (34%), Positives = 27/50 (54%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           ++  EE E+E EE    +  E K  E +  G  G  E++ E+E EE+ +E
Sbjct: 32  KQAKEEAEKEIEEYRAQREAEFKEFEAEHSGSRGELEKKIEKETEEKIDE 81



 Score = 26.8 bits (60), Expect = 7.4
 Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 9/50 (18%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+E  E   E  K +  +LK+ K         EE E+E EE   + E E 
Sbjct: 13  EKEAAEIVNEARKRRAKRLKQAK---------EEAEKEIEEYRAQREAEF 53


>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
           hydrophilic C-term.  This domain is a hydrophilic region
           found at the C-terminus of plant and metazoan
           pre-mRNA-splicing factor 38 proteins. The function is
           not known.
          Length = 97

 Score = 33.2 bits (76), Expect = 0.034
 Identities = 10/55 (18%), Positives = 20/55 (36%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E ++ EEEE EEEE++E   ++ +    +                +   +     
Sbjct: 7   EEDLDEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRR 61



 Score = 30.9 bits (70), Expect = 0.24
 Identities = 16/31 (51%), Positives = 20/31 (64%)

Query: 281 EEEEEEEEEEEEEEEEEEIGGVVVSAVDFGS 311
           EE+ +EEEE EEEE++EEI       VD G 
Sbjct: 7   EEDLDEEEESEEEEDDEEIRRKAERDVDRGR 37


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 35.0 bits (81), Expect = 0.035
 Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 16/59 (27%)

Query: 242 GGEREVHEE--EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           GG+ E  EE  E+ EEEEEE+                   E+E+EE+E+EEEEE+++E 
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDD--------------YSESEDEDEEDEDEEEEEDDDEG 317



 Score = 28.9 bits (65), Expect = 3.3
 Identities = 15/50 (30%), Positives = 30/50 (60%), Gaps = 9/50 (18%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
             E+ E+ EEEEEE++ +++++              E+EEEEE+++E ++
Sbjct: 279 LEELLEKAEEEEEEDDYSESED---------EDEEDEDEEEEEDDDEGDK 319



 Score = 28.1 bits (63), Expect = 6.0
 Identities = 17/46 (36%), Positives = 28/46 (60%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E  +E  + K  + K+K + ++ GG  E+ EE  E+ EEEEEE++ 
Sbjct: 250 EAIKEVIKKKGGDFKVKGEPEVVGGDEEDLEELLEKAEEEEEEDDY 295


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 35.0 bits (80), Expect = 0.036
 Identities = 21/80 (26%), Positives = 33/80 (41%), Gaps = 16/80 (20%)

Query: 235 SENDYCKGGEREVHEEEEE---------EEEEEEETKAKETKLKEKK-------KLGGGG 278
           +EN   +     +H   EE            + +  K K+ +L  K        K     
Sbjct: 160 TENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQ 219

Query: 279 GEEEEEEEEEEEEEEEEEEE 298
            +EE+EEEE EEE ++EE +
Sbjct: 220 PQEEQEEEEVEEEAKQEEGQ 239



 Score = 33.5 bits (76), Expect = 0.11
 Identities = 20/64 (31%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 239 YCKGGEREVHEEEEEEEEEEEETKA--KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
              GG  +V   + ++E+   + +   +E K +EK+        +EE+EEEE EEE ++E
Sbjct: 184 LSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQ-------PQEEQEEEEVEEEAKQE 236

Query: 297 EEIG 300
           E  G
Sbjct: 237 EGQG 240



 Score = 32.7 bits (74), Expect = 0.18
 Identities = 14/51 (27%), Positives = 29/51 (56%), Gaps = 6/51 (11%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            ++E+   + +E  +  +T+ K+ ++      E+EEEE EEE ++EE +  
Sbjct: 197 PKQEQLLSKLQEYLQEHKTEEKQPQE------EQEEEEVEEEAKQEEGQGT 241


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 33.1 bits (76), Expect = 0.039
 Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 13/57 (22%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE------EEEEEEEEEEEEEI 299
           E+  E+  E++  KA+E +LK++K       E E +       E    +EE+E  E 
Sbjct: 31  EKRMEKRLEQQAIKAREKELKDEK-------EAERQRRIQAIKERRAAKEEKERYEK 80


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 35.3 bits (81), Expect = 0.039
 Identities = 20/65 (30%), Positives = 34/65 (52%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + E       E+    E+E ++ E+E  K KE   + +K+L     + E EEEEEE+ E+
Sbjct: 305 KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEK 364

Query: 294 EEEEE 298
            +E+ 
Sbjct: 365 LQEKL 369



 Score = 34.9 bits (80), Expect = 0.058
 Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 4/56 (7%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E  + E+E  EEEE+ E    E + +EK K      EEE    EEE +EE E  E 
Sbjct: 769 ELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQ----EEELRALEEELKEEAELLEE 820



 Score = 34.6 bits (79), Expect = 0.068
 Identities = 21/56 (37%), Positives = 37/56 (66%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+E+ + E+E ++E+EE +  E +LKE +       EEEE+ E+ +E+ E+ EEE+
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEEL 376



 Score = 34.2 bits (78), Expect = 0.10
 Identities = 16/50 (32%), Positives = 26/50 (52%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           EEE    EEE + +A+  + ++         +EEE EE   E +EE++ E
Sbjct: 801 EEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLE 850



 Score = 33.0 bits (75), Expect = 0.22
 Identities = 15/51 (29%), Positives = 22/51 (43%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            EEEEEE+ E+  +  E   +E         E      + +EEE E + E 
Sbjct: 354 AEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEE 404



 Score = 32.6 bits (74), Expect = 0.34
 Identities = 17/54 (31%), Positives = 28/54 (51%)

Query: 246  EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            E  +EE+ +EEEEE  K      +E   +      E EE+EE   ++E ++E +
Sbjct: 948  EKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERL 1001



 Score = 31.9 bits (72), Expect = 0.50
 Identities = 15/51 (29%), Positives = 25/51 (49%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EEEEEE+ E+ + K ++ + +   K            + +EEE E + EE 
Sbjct: 355 EEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEE 405



 Score = 31.9 bits (72), Expect = 0.58
 Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 5/52 (9%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             EE  +  +E     +E     K++L     E+EEE   +  +E +EEE+ 
Sbjct: 235 LNEERIDLLQELLRDEQEEIESSKQEL-----EKEEEILAQVLKENKEEEKE 281



 Score = 31.9 bits (72), Expect = 0.59
 Identities = 21/69 (30%), Positives = 36/69 (52%)

Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
                 E       E ++ E++ +E+EEEEE    + + +E++K      E+E  EEEE+
Sbjct: 724 LADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEK 783

Query: 290 EEEEEEEEE 298
            E+ + EEE
Sbjct: 784 TEKLKVEEE 792



 Score = 31.5 bits (71), Expect = 0.73
 Identities = 16/55 (29%), Positives = 27/55 (49%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E  E+ + +EEE    + +  +  E  +      E+EE+ +EEE EE   E +
Sbjct: 790 EEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELK 844



 Score = 31.5 bits (71), Expect = 0.76
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E    E E  EEE+    + ++ K +E ++L     EE++ E+  EEE E  EEEI
Sbjct: 809 EELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEI 864



 Score = 31.1 bits (70), Expect = 0.87
 Identities = 14/63 (22%), Positives = 32/63 (50%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
           ++E+ +EEE   +  +E K +E + K +++      +EEEE + E  + E  + +    +
Sbjct: 258 KQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKL 317

Query: 304 VSA 306
             +
Sbjct: 318 KES 320



 Score = 31.1 bits (70), Expect = 1.0
 Identities = 16/50 (32%), Positives = 25/50 (50%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +EEE    EEE  +  E   +E+  +      +EEE EE   E +EE++ 
Sbjct: 800 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKL 849



 Score = 30.3 bits (68), Expect = 1.7
 Identities = 16/70 (22%), Positives = 25/70 (35%), Gaps = 5/70 (7%)

Query: 230  KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
                  E    K  E +  EEEEE  +     K +   +           E EE+EE   
Sbjct: 938  PEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIA-----EFEEKEERYN 992

Query: 290  EEEEEEEEEI 299
            ++E ++E   
Sbjct: 993  KDELKKERLE 1002



 Score = 30.3 bits (68), Expect = 1.8
 Identities = 13/55 (23%), Positives = 27/55 (49%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E +  + + +EEE     +  + + +  ++      +EE+ +EEE EE   E +E
Sbjct: 791 EEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKE 845



 Score = 29.6 bits (66), Expect = 2.9
 Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 11/79 (13%)

Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKE-----------KKKLGGGGGE 280
             + E +      +E  EEE+E++ +EEE K    + +E           K        E
Sbjct: 260 ELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKE 319

Query: 281 EEEEEEEEEEEEEEEEEEI 299
            E+E ++ E+E ++E+EEI
Sbjct: 320 SEKELKKLEKELKKEKEEI 338



 Score = 29.2 bits (65), Expect = 3.8
 Identities = 13/56 (23%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+   +E  ++  EE E  A+     E+ KL     E + +E+ ++  E  + +E 
Sbjct: 168 EKRKKKERLKKLIEETENLAELIIDLEELKLQ----ELKLKEQAKKALEYYQLKEK 219



 Score = 29.2 bits (65), Expect = 4.0
 Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 9/65 (13%)

Query: 244 EREVHEEEEEE---EEEEEETKAKETK------LKEKKKLGGGGGEEEEEEEEEEEEEEE 294
                EE   E   E+E+EE   +E +      L  K++LG        E EE+EE   +
Sbjct: 934 YESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNK 993

Query: 295 EEEEI 299
           +E + 
Sbjct: 994 DELKK 998



 Score = 28.8 bits (64), Expect = 4.7
 Identities = 12/63 (19%), Positives = 25/63 (39%), Gaps = 5/63 (7%)

Query: 241 KGGEREVHEEE-----EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           K   R   EEE     E+ +++E   K  E      + +      + +E + +E+ ++  
Sbjct: 152 KPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKAL 211

Query: 296 EEE 298
           E  
Sbjct: 212 EYY 214



 Score = 28.4 bits (63), Expect = 6.1
 Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 8/55 (14%)

Query: 249  EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
             E EE+EE   + + K+ +L+E+KK        E   E  EE  +  +E +   V
Sbjct: 982  AEFEEKEERYNKDELKKERLEEEKK--------ELLREIIEETCQRFKEFLELFV 1028


>gnl|CDD|178778 PLN03240, PLN03240, putative Low-temperature-induced protein;
           Provisional.
          Length = 626

 Score = 35.2 bits (80), Expect = 0.043
 Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 2/63 (3%)

Query: 241 KGGEREVHEEE--EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +G ++ V  +E   E+    EE KA    + EK  LGGGGGE    ++    + EE E+ 
Sbjct: 471 QGQDKGVSAKEFLTEKLSPGEEDKALSEVVAEKLHLGGGGGETAPPKKGIVTQSEEVEKR 530

Query: 299 IGG 301
           +GG
Sbjct: 531 LGG 533


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 34.5 bits (80), Expect = 0.049
 Identities = 15/48 (31%), Positives = 28/48 (58%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           +   EEE +EE +E K ++ K + + KL     EE+ + EE+E +++ 
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320



 Score = 31.1 bits (71), Expect = 0.67
 Identities = 15/52 (28%), Positives = 30/52 (57%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E ++ +  EEE +EE + K +E K +E++        EE+ + EE+E +++ 
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320



 Score = 30.7 bits (70), Expect = 0.97
 Identities = 15/50 (30%), Positives = 27/50 (54%), Gaps = 7/50 (14%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EEEEE+  +   + ++ + +EKK       EE+++EE E +  +   EE 
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKK-------EEKKKEEREAKLAKLSPEEQ 308



 Score = 29.9 bits (68), Expect = 1.8
 Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 10/54 (18%)

Query: 249 EEEEEE----EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           EEEEE+     EEE + +A+E K ++KK+      E E +  +   EE+ + EE
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKE------EREAKLAKLSPEEQRKLEE 313


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 32.8 bits (74), Expect = 0.049
 Identities = 11/37 (29%), Positives = 24/37 (64%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGWWF 316
           +EEE++++E+EE+++E+++     V    FG    +F
Sbjct: 17  DEEEDDDDEDEEDDDEDDDEDDDEVPVPSFGEAMAYF 53



 Score = 30.1 bits (67), Expect = 0.50
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 278 GGEEEEEEEEEEEEEEEEEEE 298
           G E+ + + +EEE++++E+EE
Sbjct: 8   GEEDSDSDSDEEEDDDDEDEE 28



 Score = 29.0 bits (64), Expect = 1.2
 Identities = 11/55 (20%), Positives = 29/55 (52%), Gaps = 19/55 (34%)

Query: 239 YCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + +G E    + +EEE++++E+                   EE+++E+++E+++E
Sbjct: 5   FLEGEEDSDSDSDEEEDDDDED-------------------EEDDDEDDDEDDDE 40



 Score = 28.6 bits (63), Expect = 1.5
 Identities = 8/25 (32%), Positives = 18/25 (72%)

Query: 274 LGGGGGEEEEEEEEEEEEEEEEEEE 298
           L G    + + +EEE++++E+EE++
Sbjct: 6   LEGEEDSDSDSDEEEDDDDEDEEDD 30



 Score = 28.6 bits (63), Expect = 1.8
 Identities = 7/24 (29%), Positives = 18/24 (75%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
           G    + + +EEE++++E+EE+++
Sbjct: 8   GEEDSDSDSDEEEDDDDEDEEDDD 31



 Score = 28.2 bits (62), Expect = 1.9
 Identities = 7/19 (36%), Positives = 18/19 (94%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           + +EEE++++E+EE+++E+
Sbjct: 15  DSDEEEDDDDEDEEDDDED 33


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 34.9 bits (80), Expect = 0.050
 Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 246 EVHEE--EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E+ EE  ++ E E+EE+    +    E+++ G G GE +E    +E E  + E E G
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESG 254



 Score = 34.1 bits (78), Expect = 0.098
 Identities = 16/60 (26%), Positives = 28/60 (46%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           + E E+EE+ ++ +  E + +E+ +  G G E    +E E  + E E  E   V     D
Sbjct: 206 DTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDD 265



 Score = 33.4 bits (76), Expect = 0.19
 Identities = 13/67 (19%), Positives = 34/67 (50%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + + D  +  E E  E+ E E E +E +  +E++  +++   G     + ++++  +E +
Sbjct: 212 EEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESD 271

Query: 294 EEEEEIG 300
           ++ E  G
Sbjct: 272 DDSETPG 278


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 34.8 bits (81), Expect = 0.054
 Identities = 19/53 (35%), Positives = 32/53 (60%), Gaps = 7/53 (13%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           ERE+ ++ EE E   +E +  + +L+EKK       E+ +EEE++  EE E+E
Sbjct: 529 ERELEQKAEEAEALLKEAEKLKEELEEKK-------EKLQEEEDKLLEEAEKE 574



 Score = 28.6 bits (65), Expect = 4.9
 Identities = 21/49 (42%), Positives = 31/49 (63%), Gaps = 8/49 (16%)

Query: 253 EEEEEEEETKAKET--KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EE E E E KA+E    LKE +KL      +EE EE++E+ +EEE++ +
Sbjct: 526 EELERELEQKAEEAEALLKEAEKL------KEELEEKKEKLQEEEDKLL 568


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 33.2 bits (76), Expect = 0.055
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
           G G   +EEEEEEEE E++++ +E
Sbjct: 90  GTGHTRQEEEEEEEENEKQQQSDE 113



 Score = 32.9 bits (75), Expect = 0.074
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 276 GGGGEEEEEEEEEEEEEEEEEEE 298
           G   +EEEEEEEE E++++ +E 
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEA 114



 Score = 30.9 bits (70), Expect = 0.36
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEI 299
           G    EEEEEEEE E++++ +E ++
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEAQV 116



 Score = 30.9 bits (70), Expect = 0.40
 Identities = 12/29 (41%), Positives = 21/29 (72%), Gaps = 2/29 (6%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKE 270
           G  R+  EEEEEEEE E++ ++ E ++++
Sbjct: 92  GHTRQ--EEEEEEEENEKQQQSDEAQVQQ 118


>gnl|CDD|215541 PLN03020, PLN03020, low-temperature-induced protein; Provisional.
          Length = 556

 Score = 34.6 bits (79), Expect = 0.057
 Identities = 16/62 (25%), Positives = 30/62 (48%), Gaps = 9/62 (14%)

Query: 247 VHEEEEEEEEEEEET-----KAKETKLKEKKKLGGGGGEEEEEEE----EEEEEEEEEEE 297
           +H   E E+  E+++     K K  K+K+     G G   +E       +  EE+++E+E
Sbjct: 18  LHSATEGEDHHEKKSVLKKVKDKAKKIKDTITKHGHGHHHDEHRVPDDHDLYEEDDDEDE 77

Query: 298 EI 299
           E+
Sbjct: 78  EM 79


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 34.7 bits (80), Expect = 0.057
 Identities = 20/113 (17%), Positives = 35/113 (30%), Gaps = 16/113 (14%)

Query: 196 FNMHGEVSYTYASELH------TACVFLDLDPVNRPDGYNKTIFQSENDYCK-----GGE 244
               GE+ Y     +           F   + +      +KT  ++   Y K      G 
Sbjct: 319 LRQKGELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGA 378

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +   + +  E +E            EK +     G++  EE  EE  EE   +
Sbjct: 379 KVNLDRQLSELKEAIAYYESAKTALEKAE-----GKKAIEEIREELIEEGLLK 426



 Score = 30.4 bits (69), Expect = 1.4
 Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 12/85 (14%)

Query: 216 FLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLG 275
           +LD+ P+       K   ++ ++  +  + +    E E++ E+E         K + KL 
Sbjct: 251 YLDVVPLKAYADLEKLFNEALDEKFERDKIKQLASELEKKLEKE-------LKKLENKL- 302

Query: 276 GGGGEEEEEEEEEEEEEEEEEEEIG 300
               E++E+E EE E+  EE  + G
Sbjct: 303 ----EKQEDELEELEKAAEELRQKG 323


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 34.4 bits (79), Expect = 0.060
 Identities = 12/55 (21%), Positives = 32/55 (58%), Gaps = 7/55 (12%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +R+  E+++ EE ++++   +E +LK+ +K      E    +E++++ EE  ++ 
Sbjct: 80  QRKKKEQQQAEELQQKQAAEQE-RLKQLEK------ERLAAQEQKKQAEEAAKQA 127



 Score = 34.0 bits (78), Expect = 0.079
 Identities = 13/58 (22%), Positives = 32/58 (55%), Gaps = 6/58 (10%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K  +R   + +++E+++ EE + K+   +E+ K      + E+E    +E++++ EE 
Sbjct: 72  KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLK------QLEKERLAAQEQKKQAEEA 123



 Score = 33.2 bits (76), Expect = 0.17
 Identities = 10/55 (18%), Positives = 28/55 (50%), Gaps = 6/55 (10%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             E+ +++  E+E  ++ + +    +E+KK      + EE  ++   ++++ EE 
Sbjct: 89  AEELQQKQAAEQERLKQLEKERLAAQEQKK------QAEEAAKQAALKQKQAEEA 137



 Score = 32.5 bits (74), Expect = 0.30
 Identities = 9/50 (18%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +++ EE ++++  + +  K  EK++L       +E++++ EE  ++   +
Sbjct: 86  QQQAEELQQKQAAEQERLKQLEKERL-----AAQEQKKQAEEAAKQAALK 130



 Score = 29.8 bits (67), Expect = 2.2
 Identities = 12/48 (25%), Positives = 27/48 (56%), Gaps = 9/48 (18%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EE+ +++E+ +A+E  L++K+         E+E  ++ E+E    +E 
Sbjct: 78  EEQRKKKEQQQAEE--LQQKQ-------AAEQERLKQLEKERLAAQEQ 116


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 34.6 bits (80), Expect = 0.061
 Identities = 17/62 (27%), Positives = 28/62 (45%), Gaps = 17/62 (27%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           +N++          E E  EEEEEE +  E +L+E          + +  +   EE +E+
Sbjct: 142 KNEF----------ELELPEEEEEEPEEMEEELEEDA-------ADRDARKRAAEEAKEQ 184

Query: 296 EE 297
           EE
Sbjct: 185 EE 186



 Score = 28.5 bits (64), Expect = 5.1
 Identities = 14/19 (73%), Positives = 14/19 (73%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           E  EEEEEE EE EEE EE
Sbjct: 148 ELPEEEEEEPEEMEEELEE 166



 Score = 28.1 bits (63), Expect = 7.0
 Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 280 EEEEEEEEEEEEEEEEEEEI 299
           EEEEE EE EEE EE+  + 
Sbjct: 152 EEEEEPEEMEEELEEDAADR 171


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 33.3 bits (76), Expect = 0.063
 Identities = 18/60 (30%), Positives = 33/60 (55%), Gaps = 4/60 (6%)

Query: 239 YCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           Y K  E+E +   E+E  E++   +KE +  EKKK      +E++E  ++ + E+ E+E 
Sbjct: 38  YLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKK----KLDEKKEIAKQRKREQREKEL 93


>gnl|CDD|178593 PLN03021, PLN03021, Low-temperature-induced protein; Provisional.
          Length = 619

 Score = 34.6 bits (78), Expect = 0.065
 Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 7/74 (9%)

Query: 247 VHEEEEEEEEEEEETKA----KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           +H  EE+E  E+  +K     KE   K K  L   G   + + E++++E +E++ E+ G 
Sbjct: 20  IHHPEEDEHHEKGASKVLKKVKEKAKKIKNSLTKHGNGHDHDVEDDDDEYDEQDPEVHGA 79

Query: 303 VV---SAVDFGSGG 313
            V   SAV  G  G
Sbjct: 80  PVYESSAVRGGVTG 93


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 32.6 bits (75), Expect = 0.066
 Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 9/60 (15%)

Query: 246 EVHEEEEEEEEEEEETKAKE---------TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           E  ++ EEEE+EE +    E         T+   K K       E + +E+E+++++++E
Sbjct: 28  EEFDKREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKE 87



 Score = 26.8 bits (60), Expect = 6.5
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVS 305
           + EEEE+EE +E   E +E G V V+
Sbjct: 32  KREEEEKEEAKEARNEPDEDGFVTVT 57


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 33.9 bits (78), Expect = 0.066
 Identities = 21/47 (44%), Positives = 24/47 (51%), Gaps = 12/47 (25%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           EEE EE+E                    EEEE++EE E EEEEEEEE
Sbjct: 19  EEELEEDEFFW------------TYLLFEEEEDDEEFEIEEEEEEEE 53



 Score = 33.5 bits (77), Expect = 0.11
 Identities = 17/48 (35%), Positives = 28/48 (58%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           EEEE++EE E + +E + +          +E E ++EEE E+E + EE
Sbjct: 35  EEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREE 82



 Score = 30.8 bits (70), Expect = 0.67
 Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 22/68 (32%)

Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
           +F+ E D     + E   EEEEEEEE +                    ++ E++E E ++
Sbjct: 33  LFEEEED-----DEEFEIEEEEEEEEVDS-----------------DFDDSEDDEPESDD 70

Query: 292 EEEEEEEI 299
           EEE E+E+
Sbjct: 71  EEEGEKEL 78



 Score = 30.8 bits (70), Expect = 0.77
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 5/54 (9%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E  ++EE E EEEEE +  ++   + +       E E ++EEE E+E + EE +
Sbjct: 36  EEEDDEEFEIEEEEEEEEVDSDFDDSE-----DDEPESDDEEEGEKELQREERL 84



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 9/52 (17%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            E EEEEEEEE ++   +++  E          E ++EEE E+E + EE   
Sbjct: 43  FEIEEEEEEEEVDSDFDDSEDDE---------PESDDEEEGEKELQREERLK 85



 Score = 29.7 bits (67), Expect = 1.6
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKK 273
           E  E E ++EEE E+   +E +LK+KK+
Sbjct: 62  EDDEPESDDEEEGEKELQREERLKKKKR 89



 Score = 27.3 bits (61), Expect = 9.9
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKL 274
            +E E ++EEE E + +  +  +KKK 
Sbjct: 63  DDEPESDDEEEGEKELQREERLKKKKR 89


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 33.8 bits (78), Expect = 0.071
 Identities = 18/56 (32%), Positives = 32/56 (57%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            ++E   EE + EE EE  K++E + +E++       EEE++E  E E + EE ++
Sbjct: 2   SDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKD 57



 Score = 30.0 bits (68), Expect = 1.0
 Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 8/56 (14%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             E   EE EE E+ EE +A+E + +E+         E EEE++E  E E + EE+
Sbjct: 8   TEEPDAEETEEAEKSEEEEAEEEEPEEE--------NELEEEQQEIAELEAQLEEL 55


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 33.0 bits (74), Expect = 0.082
 Identities = 26/66 (39%), Positives = 32/66 (48%), Gaps = 6/66 (9%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
             E E  EE E EEE EEE + +ET  +E +       E E E E E E E E E E G 
Sbjct: 71  EPEEEGEEEPEPEEEGEEEPEPEETGEEEPEP------EPEPEPEPEPEPEPEPEPEPGA 124

Query: 302 VVVSAV 307
             + +V
Sbjct: 125 ATLKSV 130



 Score = 32.2 bits (72), Expect = 0.15
 Identities = 25/65 (38%), Positives = 31/65 (47%), Gaps = 4/65 (6%)

Query: 238 DYCKGGEREVHEEEEEE----EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           D   G + +   EEEEE    EE EEE + +E   +E +    G  E E E E E E E 
Sbjct: 53  DDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEP 112

Query: 294 EEEEE 298
           E E E
Sbjct: 113 EPEPE 117



 Score = 32.2 bits (72), Expect = 0.18
 Identities = 22/65 (33%), Positives = 31/65 (47%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + E     G +     + ++E EEEEE + +E   +E +    G  E E EE  EEE E 
Sbjct: 43  KGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEP 102

Query: 294 EEEEE 298
           E E E
Sbjct: 103 EPEPE 107


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 34.1 bits (79), Expect = 0.085
 Identities = 15/50 (30%), Positives = 29/50 (58%), Gaps = 6/50 (12%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E EEEE E    + A+  KL++K++      + E++ E+EE E+   +++
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQR------KAEKKAEKEEAEKAAAKKK 438



 Score = 31.8 bits (73), Expect = 0.43
 Identities = 14/54 (25%), Positives = 23/54 (42%), Gaps = 5/54 (9%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E E  E E       E  K ++ + K +KK      E+EE E+   +++ E   
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKA-----EKEEAEKAAAKKKAEAAA 443



 Score = 31.4 bits (72), Expect = 0.61
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
            E++ E+EE E  A + K +   K   G   E ++ + +
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459



 Score = 29.5 bits (67), Expect = 2.9
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
            E++ E+EE  KA   K  E       G + E ++ + +
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459



 Score = 28.7 bits (65), Expect = 3.9
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
            E++ E+EE E+  AK+      KK  G  GE ++ + +
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 33.5 bits (77), Expect = 0.094
 Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 10/67 (14%)

Query: 243 GEREVHEEEEEEEEEEEETKAKE-------TKLKEKKKLGG---GGGEEEEEEEEEEEEE 292
           GE ++    +E+   E     KE         LK ++K      G   + E  EEEEEE 
Sbjct: 110 GESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEI 169

Query: 293 EEEEEEI 299
            EEEEE+
Sbjct: 170 AEEEEEV 176


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 33.9 bits (78), Expect = 0.095
 Identities = 14/76 (18%), Positives = 35/76 (46%)

Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE 285
            G+N  I Q +++   G + + +E+E  E++++++ +  +      +K      +    +
Sbjct: 40  KGFNADIPQPKDEGIIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSD 99

Query: 286 EEEEEEEEEEEEEIGG 301
              +EE +E  E  G 
Sbjct: 100 SGFDEEPDEPAETAGS 115



 Score = 28.2 bits (63), Expect = 6.8
 Identities = 12/58 (20%), Positives = 20/58 (34%), Gaps = 13/58 (22%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
            RE+    E  + +EE+        +  +++     EE E     E     E EE   
Sbjct: 130 NRELGSFYEYPKTDEEK--------ELLREV-----EELESRLATEPSPAPELEEQLA 174


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 34.0 bits (78), Expect = 0.10
 Identities = 18/76 (23%), Positives = 30/76 (39%), Gaps = 2/76 (2%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
             ++D     E    E   EE E  +E++  E +  + +   G     E  E+ E +E +
Sbjct: 235 DGDDDQPDNNEDS--EAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESD 292

Query: 294 EEEEEIGGVVVSAVDF 309
           E+ E  G     A  F
Sbjct: 293 EDTETPGEDARPATPF 308



 Score = 32.9 bits (75), Expect = 0.22
 Identities = 16/55 (29%), Positives = 26/55 (47%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E  ++  EE+ +EE+    +    E  + G    E  +E EE+E E  + E E G
Sbjct: 221 ETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEG 275



 Score = 32.1 bits (73), Expect = 0.43
 Identities = 16/67 (23%), Positives = 29/67 (43%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + + D   G + +    E+ E   EE   + E++  E +   G G E E +  E  E+ E
Sbjct: 228 EEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSE 287

Query: 294 EEEEEIG 300
            +E +  
Sbjct: 288 SDESDED 294



 Score = 31.4 bits (71), Expect = 0.74
 Identities = 16/57 (28%), Positives = 28/57 (49%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
           EE+ +EE+ ++ +    +  E  +    G +E EE+E E  + E EE E+     S 
Sbjct: 228 EEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASE 284



 Score = 30.2 bits (68), Expect = 1.5
 Identities = 12/56 (21%), Positives = 29/56 (51%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           G +    + +EE+ ++++    ++++   ++  G    EE+E E  + E EE E +
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMD 278



 Score = 27.9 bits (62), Expect = 9.1
 Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 12/56 (21%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           G+  + E+ +EE+ ++++    E              E   EE E  +E EE+E E
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDS------------EAGREESEGSDESEEDEAE 266


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 33.9 bits (77), Expect = 0.10
 Identities = 16/61 (26%), Positives = 33/61 (54%), Gaps = 2/61 (3%)

Query: 254 EEEEEEETKAKETKLKEK--KKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGS 311
           +  E  E+K KE K ++   +++  G GE++++  +EE ++E E+  +     + V  GS
Sbjct: 26  DWFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRTHTEVGSGS 85

Query: 312 G 312
            
Sbjct: 86  S 86


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 33.9 bits (78), Expect = 0.11
 Identities = 13/63 (20%), Positives = 31/63 (49%)

Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            Y       + +E      EEE   A  +  K+KK+      +++ E+++++++E++E +
Sbjct: 25  SYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84

Query: 298 EIG 300
             G
Sbjct: 85  SEG 87



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 19/91 (20%), Positives = 37/91 (40%), Gaps = 6/91 (6%)

Query: 229 NKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKL------KEKKKLGGGGGEEE 282
             T    + D     +++  +++++++E++E     ETKL      K KK        + 
Sbjct: 52  TSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111

Query: 283 EEEEEEEEEEEEEEEEIGGVVVSAVDFGSGG 313
            E+ +    +  E  E   V + A    SGG
Sbjct: 112 NEDVDNAFNKIAELAEKSNVYIGAHVSASGG 142


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 33.2 bits (76), Expect = 0.11
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query: 261 TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           T  K    K+ KK       EEE++ E ++++ EE E        + D
Sbjct: 58  TPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGD 105



 Score = 27.4 bits (61), Expect = 7.7
 Identities = 10/60 (16%), Positives = 26/60 (43%), Gaps = 4/60 (6%)

Query: 240 CKGGEREVHEEEEEEEEEEEETKAKE---TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
            K  E+   EEE++ E ++++T+  E      +E              + + E++ ++ +
Sbjct: 70  KKDKEKLT-EEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDNEDDMDDSD 128


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 33.3 bits (76), Expect = 0.11
 Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 243 GEREVHEEEEEEE-----EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           GE E +++E E E     ++ E     + K++ + +   G      E E E+E E+E EE
Sbjct: 213 GENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEE 272

Query: 298 EIG 300
           E G
Sbjct: 273 EAG 275


>gnl|CDD|223731 COG0658, ComEC, Predicted membrane metal-binding protein [General
           function prediction only].
          Length = 453

 Score = 33.6 bits (77), Expect = 0.11
 Identities = 13/66 (19%), Positives = 24/66 (36%), Gaps = 5/66 (7%)

Query: 63  IIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATD 122
           +IV     +I + G   N L+  +V+F         L+++ L +A LL            
Sbjct: 393 LIVSPLISLISLPGLLANFLLSPLVSF-----LIVPLILLGLLLAPLLIGAALARVRPGV 447

Query: 123 YVTTCW 128
           +     
Sbjct: 448 FGLIQL 453


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 33.9 bits (78), Expect = 0.11
 Identities = 20/55 (36%), Positives = 25/55 (45%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           ER    E E EE EE   + KE     K++L       EE E+   E EE +EE 
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363



 Score = 33.1 bits (76), Expect = 0.24
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           ER++ E + E    EEE +  +++L+E ++      EE EE +E  EE EEE E +
Sbjct: 715 ERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESL 770



 Score = 31.6 bits (72), Expect = 0.70
 Identities = 17/55 (30%), Positives = 23/55 (41%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           + E+ E E   EE E+     E   +E ++      EE EE  E   EE  E E 
Sbjct: 336 KEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEA 390



 Score = 31.2 bits (71), Expect = 0.74
 Identities = 20/53 (37%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 249 EEEEEEEEEEEETKAKETKLKEK--KKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EE E   EE E+  A+  + KE+  +KL     E EE  E   EE  E E E+
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAEL 392



 Score = 31.2 bits (71), Expect = 0.91
 Identities = 17/66 (25%), Positives = 30/66 (45%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + EN+  +  ER    +E+ E  +EE  + +    + ++ L      +EE EE+     E
Sbjct: 313 ELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLE 372

Query: 294 EEEEEI 299
           E EE  
Sbjct: 373 ELEELF 378



 Score = 30.8 bits (70), Expect = 1.1
 Identities = 15/52 (28%), Positives = 21/52 (40%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            + E  +EE EE     E   +   +L     E EE+     EE EE  E +
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEAL 381



 Score = 30.5 bits (69), Expect = 1.4
 Identities = 18/50 (36%), Positives = 27/50 (54%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E    EEE E+ +++  +L+E+ +      EE +E  EE EEE E  EE
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772



 Score = 30.5 bits (69), Expect = 1.4
 Identities = 23/54 (42%), Positives = 35/54 (64%), Gaps = 6/54 (11%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+ +E EE +EE EE +A++ +L+++ K      E EEE+EE EEE  E E E+
Sbjct: 853 ELEKELEELKEELEELEAEKEELEDELK------ELEEEKEELEEELRELESEL 900



 Score = 29.3 bits (66), Expect = 3.7
 Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 1/68 (1%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEK-KKLGGGGGEEEEEEEEEEEE 291
            ++E    +    E+  + EE E + EE K +   L+E+ ++L     E EEE EE EEE
Sbjct: 693 LKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEE 752

Query: 292 EEEEEEEI 299
            EE +E +
Sbjct: 753 LEELQERL 760



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 22/52 (42%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 249 EEEEEEEEEEEETKAKETKLKEK-KKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E  EE EEE E  +    KLKE+ ++L       +EE EE EEE EE E  +
Sbjct: 758 ERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRL 809



 Score = 28.9 bits (65), Expect = 4.0
 Identities = 16/53 (30%), Positives = 21/53 (39%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E     EE E+   E +  + +L+EK        EE  E   EE  E E E  
Sbjct: 341 ERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELA 393



 Score = 28.5 bits (64), Expect = 5.4
 Identities = 26/58 (44%), Positives = 37/58 (63%), Gaps = 5/58 (8%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE-KKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E+E+ E EEE EE EE+    E +L+E +K+L     E +EE EE E E+EE E+E+ 
Sbjct: 827 EQEIEELEEEIEELEEKLDELEEELEELEKELE----ELKEELEELEAEKEELEDELK 880



 Score = 27.8 bits (62), Expect = 9.2
 Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 3/52 (5%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           EEE    EEE +  + +L+E +K      E + E EE  EE EE +EE+  +
Sbjct: 245 EEELSRLEEELEELQEELEEAEK---EIEELKSELEELREELEELQEELLEL 293



 Score = 27.8 bits (62), Expect = 9.7
 Identities = 19/53 (35%), Positives = 28/53 (52%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           RE+  E  E +EE E+ + +  +L+ K +       E EEE EEE E+  E E
Sbjct: 894 RELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETE 946


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 33.3 bits (76), Expect = 0.12
 Identities = 16/45 (35%), Positives = 21/45 (46%), Gaps = 7/45 (15%)

Query: 243 GEREVHE-------EEEEEEEEEEETKAKETKLKEKKKLGGGGGE 280
           G+ EV E       EEEEEEE++     ++  L       GGGG 
Sbjct: 146 GDPEVREQVLSAMQEEEEEEEQDAANGVRDNVLNNLNNAPGGGGY 190


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 33.9 bits (77), Expect = 0.13
 Identities = 11/52 (21%), Positives = 24/52 (46%), Gaps = 9/52 (17%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           R   E   +E E ++ T+  +            G + + EE++++ EEE+ +
Sbjct: 87  RSFQEPVSQESEVQDNTEQNQ---------DTKGSKTDSEEDDDDSEEEDNK 129


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 33.0 bits (75), Expect = 0.13
 Identities = 14/65 (21%), Positives = 30/65 (46%), Gaps = 1/65 (1%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
           E E H EEE +    +  +  E ++KE+ +         ++EE+   +E + +E+   ++
Sbjct: 81  EAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDED-KFLL 139

Query: 304 VSAVD 308
               D
Sbjct: 140 AEDSD 144



 Score = 30.3 bits (68), Expect = 1.2
 Identities = 14/55 (25%), Positives = 27/55 (49%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             + +E+ +E  E+ E TKA+   + E+         E+ +  +EE  +E  EE+
Sbjct: 182 SEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQ 236



 Score = 29.5 bits (66), Expect = 1.8
 Identities = 12/53 (22%), Positives = 23/53 (43%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           ++E     +E + +E++    E     ++ L  G   EE E+    EE   E+
Sbjct: 120 KQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQ 172



 Score = 29.5 bits (66), Expect = 2.1
 Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 6/60 (10%)

Query: 244 EREVHEEEEEEEEE------EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E EV E+ +   E+      EE+   KE +L E K L     ++ +E  E  +  EE E+
Sbjct: 102 EEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETED 161



 Score = 28.4 bits (63), Expect = 4.3
 Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 5/59 (8%)

Query: 244 EREVHEEEEE-----EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E+ V  ++EE     E + +E+         ++++    G   EE E+    EE   E+
Sbjct: 114 EKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQ 172



 Score = 27.6 bits (61), Expect = 7.5
 Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 6/58 (10%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE--EEEEEEEEEEIGGVVV 304
           EEE +E+ +    K   +K +E     G G E + +E++    E+ ++ +E +    V
Sbjct: 102 EEEVKEQLQSLLEKIVVSKQEE----DGPGKEPQLDEDKFLLAEDSDDRQETLEAGKV 155



 Score = 27.6 bits (61), Expect = 7.6
 Identities = 17/76 (22%), Positives = 34/76 (44%)

Query: 224 RPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEE 283
           + D     + +  +D  +  E     EE E+    EET +++ K   K+K      E+ +
Sbjct: 131 QLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSK 190

Query: 284 EEEEEEEEEEEEEEEI 299
           E  E+ E  + E +++
Sbjct: 191 EPVEKAERTKAETDDV 206


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 31.8 bits (72), Expect = 0.13
 Identities = 16/47 (34%), Positives = 28/47 (59%), Gaps = 14/47 (29%)

Query: 268 LKEKKKLGGG--------------GGEEEEEEEEEEEEEEEEEEEIG 300
           +K+ + +GGG              G + E ++EE++EEEEEEE+++G
Sbjct: 61  MKKLQNIGGGVAAAAAPAAGAATAGAKAEAKKEEKKEEEEEEEDDLG 107


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 33.8 bits (78), Expect = 0.13
 Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 9/75 (12%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE------EEEEEEEEEEEIGGV 302
           +E  E EE EEET  +     E K   G   E+E   EE      EE+E+ ++ E++  +
Sbjct: 227 KETSETEEREEETDVEIETTSETK---GTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283

Query: 303 VVSAVDFGSGGWWFE 317
            +         +WFE
Sbjct: 284 EILKEKKDEELFWFE 298


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 31.9 bits (73), Expect = 0.14
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIG 300
                   ++EE++EE EEE ++++G
Sbjct: 79  AAAAAAAAKKEEKKEESEEESDDDMG 104



 Score = 31.5 bits (72), Expect = 0.16
 Identities = 7/24 (29%), Positives = 12/24 (50%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                     ++EE++EE EEE +
Sbjct: 77  AAAAAAAAAAKKEEKKEESEEESD 100



 Score = 30.7 bits (70), Expect = 0.30
 Identities = 7/24 (29%), Positives = 13/24 (54%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                    ++EE++EE EEE ++
Sbjct: 78  AAAAAAAAAKKEEKKEESEEESDD 101



 Score = 30.7 bits (70), Expect = 0.31
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                      ++EE++EE EEE 
Sbjct: 76  AAAAAAAAAAAKKEEKKEESEEES 99



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                       ++EE++EE EEE
Sbjct: 75  AAAAAAAAAAAAKKEEKKEESEEE 98


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 33.1 bits (76), Expect = 0.15
 Identities = 8/22 (36%), Positives = 11/22 (50%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
                     EEEEEEE+++G 
Sbjct: 285 AAAAAAAAPAEEEEEEEDDMGF 306



 Score = 33.1 bits (76), Expect = 0.16
 Identities = 8/22 (36%), Positives = 9/22 (40%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
                      EEEEEEE+  G
Sbjct: 284 AAAAAAAAAPAEEEEEEEDDMG 305



 Score = 30.8 bits (70), Expect = 0.87
 Identities = 8/22 (36%), Positives = 10/22 (45%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGG 301
                    EEEEEEE++   G
Sbjct: 286 AAAAAAAPAEEEEEEEDDMGFG 307



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 7/20 (35%), Positives = 7/20 (35%)

Query: 280 EEEEEEEEEEEEEEEEEEEI 299
                       EEEEEEE 
Sbjct: 283 AAAAAAAAAAPAEEEEEEED 302


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 32.7 bits (75), Expect = 0.15
 Identities = 14/46 (30%), Positives = 27/46 (58%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           E+ E EEEE+ + + +E  L+  K        +++ +E ++EE+EE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165



 Score = 30.4 bits (69), Expect = 1.1
 Identities = 12/46 (26%), Positives = 24/46 (52%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           E+ E EEEE+   +E +   +           +++ +E ++EE+EE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165



 Score = 30.0 bits (68), Expect = 1.1
 Identities = 15/45 (33%), Positives = 23/45 (51%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
           + E  EEE+ +EEE E          E+ +L     E ++EE+EE
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165



 Score = 30.0 bits (68), Expect = 1.2
 Identities = 15/46 (32%), Positives = 26/46 (56%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
           E+   EEEE+ +EEE E   +  K + ++       +E ++EE+EE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 32.8 bits (75), Expect = 0.15
 Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 5/46 (10%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
            E+E E EEEE K K+ K + KK+      +E+++++E+  E +  
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEK-----KEKKDKKEKMVEPKGS 183



 Score = 31.2 bits (71), Expect = 0.49
 Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 12/64 (18%)

Query: 243 GEREVHEEEEEEEEE------EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
                 E E  E+E       E E + +E K K+KKK      E ++E++E+++++E+  
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKK------EVKKEKKEKKDKKEKMV 178

Query: 297 EEIG 300
           E  G
Sbjct: 179 EPKG 182



 Score = 30.8 bits (70), Expect = 0.54
 Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 5/49 (10%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
               E   E ET  KET  K +K+      E EEEE++E+++++E ++E
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEA-----EVEEEEKKEKKKKKEVKKE 166


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 31.4 bits (71), Expect = 0.15
 Identities = 23/58 (39%), Positives = 28/58 (48%), Gaps = 14/58 (24%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K  E E  EE +EEEEEEEE +                GE E   ++EE EEE E+ E
Sbjct: 53  KALEAEESEENDEEEEEEEEEED--------------EGEIEYVSDDEELEEEIEDLE 96



 Score = 31.0 bits (70), Expect = 0.24
 Identities = 17/30 (56%), Positives = 19/30 (63%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
           E EE EE +EEEEEEEEEE  G +    D 
Sbjct: 56  EAEESEENDEEEEEEEEEEDEGEIEYVSDD 85



 Score = 30.6 bits (69), Expect = 0.28
 Identities = 17/29 (58%), Positives = 20/29 (68%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           EE EE +EEEEEEEEEE+E     VS  +
Sbjct: 58  EESEENDEEEEEEEEEEDEGEIEYVSDDE 86



 Score = 30.2 bits (68), Expect = 0.38
 Identities = 16/32 (50%), Positives = 20/32 (62%)

Query: 281 EEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSG 312
           EE EE +EEEEEEEEEE+ G +   + D    
Sbjct: 58  EESEENDEEEEEEEEEEDEGEIEYVSDDEELE 89



 Score = 28.3 bits (63), Expect = 1.9
 Identities = 17/28 (60%), Positives = 19/28 (67%)

Query: 271 KKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           KK L     EE +EEEEEEEEEE+E E 
Sbjct: 52  KKALEAEESEENDEEEEEEEEEEDEGEI 79



 Score = 27.5 bits (61), Expect = 3.1
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
            +EEEEEEEEEE+E E E +        +
Sbjct: 63  NDEEEEEEEEEEDEGEIEYVSDDEELEEE 91



 Score = 27.1 bits (60), Expect = 5.0
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           EEEEEEEEEE+E E E       +   ++
Sbjct: 65  EEEEEEEEEEDEGEIEYVSDDEELEEEIE 93


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 33.3 bits (76), Expect = 0.16
 Identities = 16/50 (32%), Positives = 27/50 (54%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            E  ++     +   ++ + + + ++   G EEE EE EE+EEEE EEE 
Sbjct: 50  PETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEES 99



 Score = 32.1 bits (73), Expect = 0.35
 Identities = 20/53 (37%), Positives = 26/53 (49%), Gaps = 7/53 (13%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
             E++E   E +  E   +E+        EE EE+EEEE EEE EE E  G  
Sbjct: 63  FGEDDEGRIEVRISEDGEEEE-------VEEGEEDEEEEGEEESEEFEPLGDT 108



 Score = 30.2 bits (68), Expect = 1.5
 Identities = 26/74 (35%), Positives = 29/74 (39%), Gaps = 18/74 (24%)

Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
           +P    D Y       E+D    G  EV   E+ EEEE EE               G   
Sbjct: 49  EPETSDDPYGNPDPFGEDD---EGRIEVRISEDGEEEEVEE---------------GEED 90

Query: 280 EEEEEEEEEEEEEE 293
           EEEE EEE EE E 
Sbjct: 91  EEEEGEEESEEFEP 104



 Score = 28.6 bits (64), Expect = 4.4
 Identities = 27/69 (39%), Positives = 33/69 (47%), Gaps = 10/69 (14%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           G E EV E EE+EEEE EE   +   L          G+  EE  E  E+ EE EE    
Sbjct: 79  GEEEEVEEGEEDEEEEGEEESEEFEPL----------GDTPEELTEASEQLEEHEEGFQA 128

Query: 302 VVVSAVDFG 310
           +V  AV+ G
Sbjct: 129 MVEQAVERG 137


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 33.0 bits (75), Expect = 0.17
 Identities = 17/54 (31%), Positives = 24/54 (44%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            +   E  EE  EEE+  AK       K       +++ E  EE  EEEEE ++
Sbjct: 109 AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDK 162



 Score = 31.1 bits (70), Expect = 0.89
 Identities = 20/53 (37%), Positives = 25/53 (47%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           RE  EE  EEEEE ++ KAK       K       +++  E  E  EE  EEE
Sbjct: 147 REGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEE 199



 Score = 30.3 bits (68), Expect = 1.5
 Identities = 19/51 (37%), Positives = 24/51 (47%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           RE  EE  EEE+   + KA      +   L     E  EE  EEEEE ++E
Sbjct: 113 REGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKE 163



 Score = 28.8 bits (64), Expect = 4.4
 Identities = 16/47 (34%), Positives = 21/47 (44%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
            E  EE  EEE+ KAK       K       +++ E  EE  EEE+ 
Sbjct: 79  REGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKA 125



 Score = 28.4 bits (63), Expect = 6.1
 Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 6/56 (10%)

Query: 245 REVHEEEEEEEEEE------EETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           RE  EE  EEE+ +         KAK   L ++K+ G     EEE+   + +    
Sbjct: 79  REGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAA 134



 Score = 28.4 bits (63), Expect = 6.2
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           + E++ E+ ++  A+  K + +K+L    G E  + EEE  E+E+   
Sbjct: 2   DPEKDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALP 49


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 33.3 bits (76), Expect = 0.17
 Identities = 18/48 (37%), Positives = 23/48 (47%), Gaps = 2/48 (4%)

Query: 255 EEEEEETKAKETKLKEKKK--LGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           EE+ + T      LKEK    +     E +EEE EEEEE  + EE   
Sbjct: 375 EEDAKTTDELRKFLKEKGHPVVKRVVREVDEEEIEEEEEAMQPEEMEM 422


>gnl|CDD|130217 TIGR01147, V_ATP_synt_G, vacuolar ATP synthase, subunit G.  This
           model describes the vacuolar ATP synthase G subunit in
           eukaryotes and includes members from diverse groups
           e.g., fungi, plants, parasites etc. V-ATPases are
           multi-subunit enzymes composed of two functional
           domains: A transmembrane Vo domain and a peripheral
           catalytic domain V1. The G subunit is one of the
           subunits of the catalytic domain. V-ATPases are
           responsible for the acidification of endosomes and
           lysosomes, which are part of the central vacuolar system
           [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 113

 Score = 31.3 bits (71), Expect = 0.18
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           ++  EE ++E E+ ++ + KE K  E K LGG G  EE+ E E + +  E
Sbjct: 34  KQAKEEAQKEVEKYKQQREKEFKEFEAKHLGGNGAAEEKAEAETQAKIRE 83


>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
          Length = 196

 Score = 32.6 bits (74), Expect = 0.18
 Identities = 13/51 (25%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EE E+E  +EEE       L   +++ G   +E+ ++ + +++  +E EE+
Sbjct: 2   EEVEKEINKEEEKDVNN--LSSNEQMEGPPEDEQAQQNQPQQQTVDEIEEL 50


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 29.6 bits (67), Expect = 0.19
 Identities = 13/19 (68%), Positives = 16/19 (84%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           EEE+ E+EEE E+EEE EE
Sbjct: 22  EEEKREDEEENEDEEEGEE 40



 Score = 29.2 bits (66), Expect = 0.32
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           E+EEE E+EEE EE+ E +
Sbjct: 27  EDEEENEDEEEGEEQSEVK 45



 Score = 28.5 bits (64), Expect = 0.50
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
            E+EEE E+EEE EE+ E 
Sbjct: 26  REDEEENEDEEEGEEQSEV 44



 Score = 28.5 bits (64), Expect = 0.53
 Identities = 12/20 (60%), Positives = 15/20 (75%)

Query: 281 EEEEEEEEEEEEEEEEEEIG 300
           EEE+ E+EEE E+EEE E  
Sbjct: 22  EEEKREDEEENEDEEEGEEQ 41



 Score = 28.5 bits (64), Expect = 0.56
 Identities = 12/19 (63%), Positives = 16/19 (84%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           EE+ E+EEE E+EEE EE+
Sbjct: 23  EEKREDEEENEDEEEGEEQ 41



 Score = 26.9 bits (60), Expect = 1.6
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           E+ E+EEE E+EEE EE+ 
Sbjct: 24  EKREDEEENEDEEEGEEQS 42



 Score = 26.5 bits (59), Expect = 2.9
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKK 272
           EE+ E+EEE E+ +  E + + K+
Sbjct: 23  EEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.1 bits (76), Expect = 0.20
 Identities = 27/53 (50%), Positives = 35/53 (66%), Gaps = 2/53 (3%)

Query: 249 EEEEEEEEEEEETKAKETKLKE-KKKLGGGGGEEEEEEEEEEEEEEEE-EEEI 299
           EE EE  E  EE KAK+ +L+  KK+L G   E+ E+E EE E+ +EE EEEI
Sbjct: 355 EELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407



 Score = 32.7 bits (75), Expect = 0.24
 Identities = 25/57 (43%), Positives = 33/57 (57%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E+EV E EE +EE EE  K  E+    K+KL     E EE  EE ++E EE EE++ 
Sbjct: 227 EKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283



 Score = 32.0 bits (73), Expect = 0.51
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 245 REVHEEEEEEEEEEEETKAKETKL-KEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           RE  E+ E+E +E EE K +  +L KE + L G   + EE+  E EE  EE ++EI
Sbjct: 220 REELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEI 275



 Score = 31.6 bits (72), Expect = 0.65
 Identities = 19/58 (32%), Positives = 30/58 (51%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
            RE++E   E  E  EE +  E ++KE ++L     E E+E E  E  + + EE+I  
Sbjct: 206 LREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRE 263



 Score = 31.2 bits (71), Expect = 0.89
 Identities = 21/58 (36%), Positives = 30/58 (51%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
            E+ +E EE EE+ +E K  + K +E  KL     E  +E  E E+     EEEI G+
Sbjct: 269 EELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGI 326



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 22/67 (32%), Positives = 29/67 (43%), Gaps = 11/67 (16%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKK-----------LGGGGGEEEEEEEEEE 289
           K  E+E+  EE+E ++ EEE      +L E +K           L     EEE EE  EE
Sbjct: 608 KDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREE 667

Query: 290 EEEEEEE 296
             E   E
Sbjct: 668 YLELSRE 674



 Score = 28.9 bits (65), Expect = 4.3
 Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 17/75 (22%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKE-----------------KKKLGGGGGEEEEEEE 286
           E+E+ E EE+E +  +E +  E  LK+                 ++KL     EE E++ 
Sbjct: 465 EKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKA 524

Query: 287 EEEEEEEEEEEEIGG 301
           EE E+ +E+  ++ G
Sbjct: 525 EEYEKLKEKLIKLKG 539



 Score = 28.9 bits (65), Expect = 4.6
 Identities = 18/59 (30%), Positives = 31/59 (52%), Gaps = 6/59 (10%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE-EEEEEEEEEIGGVV 303
           E+ E E+E E  E   +  E K++E ++       EE ++E EE EE+ +E +E+    
Sbjct: 239 EIEELEKELESLEGSKRKLEEKIRELEE-----RIEELKKEIEELEEKVKELKELKEKA 292



 Score = 28.1 bits (63), Expect = 8.0
 Identities = 16/59 (27%), Positives = 27/59 (45%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           K  ++E+ + EE +++  E  K  +   +E  +L     E   E  EE EE  +E E  
Sbjct: 542 KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPF 600


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 32.0 bits (73), Expect = 0.20
 Identities = 13/48 (27%), Positives = 29/48 (60%), Gaps = 9/48 (18%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            EE E+E++  +KET+ KE           + ++E++E++E++  E++
Sbjct: 92  PEESEKEQKEVSKETEEKEA---------IKAKKEKKEKKEKKVAEKL 130



 Score = 29.0 bits (65), Expect = 2.1
 Identities = 11/42 (26%), Positives = 25/42 (59%), Gaps = 6/42 (14%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
           +  EE E+E++E   + +E +  + KK      E++E++E++
Sbjct: 90  LFPEESEKEQKEVSKETEEKEAIKAKK------EKKEKKEKK 125



 Score = 27.8 bits (62), Expect = 5.5
 Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 7/49 (14%)

Query: 225 PDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK 273
            +GY+  +F  E         +  +E  +E EE+E  KAK+ K ++K+K
Sbjct: 83  AEGYSINLFPEE-------SEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124



 Score = 27.0 bits (60), Expect = 9.6
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
             E  +E++E  +E EE +A + K ++K+K 
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKK 122


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 32.9 bits (75), Expect = 0.20
 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 5/66 (7%)

Query: 233 FQSENDYCKGGEREVHEEEEEE-----EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
           FQ E++     E E +EE E+E     E+EE+    +    +++  L G   +  EEEEE
Sbjct: 208 FQLESEVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEEE 267

Query: 288 EEEEEE 293
           + E  E
Sbjct: 268 KGERPE 273


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 32.3 bits (74), Expect = 0.21
 Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 28/79 (35%)

Query: 249 EEEEEEEEEEEETKAKETK----------LKEKKKLGGG-----------GGEEEEEEEE 287
           +++EEE+E+E E + K  K             KKK+G               EE+E E  
Sbjct: 15  DDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFLPDKAREEKEAELR 74

Query: 288 EE-------EEEEEEEEEI 299
           EE       ++E  +EEEI
Sbjct: 75  EELREEFLKKQEAVKEEEI 93



 Score = 31.6 bits (72), Expect = 0.40
 Identities = 13/32 (40%), Positives = 23/32 (71%)

Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K K+KKK     G+++EEE+E+E E+E++  +
Sbjct: 2   KKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPK 33



 Score = 31.6 bits (72), Expect = 0.41
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K+ K K+K KL  G  +EEE+E+E E+E++  +E 
Sbjct: 1   KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKES 35


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 31.2 bits (71), Expect = 0.21
 Identities = 15/72 (20%), Positives = 25/72 (34%), Gaps = 15/72 (20%)

Query: 227 GYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEE 286
              +  +   +D     + E    + +EEE      A+                E+ EEE
Sbjct: 46  NKGRKRYAYRDDDDDDDDMEAGGADIQEEERRSARLARL---------------EDREEE 90

Query: 287 EEEEEEEEEEEE 298
             E+EEE E+  
Sbjct: 91  RLEKEEEREKRA 102



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETK 267
             I + E    +    E  EEE  E+EEE E +A++ K
Sbjct: 69  ADIQEEERRSARLARLEDREEERLEKEEEREKRARKRK 106



 Score = 26.6 bits (59), Expect = 8.2
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 9/48 (18%)

Query: 235 SENDYCKGGEREVHEEE---------EEEEEEEEETKAKETKLKEKKK 273
            ++D  + G  ++ EEE         E+ EEE  E + +  K   K+K
Sbjct: 59  DDDDDMEAGGADIQEEERRSARLARLEDREEERLEKEEEREKRARKRK 106


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 33.1 bits (75), Expect = 0.22
 Identities = 15/56 (26%), Positives = 27/56 (48%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           G  +  +E EEE  E E +   E+   ++ +      E+  E+E E +  +EE+ E
Sbjct: 926 GSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGE 981



 Score = 31.5 bits (71), Expect = 0.67
 Identities = 16/55 (29%), Positives = 27/55 (49%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E EV E E   ++E +ET   E   +  + L     E +  +EE+ E+ +E E +
Sbjct: 935 EEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESK 989



 Score = 30.8 bits (69), Expect = 1.1
 Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 2/59 (3%)

Query: 242 GGEREVHEEEEE--EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           G + E  E EEE  E E   + ++ ET   E+         E+E E +  +EE+ E+ +
Sbjct: 926 GSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWD 984



 Score = 30.4 bits (68), Expect = 1.7
 Identities = 15/52 (28%), Positives = 24/52 (46%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           +E +E EEE  E +A      ++        E  E+  E+E E +  +EE G
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDG 980


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 32.7 bits (74), Expect = 0.23
 Identities = 21/46 (45%), Positives = 26/46 (56%), Gaps = 4/46 (8%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           V EEEEE E EE+E    E   KE  K+     EEEEEEE +  ++
Sbjct: 139 VLEEEEEVEMEEDE----EYYEKEPGKVVDEKSEEEEEEELKTMKD 180



 Score = 32.3 bits (73), Expect = 0.35
 Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 10/51 (19%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EEEE E EE+EE   KE           G   +E+ EEEEEEE +  ++ I
Sbjct: 142 EEEEVEMEEDEEYYEKE----------PGKVVDEKSEEEEEEELKTMKDFI 182



 Score = 30.8 bits (69), Expect = 1.2
 Identities = 15/24 (62%), Positives = 17/24 (70%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVV 303
            EEEEE E EE+EE  E+E G VV
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVV 163



 Score = 30.4 bits (68), Expect = 1.3
 Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 7/50 (14%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKE-------TKLKEKKKLGG 276
           + + +Y +    +V +E+ EEEEEEE    K+       TK    K  G 
Sbjct: 149 EEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGK 198



 Score = 29.6 bits (66), Expect = 2.5
 Identities = 14/24 (58%), Positives = 16/24 (66%)

Query: 281 EEEEEEEEEEEEEEEEEEIGGVVV 304
            EEEEE E EE+EE  E+  G VV
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVV 163


>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110).  This is
           a family that is found predominantly at the C-terminus
           of Kelch-containing proteins. However, the exact
           function of this region is not known.
          Length = 99

 Score = 30.8 bits (70), Expect = 0.24
 Identities = 22/63 (34%), Positives = 26/63 (41%), Gaps = 4/63 (6%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEE----EEEEEEEEEEEEEEEEIGGVVVSAVDFG 310
           E   E  +  E   KE KK      EE      EE    E+EEEE+  I  VV  A +  
Sbjct: 31  EIAMESLRDAELSGKELKKDAFDLAEERYWELREELRALEDEEEEQAGIEEVVSLAKEKD 90

Query: 311 SGG 313
            GG
Sbjct: 91  WGG 93



 Score = 28.9 bits (65), Expect = 1.3
 Identities = 13/34 (38%), Positives = 17/34 (50%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG 278
           RE     E+EEEE+   +   +  KEK   GG G
Sbjct: 63  REELRALEDEEEEQAGIEEVVSLAKEKDWGGGEG 96


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 32.6 bits (75), Expect = 0.24
 Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 8/64 (12%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE--------EEEEEEE 295
           ERE+ E   E +  E     +E  L  K +      E  E++E+E        +E+EEE 
Sbjct: 68  ERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEEL 127

Query: 296 EEEI 299
           EE I
Sbjct: 128 EELI 131


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 32.8 bits (75), Expect = 0.25
 Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 9/84 (10%)

Query: 224 RPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE--- 280
             +   + + + E +  +  ERE+ E EEE ++ EE+    E+K     +L G G +   
Sbjct: 404 ALEEIQEELEELEKELEEL-ERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPV 462

Query: 281 -----EEEEEEEEEEEEEEEEEEI 299
                 EE E+E  E  E E EE+
Sbjct: 463 CGQELPEEHEKELLELYELELEEL 486



 Score = 32.8 bits (75), Expect = 0.30
 Identities = 21/51 (41%), Positives = 28/51 (54%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            +  E  E+ E+  +A E +LKE KKL     E+EEEE E+E E  EE   
Sbjct: 192 GQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLA 242



 Score = 31.3 bits (71), Expect = 0.89
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E  + EE EE+ E+ E    +  +LKEK +L     E  + E+  +E +E  EE  
Sbjct: 524 EEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELR 579



 Score = 29.0 bits (65), Expect = 3.8
 Identities = 17/65 (26%), Positives = 28/65 (43%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
            E    +  E+    E   EE EE + K +  +LKE+ +      +E +E  EE      
Sbjct: 524 EEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRT 583

Query: 295 EEEEI 299
            +EE+
Sbjct: 584 RKEEL 588



 Score = 29.0 bits (65), Expect = 4.4
 Identities = 20/52 (38%), Positives = 27/52 (51%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
             E+ E E E EE EEE +++  KL  + +L        EE EE+ EE E E
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAE 669



 Score = 28.6 bits (64), Expect = 5.0
 Identities = 17/54 (31%), Positives = 26/54 (48%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
             E  E+  E  EEE +   K  +++E+++      E E  EE   E EEE+E 
Sbjct: 197 LLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKER 250



 Score = 28.6 bits (64), Expect = 5.3
 Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 4/56 (7%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           ERE+ E EEE E      +  E  L++ K L     E  E+ EE+ E+ E E EE+
Sbjct: 297 EREIEELEEELEGLRALLEELEELLEKLKSL----EERLEKLEEKLEKLESELEEL 348



 Score = 28.6 bits (64), Expect = 6.0
 Identities = 17/55 (30%), Positives = 28/55 (50%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           RE+    EE EE+ E  +  E +++E ++   G     EE EE  E+ +  EE +
Sbjct: 277 RELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERL 331



 Score = 28.2 bits (63), Expect = 7.3
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E      E  EEE +E  K +E + +++++      E  EE   E EEE+E  EE+
Sbjct: 199 EDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEEL 254



 Score = 27.8 bits (62), Expect = 9.2
 Identities = 19/56 (33%), Positives = 29/56 (51%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+E+ +  E  ++ EE  +  + +L E         EE EE E+E EE E E EE+
Sbjct: 374 EKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEEL 429


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 32.2 bits (74), Expect = 0.28
 Identities = 23/60 (38%), Positives = 26/60 (43%), Gaps = 8/60 (13%)

Query: 244 EREVHEEEEEEEEE--------EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E E+ EE EEE +E        E E K  E K K+  K G       EE  E E EE E 
Sbjct: 67  EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEP 126


>gnl|CDD|218544 pfam05296, TAS2R, Mammalian taste receptor protein (TAS2R).  This
           family consists of several forms of mammalian taste
           receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled
           receptors expressed in subsets of taste receptor cells
           of the tongue and palate epithelia and are organised in
           the genome in clusters. The proteins are genetically
           linked to loci that influence bitter perception in mice
           and humans.
          Length = 303

 Score = 32.2 bits (74), Expect = 0.29
 Identities = 22/98 (22%), Positives = 43/98 (43%), Gaps = 23/98 (23%)

Query: 60  MVSIIVPIFFGIII---ILGFFGNALVVVVVA---FNQQMRSTTNLLIINLAIADLL--- 110
           M+S +  IF  + +   +LG  GN  +V+V       ++  S+ +L++ +LAI+ +    
Sbjct: 1   MLSALEGIFLIVAVVEFLLGILGNGFIVLVNCIDWVKKRKLSSIDLILTSLAISRICLQW 60

Query: 111 ------FIVCCVPFT-----ATDYVTTCWPFGD---VW 134
                 FI+   P T         ++  W F +   +W
Sbjct: 61  LILLDNFIIVFFPNTYLSGKLLRIISLFWTFLNHLSLW 98


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 32.5 bits (74), Expect = 0.30
 Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 5/56 (8%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE-----EEEEEEEEEEI 299
            EE+E+E+ +   K  E + ++++         EEEE EE       ++EEE+EE+
Sbjct: 287 TEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEV 342



 Score = 32.1 bits (73), Expect = 0.41
 Identities = 14/69 (20%), Positives = 32/69 (46%), Gaps = 4/69 (5%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEE--- 290
           + E++   G    + E+E  E    +E + ++      ++       EE+E+E+ +    
Sbjct: 241 KEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300

Query: 291 -EEEEEEEE 298
             E+E+E+E
Sbjct: 301 MMEDEDEDE 309



 Score = 31.7 bits (72), Expect = 0.55
 Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 14/65 (21%)

Query: 244 EREVHEEEEEEEEEEEETKAKE-------TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           E       +E+E+E+E   + E       T+ KEK+K        +  ++  E+E+E+EE
Sbjct: 258 ESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEK-------RKRLKKMMEDEDEDEE 310

Query: 297 EEIGG 301
            EI  
Sbjct: 311 MEIVP 315



 Score = 31.0 bits (70), Expect = 0.94
 Identities = 14/56 (25%), Positives = 25/56 (44%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E E    + +   E+E  +       E +      GE  + EEE EE+E+E+ + +
Sbjct: 243 ESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRL 298



 Score = 30.6 bits (69), Expect = 1.3
 Identities = 20/52 (38%), Positives = 26/52 (50%), Gaps = 13/52 (25%)

Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE-----EEEEEEEIG 300
           EEE EE+ K K  +LK+          E+E+E+EE E       EEEE E  
Sbjct: 284 EEETEEKEKEKRKRLKKM--------MEDEDEDEEMEIVPESPVEEEESEEP 327



 Score = 28.3 bits (63), Expect = 6.1
 Identities = 20/76 (26%), Positives = 28/76 (36%), Gaps = 19/76 (25%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE--------------EEE-----EEE 286
           E   EEEE EE E     K+ + KE+  +   GG               +EE     ++ 
Sbjct: 316 ESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKV 375

Query: 287 EEEEEEEEEEEEIGGV 302
            E E   E+E E    
Sbjct: 376 YEWESFSEDEAEPPPT 391


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
           Ribosomal proteins P1 and P2 are the eukaryotic proteins
           that are functionally equivalent to bacterial L7/L12.
           L12p is the archaeal homolog. Unlike other ribosomal
           proteins, the archaeal L12p and eukaryotic P1 and P2 do
           not share sequence similarity with their bacterial
           counterparts. They are part of the ribosomal stalk
           (called the L7/L12 stalk in bacteria), along with 28S
           rRNA and the proteins L11 and P0 in eukaryotes (23S
           rRNA, L11, and L10e in archaea). In bacterial ribosomes,
           L7/L12 homodimers bind the extended C-terminal helix of
           L10 to anchor the L7/L12 molecules to the ribosome.
           Eukaryotic P1/P2 heterodimers and archaeal L12p
           homodimers are believed to bind the L10 equivalent
           proteins, eukaryotic P0 and archaeal L10e, in a similar
           fashion. P1 and P2 (L12p, L7/L12) are the only proteins
           in the ribosome to occur as multimers, always appearing
           as sets of dimers. Recent data indicate that most
           archaeal species contain six copies of L12p (three
           homodimers), while eukaryotes have two copies each of P1
           and P2 (two heterodimers). Bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. As in bacteria, the stalk is crucial for
           binding of initiation, elongation, and release factors
           in eukaryotes and archaea.
          Length = 105

 Score = 30.7 bits (69), Expect = 0.30
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 277 GGGEEEEEEEEEEEEEEEEEEEIG 300
                E+ EE +EEEEEEE+E+ G
Sbjct: 78  TAEPAEKAEEAKEEEEEEEDEDFG 101



 Score = 30.7 bits (69), Expect = 0.35
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEEIG 300
                E+ EE +EEEEEEE+E+   G
Sbjct: 78  TAEPAEKAEEAKEEEEEEEDEDFGFG 103



 Score = 30.3 bits (68), Expect = 0.38
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 278 GGEEEEEEEEEEEEEEEEEEEIGGV 302
             E  E+ EE +EEEEEEE+E  G 
Sbjct: 78  TAEPAEKAEEAKEEEEEEEDEDFGF 102



 Score = 28.8 bits (64), Expect = 1.6
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
                 E  E+ EE +EEEEEEE+
Sbjct: 74  TAAATAEPAEKAEEAKEEEEEEED 97


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 31.4 bits (72), Expect = 0.33
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +   E+ ++E EE ++  A+     EK K    G EE EE  E  EE ++ E+E+
Sbjct: 68  KTRLEKLKKELEELKQRIAELQAQIEKLK---KGREETEERTELLEELKQLEKEL 119


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 32.2 bits (73), Expect = 0.36
 Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           +E ++EEE    + +E + KE   L G   E +EE+E+ E EE +
Sbjct: 73  DELKKEEERLLDELEELE-KEDDDLDGELVELQEEKEQLENEELQ 116



 Score = 27.9 bits (62), Expect = 7.9
 Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 7/56 (12%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
            + E +  E    +A +++L E KK       EEE   +E EE E+E++++ G +V
Sbjct: 53  SKLESQNVEISNYEALDSELDELKK-------EEERLLDELEELEKEDDDLDGELV 101


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 31.6 bits (72), Expect = 0.37
 Identities = 19/64 (29%), Positives = 34/64 (53%), Gaps = 11/64 (17%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           S        ER V +EE ++  +E E K    KLK+++       ++ EEE  ++ ++++
Sbjct: 163 SSLSGSAKPERNVSQEEAKKRLQEWELK----KLKQQQ-------QKREEERRKQRKKQQ 211

Query: 295 EEEE 298
           EEEE
Sbjct: 212 EEEE 215



 Score = 30.9 bits (70), Expect = 0.71
 Identities = 17/54 (31%), Positives = 29/54 (53%), Gaps = 2/54 (3%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EV  E + +E  E    AK+   + +KKL     E++++E E+E EE E  + +
Sbjct: 78  EVKLERQAQEAYENWLSAKQ--AQRQKKLQKLLEEKQKQEREKEREEAELRQRL 129



 Score = 28.5 bits (64), Expect = 3.7
 Identities = 7/30 (23%), Positives = 20/30 (66%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKK 273
           E +  ++++++ EEE   + K+ + +E++K
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERK 217


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 31.6 bits (72), Expect = 0.39
 Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 4/60 (6%)

Query: 244 EREVHEEEEE---EEEEEEETKAKETKLKE-KKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E    E EE+   EEEE +  + K  +L+E  ++L       EEE E  E E +E   E+
Sbjct: 32  EETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEV 91



 Score = 27.8 bits (62), Expect = 7.6
 Identities = 20/49 (40%), Positives = 23/49 (46%), Gaps = 1/49 (2%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           EE EE   E E K K+ +  E + L     E EEE    EEE    EEE
Sbjct: 29  EEYEETALELEEKLKQEEE-EAQLLEKKADELEEENRRLEEEAAASEEE 76



 Score = 27.4 bits (61), Expect = 8.7
 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 7/48 (14%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
             EV + EEE E++E ET+  + +L+E +       E  E   +E  E
Sbjct: 88  TAEVAKLEEEREKKEAETRQLQQELREAQ-------EAHERARQELLE 128


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 31.6 bits (71), Expect = 0.40
 Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 8/52 (15%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E   +E EE EE  E + +E K+ E+        E++EE+E E EE+  EE+
Sbjct: 129 EKARQEREELEERMEWERREEKIDER--------EDQEEQEREREEQTIEEQ 172



 Score = 30.0 bits (67), Expect = 1.0
 Identities = 17/54 (31%), Positives = 30/54 (55%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           ERE  EE  E E  EE+   +E + +++++      EE+ ++ E E  E++E E
Sbjct: 134 EREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESE 187



 Score = 28.1 bits (62), Expect = 5.7
 Identities = 13/55 (23%), Positives = 30/55 (54%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E  +  E  EE+ +E E + ++ + +E++ +     + E E  E++E E E +++
Sbjct: 139 EERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDD 193


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 32.0 bits (72), Expect = 0.42
 Identities = 17/39 (43%), Positives = 26/39 (66%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
           EEEE E +EE  +A+E K +  +K+G  G EE+ + +EE
Sbjct: 242 EEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEE 280


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 32.3 bits (74), Expect = 0.43
 Identities = 9/79 (11%), Positives = 35/79 (44%)

Query: 220  DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
             P  +          +     K  E++   +++ +   ++ + ++ ++L  + +      
Sbjct: 1304 SPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDS 1363

Query: 280  EEEEEEEEEEEEEEEEEEE 298
              E++++ E ++ E+E++E
Sbjct: 1364 SSEDDDDSEVDDSEDEDDE 1382


>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
           N-terminal domain.  Autophagocytosis is a
           starvation-induced process responsible for transport of
           cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
           ubiquitin like modifier that is topologically similar to
           the canonical E2 enzyme. It catalyzes the conjugation of
           Atg8 and phosphatidylethanolamine.
          Length = 146

 Score = 30.8 bits (70), Expect = 0.43
 Identities = 12/67 (17%), Positives = 32/67 (47%), Gaps = 7/67 (10%)

Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGG-------GGEEEEEEEEEEEEE 292
            K  E     EE  E+E+E++        ++K+K            G+++++  +  + +
Sbjct: 79  AKQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDAD 138

Query: 293 EEEEEEI 299
           E+++++I
Sbjct: 139 EDDDDDI 145


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 31.8 bits (73), Expect = 0.44
 Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKE--KKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             + E   EE EE E  +  E    +  +++L     E++++ +++ +E+++E  E+
Sbjct: 330 TFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVEV 386


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 31.9 bits (72), Expect = 0.44
 Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 3/52 (5%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           + E ++ +EE  KAK+ K  + K+        E+E E++E E +++ E +  
Sbjct: 287 QIEIKKNDEEALKAKDHKAFDLKQ---ESKASEKEAEDKELEAQKKREPVAE 335



 Score = 30.4 bits (68), Expect = 1.3
 Identities = 11/53 (20%), Positives = 26/53 (49%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           ++    ++ E E+ + E K  + +  + K       ++E +  E+E E++E E
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELE 325



 Score = 30.0 bits (67), Expect = 1.8
 Identities = 10/46 (21%), Positives = 23/46 (50%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
            ++++E +     A  +  KE K++      E E+ + E ++ +EE
Sbjct: 251 RQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEE 296



 Score = 28.8 bits (64), Expect = 3.9
 Identities = 12/57 (21%), Positives = 24/57 (42%), Gaps = 2/57 (3%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLK--EKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +  ++  E ++ E E    E K    E  K       + ++E +  E+E E++E 
Sbjct: 268 SPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324



 Score = 28.0 bits (62), Expect = 7.7
 Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 244 EREVHEEEEEEEE--EEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +++V E ++ E E  + E  K  E  LK K        + ++E +  E+E E++E E
Sbjct: 272 DKQVAENQKREIEKAQIEIKKNDEEALKAKDH---KAFDLKQESKASEKEAEDKELE 325


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 31.0 bits (71), Expect = 0.45
 Identities = 19/57 (33%), Positives = 34/57 (59%), Gaps = 5/57 (8%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           G+ E+ +E ++ EEE+EE + +  +L+ K +       E+ EEEE + EE+   +EI
Sbjct: 121 GKSELEQEIKKLEEEKEELEKRVAELEAKLE-----AIEKREEEERQIEEKRHADEI 172


>gnl|CDD|225053 COG2142, SdhD, Succinate dehydrogenase, hydrophobic anchor subunit
           [Energy production and conversion].
          Length = 117

 Score = 30.3 bits (69), Expect = 0.45
 Identities = 15/76 (19%), Positives = 26/76 (34%), Gaps = 13/76 (17%)

Query: 123 YVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV--------RTE 174
           Y    W         Y   V   A+ +  V + L    A++H    + V           
Sbjct: 36  YFLLTWLNAT-----YAAWVAFLANPFWKVFLLLLLVAALIHAWNGLRVIIEDYIKPEKL 90

Query: 175 RNALKAILITWIVIVL 190
           R AL+ +L+  +V+  
Sbjct: 91  RLALQILLVLALVLTG 106


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 30.7 bits (70), Expect = 0.45
 Identities = 16/82 (19%), Positives = 37/82 (45%), Gaps = 9/82 (10%)

Query: 218 DLDPVNRPDGYNKTIFQSENDYCKGGEREV---HEEEEEEEEEEEETKAKETKLKEKKKL 274
           D+ P       +    +    Y  G    +    + ++E+EE +   ++ + ++ E++K 
Sbjct: 21  DIGPARDGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRK- 79

Query: 275 GGGGGEEEEEEEEEEEEEEEEE 296
                +  E++E+EE E+  EE
Sbjct: 80  -----KRREQKEKEEIEKYREE 96


>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
          Length = 445

 Score = 32.0 bits (73), Expect = 0.46
 Identities = 19/58 (32%), Positives = 26/58 (44%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K G R V   EEEEEEEE E  A        +   G  G+      +EE++ E++   
Sbjct: 29  KKGRRRVSPVEEEEEEEEAEVVAVGFSYPPVRISRGKDGKRPVRPLKEEKDSEKKAST 86


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
           domain family is found in eukaryotes, and is typically
           between 99 and 114 amino acids in length. The family is
           found in association with pfam08697, pfam01585. There
           are two completely conserved residues (G and F) that may
           be functionally important. TIP is involved in enamel
           assembly by interacting with one of the major proteins
           responsible for biomineralisation of enamel - tuftelin.
          Length = 106

 Score = 30.0 bits (68), Expect = 0.47
 Identities = 12/62 (19%), Positives = 32/62 (51%), Gaps = 14/62 (22%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKL--------------GGGGGEEEEEEEEEEEEEEEEE 296
           ++++++E E  + +  +  +KKK                    EE+++E+E E+++E +E
Sbjct: 41  DDDDDDEGERRRGRGRRRSKKKKDYSKPVAFVSGGIKQAAKEPEEDDKEDESEDDDESDE 100

Query: 297 EE 298
           ++
Sbjct: 101 DD 102


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 31.3 bits (71), Expect = 0.48
 Identities = 11/52 (21%), Positives = 20/52 (38%), Gaps = 3/52 (5%)

Query: 250 EEEEEEEEEEETKAK---ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +++  + ++   KA    E K+ E +       +  E      EE    EEE
Sbjct: 118 KDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169



 Score = 30.5 bits (69), Expect = 0.72
 Identities = 15/44 (34%), Positives = 20/44 (45%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E++  +A+   + EKK         EE    EEEE EE   E
Sbjct: 134 EAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177



 Score = 30.5 bits (69), Expect = 0.83
 Identities = 17/57 (29%), Positives = 24/57 (42%), Gaps = 8/57 (14%)

Query: 244 EREVHEEEEEE--EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E++V+E   E   E++  E  A   +     +      EEE EE   EE   EE  E
Sbjct: 136 EKKVNEARAEAVAEKKAAEAAAVAAEEAAAAE------EEEAEEAPAEEAPAEESAE 186


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 30.4 bits (69), Expect = 0.48
 Identities = 11/43 (25%), Positives = 24/43 (55%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +E +EE    E K  +KKK    G ++  +++   +++ +EE+
Sbjct: 74  DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEED 116



 Score = 29.6 bits (67), Expect = 0.94
 Identities = 10/48 (20%), Positives = 26/48 (54%), Gaps = 4/48 (8%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +E +EE    + K     +KKK     G+++  +++   +++ +EE++
Sbjct: 74  DEIDEEIIPLEEKF----DKKKKKFMDGDDDIIDDDILPDDDFDEEDL 117



 Score = 27.3 bits (61), Expect = 5.2
 Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 6/53 (11%)

Query: 249 EEEEEEEEEEEETKAK------ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           +EE    EE+ + K K      +  + +         EE+ +EE++E+EE+EE
Sbjct: 77  DEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129


>gnl|CDD|187756 cd09299, TDT, The Tellurite-resistance/Dicarboxylate Transporter
           (TDT) family.  The Tellurite-resistance/Dicarboxylate
           Transporter (TDT) family includes members from all three
           kingdoms, but only three members of the family have been
           functionally characterized: the TehA protein of E. coli
           functioning as a tellurite-resistance uptake permease,
           the Mae1 protein of S. pombe functioning in the uptake
           of malate and other dicarboxylates, and the sulfite
           efflux pump (SSU1) of Saccharomyces cerevisiae. In
           plants, the plasma membrane protein SLAC1 (Slow Anion
           Channel-Associated 1), which is preferentially expressed
           in guard cells, encodes a distant homolog of fungal and
           bacterial dicarboxylate/malic acid transport proteins.
           SLAC1 is essential  in mediating stomatal responses to
           physiological and stress stimuli. Members of the TDT
           family exhibit 10 putative transmembrane alpha-helical
           spanners (TMSs).
          Length = 326

 Score = 31.5 bits (72), Expect = 0.48
 Identities = 19/93 (20%), Positives = 31/93 (33%), Gaps = 14/93 (15%)

Query: 61  VSIIVPIFFGIIIILGFFGN---------ALVVVVVAFNQQMRSTTNLLIINLAIADLLF 111
           +   + IF G II      N          L++V+    Q   S   LL++      + F
Sbjct: 179 IGFFLFIFIGAIIFSRLVINKLPPGQLAPTLMIVLGPMGQ---SIFALLLLGKNARLIFF 235

Query: 112 -IVCCVPFTATDYVTTCWPFGDVWCKMYLIVVT 143
                +   A       W FG +W  +  +V  
Sbjct: 236 LYASFLQALAVFASIILWGFG-LWWLLLALVSI 267


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 30.9 bits (70), Expect = 0.49
 Identities = 22/65 (33%), Positives = 28/65 (43%), Gaps = 7/65 (10%)

Query: 242 GGEREVHEEEEEEEEEEEETKAK--ETKLKEKKK-----LGGGGGEEEEEEEEEEEEEEE 294
           G E  + EE+E  EE +EE K K  E KLKE ++           E    E E   E   
Sbjct: 91  GSEEALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHEFGPETNG 150

Query: 295 EEEEI 299
            EE+ 
Sbjct: 151 VEEDT 155


>gnl|CDD|219227 pfam06912, DUF1275, Protein of unknown function (DUF1275).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown although a few members are
           thought to be membrane proteins.
          Length = 206

 Score = 31.0 bits (71), Expect = 0.51
 Identities = 12/63 (19%), Positives = 24/63 (38%), Gaps = 1/63 (1%)

Query: 66  PIFFGIIIILGF-FGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYV 124
                ++ +L F  G AL  +++   ++ R    L ++ L  A LL     +P      +
Sbjct: 48  LALHKLLALLAFVLGAALAGLLIRLLRRRRLRRWLALLLLLEAVLLLAAALLPLALPPLI 107

Query: 125 TTC 127
              
Sbjct: 108 AIA 110


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 31.6 bits (71), Expect = 0.52
 Identities = 19/58 (32%), Positives = 36/58 (62%), Gaps = 3/58 (5%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           ER   +EEEE+E+ EE  + +E ++ E++       E+ E+++E+E + E +E E+ G
Sbjct: 102 ERGTQKEEEEDEDVEEIEEVEEVEVVEEEY---DDDEDSEKDDEKESDAEGDENELAG 156



 Score = 28.9 bits (64), Expect = 3.3
 Identities = 19/67 (28%), Positives = 38/67 (56%), Gaps = 3/67 (4%)

Query: 233 FQSENDY-CKGGEREVHEEEEE--EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
             +END   +G    V  EEE   ++EEEE+   +E +  E+ ++     +++E+ E+++
Sbjct: 82  SDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDD 141

Query: 290 EEEEEEE 296
           E+E + E
Sbjct: 142 EKESDAE 148


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.9 bits (73), Expect = 0.53
 Identities = 16/50 (32%), Positives = 22/50 (44%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
            E  E  EE+ +  E   +   +L     E+ E   E EEE EE  EE+ 
Sbjct: 526 AERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVA 575



 Score = 27.7 bits (62), Expect = 9.5
 Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +EE E  EE+  +A+ET+ +  + L     EE EE  EE E  E E E++
Sbjct: 219 DEEIERYEEQREQARETRDEADEVL-----EEHEERREELETLEAEIEDL 263


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 31.7 bits (73), Expect = 0.56
 Identities = 18/56 (32%), Positives = 33/56 (58%), Gaps = 3/56 (5%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+E+ E   E ++ E+    KE  L  K +L     + EEE E++E+E E++++E+
Sbjct: 74  EKELRERRNELQKLEKRLLQKEENLDRKLEL---LEKREEELEKKEKELEQKQQEL 126



 Score = 29.4 bits (67), Expect = 3.0
 Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 3/58 (5%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           E  +  + E  E+ EEE + KE +L++K++      ++EEE EE  EE+ +E E I G
Sbjct: 95  EENLDRKLELLEKREEELEKKEKELEQKQQ---ELEKKEEELEELIEEQLQELERISG 149



 Score = 27.8 bits (63), Expect = 8.1
 Identities = 17/54 (31%), Positives = 35/54 (64%), Gaps = 6/54 (11%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E+ + ++EE  + + E  + +E +L++K+K      E E++++E E++EEE EE
Sbjct: 88  EKRLLQKEENLDRKLELLEKREEELEKKEK------ELEQKQQELEKKEEELEE 135


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 31.5 bits (71), Expect = 0.56
 Identities = 15/65 (23%), Positives = 35/65 (53%), Gaps = 3/65 (4%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKA-KETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           + E    K  E+   E + ++  E+E  K  ++ +LK +++      EE E++ + E+++
Sbjct: 76  KGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ--QKQAEEAEKQAQLEQKQ 133

Query: 293 EEEEE 297
           +EE+ 
Sbjct: 134 QEEQA 138



 Score = 30.7 bits (69), Expect = 0.99
 Identities = 33/138 (23%), Positives = 70/138 (50%), Gaps = 11/138 (7%)

Query: 169 MSVRTERNA--LKAILITWIV-IVLTAVPV---FNMHGEVSYTYA-SELHTACVFLDLDP 221
           MS  TE+N    +AI+I+ ++ I+L A+ +    +   E S       +  A +   +DP
Sbjct: 1   MSKATEQNDKLKRAIIISAVLHIILFALLIWGSLDETIEASGGGGGGSVIDAVM---VDP 57

Query: 222 VNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEE 281
                 Y + I   ++   KG ++   +EE+  EE + +  A++ +LK+ +K      E+
Sbjct: 58  GAVVQQYGR-IQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ 116

Query: 282 EEEEEEEEEEEEEEEEEI 299
           +++ EE E++ + E+++ 
Sbjct: 117 QKQAEEAEKQAQLEQKQQ 134



 Score = 28.4 bits (63), Expect = 5.8
 Identities = 12/65 (18%), Positives = 33/65 (50%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
            +E +  K  E+E  + +E++++ EE  K  + + K++++       E++++ E  + + 
Sbjct: 97  AAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKA 156

Query: 294 EEEEE 298
             E  
Sbjct: 157 AAEAA 161


>gnl|CDD|219521 pfam07695, 7TMR-DISM_7TM, 7TM diverse intracellular signalling.
           This entry represents the transmembrane region of the
           7TM-DISM (7TM Receptors with Diverse Intracellular
           Signalling Modules).
          Length = 207

 Score = 31.1 bits (71), Expect = 0.56
 Identities = 24/159 (15%), Positives = 54/159 (33%), Gaps = 39/159 (24%)

Query: 67  IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATD---- 122
           +F+GI++ L  + N  +   +      R  + L  +       L+I+  + +  +     
Sbjct: 7   LFYGILLALALY-NLFLFFSL------RDRSYLYYV-------LYILSFLLYQLSLNGLG 52

Query: 123 --YVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKA 180
             Y+   WP    W    L+ +     V    L+    FL             ++   + 
Sbjct: 53  FQYL---WPNAPPWLNNKLLYLFLALLVIFFALLFARSFL-----------ELKKYLPRL 98

Query: 181 --ILITWIVIVLTAVPVFNMHGEVSYTYASELHTACVFL 217
             +L+   +++L  + +  +     YT +  L      L
Sbjct: 99  DRLLLGLALLLLLLLLLAPL---FPYTLSLRLAQLLALL 134


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 31.7 bits (73), Expect = 0.58
 Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 5/42 (11%)

Query: 235 SENDYCKGGEREVHEEEEEEEEEEEET-----KAKETKLKEK 271
           +E D    G      +++E+EEEEE+       A E++L EK
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219



 Score = 31.3 bits (72), Expect = 0.73
 Identities = 13/55 (23%), Positives = 24/55 (43%), Gaps = 7/55 (12%)

Query: 259 EETKAKETKLKEKKKL-------GGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSA 306
           +  +  E +L+E                   E EE +++E+EEEEE+     ++A
Sbjct: 157 DRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAA 211



 Score = 29.0 bits (66), Expect = 3.8
 Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 10/70 (14%)

Query: 243 GEREVHE----EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           GER + E      +   EE+      E +  +         +EEEEE+E ++    +E E
Sbjct: 162 GERRLRELIDGFVDPNAEEDPAHVGSELEELD------DDEDEEEEEDENDDSLAADESE 215

Query: 299 IGGVVVSAVD 308
           +   V+    
Sbjct: 216 LPEKVLEKFK 225


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 31.5 bits (72), Expect = 0.63
 Identities = 6/54 (11%), Positives = 33/54 (61%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +V  + +++++++++    +  + +         ++++++ ++E+EE++E +E+
Sbjct: 125 DVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKEL 178



 Score = 31.5 bits (72), Expect = 0.68
 Identities = 6/50 (12%), Positives = 30/50 (60%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            +++++++++++    +    +  +      ++++ ++E+EE++E +E E
Sbjct: 130 ADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELE 179



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 9/61 (14%), Positives = 29/61 (47%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           +++++++++ ++    +    E         + ++E+EE++E +E E+       V   D
Sbjct: 133 DDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDED 192

Query: 309 F 309
            
Sbjct: 193 D 193



 Score = 29.6 bits (67), Expect = 2.2
 Identities = 9/90 (10%), Positives = 35/90 (38%), Gaps = 9/90 (10%)

Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
              +    Y K I           + +  + ++++ +++++ +  +    +         
Sbjct: 112 LDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDD--------- 162

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
           ++ ++E+EE++E +E E+          + 
Sbjct: 163 DDVDDEDEEKKEAKELEKLSDDDDFVWDED 192


>gnl|CDD|218134 pfam04535, DUF588, Domain of unknown function (DUF588).  This
           family of plant proteins contains a domain that may have
           a catalytic activity. It has a conserved arginine and
           aspartate that could form an active site. These proteins
           are predicted to contain 3 or 4 transmembrane helices.
          Length = 150

 Score = 30.4 bits (69), Expect = 0.63
 Identities = 18/101 (17%), Positives = 35/101 (34%), Gaps = 26/101 (25%)

Query: 104 LAIADLLFIVCCVPFT-------ATDYVTTCWPFGDVWCK-------MYLIVVTAYASVY 149
           L +A+L+  +              T+  T  + F              YL+V  A A+ Y
Sbjct: 4   LRLAELVLRLAAFVLALAAAVVMGTNKQTKPFFFIQKKASFSDLPAFRYLVVANAIAAGY 63

Query: 150 TLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVL 190
           +L+ +     L+V            R   +   + W++ +L
Sbjct: 64  SLLQL----VLSVYLLS--------RKKPRTKGLAWLLFIL 92


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 31.6 bits (71), Expect = 0.65
 Identities = 20/52 (38%), Positives = 24/52 (46%), Gaps = 6/52 (11%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           H E    E+E    KA E K+ EKK++      E  E EE E  E E  E I
Sbjct: 434 HAERARIEKENAHRKALEMKILEKKRI------ERLEREERERLERERMERI 479


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 31.5 bits (72), Expect = 0.67
 Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 10/72 (13%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
                   +E++ EE+E+  +  +              EEEEE  ++    +  E E+G 
Sbjct: 319 LAYLLSKRKEQQAEEKEKPAEEAKK----------EEEEEEEESVDDVLLIDPIELELGY 368

Query: 302 VVVSAVDFGSGG 313
            ++  VD   GG
Sbjct: 369 GLIPLVDEQQGG 380


>gnl|CDD|225153 COG2244, RfbX, Membrane protein involved in the export of O-antigen
           and teichoic acid [General function prediction only].
          Length = 480

 Score = 31.3 bits (71), Expect = 0.72
 Identities = 23/179 (12%), Positives = 58/179 (32%), Gaps = 33/179 (18%)

Query: 36  PAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFG---------------- 79
            AI R  +        +    +  ++ +++ +   ++++L  +                 
Sbjct: 65  AAITREIAEYREKGEYLLLILLSVLLLLLLALILLLLLLLIAYLLAPIDPVLALLLRILS 124

Query: 80  -----NALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVW 134
                  L  V+    Q       L +  ++   LL  V  + F A              
Sbjct: 125 LALLLLPLSSVLRGLFQGFGRFGPLALSIVSSIFLLAAVFALLFAALGLA---------- 174

Query: 135 CKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAV 193
             ++ +V+ A  S+  L+++   +   +  PI   S+   +  L+  L   +  +L  +
Sbjct: 175 --VWALVLGAVVSLLVLLILLGKKKRGLKRPILRFSLALLKELLRFGLPLLLSSLLNFL 231


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 29.9 bits (68), Expect = 0.74
 Identities = 14/27 (51%), Positives = 21/27 (77%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSA 306
           ++EEEEE+EE+E+E E +E G V V+ 
Sbjct: 31  DKEEEEEKEEKEKEAEPDEDGWVTVTK 57



 Score = 27.2 bits (61), Expect = 5.3
 Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 8/59 (13%)

Query: 246 EVHEEEEEEEEEEEETKAKE--------TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           E +++EEEEE+EE+E +A+         TK   + K         +  E+E+++++++E
Sbjct: 28  EEYDKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKE 86


>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
           motility and secretion].
          Length = 214

 Score = 30.6 bits (69), Expect = 0.76
 Identities = 12/49 (24%), Positives = 23/49 (46%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           +E+ EE   +  K+ +  ++E  +      E    E+ +  EEE  E+E
Sbjct: 24  KEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKE 72



 Score = 28.3 bits (63), Expect = 4.9
 Identities = 13/53 (24%), Positives = 24/53 (45%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            + E +E+ EE   + K  + +L E+        +E    E+ +  EEE  E+
Sbjct: 19  ALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEK 71



 Score = 27.9 bits (62), Expect = 6.1
 Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 1/62 (1%)

Query: 244 EREVHEEEEEEEEE-EEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           E+       E+ +  EEE   KE  + EK K         EEEE++ + EE  E+ +  +
Sbjct: 50  EQTKENSLIEKVDSIEEEISEKEKVMSEKLKEPAQMSSTSEEEEKKAKLEEIPEDVVSIM 109

Query: 303 VV 304
           V 
Sbjct: 110 VA 111


>gnl|CDD|216683 pfam01758, SBF, Sodium Bile acid symporter family.  This family
           consists of Na+/bile acid co-transporters. These
           transmembrane proteins function in the liver in the
           uptake of bile acids from portal blood plasma a process
           mediated by the co-transport of Na+. Also in the family
           is ARC3 from S. cerevisiae, this is a putative
           transmembrane protein involved in resistance to arsenic
           compounds.
          Length = 188

 Score = 30.3 bits (69), Expect = 0.78
 Identities = 23/121 (19%), Positives = 45/121 (37%), Gaps = 14/121 (11%)

Query: 72  IIILGFFGNALV--VVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWP 129
            ++LG     ++  +++      +      L + L       +V C P  A   V T   
Sbjct: 29  ALLLGLLLQWVIMPLLMFILAWFLLRLPPELAVGL------ILVGCAPGGAMSNVWTYLA 82

Query: 130 FGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIV 189
            GDV      +V+ A +++   ++  L  FL     +    V T  + L  I    I ++
Sbjct: 83  KGDV---ELSVVMVALSTLTAPLVTPLLSFLLSGLLLD---VGTAVSPLSLIKSVLIYVI 136

Query: 190 L 190
           +
Sbjct: 137 I 137


>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
           This family consists of several mammalian protein
           phosphatase inhibitor 1 (IPP-1) and dopamine- and
           cAMP-regulated neuronal phosphoprotein (DARPP-32)
           proteins. Protein phosphatase inhibitor-1 is involved in
           signal transduction and is an endogenous inhibitor of
           protein phosphatase-1. It has been demonstrated that
           DARPP-32, if phosphorylated, can inhibit
           protein-phosphatase-1. DARPP-32 has a key role in many
           neurotransmitter pathways throughout the brain and has
           been shown to be involved in controlling receptors, ion
           channels and other physiological factors including the
           brain's response to drugs of abuse, such as cocaine,
           opiates and nicotine. DARPP-32 is reciprocally regulated
           by the two neurotransmitters that are most often
           implicated in schizophrenia - dopamine and glutamate.
           Dopamine activates DARPP-32 through the D1 receptor
           pathway and disables DARPP-32 through the D2 receptor.
           Glutamate, acting through the N-methyl-d-aspartate
           receptor, renders DARPP-32 inactive. A mutant form of
           DARPP-32 has been linked with gastric cancers.
          Length = 170

 Score = 30.3 bits (68), Expect = 0.81
 Identities = 12/51 (23%), Positives = 21/51 (41%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
              + +   EE +   +E +L    +LG   GE E  E ++ EE      +
Sbjct: 89  SHLQSQGSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQ 139



 Score = 29.5 bits (66), Expect = 1.7
 Identities = 12/61 (19%), Positives = 19/61 (31%), Gaps = 4/61 (6%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGW 314
              EE   ++E    +     G    E E  E ++ EE            +  + G  G 
Sbjct: 95  GSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQD----AQSEVGLEGP 150

Query: 315 W 315
           W
Sbjct: 151 W 151


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 30.6 bits (69), Expect = 0.83
 Identities = 14/48 (29%), Positives = 25/48 (52%), Gaps = 4/48 (8%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           EE   E++  +E    E   KE K+      +EE++E++   E+ E+E
Sbjct: 4   EEFPSEKDLPQEETTDEAPKKEAKE----APKEEDQEKQNPVEDLEKE 47


>gnl|CDD|225029 COG2118, COG2118, DNA-binding protein [General function prediction
           only].
          Length = 116

 Score = 29.6 bits (67), Expect = 0.84
 Identities = 14/47 (29%), Positives = 27/47 (57%), Gaps = 2/47 (4%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            ++EE EE + +  KL E ++      + E +EE+  +EEE +++ I
Sbjct: 1   MDDEELEEIRRR--KLAELQRQAKLEEQREAQEEQARQEEEAQKQAI 45



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 16/49 (32%), Positives = 27/49 (55%), Gaps = 6/49 (12%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
            ++EE EE  + K  +L+ + KL      EE+ E +EE+  +EEE +  
Sbjct: 1   MDDEELEEIRRRKLAELQRQAKL------EEQREAQEEQARQEEEAQKQ 43


>gnl|CDD|219921 pfam08592, DUF1772, Domain of unknown function (DUF1772).  This
           domain is of unknown function.
          Length = 137

 Score = 30.0 bits (68), Expect = 0.84
 Identities = 16/69 (23%), Positives = 26/69 (37%), Gaps = 7/69 (10%)

Query: 53  DAEAMERMVSI--IVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLL 110
             +A+     I    P F  +         A  + V+A   +    + LL   LA A  L
Sbjct: 19  PDQALAAWQRINRRGPAFMPLFAGTSALLFA-YLAVLALRGRGGPGSRLL---LAAA-AL 73

Query: 111 FIVCCVPFT 119
           +++  VP T
Sbjct: 74  YLLGVVPVT 82


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 29.5 bits (66), Expect = 0.84
 Identities = 13/44 (29%), Positives = 20/44 (45%), Gaps = 7/44 (15%)

Query: 238 DYCKGGEREVHEEEE-------EEEEEEEETKAKETKLKEKKKL 274
           D  +    E+ +EE         E+E E   + +E K K+KKK 
Sbjct: 72  DNMEATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKN 115



 Score = 28.7 bits (64), Expect = 1.7
 Identities = 14/65 (21%), Positives = 21/65 (32%), Gaps = 18/65 (27%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
              E+D     E       E ++EE    +                   E+E E   EEE
Sbjct: 64  RYDEDDALDNMEAT---FMEIQKEERRSARM---------------ARLEDERELAREEE 105

Query: 293 EEEEE 297
           EE+ +
Sbjct: 106 EEKRK 110



 Score = 28.3 bits (63), Expect = 2.2
 Identities = 14/71 (19%), Positives = 25/71 (35%), Gaps = 14/71 (19%)

Query: 225 PDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEE 284
             G  ++ +   ++       E    E ++EE      A+               E+E E
Sbjct: 54  GKGRKRSYYDRYDEDDALDNMEATFMEIQKEERRSARMARL--------------EDERE 99

Query: 285 EEEEEEEEEEE 295
              EEEEE+ +
Sbjct: 100 LAREEEEEKRK 110



 Score = 27.9 bits (62), Expect = 2.8
 Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 14/50 (28%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E    E ++EE   A+  +L              E+E E   EEEEE+ +
Sbjct: 75  EATFMEIQKEERRSARMARL--------------EDERELAREEEEEKRK 110


>gnl|CDD|218218 pfam04702, Vicilin_N, Vicilin N terminal region.  This region is
           found in plant seed storage proteins, N-terminal to the
           Cupin domain (pfam00190). In Macadamia integrifolia,
           this region is processed into peptides of approximately
           50 amino acids containing a C-X-X-X-C-(10-12)X-C-X-X-X-C
           motif. These peptides exhibit antimicrobial activity in
           vitro.
          Length = 147

 Score = 30.1 bits (67), Expect = 0.88
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           +++E+     +  + + +E+ +  G G  E   +E EEE  EE 
Sbjct: 104 QQQEQRPRRQQQCQRECREQYQEHGRGEGENINQEGEEERSEEG 147



 Score = 30.1 bits (67), Expect = 0.92
 Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 6/69 (8%)

Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
           Y K   Q +  Y +  ++   ++E+    +++  +    + +E  +     GE E   +E
Sbjct: 85  YEKEQQQQQRQY-QECQQRCQQQEQRPRRQQQCQRECREQYQEHGR-----GEGENINQE 138

Query: 288 EEEEEEEEE 296
            EEE  EE 
Sbjct: 139 GEEERSEEG 147



 Score = 28.1 bits (62), Expect = 3.5
 Identities = 8/59 (13%), Positives = 28/59 (47%)

Query: 240 CKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            +  +R   +E   + E+++  +  + + KE+++      E+ +   E+ ++  ++ E 
Sbjct: 13  YEQCQRRCEQETRGQREQQQCERRCKEQYKEEQQQQRQREEDPQRRYEQCQQRCQQHEP 71


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 28.9 bits (65), Expect = 0.93
 Identities = 8/44 (18%), Positives = 23/44 (52%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           +++E++ + K KE K K+           E   E+ + ++++++
Sbjct: 19  DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 31.1 bits (71), Expect = 0.94
 Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 5/50 (10%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            + EEE ++E         L+E         +EE +EE+EE  + E++E+
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEE-----DDDDDEESDEEDEESSKSEDDED 148



 Score = 28.0 bits (63), Expect = 8.8
 Identities = 10/56 (17%), Positives = 30/56 (53%), Gaps = 19/56 (33%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + E  ++++EE +EE+E                   E  + E++E++++++++++I
Sbjct: 122 DLEEDDDDDEESDEEDE-------------------ESSKSEDDEDDDDDDDDDDI 158


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 31.0 bits (70), Expect = 0.98
 Identities = 19/56 (33%), Positives = 32/56 (57%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            +E  + EEE E+E E+   +E  L ++  L  G  E E++EE   +E  +EE+E+
Sbjct: 378 VKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEEKEL 433



 Score = 30.2 bits (68), Expect = 1.7
 Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 8/66 (12%)

Query: 244 EREVHEEEEEEEEEEE--------ETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E E+ +E E+  +EEE        E   KE +  E+  L     EE+E  E+ + E  E 
Sbjct: 385 EEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEEKELLEKLKMEASEV 444

Query: 296 EEEIGG 301
           E+  G 
Sbjct: 445 EKLFGR 450


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 30.5 bits (69), Expect = 0.98
 Identities = 16/57 (28%), Positives = 25/57 (43%), Gaps = 19/57 (33%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           + E  ++  +E+EE+E                      EE+E E  EEEE+EE E+ 
Sbjct: 37  DEEAEDDVVDEDEEDE-------------------AVVEEDENELTEEEEDEEGEVK 74



 Score = 28.6 bits (64), Expect = 4.5
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 18/60 (30%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           +EE E++  +E+                   E+E   EE+E E  EEEE+  G V ++ D
Sbjct: 37  DEEAEDDVVDEDE------------------EDEAVVEEDENELTEEEEDEEGEVKASPD 78


>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306).  This
           family of proteobacterial species proteins has no known
           function.
          Length = 115

 Score = 29.2 bits (66), Expect = 1.0
 Identities = 16/55 (29%), Positives = 19/55 (34%), Gaps = 14/55 (25%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             V  EE  E  E  E +A                       EEEEE E E+EE+
Sbjct: 11  LAVRAEEPAEPAETAEEEAA--------------AAAPAPAPEEEEEAELEDEEL 51



 Score = 26.9 bits (60), Expect = 6.5
 Identities = 18/51 (35%), Positives = 21/51 (41%), Gaps = 9/51 (17%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EE  E  E  EE  A                 EEEEE E E+EE  EE ++
Sbjct: 16  EEPAEPAETAEEEAAAAAPA---------PAPEEEEEAELEDEELLEELDL 57


>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH.  This
           bacillus clade of FliH proteins is not found by the Pfam
           FliH model pfam02108, but is closely related to the
           sequences identified by that model. Sequences identified
           by this model are observed in flagellar operons in an
           analogous position relative to other flagellar operon
           genes.
          Length = 255

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 18/74 (24%), Positives = 34/74 (45%)

Query: 222 VNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEE 281
              P      I   +       E+E+ +EE+E E+  E+ +A+  ++ E+ +       E
Sbjct: 10  SVIPAQERGIIPLRQVTDVPETEQELADEEQEFEQILEKAEAEAAQIIEQAEAQAAAIRE 69

Query: 282 EEEEEEEEEEEEEE 295
           + E+E  + EEE E
Sbjct: 70  QIEQERAQWEEERE 83


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 30.2 bits (68), Expect = 1.1
 Identities = 13/38 (34%), Positives = 17/38 (44%)

Query: 261 TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
            + K+T  K KKK        E   EE  + EE EE +
Sbjct: 58  PRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEND 95



 Score = 28.6 bits (64), Expect = 3.0
 Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K K TK  +KKK       E   EE  + EE EE ++
Sbjct: 60  KKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDK 96



 Score = 27.9 bits (62), Expect = 6.7
 Identities = 13/38 (34%), Positives = 16/38 (42%)

Query: 261 TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           T  K+   K+ KK      E  E   EE  + EE EE 
Sbjct: 57  TPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEN 94


>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
           protein; Provisional.
          Length = 196

 Score = 29.9 bits (67), Expect = 1.1
 Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 1/50 (2%)

Query: 249 EEEEEEEEEEEETKAKETKLKE-KKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           EE +   +     KAKE  LK  + +L    G+  E     EE+E E++E
Sbjct: 147 EEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEGEKDE 196


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 30.6 bits (68), Expect = 1.2
 Identities = 17/65 (26%), Positives = 39/65 (60%), Gaps = 4/65 (6%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           ++EN   +  +  +  E+EE++ E+E+ K +    KE  +L     + E+E+++ E+E++
Sbjct: 136 EAENARDRANKSGIELEQEEQKTEQEKQKTE----KEGIELANSQIKAEQEKQKTEQEKQ 191

Query: 294 EEEEE 298
           + E+E
Sbjct: 192 KTEQE 196



 Score = 28.7 bits (63), Expect = 4.3
 Identities = 14/49 (28%), Positives = 30/49 (61%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E E+EE++ E+ K K  K   +        E+E+++ E+E+++ E+E++
Sbjct: 150 ELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQ 198


>gnl|CDD|240539 cd13134, MATE_like_8, Uncharacterized subfamily of the multidrug
           and toxic compound extrusion (MATE) proteins.  The
           integral membrane proteins from the MATE family are
           involved in exporting metabolites across the cell
           membrane and are responsible for multidrug resistance
           (MDR) in many bacteria and animals. A number of family
           members are involved in the synthesis of peptidoglycan
           components in bacteria.
          Length = 438

 Score = 30.5 bits (70), Expect = 1.2
 Identities = 27/156 (17%), Positives = 57/156 (36%), Gaps = 28/156 (17%)

Query: 46  YINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAF-----NQQMRSTTNL- 99
             +DN V A      V +   I F +I+I  F      +++  +      ++ R    + 
Sbjct: 32  RYSDNAVAA------VGVANQILFLLILIFSFISTGTSILIAQYLGAKDKKKARQIAGVA 85

Query: 100 LIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLI--VVTAYASVYTLVLMSLD 157
           L+ NL +  +  ++                FG    ++  +   + A  S Y L ++   
Sbjct: 86  LLFNLLLGLVFSLLLLF-------------FGAELLRLMGLPPELLALGSTY-LRIVGGS 131

Query: 158 RFLAVVHPIASMSVRTERNALKAILITWIVIVLTAV 193
            FL  +    S  +R+       + ++ I+ +L  V
Sbjct: 132 LFLQALSLTLSAILRSHGRTKIPMYVSLIMNILNIV 167


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 29.9 bits (67), Expect = 1.2
 Identities = 10/36 (27%), Positives = 18/36 (50%)

Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           K ++ K + +K   GG   ++  E + E E  E E+
Sbjct: 94  KKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPED 129



 Score = 29.9 bits (67), Expect = 1.2
 Identities = 14/37 (37%), Positives = 20/37 (54%)

Query: 262 KAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K KE K KE +K   GG + ++  E + E E  E E+
Sbjct: 93  KKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPED 129


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 29.5 bits (67), Expect = 1.2
 Identities = 14/61 (22%), Positives = 34/61 (55%), Gaps = 8/61 (13%)

Query: 246 EVHEEEEEEEEEEEETKAKE--------TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E +++ EEE +EE + +++         T+   K+K G    +  EE  + +E+++++++
Sbjct: 28  EKYDKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKK 87

Query: 298 E 298
           E
Sbjct: 88  E 88



 Score = 27.2 bits (61), Expect = 5.6
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVV 304
           ++ EEE +EE +   E +E G   V
Sbjct: 31  DKREEEAKEEAKARSEPDEDGWTTV 55


>gnl|CDD|183538 PRK12460, PRK12460, 2-keto-3-deoxygluconate permease; Provisional.
          Length = 312

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 8/46 (17%), Positives = 21/46 (45%), Gaps = 6/46 (13%)

Query: 60  MVSIIVPIFFGIII------ILGFFGNALVVVVVAFNQQMRSTTNL 99
           +V+ ++P+  G+I+      +  F      +++  F   + +  NL
Sbjct: 166 LVAALLPLVLGMILGNLDPDMRKFLTKGGPLLIPFFAFALGAGINL 211


>gnl|CDD|218270 pfam04795, PAPA-1, PAPA-1-like conserved region.  Family of
           proteins with a conserved region found in PAPA-1, a
           PAP-1 binding protein.
          Length = 89

 Score = 28.5 bits (64), Expect = 1.3
 Identities = 12/46 (26%), Positives = 25/46 (54%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           E+  EEE+ ET  K  K + ++K      ++ +E++ +E+    E+
Sbjct: 18  EKRAEEEKMETINKLLKKQARRKKREDKIKKGDEKKAQEKAARAEK 63


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 28.8 bits (65), Expect = 1.3
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 277 GGGEEEEEEEEEEEEEEEEEEEI 299
            G  +EE+EEE E + + E+E+I
Sbjct: 66  VGDGDEEDEEEREAKRKLEQEKI 88



 Score = 28.8 bits (65), Expect = 1.4
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           + +  + +EE+EEE +AK    +EK          E   EEEEEE++   EE
Sbjct: 63  LDDVGDGDEEDEEEREAKRKLEQEKID------NAEPLTEEEEEEKQRLLEE 108



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 263 AKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
            K+   K      G G EE+EEE E + + E+E+
Sbjct: 54  KKKIGYKVPLDDVGDGDEEDEEEREAKRKLEQEK 87



 Score = 26.9 bits (60), Expect = 6.0
 Identities = 14/34 (41%), Positives = 21/34 (61%), Gaps = 7/34 (20%)

Query: 271 KKKLG-------GGGGEEEEEEEEEEEEEEEEEE 297
           KKK+G        G G+EE+EEE E + + E+E+
Sbjct: 54  KKKIGYKVPLDDVGDGDEEDEEEREAKRKLEQEK 87


>gnl|CDD|223608 COG0534, NorM, Na+-driven multidrug efflux pump [Defense
           mechanisms].
          Length = 455

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 32/159 (20%), Positives = 53/159 (33%), Gaps = 40/159 (25%)

Query: 61  VSIIVPIFFGIIIILGFFGNALVVVVVA-------FNQQMRSTTNLLIINLAIADLLFIV 113
           V +  PIFF II I    G     V+VA         +  R     L++ L +  LL I+
Sbjct: 55  VGLANPIFFLIIAIFIGLGTG-TTVLVAQAIGAGDRKKAKRVLGQGLLLALLLGLLLAIL 113

Query: 114 CCVPFTATDYVTTCWPFGDVWCKMYLI----VVTAYASVYTLVLMSLDRFLAVVHPIASM 169
                           F +    + L+     V   A+ Y  ++      L     + S 
Sbjct: 114 -------------LLFFAEPL--LRLLGAPAEVLELAAEYLRII-----LLGAPFALLSF 153

Query: 170 S----VRTERNALKAILITWIV----IVLTAVPVFNMHG 200
                +R   +    + I  +     IVL  + +F + G
Sbjct: 154 VLSGILRGLGDTKTPMYILLLGNLLNIVLNYLLIFGLFG 192


>gnl|CDD|232958 TIGR00399, metG_C_term, methionyl-tRNA synthetase C-terminal
           region/beta chain.  The methionyl-tRNA synthetase (metG)
           is a class I amino acyl-tRNA ligase. This model
           describes a region of the methionyl-tRNA synthetase that
           is present at the C-terminus of MetG in some species (E.
           coli, B. subtilis, Thermotoga maritima, Methanobacterium
           thermoautotrophicum), and as a separate beta chain in
           Aquifex aeolicus. It is absent in a number of other
           species (e.g. Mycoplasma genitalium, Mycobacterium
           tuberculosis), while Pyrococcus horikoshii has both a
           full length MetG and a second protein homologous to the
           beta chain only. Proteins hit by This model should
           called methionyl-tRNA synthetase beta chain if and only
           if the model metG hits a separate protein not also hit
           by This model [Protein synthesis, tRNA aminoacylation].
          Length = 137

 Score = 29.3 bits (66), Expect = 1.3
 Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEE 285
           +++ EE K K  K KEKK  G    E ++E 
Sbjct: 1   DKKIEELKLKGAKKKEKKDEGEKALEPQKET 31


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 30.5 bits (69), Expect = 1.3
 Identities = 12/51 (23%), Positives = 17/51 (33%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           E  E   +E   E E +A        +          EE E+  EEE  + 
Sbjct: 199 EPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKP 249



 Score = 28.9 bits (65), Expect = 3.7
 Identities = 14/55 (25%), Positives = 20/55 (36%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
             E   +E   ET+ + T   E          E   EE E+  EEE  + +   V
Sbjct: 200 PPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPV 254



 Score = 28.2 bits (63), Expect = 5.9
 Identities = 15/60 (25%), Positives = 24/60 (40%), Gaps = 13/60 (21%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVV 304
           R+  EE EEE   EEE   ++     ++ LG                + ++    GGV+V
Sbjct: 128 RKAGEEPEEELPLEEEATKEDDLADPRQVLG-------------VYYKAKDPPGAGGVIV 174


>gnl|CDD|223605 COG0531, PotE, Amino acid transporters [Amino acid transport and
           metabolism].
          Length = 466

 Score = 30.5 bits (69), Expect = 1.4
 Identities = 17/133 (12%), Positives = 45/133 (33%), Gaps = 16/133 (12%)

Query: 61  VSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTA 120
           + +I      ++ +L   G      + +          + I+ + I  L+FI+  +    
Sbjct: 134 IILIALALIALLTLLNLRGIKASAKINSI---------ITILKIIIL-LIFIILGLFAFG 183

Query: 121 TDYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKA 180
                   PF                +   L   +   F A+   +A      +R   +A
Sbjct: 184 FSNGNLFAPFNPGG-----GSFGGILAAILLAFFAFTGFEAIAT-LAEEVKNPKRTIPRA 237

Query: 181 ILITWIVIVLTAV 193
           I+++ +++++  +
Sbjct: 238 IILSLLIVLILYI 250


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 29.2 bits (66), Expect = 1.4
 Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 18/50 (36%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           E+E +EEEE                   E++ ++ +EE EEEE+ E++ G
Sbjct: 1   EDEPDEEEEL------------------EDDIDDLDEEAEEEEDGEDLFG 32



 Score = 26.9 bits (60), Expect = 8.6
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 279 GEEEEEEEEEEEEEEEEEEEIGGVVV 304
            EEEE E++ ++ +EE EEE  G  +
Sbjct: 5   DEEEELEDDIDDLDEEAEEEEDGEDL 30


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 20/93 (21%), Positives = 39/93 (41%), Gaps = 14/93 (15%)

Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETK----AKETKLKEK-----KKLGG 276
             Y +   +       G   +  E E E  E++++T      +E +L +K     K+L  
Sbjct: 90  KEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKEL-- 147

Query: 277 GGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
              E+ ++  EE E+ +E + EI  +   A + 
Sbjct: 148 ---EDAKKALEENEKLKELKAEIDELKKKAREI 177


>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter.  [Transport and binding
           proteins, Other].
          Length = 953

 Score = 30.5 bits (69), Expect = 1.4
 Identities = 15/60 (25%), Positives = 21/60 (35%), Gaps = 7/60 (11%)

Query: 256 EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGWW 315
           EE  +   K   L          G+  +E EE E+ E  +E+   G V         GW 
Sbjct: 29  EELHDLLDKVVSLLGPLADYTNNGQGMKEHEEAEDAEGTKEKPPAGAV-------KFGWV 81


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 30.3 bits (68), Expect = 1.5
 Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 264 KETKLKEKKKLGGGGGEEEE---EEEEEEEEEEEEEEEIGGVV 303
           KE K + +++L      +EE   E +E++EEEE EEEE    V
Sbjct: 311 KEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEENDTV 353



 Score = 29.6 bits (66), Expect = 2.6
 Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 256 EEEEETKAK-ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
            EE+E K + + +L     +     EE +E++EEEE EEEEE
Sbjct: 308 GEEKEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEE 349


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 29.2 bits (66), Expect = 1.5
 Identities = 9/30 (30%), Positives = 13/30 (43%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEE 282
            E  +E   K K T+   +KK      E+E
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRRQKEKE 129


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 30.3 bits (68), Expect = 1.5
 Identities = 13/87 (14%), Positives = 24/87 (27%)

Query: 212 TACVFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEK 271
              V  ++  V         +  ++ D       E   E + E E+  +   +     + 
Sbjct: 330 AEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQV 389

Query: 272 KKLGGGGGEEEEEEEEEEEEEEEEEEE 298
                    EE      E  +E E E 
Sbjct: 390 DAEAASAAPEEPAALASEAHDETEPEV 416


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 21/74 (28%), Positives = 29/74 (39%), Gaps = 8/74 (10%)

Query: 234 QSENDYCKGGEREVHEEEE-------EEEEEEEETKAKETKL-KEKKKLGGGGGEEEEEE 285
           Q +    K   + + +E E       E EEE EE +A    L      L     E E + 
Sbjct: 839 QEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898

Query: 286 EEEEEEEEEEEEEI 299
            E E + EE E +I
Sbjct: 899 RELERKIEELEAQI 912



 Score = 29.3 bits (66), Expect = 3.5
 Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 8/64 (12%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKK--------LGGGGGEEEEEEEEEEEEEEEEEE 297
            +  E+E  E+E +E + +   LKE+ K        L G   E EEE EE E    + E 
Sbjct: 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLES 882

Query: 298 EIGG 301
            +G 
Sbjct: 883 RLGD 886



 Score = 28.9 bits (65), Expect = 4.6
 Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 11/69 (15%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLK-----------EKKKLGGGGGEEEEEEEEEEEEE 292
           E+E+ + E+EEE+ +E  +  E  L            E K+L     E EE+  + EE  
Sbjct: 722 EKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEAL 781

Query: 293 EEEEEEIGG 301
            + E  +  
Sbjct: 782 NDLEARLSH 790


>gnl|CDD|220098 pfam09057, Smac_DIABLO, Second Mitochondria-derived Activator of
           Caspases.  Second Mitochondria-derived Activator of
           Caspases promotes apoptosis by activating caspases in
           the cytochrome c/Apaf-1/caspase-9 pathway, and by
           opposing the inhibitory activity of inhibitor of
           apoptosis proteins (XIAP-BIR3). The protein assumes an
           elongated three-helix bundle structure, and forms a
           dimer in solution.
          Length = 234

 Score = 29.9 bits (67), Expect = 1.6
 Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
           EE  +  +E E K  E+K +E +++       +  E E+  E
Sbjct: 187 EEVRQLSKEAEKKLAESKAEEIQRMAEYASSIDLSELEDIPE 228


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 28.8 bits (65), Expect = 1.6
 Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 17/54 (31%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+ E  E +   +EE K                  +EEE + + EE +E E E 
Sbjct: 88  EIDERAEAQRARDEEKK-----------------LDEEEAKRQHEEAKEREREK 124


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
           This family consists of several brain acid soluble
           protein 1 (BASP1) or neuronal axonal membrane protein
           NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
           calmodulin-binding protein of unknown function.
          Length = 233

 Score = 29.8 bits (66), Expect = 1.6
 Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 1/55 (1%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E    +E EE +   E T+ KE K +EK          + EE+E E+E    +EE
Sbjct: 32  EEGTPKENEEAQAAAETTEVKEAK-EEKPDKDAQDTANKTEEKEGEKEAAAAKEE 85


>gnl|CDD|143643 cd07883, RHD-n_NFkB, N-terminal sub-domain of the Rel homology
           domain (RHD) of nuclear factor of kappa light
           polypeptide gene enhancer in B-cells (NF-kappa B).
           Proteins containing the Rel homology domain (RHD) are
           metazoan transcription factors. The RHD is composed of
           two structural sub-domains; this model characterizes the
           N-terminal RHD sub-domain of the NF-kappa B1 and B2
           families of transcription factors, also referred to as
           class I members of the NF-kappa B family. In class I
           NF-kappa Bs, the RHD domain co-occurs with C-terminal
           ankyrin repeats. Family members include NF-kappa B1 and
           NF-kappa B2. NF-kappa B1 is commonly referred to as p105
           or p50 (proteolytically processed form), while NF-kappa
           B2 is called p100 or p52 (proteolytically processed
           form). NF-kappa B proteins are part of a protein complex
           that acts as a transcription factor, which is
           responsible for regulating a host of cellular responses
           to a variety of stimuli. This complex tightly regulates
           the expression of a large number of genes, and is
           involved in processes such as adaptive and innate
           immunity, stress response, inflammation, cell adhesion,
           proliferation and apoptosis. The cytosolic NF-kappa B
           complex is activated via phosphorylation of the
           ankyrin-repeat containing inhibitory protein I-kappa B,
           which dissociates from the complex and exposes the
           nuclear localization signal of the heterodimer (NF-kappa
           B and REL). p105 and p100 may also act as I-kappa Bs due
           to their C-terminal ankyrin repeats.
          Length = 197

 Score = 29.4 bits (66), Expect = 1.6
 Identities = 10/41 (24%), Positives = 19/41 (46%)

Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKL 268
                     D   GG+R++ +EE+ E  ++ + +AK   L
Sbjct: 123 GYNPGDLVHVDAEGGGDRQLTDEEQAEIRQKAKQQAKSMDL 163


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 29.3 bits (66), Expect = 1.7
 Identities = 17/64 (26%), Positives = 36/64 (56%), Gaps = 6/64 (9%)

Query: 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E DY +  E E  +E +  ++E ++    + ++ E         E++E +E+E+++E+EE
Sbjct: 117 ELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVE------ILIEDDEVDEDEDDDEDEE 170

Query: 296 EEEI 299
           +EE 
Sbjct: 171 DEED 174



 Score = 29.3 bits (66), Expect = 1.7
 Identities = 17/79 (21%), Positives = 38/79 (48%)

Query: 221 PVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGE 280
           P++  D   + + + ++   K  E +  EE +E+E + +E K +E   +          E
Sbjct: 83  PLDEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDE 142

Query: 281 EEEEEEEEEEEEEEEEEEI 299
           +E+++E  E   E++E + 
Sbjct: 143 DEDDDEIVEILIEDDEVDE 161


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 30.2 bits (68), Expect = 1.7
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 5/38 (13%)

Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           KE K K+++K      EE+ EE + EE +EE EEE  G
Sbjct: 277 KELKEKKEEK-----DEEKSEEVKTEEVDEEFEEEEKG 309



 Score = 29.8 bits (67), Expect = 2.7
 Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 12/55 (21%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAV 307
           E++EE++E K++E K             EE +EE EEEE+    E    V + A 
Sbjct: 281 EKKEEKDEEKSEEVK------------TEEVDEEFEEEEKGFYYELYEKVNIEAN 323


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 30.1 bits (68), Expect = 1.7
 Identities = 11/59 (18%), Positives = 23/59 (38%), Gaps = 3/59 (5%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE---EEEEEEEI 299
           E  + + +     EE   K + ++L+   K        +  EE+  +E     +  EE+
Sbjct: 871 EGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEEL 929


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 7/29 (24%), Positives = 18/29 (62%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKK 273
           +++ + +++ + E+ E KAK     EK++
Sbjct: 28  KKLLKAKKKLKSEKLEKKAKRQLRAEKRQ 56


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 30.3 bits (68), Expect = 1.7
 Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 11/61 (18%)

Query: 244 EREVHEEEEEEE------EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E+  +EE E ++          E +  +    EK ++     E    EE E EEE E+ E
Sbjct: 339 EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEI-----ENRNPEESEHEEEVEDYE 393

Query: 298 E 298
           +
Sbjct: 394 D 394



 Score = 29.2 bits (65), Expect = 3.2
 Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 8/47 (17%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
               E++ ++    +  E  ++ +        EE E EEE E+ E+E
Sbjct: 357 SVPLEKQFDDILYFEKME--IENRNP------EESEHEEEVEDYEDE 395



 Score = 28.8 bits (64), Expect = 4.3
 Identities = 14/54 (25%), Positives = 22/54 (40%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           R ++ E+   EE E +    ++     +K        E+ E E    EE E EE
Sbjct: 334 RNLNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEE 387


>gnl|CDD|180207 PRK05696, fliL, flagellar basal body-associated protein FliL;
           Reviewed.
          Length = 170

 Score = 29.2 bits (66), Expect = 1.8
 Identities = 18/70 (25%), Positives = 22/70 (31%), Gaps = 26/70 (37%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKL------------GGGGG-------EEEEEEEEEEE 290
           EEE  E E+E  K+       KKKL            GGGG         ++      E 
Sbjct: 3   EEEALELEDEAPKS-------KKKLIIIIVIGVLLALGGGGAAWFFMGSSDKAAAAAAEA 55

Query: 291 EEEEEEEEIG 300
             E      G
Sbjct: 56  AAEAAAAVKG 65


>gnl|CDD|222300 pfam13664, DUF4149, Domain of unknown function (DUF4149). 
          Length = 98

 Score = 28.3 bits (64), Expect = 1.8
 Identities = 8/49 (16%), Positives = 20/49 (40%), Gaps = 2/49 (4%)

Query: 59  RMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIA 107
            +   + P++F + + L      L+  ++   +  R    LL + L + 
Sbjct: 32  ALQGKLFPVYFLLGLALAVL--LLLTELLLGGKAERWQLLLLAVLLLLT 78


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 29.6 bits (67), Expect = 1.9
 Identities = 12/52 (23%), Positives = 14/52 (26%), Gaps = 8/52 (15%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
              + EE EE+EE                        E EE EE       E
Sbjct: 203 FWRDPEEAEEKEEAAAETAGV--------EDAAAAAAEAEEGEEWVAAANNE 246



 Score = 28.5 bits (64), Expect = 4.3
 Identities = 10/34 (29%), Positives = 15/34 (44%)

Query: 282 EEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGGWW 315
           + EE EE+EE   E   +     +A +   G  W
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEW 239


>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
          Length = 325

 Score = 29.5 bits (67), Expect = 1.9
 Identities = 11/55 (20%), Positives = 21/55 (38%), Gaps = 9/55 (16%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
             +VH+  E+ + +E   +  + K            EEE  + +  EEE   +  
Sbjct: 2   TLKVHDLNEDADFDENGVEVFDEKALV---------EEEPSDNDLAEEELLSQGA 47


>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655).  This
           domain family is found in viruses, and is approximately
           70 amino acids in length. The family is found in
           association with pfam08716, pfam01661, pfam05409,
           pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
           pfam08715, pfam08710.
          Length = 70

 Score = 27.7 bits (61), Expect = 2.0
 Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 3/56 (5%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKK---LGGGGGEEEEEEEEEEEEEEEEE 296
           E E  EE E EEEE EET   E   ++  K   L  G   E  + EEEEEE+  ++
Sbjct: 10  EEEDCEEYECEEEEIEETCEHEYGTEDDYKGLPLEFGASTEIVQVEEEEEEDWLDD 65


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 28.2 bits (63), Expect = 2.0
 Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 279 GEEEEEEEEEEEEEEEEEEEIGGV 302
            +EEEEEEEEE++ E+  +E   +
Sbjct: 9   DDEEEEEEEEEDDLEDLSDEDEFI 32



 Score = 28.2 bits (63), Expect = 2.0
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 280 EEEEEEEEEEEEEEEEEEEI 299
           E ++EEEEEEEEE++ E+  
Sbjct: 7   EVDDEEEEEEEEEDDLEDLS 26



 Score = 27.4 bits (61), Expect = 2.9
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 281 EEEEEEEEEEEEEEEEEEIGGV 302
           E ++EEEEEEEEE++ E++   
Sbjct: 7   EVDDEEEEEEEEEDDLEDLSDE 28



 Score = 27.4 bits (61), Expect = 3.8
 Identities = 18/55 (32%), Positives = 28/55 (50%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E EV +EEEEEEEEE++ +    + +   +            +   E+EEEE+ E
Sbjct: 5   EAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAE 59



 Score = 26.6 bits (59), Expect = 5.4
 Identities = 13/28 (46%), Positives = 17/28 (60%)

Query: 282 EEEEEEEEEEEEEEEEEIGGVVVSAVDF 309
           + E E ++EEEEEEEEE     +S  D 
Sbjct: 3   DTEAEVDDEEEEEEEEEDDLEDLSDEDE 30



 Score = 26.6 bits (59), Expect = 6.1
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
           + E E ++EEEEEEEEE++
Sbjct: 3   DTEAEVDDEEEEEEEEEDD 21



 Score = 26.6 bits (59), Expect = 6.9
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 280 EEEEEEEEEEEEEEEEEEE 298
            E E ++EEEEEEEEE++ 
Sbjct: 4   TEAEVDDEEEEEEEEEDDL 22



 Score = 26.3 bits (58), Expect = 7.7
 Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 281 EEEEEEEEEEEEEEEEEEIGGVVVSAVDFGS 311
           +EEEEEEEEE++ E+  +    +  A     
Sbjct: 10  DEEEEEEEEEDDLEDLSDEDEFIDEAEAEDD 40


>gnl|CDD|216828 pfam01984, dsDNA_bind, Double-stranded DNA-binding domain.  This
           domain is believed to bind double-stranded DNA of 20
           bases length.
          Length = 105

 Score = 28.3 bits (64), Expect = 2.0
 Identities = 9/30 (30%), Positives = 21/30 (70%)

Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           +  +L E ++  GG  E++++EE++++EE 
Sbjct: 2   RRARLAELQQQQGGLEEQQQQEEQQQQEEA 31


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 29.4 bits (66), Expect = 2.0
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 10/62 (16%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVV 303
           ++E+ E+ ++  E  E  KA E K  E K L          EEE E     ++ E G +V
Sbjct: 76  QKELLEKLKDLAEVNERLKALEKKKAELKDL----------EEEREGILRSKQAEDGKLV 125

Query: 304 VS 305
             
Sbjct: 126 KI 127


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 28.0 bits (63), Expect = 2.0
 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 4/57 (7%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEK----KKLGGGGGEEEEEEEEEEEEEEEEE 296
           ER+  E+ + EEE +     K  +++EK    KK+ G  G +  EE+  + + + E+
Sbjct: 2   ERKEEEKAQREEELKRLKNLKREEIEEKLEKIKKVAGLRGADLSEEDLADGDFDPEK 58


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 30.1 bits (67), Expect = 2.0
 Identities = 15/55 (27%), Positives = 28/55 (50%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           GE E+ +++E+   E+ E +   +  + ++  G     E EE  +E + E E EE
Sbjct: 591 GEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREE 645



 Score = 28.6 bits (63), Expect = 5.8
 Identities = 19/84 (22%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 215 VFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
           +  D +  N  D  ++            G     E EE  +E + ET+ +E   ++K++L
Sbjct: 595 LIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETE-REENARKKEEL 653

Query: 275 GGGGGEEEEEEEEEEEEEEEEEEE 298
            G    EE  + E+++ +   EE+
Sbjct: 654 RGNFELEERGDPEKKDVDWYTEEK 677


>gnl|CDD|220672 pfam10277, Frag1, Frag1/DRAM/Sfk1 family.  This family includes
           Frag1, DRAM and Sfk1 proteins. Frag1 (FGF receptor
           activating protein 1) is a protein that is conserved
           from fungi to humans. There are four potential
           iso-prenylation sites throughout the peptide, viz CILW,
           CIIW and CIGL. Frag1 is a membrane-spanning protein that
           is ubiquitously expressed in adult tissues suggesting an
           important cellular function. Dram is a family of
           proteins conserved from nematodes to humans with six
           hydrophobic transmembrane regions and an Endoplasmic
           Reticulum signal peptide. It is a lysosomal protein that
           induces macro-autophagy as an effector of p53-mediated
           death, where p53 is the tumour-suppressor gene that is
           frequently mutated in cancer. Expression of Dram is
           stress-induced. This region is also part of a family of
           small plasma membrane proteins, referred to as Sfk1,
           that may act together with or upstream of Stt4p to
           generate normal levels of the essential phospholipid
           PI4P, thus allowing proper localisation of Stt4p to the
           actin cytoskeleton.
          Length = 216

 Score = 29.1 bits (66), Expect = 2.1
 Identities = 6/57 (10%), Positives = 16/57 (28%), Gaps = 1/57 (1%)

Query: 67  IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDY 123
           I   +  + G     L   +               + L ++ ++  +  +PF     
Sbjct: 122 IGAILFFVGGLLYMLLQTALSYRLGPTPVLRKSFRLKLILS-IIAFISAIPFGVFFI 177



 Score = 29.1 bits (66), Expect = 2.1
 Identities = 11/86 (12%), Positives = 30/86 (34%), Gaps = 8/86 (9%)

Query: 71  IIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTT-CWP 129
           + ++ G  G   +  V  F      +T    ++   A +LF V  + +       +    
Sbjct: 93  LALVFGIIGALGLGGVANF-----QSTEDHDVHDIGA-ILFFVGGLLYMLLQTALSYRLG 146

Query: 130 FGDVWCKMYLI-VVTAYASVYTLVLM 154
              V  K + + ++ +  +  + +  
Sbjct: 147 PTPVLRKSFRLKLILSIIAFISAIPF 172


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 28.7 bits (65), Expect = 2.1
 Identities = 13/44 (29%), Positives = 27/44 (61%)

Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAV 307
           K  K K+KKK  G   EE++++    EE +E+++ +   +++A+
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAI 125


>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
           and repair].
          Length = 1187

 Score = 30.0 bits (68), Expect = 2.1
 Identities = 16/51 (31%), Positives = 24/51 (47%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
            EE  E+  +  KL+ K           EE  E E E E++  ++G +VVS
Sbjct: 232 SEEVIESAYELIKLRRKLYGEKRAERVREELREVEREREKKRRKLGILVVS 282


>gnl|CDD|218301 pfam04871, Uso1_p115_C, Uso1 / p115 like vesicle tethering protein,
           C terminal region.  Also known as General vesicular
           transport factor, Transcytosis associate protein (TAP)
           and Vesicle docking protein, this myosin-shaped molecule
           consists of an N-terminal globular head region, a
           coiled-coil tail which mediates dimerisation, and a
           short C-terminal acidic region. p115 tethers COP1
           vesicles to the Golgi by binding the coiled coil
           proteins giantin (on the vesicles) and GM130 (on the
           Golgi), via its C-terminal acidic region. It is required
           for intercisternal transport in the golgi stack. This
           family consists of the acidic C-terminus, which binds to
           the golgins giantin and GM130. p115 is thought to
           juxtapose two membranes by binding giantin with one
           acidic region, and GM130 with another.
          Length = 136

 Score = 28.8 bits (64), Expect = 2.2
 Identities = 21/62 (33%), Positives = 33/62 (53%), Gaps = 9/62 (14%)

Query: 246 EVHEEEEEEEEEEEE--------TKAKETKLKEK-KKLGGGGGEEEEEEEEEEEEEEEEE 296
           ++ EE  +E ++E++         + K TK KEK K LG     +E  E   +E E+EEE
Sbjct: 73  KLDEEALKESQKEQDDLLVLLADQEEKVTKYKEKLKDLGVPVSNDEPLEAIGDESEDEEE 132

Query: 297 EE 298
           +E
Sbjct: 133 DE 134


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 29.7 bits (66), Expect = 2.2
 Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKL--------KEKKKLGGGGGEEEEEEEEEEEEE 292
           KG      +  E+EEEE+ +   +E+ +        KE       G EE   ++ E  EE
Sbjct: 383 KGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEE 442

Query: 293 EEEEEEI 299
              EEE 
Sbjct: 443 LSPEEEE 449


>gnl|CDD|224535 COG1620, LldP, L-lactate permease [Energy production and
           conversion].
          Length = 522

 Score = 29.5 bits (67), Expect = 2.2
 Identities = 11/54 (20%), Positives = 22/54 (40%), Gaps = 4/54 (7%)

Query: 154 MSLDRFLAVVHPIASMSVR-TERN--ALKAILITWIVIVLTAVPVFNMHGEVSY 204
           + L   +A++ PI    +           A  IT ++ +L A+  F M   ++ 
Sbjct: 2   LWLSALVALL-PIVLFFLLLVVLKLKGYVAAPITLLLTLLIALFYFKMPVVMAS 54


>gnl|CDD|188087 TIGR00841, bass, bile acid transporter.  The Bile Acid:Na+
           Symporter (BASS) Family (TC 2.A.28) Functionally
           characterized members of the BASS family catalyze
           Na+:bile acid symport. These systems have been
           identified in intestinal, liver and kidney tissues of
           animals. These symporters exhibit broad specificity,
           taking up a variety of non bile organic compounds as
           well as taurocholate and other bile salts. Functionally
           uncharacterised homologues are found in plants, yeast,
           archaea and bacteria [Transport and binding proteins,
           Carbohydrates, organic alcohols, and acids].
          Length = 286

 Score = 29.6 bits (67), Expect = 2.2
 Identities = 17/85 (20%), Positives = 32/85 (37%), Gaps = 8/85 (9%)

Query: 72  IIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFG 131
            +I+G      ++ +  F         +  +   +A  + IV C P      V T    G
Sbjct: 39  GVIIGLLAQYGIMPLTGFLLAK-----VFKLPPELAVGVLIVGCCPGGTASNVFTYLLKG 93

Query: 132 DVWCKMYLIVVTAYASVYTLVLMSL 156
           D+      I +T  +++  L +M L
Sbjct: 94  DM---ALSISMTTCSTLLALGMMPL 115


>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
           recombination, and repair].
          Length = 254

 Score = 29.3 bits (66), Expect = 2.3
 Identities = 10/38 (26%), Positives = 16/38 (42%)

Query: 233 FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKE 270
            +   D  + G  E  E   EEE+   E +  E +L+ 
Sbjct: 30  LEQAADCKEWGYAEFLEYLLEEEKLAREARKIERRLRS 67


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 29.5 bits (66), Expect = 2.4
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 270 EKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E  +   G    EE  +EE E+E E +E +
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESV 362



 Score = 28.7 bits (64), Expect = 3.8
 Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 12/59 (20%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
                EV E++E +   EE T  +               E E +E   EE  E E  E+
Sbjct: 328 IFEVVEVVEKQEGDVVTEESTDEESED------------EVEIDESVIEEVAEMELLEV 374



 Score = 28.7 bits (64), Expect = 4.2
 Identities = 15/65 (23%), Positives = 26/65 (40%), Gaps = 10/65 (15%)

Query: 236 ENDYCKGGEREVHE-EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           E    + G+    E  +EE E+E E  ++   ++ E          E  E + ++  E  
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEM---------ELLEVQVDDLAERL 383

Query: 295 EEEEI 299
            E EI
Sbjct: 384 AETEI 388


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 29.7 bits (67), Expect = 2.4
 Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 3/58 (5%)

Query: 245 REVHEEEEEEEEEEEETKAKE---TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            +V EEE   E+E  E + +       K  KKL       EEE  E + E EE + EI
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREI 452


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 28.9 bits (64), Expect = 2.4
 Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 4/54 (7%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+ E E+ ++E EE  + K   + E+ +       EE  E+EEE EE++ +EEI
Sbjct: 20  ELAEREKLKKEVEEIPEQKPEDIVEELE----DQPEEPPEQEEENEEQKPKEEI 69



 Score = 27.8 bits (61), Expect = 5.9
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 7/55 (12%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           R++ E  E E+ ++E  +  E K ++         EE E++ EE  E+EEE EE 
Sbjct: 16  RDIDELAEREKLKKEVEEIPEQKPEDIV-------EELEDQPEEPPEQEEENEEQ 63


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 29.5 bits (66), Expect = 2.4
 Identities = 12/51 (23%), Positives = 26/51 (50%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + E E +  +E+  + E +L+++++       +  E EE+ E  E+EE   
Sbjct: 668 DLEAEHKSLQEKISSLEDELEKERQNHQELIAKCRELEEKIERAEQEENMQ 718


>gnl|CDD|224251 COG1332, COG1332, CRISPR system related protein, RAMP superfamily
           [Defense mechanisms].
          Length = 369

 Score = 29.3 bits (66), Expect = 2.6
 Identities = 25/134 (18%), Positives = 37/134 (27%), Gaps = 25/134 (18%)

Query: 172 RTERNALKAILITWIVIVLTAVPVF-NMHGEVSYTYASELHTACVF-----------LDL 219
           R  R +    L T +           N+H +        +     F           ++ 
Sbjct: 182 RAIRVSDSPALRTSVSDKRIEYKKLMNVHEKGIPLDNLPIEREFFFNVVISIRTRVEVED 241

Query: 220 DPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKK------ 273
               R       +F   +   K    E  E+  EE  E    KA E  L + +K      
Sbjct: 242 PETKREILAIVNVFNKLD--IKIFSGERFEKTLEENFESYNQKAYEEILAKLEKVDAING 299

Query: 274 -----LGGGGGEEE 282
                LG GGG   
Sbjct: 300 NIELHLGFGGGWRS 313


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 29.7 bits (66), Expect = 2.6
 Identities = 15/53 (28%), Positives = 28/53 (52%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
           EEE   EEE++    K+   ++ K+  G G   +EE ++   +E+ +  + GG
Sbjct: 423 EEEALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDEDRDLADDGG 475



 Score = 29.3 bits (65), Expect = 3.4
 Identities = 17/60 (28%), Positives = 30/60 (50%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           K G     EE   EEE++   K K+   K K++ G G   +EE ++   +E+ +  ++ G
Sbjct: 416 KAGLASPEEEALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDEDRDLADDGG 475


>gnl|CDD|227621 COG5305, COG5305, Predicted membrane protein [Function unknown].
          Length = 552

 Score = 29.3 bits (66), Expect = 2.7
 Identities = 30/140 (21%), Positives = 43/140 (30%), Gaps = 19/140 (13%)

Query: 67  IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLF--IVCCVPFTATDYV 124
           I++ +    G     L  ++V F    R     LI    I   L   I    P  A   +
Sbjct: 243 IWWLLATAAGL-LAFLPWLLVYFANSFRVGIPDLITLDGIVGQLINPI---YPLIAAWLL 298

Query: 125 TTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV--RTERNALKAIL 182
                F D+     L           L  + L   + ++   A  ++   TER  L  IL
Sbjct: 299 AFSLLFFDL--LHGLD-FPINGVARFLYPIWL--AIILI-LGALYTLCRATERKILLFIL 352

Query: 183 ITWIVIVLTA-----VPVFN 197
              I   L       V VF 
Sbjct: 353 TLIIAPALPLILSDLVSVFI 372


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 28.5 bits (64), Expect = 2.7
 Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 9/56 (16%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           ++E+ E   + E   EE K    KLK          E++E  ++ +E E+EE+++I
Sbjct: 96  KKELEEANGDSEGLLEELKKHRDKLK---------KEQKELRKKLDELEKEEKKKI 142


>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
           co-repressor CIR.  This is a 45 residue conserved region
           at the N-terminal end of a family of proteins referred
           to as CIRs (CBF1-interacting co-repressors). CBF1
           (centromere-binding factor 1) acts as a transcription
           factor that causes repression by binding specifically to
           GTGGGAA motifs in responsive promoters, and it requires
           CIR as a co-repressor. CIR binds to histone deacetylase
           and to SAP30 and serves as a linker between CBF1 and the
           histone deacetylase complex.
          Length = 37

 Score = 26.3 bits (59), Expect = 2.7
 Identities = 8/20 (40%), Positives = 15/20 (75%)

Query: 280 EEEEEEEEEEEEEEEEEEEI 299
           + E++ EEE+++ EE  +EI
Sbjct: 15  KAEQKAEEEKKKIEERRKEI 34


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 19/75 (25%), Positives = 32/75 (42%), Gaps = 7/75 (9%)

Query: 226 DGYNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEE 285
           + Y         +  +   +E+HE    EEE E + K    K++EK+K        EE E
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKK-FTEKIREKEK------RLEELE 337

Query: 286 EEEEEEEEEEEEEIG 300
           +   EE +E   ++ 
Sbjct: 338 QNLIEERKELNSKLE 352


>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit.  he A1/A0 ATP
           synthase is homologous to the V-type (V1/V0, vacuolar)
           ATPase, but functions in the ATP synthetic direction as
           does the F1/F0 ATPase of bacteria. The hydrophilic A1
           "stalk" complex (AhaABCDEFG) is the site of ATP
           generation and is coupled to the membrane-embedded
           proton translocating A0 complex. It is unclear precisely
           where AhaH fits into these complexes.
          Length = 85

 Score = 27.5 bits (61), Expect = 2.8
 Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 11/77 (14%)

Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEK-----KKLGGGGGEEEEE 284
           + I ++E D     E  + E EEE ++   E + +  +L E+      KLG    +E EE
Sbjct: 2   EEIKKAEED----AEELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEE 57

Query: 285 EEEEEEEE--EEEEEEI 299
           E E+E E+  EE E+EI
Sbjct: 58  EIEKEAEKIREEGEKEI 74


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 28.2 bits (63), Expect = 2.9
 Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 10/56 (17%)

Query: 244 EREVHEEEEEEEEEEEETKA-----KETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           E+++ E+E++E EE  + K      +  K  E +KL     E++ E+E+ +E   E
Sbjct: 41  EQKLEEQEKQEREELRKEKRELFEERRRKQLELRKL-----EQKMEDEKLQETWHE 91


>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
          Length = 233

 Score = 29.0 bits (65), Expect = 3.0
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 11/57 (19%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           GGE +  +E+EE+     E KA++              EEE +E++EEE EE+E+ +
Sbjct: 187 GGEDDWADEDEEDGYVASEAKARK-----------PQDEEEWDEDDEEESEEDEDGD 232


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 27.4 bits (61), Expect = 3.0
 Identities = 8/41 (19%), Positives = 23/41 (56%), Gaps = 6/41 (14%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
           E EE E+++++E    +    +K        ++++++E++E
Sbjct: 46  EGEEPEDDDDDEDDDDDDDKDDKDD------DDDDDDEDDE 80



 Score = 25.9 bits (57), Expect = 10.0
 Identities = 5/20 (25%), Positives = 19/20 (95%)

Query: 279 GEEEEEEEEEEEEEEEEEEE 298
           GEE E+++++E+++++++++
Sbjct: 47  GEEPEDDDDDEDDDDDDDKD 66


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 29.2 bits (66), Expect = 3.1
 Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 9/59 (15%)

Query: 243 GERE----VHEEEEEEEEEEEETK-AKETKLKEKKKLGG----GGGEEEEEEEEEEEEE 292
           GE+     + E E E++EEE   K  K+   K  + L        GE + EEEEE++ E
Sbjct: 85  GEQGLQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQME 143


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 29.4 bits (66), Expect = 3.1
 Identities = 15/76 (19%), Positives = 31/76 (40%)

Query: 228 YNKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE 287
            NK        Y +  E E  EEEE+E+++++ +  ++                E   + 
Sbjct: 120 NNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR 179

Query: 288 EEEEEEEEEEEIGGVV 303
            +E++EE+E+      
Sbjct: 180 VKEDDEEDEDADLSKK 195



 Score = 27.8 bits (62), Expect = 8.9
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 281 EEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSG 312
           E+ E E+EEEEE+E++++ G       + G G
Sbjct: 133 EDPESEDEEEEEDEDDDDDGSDDEDEDEDGVG 164


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 29.2 bits (66), Expect = 3.1
 Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 6/55 (10%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +E  EE   + EEE   + +  +   +K+L        E E E+EE  ++ EE++
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQL------RLEFEREKEELRKKYEEKL 243


>gnl|CDD|226341 COG3820, COG3820, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 230

 Score = 28.7 bits (64), Expect = 3.2
 Identities = 24/75 (32%), Positives = 32/75 (42%), Gaps = 6/75 (8%)

Query: 214 CVFLDLDP-VNRPDGYNKTIFQSENDYCKGGEREVHE-EEEEEEEEEEETKAKETKLKEK 271
           C  +DLD  V R      T  +         E E  + EEE EEE E +  A   KL   
Sbjct: 160 CSQIDLDKEVERASRGRPTAKELGPTLLPASETEQLDYEEEREEERELDADAVFAKLSSL 219

Query: 272 KKLGGGGGEEEEEEE 286
           K     G ++EE+E+
Sbjct: 220 K----SGNKDEEDED 230


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 20/56 (35%), Positives = 34/56 (60%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E ++ EEEEEE  E EE + +E + ++K+K G     +  E +E++ E+  EE E+
Sbjct: 17  EEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEELEL 72


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 29.0 bits (66), Expect = 3.4
 Identities = 17/34 (50%), Positives = 19/34 (55%)

Query: 265 ETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E +     +      EE EEEEEEEEEEE EE E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAE 256



 Score = 28.6 bits (65), Expect = 3.8
 Identities = 23/53 (43%), Positives = 24/53 (45%), Gaps = 19/53 (35%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           R+    E  EEEEEE                    EEEEEEEEEEE EE E E
Sbjct: 225 RQGRLAEAAEEEEEEA-------------------EEEEEEEEEEEAEEAEAE 258



 Score = 27.8 bits (63), Expect = 6.3
 Identities = 21/51 (41%), Positives = 21/51 (41%), Gaps = 19/51 (37%)

Query: 248 HEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
                E  EEEEE                   E EEEEEEEEEEE EE E 
Sbjct: 226 QGRLAEAAEEEEE-------------------EAEEEEEEEEEEEAEEAEA 257



 Score = 27.8 bits (63), Expect = 6.3
 Identities = 22/53 (41%), Positives = 23/53 (43%), Gaps = 19/53 (35%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           + G      EEEEEE EEEE                   EEEEEEE EE E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEE-------------------EEEEEEEAEEAEAE 258



 Score = 27.4 bits (62), Expect = 8.6
 Identities = 22/53 (41%), Positives = 22/53 (41%), Gaps = 20/53 (37%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
             R     EEEEEE EEE                    EEEEEEEE EE E E
Sbjct: 226 QGRLAEAAEEEEEEAEEE--------------------EEEEEEEEAEEAEAE 258


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 28.8 bits (64), Expect = 3.4
 Identities = 10/54 (18%), Positives = 22/54 (40%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           R+      ++++     K   +  K     G   G E +++E     + +EEE+
Sbjct: 236 RQTRGFRSDDDQGYGRYKQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDEEED 289


>gnl|CDD|218358 pfam04979, IPP-2, Protein phosphatase inhibitor 2 (IPP-2).  Protein
           phosphotase inhibitor 2 (IPP-2) is a phosphoprotein
           conserved among all eukaryotes, and it appears in both
           the nucleus and cytoplasm of tissue culture cells.
          Length = 123

 Score = 27.8 bits (62), Expect = 3.4
 Identities = 14/51 (27%), Positives = 19/51 (37%), Gaps = 2/51 (3%)

Query: 249 EEEEEEEEEEEETKAKE--TKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           E+ + E E  E    +    KL   +        EE E    E+EE   EE
Sbjct: 40  EDSDSESEGNESLTPESLAEKLAAAESSDPSFSIEESESSSSEDEEFSPEE 90



 Score = 27.0 bits (60), Expect = 7.0
 Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 4/56 (7%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKK----KLGGGGGEEEEEEEEEEEEEEEEEEE 298
              ++E+ + E E  ++   +   +K    +        EE E    E+EE   EE
Sbjct: 35  HRMDDEDSDSESEGNESLTPESLAEKLAAAESSDPSFSIEESESSSSEDEEFSPEE 90


>gnl|CDD|234397 TIGR03920, T7SS_EccD, type VII secretion integral membrane protein
           EccD.  Members of this family are EccD, a component of
           actinobacterial type VII secretion systems (T7SS) with
           ten to eleven predicted transmembrane helix regions
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 431

 Score = 28.9 bits (65), Expect = 3.6
 Identities = 28/145 (19%), Positives = 51/145 (35%), Gaps = 23/145 (15%)

Query: 62  SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTAT 121
              +  ++G  I  G     L+V+  A   +      +  ++L  A L+ +       A 
Sbjct: 135 GRGLGGWWGAPIAAG--VIGLLVLAAAVVARTLRDRAVAALSLLGAALVLV-----AVAG 187

Query: 122 DYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV--------RT 173
                    G +     L+  T  A++  L+L    R LAV   + +++V          
Sbjct: 188 ----ALAVPGALGVPHVLLAGTVLAALAVLLLRLTGRGLAVFTAVVTVAVILALAAAVGM 243

Query: 174 ER----NALKAILITWIVIVLTAVP 194
                   + A LI   + VL+A P
Sbjct: 244 LWQTPVPGIAAGLILLGLAVLSAAP 268


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 29.2 bits (65), Expect = 3.6
 Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEE----EEEEEEEEEEEEEEEEI 299
            + EEE+ E++ + +    K  + +   G E E    EE E     E E EEE+
Sbjct: 126 SDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEEL 179


>gnl|CDD|218637 pfam05558, DREPP, DREPP plasma membrane polypeptide.  This family
           contains several plant plasma membrane proteins termed
           DREPPs as they are developmentally regulated plasma
           membrane polypeptides.
          Length = 201

 Score = 28.5 bits (63), Expect = 3.7
 Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 11/47 (23%)

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EE++ EEE  A ET+            +   +E+E   EEE++EEE 
Sbjct: 132 EEKKPEEEAPAAETE-----------KKPAVKEKEIVVEEEKKEEEA 167


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 29.0 bits (65), Expect = 3.7
 Identities = 12/46 (26%), Positives = 27/46 (58%), Gaps = 5/46 (10%)

Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+EE EE    E + +E+++L       ++EEEE++  + + ++ +
Sbjct: 143 EQEELEEALEFEKEEEEQRRL-----LLQKEEEEQQMNKRKNKQAL 183


>gnl|CDD|217525 pfam03383, Serpentine_r_xa, Caenorhabditis serpentine receptor-like
           protein, class xa.  This family contains various
           Caenorhabditis proteins, some of which are annotated as
           being serpentine receptors, mainly of the xa class.
          Length = 153

 Score = 28.1 bits (63), Expect = 3.7
 Identities = 13/52 (25%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 146 ASVYTLVLMSLDRFLAVVHPIASMSVRTERN-ALKAILITWIVIVLTAVPVF 196
             +   VLM+L+R   V+ P +     T+R        I  + ++L  +P F
Sbjct: 57  HPLCLSVLMTLNRISIVISPFSQPIWFTDRRLFGYCAGIAILSLILLLIPYF 108


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
           protein.  This domain family is found in eukaryotes, and
           is typically between 212 and 238 amino acids in length.
           The family is found in association with pfam01805. There
           are two completely conserved residues (W and H) that may
           be functionally important. PRP21 is required for
           assembly of the prespliceosome and it interacts with U2
           snRNP and/or pre-mRNA in the prespliceosome. This family
           also contains proteins similar to PRP21, such as the
           mammalian SF3a. SF3a also interacts with U2 snRNP from
           the prespliceosome, converting it to its active form.
          Length = 230

 Score = 28.6 bits (64), Expect = 3.7
 Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           L+E         EEE EEEEE+ +EE++EE
Sbjct: 96  LEEPDAAAAPEEEEEMEEEEEDIDEEKKEE 125



 Score = 27.8 bits (62), Expect = 6.8
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGG 279
           +  E E  EE+ +EE++EE  K     +KE K    G  
Sbjct: 107 EEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGES 145



 Score = 27.8 bits (62), Expect = 7.1
 Identities = 13/36 (36%), Positives = 20/36 (55%)

Query: 260 ETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           E K+K  +  +         E EEEEE+ +EE++EE
Sbjct: 90  EQKSKALEEPDAAAAPEEEEEMEEEEEDIDEEKKEE 125


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 28.1 bits (63), Expect = 3.8
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVVVSAVD 308
           E ++++ EE EEEEE+EE+   +  S V 
Sbjct: 11  ESDDDDSEEYEEEEEDEEDAESLESSDVS 39


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 28.8 bits (64), Expect = 3.8
 Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 19/89 (21%)

Query: 232 IFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLG----------GGGG-- 279
           I    +D     +++    EEE+E+  +  K +    + +   G          GG G  
Sbjct: 155 ILGEGSDDSDSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPR 214

Query: 280 -------EEEEEEEEEEEEEEEEEEEIGG 301
                  EE+EE+ E E   E   E+  G
Sbjct: 215 GHKRKLNEEDEEDAESESSFESSNEDEEG 243


>gnl|CDD|221753 pfam12753, Nro1, Nuclear pore complex subunit Nro1.  In fission
           yeast, this protein is a positive regulator of the
           stability of Sre1N, the sterol regulatory
           element-binding protein which is an ER membrane-bound
           transcription factor that controls adaptation to low
           oxygen-growth. In addition, the fission yeast Nro1 is a
           direct inhibitor of a protein that inhibits SreN1
           degradation, Ofd1 (an oxoglutamate deoxygenase). The
           outcome of this reactivity is that Ofd1 acts as an
           oxygen sensor that regulates the binding of Nro1 to Ofd1
           to control the stability of Sre1N. Solution of the
           structure of Nro1 reveals it to be made up of a number
           of TPR coils.
          Length = 401

 Score = 28.7 bits (64), Expect = 3.9
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 279 GEEEEEEEEEEEEEEEEEEEI 299
           G+E E +E  + +EEEEEEE+
Sbjct: 219 GKENEIDEGLDSDEEEEEEEV 239


>gnl|CDD|224667 COG1753, COG1753, Predicted antotoxin, copG family [Signal
           transduction mechanisms].
          Length = 74

 Score = 26.7 bits (59), Expect = 3.9
 Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 5/59 (8%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKK-----LGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +V+E+  + +  +E       +L EKK+     L    G   EEE E+ ++EE+E E++
Sbjct: 11  DVYEKLVKMKRGKESFSDVIRELIEKKRGNLEVLMRAFGTLSEEEVEKIKKEEKEAEKM 69


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 29.3 bits (66), Expect = 3.9
 Identities = 21/70 (30%), Positives = 32/70 (45%), Gaps = 22/70 (31%)

Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
           K IF+  N   K  E ++ +  EE+           TK K+KK         E+++EEE 
Sbjct: 722 KKIFR--NWNGKDAEFKISDSVEEK-----------TKKKKKK---------EKKKEEEY 759

Query: 290 EEEEEEEEEI 299
           + EE+   EI
Sbjct: 760 KREEKARIEI 769


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 29.0 bits (65), Expect = 3.9
 Identities = 20/87 (22%), Positives = 39/87 (44%), Gaps = 7/87 (8%)

Query: 219 LDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEEEEEE-------EEEETKAKETKLKEK 271
           L+ + R     + + Q+E  YCK  + +   E+E EEE       +EEE +A +   +EK
Sbjct: 368 LECMLRYQSSQRELKQTEAAYCKRLDEKRLFEDEAEEEARQRLADDEEEVRAGDEAREEK 427

Query: 272 KKLGGGGGEEEEEEEEEEEEEEEEEEE 298
                   ++EE     +  +   +++
Sbjct: 428 IAANSQVIDKEEVCNLYDRRDTAWQKQ 454


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 28.2 bits (63), Expect = 3.9
 Identities = 13/59 (22%), Positives = 22/59 (37%), Gaps = 8/59 (13%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           ERE+ EE E EEE E +                 G    + +E E++   ++ +     
Sbjct: 68  ERELKEEAEAEEEGEVDA--------SPDAGAVAGESSADRKEAEQQGAAQKRKSCRDK 118


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 27.9 bits (62), Expect = 4.0
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKL 274
           ++EEE+EEE  E   +E  L E + L
Sbjct: 96  KKEEEKEEEIPEPTKEEELLGEIRDL 121



 Score = 27.5 bits (61), Expect = 4.7
 Identities = 15/33 (45%), Positives = 18/33 (54%), Gaps = 6/33 (18%)

Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           KL  KK+      EE+EEE  E  +EEE   EI
Sbjct: 92  KLTSKKE------EEKEEEIPEPTKEEELLGEI 118


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 19/57 (33%), Positives = 21/57 (36%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           E    E EE EE  EE     E  L E+  L         E EE  EE  E E +  
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRS 911


>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
          Length = 535

 Score = 29.1 bits (66), Expect = 4.0
 Identities = 15/45 (33%), Positives = 20/45 (44%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+EE + +    KL  KK       EEEEEE    E  +   E+ 
Sbjct: 491 EKEEPDIEEVLAKLIGKKLEEEEVEEEEEEEAVVVESAKNYTEKD 535


>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
          Length = 815

 Score = 29.0 bits (65), Expect = 4.0
 Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEE 291
           KGG R+   E +E E E+   KAK  K K++   G G G+  E+  +E EE
Sbjct: 18  KGGTRKQKAENKEHEGEQSPKKAK--KEKKQDDSGNGNGKSAEDAVKEFEE 66


>gnl|CDD|216887 pfam02124, Marek_A, Marek's disease glycoprotein A. 
          Length = 211

 Score = 28.5 bits (64), Expect = 4.1
 Identities = 11/54 (20%), Positives = 18/54 (33%), Gaps = 5/54 (9%)

Query: 36  PAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAF 89
           P     F      D    A     ++S+I     G+I  L   G  +V+  +  
Sbjct: 161 PDGLPVFEDTGTYDASPSAVTGPMVISVI-----GVICGLVALGLLVVLAALCL 209


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 28.3 bits (64), Expect = 4.2
 Identities = 7/18 (38%), Positives = 15/18 (83%)

Query: 282 EEEEEEEEEEEEEEEEEI 299
            ++++EEE EEEE ++++
Sbjct: 147 SDDDDEEESEEEESDDDL 164



 Score = 28.0 bits (63), Expect = 5.3
 Identities = 8/19 (42%), Positives = 15/19 (78%)

Query: 279 GEEEEEEEEEEEEEEEEEE 297
           G  ++++EEE EEEE +++
Sbjct: 145 GISDDDDEEESEEEESDDD 163



 Score = 27.6 bits (62), Expect = 6.4
 Identities = 8/19 (42%), Positives = 15/19 (78%)

Query: 278 GGEEEEEEEEEEEEEEEEE 296
           G  ++++EEE EEEE +++
Sbjct: 145 GISDDDDEEESEEEESDDD 163


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 29.0 bits (65), Expect = 4.3
 Identities = 21/65 (32%), Positives = 28/65 (43%), Gaps = 3/65 (4%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
            +E D  K  E E+   +E+ +E E      E  L E K L     +  EE     +E E
Sbjct: 259 DTEEDRTKEREAELEALQEQIDELESSI---EEVLSEIKALASKIKQVNEELTTVRQENE 315

Query: 294 EEEEE 298
           E EEE
Sbjct: 316 ELEEE 320



 Score = 28.2 bits (63), Expect = 7.4
 Identities = 18/56 (32%), Positives = 26/56 (46%), Gaps = 8/56 (14%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           + ++ E E   EE   E KA  +K+K+          EE     +E EE EEE +I
Sbjct: 276 QEQIDELESSIEEVLSEIKALASKIKQV--------NEELTTVRQENEELEEEYKI 323


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 28.8 bits (64), Expect = 4.3
 Identities = 12/57 (21%), Positives = 27/57 (47%), Gaps = 9/57 (15%)

Query: 244 EREVHEEEEE-EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             +V+  EE   E E E +   ++ +K++K        ++  EE++   E   +++I
Sbjct: 85  VADVNNGEEFLSESESEASLEIDSDIKDEK--------QKSLEEQKIAPEIPVKQQI 133


>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
          Length = 110

 Score = 27.2 bits (61), Expect = 4.5
 Identities = 14/44 (31%), Positives = 26/44 (59%), Gaps = 2/44 (4%)

Query: 256 EEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           EE EE + +  KL+E +K      + +EE+EE + + E +++ I
Sbjct: 1   EELEEIRRR--KLEELQKQAQEQQQAQEEQEEAQAQAEAQKQAI 42


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 28.9 bits (64), Expect = 4.5
 Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 8/51 (15%)

Query: 226 DGYNKTI--FQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
           DG       F   ++     E E  EEE+EEEE +E + AK  K K++K +
Sbjct: 758 DGEQDNSDSFAESSE-----EDESSEEEKEEEENKEVS-AKRAKKKQRKNM 802


>gnl|CDD|226122 COG3594, NolL, Fucose 4-O-acetylase and related acetyltransferases
           [Carbohydrate transport and metabolism].
          Length = 343

 Score = 28.5 bits (64), Expect = 4.5
 Identities = 29/153 (18%), Positives = 48/153 (31%), Gaps = 18/153 (11%)

Query: 58  ERMVSIIVP--IFFGIIIILGFF-GNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVC 114
           ++  ++IVP   FF I  IL F        + + F         L+I+      LLFI  
Sbjct: 69  KKARTLIVPYIFFFLIYSILYFLLRKFNSELTLNFILPGWELWFLVILFFWFLPLLFITK 128

Query: 115 CVPFTATDYVTTCWPFGDVWCKM------YLIVVTAYASVYTLV--LMSLDRFL--AVVH 164
            +       +   W    V   +      + I ++  +   TLV     L   L   +  
Sbjct: 129 LIYN-----LIAVWLAVSVSLIILILVGFFFIPLSLGSFSRTLVFFPFFLLGTLFAKLHF 183

Query: 165 PIASMSVRTERNALKAILITWIVIVLTAVPVFN 197
               +    +   +        VI L     FN
Sbjct: 184 WFGGLLRTYKGLRVNIKSFLAAVIFLAVGLAFN 216


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 28.9 bits (65), Expect = 4.5
 Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 4/105 (3%)

Query: 193 VPVFNMHGEVSYTYASELHTACVFLDLDPVNRPDGYNKTIFQSENDYCKGGEREVHEEEE 252
           V +FN          +E+ +  +          +  ++ I   E        +       
Sbjct: 773 VSLFNNPNAEKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMR 832

Query: 253 EEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
            EE E    + K  K + K +L     E +E EEE +   E + E
Sbjct: 833 PEELELINEEQKNLKQEIKLEL----SEIQEAEEEIQNINENKNE 873


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 27.6 bits (62), Expect = 4.7
 Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 6/38 (15%)

Query: 242 GGEREVHEEEEEEEEEEEETKAKETK------LKEKKK 273
           G E     EEE++E+E+++ K KE +      L+EKKK
Sbjct: 67  GEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKK 104


>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
          Length = 232

 Score = 28.3 bits (63), Expect = 4.9
 Identities = 16/32 (50%), Positives = 18/32 (56%)

Query: 271 KKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           K  +GG    EE E E E+E  EEEEEE   V
Sbjct: 53  KADVGGLEVSEETEAEVEDEGGEEEEEEDADV 84



 Score = 28.3 bits (63), Expect = 5.5
 Identities = 16/50 (32%), Positives = 23/50 (46%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEE 292
           G  EV EE E E E+E   + +E     + +L   G  E+  E  +EE  
Sbjct: 57  GGLEVSEETEAEVEDEGGEEEEEEDADVETELRPRGLTEKTPELSDEEAR 106


>gnl|CDD|227225 COG4888, COG4888, Uncharacterized Zn ribbon-containing protein
           [General function prediction only].
          Length = 104

 Score = 27.1 bits (60), Expect = 5.1
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 275 GGGGGEEEEEEEEEEEEEEEEEE 297
           G G GE   +EE ++E E   EE
Sbjct: 80  GRGCGESGTDEENDQEIESPGEE 102



 Score = 26.7 bits (59), Expect = 6.8
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 276 GGGGEEEEEEEEEEEEEEEEEEEI 299
           G G  E   +EE ++E E   EE+
Sbjct: 80  GRGCGESGTDEENDQEIESPGEEV 103


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 28.6 bits (63), Expect = 5.4
 Identities = 22/71 (30%), Positives = 30/71 (42%)

Query: 229  NKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEE 288
            N   +  EN        E ++EE  EE EE   +  E  ++E  +       EE EE  E
Sbjct: 1018 NVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIE 1077

Query: 289  EEEEEEEEEEI 299
            E  EE  EE +
Sbjct: 1078 ENIEENVEENV 1088


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 28.5 bits (64), Expect = 5.4
 Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 10/105 (9%)

Query: 62  SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTAT 121
             +     G + I+       ++V    +   R+   LL+I +  A  L ++  V    T
Sbjct: 387 DGLASDVIGQLGIIAALLAPALLVARLKDAAWRAYPRLLLITVLSAAPLALI--VA-ALT 443

Query: 122 DYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPI 166
            Y+ T          + L +V   A ++T  ++ + R L+V    
Sbjct: 444 GYLYT-------ALFLALRLVVTVAILFTWYVLYILRGLSVSARR 481


>gnl|CDD|148790 pfam07376, Prosystemin, Prosystemin.  This family consists of
           several plant specific prosystemin proteins. Prosystemin
           is the precursor protein of the 18 amino acid wound
           signal systemin which activates systemic defence in
           plant leaves against insect herbivores.
          Length = 193

 Score = 28.1 bits (61), Expect = 5.5
 Identities = 21/67 (31%), Positives = 33/67 (49%)

Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
           +T  Q   +     +     E EE  +E+E+   KET  K   K+ G   +E+ + E EE
Sbjct: 37  ETPSQDIKNKDDDAQEIPKVEHEEGGDEKEKIIEKETISKCIIKIEGDDAQEKIKVEYEE 96

Query: 290 EEEEEEE 296
           EE E+E+
Sbjct: 97  EECEKEK 103


>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
          Length = 586

 Score = 28.6 bits (64), Expect = 5.5
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 268 LKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           L++K++     GEE +E+EEE+ E+E 
Sbjct: 52  LEQKEEKEEDAGEEGDEDEEEQGEDEH 78



 Score = 27.8 bits (62), Expect = 8.7
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEE 295
           +L E+K+       EE +E+EEE+ E+E 
Sbjct: 50  QLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78


>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
           (CLPTM1).  This family consists of several eukaryotic
           cleft lip and palate transmembrane protein 1 sequences.
           Cleft lip with or without cleft palate is a common birth
           defect that is genetically complex. The nonsyndromic
           forms have been studied genetically using linkage and
           candidate-gene association studies with only partial
           success in defining the loci responsible for orofacial
           clefting. CLPTM1 encodes a transmembrane protein and has
           strong homology to two Caenorhabditis elegans genes,
           suggesting that CLPTM1 may belong to a new gene family.
           This family also contains the human cisplatin resistance
           related protein CRR9p which is associated with
           CDDP-induced apoptosis.
          Length = 437

 Score = 28.4 bits (64), Expect = 5.6
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 261 TKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
           T     K  +KK L GG  E+EE EEE+    ++
Sbjct: 145 TTYLPKKKVKKKNLLGGKSEKEEPEEEKTPAPDK 178


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 28.2 bits (63), Expect = 5.6
 Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 15/81 (18%)

Query: 220 DPVNRPDGY-NKTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGG 278
           +    P G  N   +Q  +D   G    + + +  E EE E  +    K +E++ L    
Sbjct: 73  NEAVAPGGTINGDGYQEPDDPTSGYA-SISQADRVEGEEPESIR----KWRERRDL---- 123

Query: 279 GEEEEEEEEEEEEEEEEEEEI 299
                  EE +E  E+++EE+
Sbjct: 124 -----RIEERDEASEKKKEEL 139


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 27.1 bits (60), Expect = 5.7
 Identities = 22/65 (33%), Positives = 33/65 (50%)

Query: 230 KTIFQSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
           K I +   +  K  E E+ ++ EEE ++  E K KE + + KK L  G  E EE + + E
Sbjct: 27  KKIIRKAKEEAKKIEEEIIKKAEEEAQKLIEKKKKEGEEEAKKILEEGEKEIEELKVKAE 86

Query: 290 EEEEE 294
           E  E 
Sbjct: 87  ENFET 91


>gnl|CDD|227635 COG5325, COG5325, t-SNARE complex subunit, syntaxin [Intracellular
           trafficking and secretion].
          Length = 283

 Score = 28.3 bits (63), Expect = 5.7
 Identities = 15/55 (27%), Positives = 25/55 (45%), Gaps = 7/55 (12%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           + H  EEEE+EE   +   +  L+++           EE E ++    E +EEI 
Sbjct: 151 DQHPLEEEEDEESLSSLGSQQTLQQQGLS-------NEELEYQQILITERDEEIK 198


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 28.1 bits (63), Expect = 5.8
 Identities = 16/53 (30%), Positives = 22/53 (41%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
               EE E+ E+E    KE   + +K L       EEE  E  EE +E   + 
Sbjct: 113 AELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKR 165


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 28.5 bits (63), Expect = 5.8
 Identities = 13/43 (30%), Positives = 19/43 (44%)

Query: 247 VHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
           V  EEE    +  E   +E K +EK+K G         E +E+
Sbjct: 82  VKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDED 124


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 28.1 bits (63), Expect = 5.8
 Identities = 19/56 (33%), Positives = 29/56 (51%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            +E+ EE EE+  E  E   +E K +  KK        E+ + + E E+EE EEE+
Sbjct: 57  AKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEEL 112


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
           consists of several mammalian dentin matrix protein 1
           (DMP1) sequences. The dentin matrix acidic
           phosphoprotein 1 (DMP1) gene has been mapped to human
           chromosome 4q21. DMP1 is a bone and teeth specific
           protein initially identified from mineralised dentin.
           DMP1 is primarily localised in the nuclear compartment
           of undifferentiated osteoblasts. In the nucleus, DMP1
           acts as a transcriptional component for activation of
           osteoblast-specific genes like osteocalcin. During the
           early phase of osteoblast maturation, Ca(2+) surges into
           the nucleus from the cytoplasm, triggering the
           phosphorylation of DMP1 by a nuclear isoform of casein
           kinase II. This phosphorylated DMP1 is then exported out
           into the extracellular matrix, where it regulates
           nucleation of hydroxyapatite. DMP1 is a unique molecule
           that initiates osteoblast differentiation by
           transcription in the nucleus and orchestrates
           mineralised matrix formation extracellularly, at later
           stages of osteoblast maturation. The DMP1 gene has been
           found to be ectopically expressed in lung cancer
           although the reason for this is unknown.
          Length = 514

 Score = 28.5 bits (63), Expect = 5.8
 Identities = 13/54 (24%), Positives = 23/54 (42%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           R+    EE+   E +++   E K    +  G     E    E +E+ EE + +E
Sbjct: 270 RKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQE 323


>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal.  This is the N terminal of
           the DNA repair protein Rad54.
          Length = 191

 Score = 27.7 bits (62), Expect = 5.9
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 279 GEEEEEEEEEEEEEEEEEEE 298
            + + EEE+ E+++E EE E
Sbjct: 143 DKPKIEEEKAEKDQEPEESE 162



 Score = 27.7 bits (62), Expect = 5.9
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 280 EEEEEEEEEEEEEEEEEEEIGGVV 303
           + EEE+ E+++E EE E ++    
Sbjct: 146 KIEEEKAEKDQEPEESETKLSNGP 169


>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
           [Transport and binding proteins, Nucleosides, purines
           and pyrimidines].
          Length = 437

 Score = 28.1 bits (63), Expect = 5.9
 Identities = 14/83 (16%), Positives = 30/83 (36%), Gaps = 7/83 (8%)

Query: 118 FTATDYVTTCWPF-GDVW------CKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMS 170
            TA  Y    + +  ++        KM+    T Y     L  +  +     +     ++
Sbjct: 16  ITAPQYFIEYYKYAQNIPEAIPSSSKMWKHFNTYYTLASQLPSLLFNSLNLFLIFRIPVT 75

Query: 171 VRTERNALKAILITWIVIVLTAV 193
           VR     +  +++  +V+VL  V
Sbjct: 76  VRLLGGLVILLVVVILVMVLVKV 98


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 28.4 bits (64), Expect = 5.9
 Identities = 12/56 (21%), Positives = 21/56 (37%), Gaps = 5/56 (8%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             E  E   + E+ + +   KE +L   ++        E E    + E +   EEI
Sbjct: 15  SYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQ-----RLEAEGLRLKREVDRLREEI 65


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 27.2 bits (61), Expect = 6.1
 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 20/69 (28%)

Query: 245 REVHEEEEEEEEEEEETKAKETKLKE--------------KKKLGGGGGEEEEEEEEEEE 290
            E    EE  ++ EEE + KE +L+E              K++        E++ EEE++
Sbjct: 21  EEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRR------RAEKKAEEEKK 74

Query: 291 EEEEEEEEI 299
             +E+EEEI
Sbjct: 75  LRKEKEEEI 83


>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking.  Sec14-like
           Golgi-trafficking domain The GOLD domain is always found
           combined with lipid- or membrane-association domains.
          Length = 136

 Score = 27.3 bits (61), Expect = 6.1
 Identities = 13/24 (54%), Positives = 15/24 (62%)

Query: 279 GEEEEEEEEEEEEEEEEEEEIGGV 302
            +EEEEEE EEEE E  + E G  
Sbjct: 55  SDEEEEEEAEEEEAETGDVEAGSK 78


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 28.1 bits (63), Expect = 6.1
 Identities = 18/128 (14%), Positives = 36/128 (28%), Gaps = 14/128 (10%)

Query: 184 TWIVIVLTAVPVFNMHGEVSYTYASELHTACVFLDLDPVNRPDGYNKTIFQSENDYCKGG 243
           + ++++L  +    +   V      +         L    +           E       
Sbjct: 4   SILLVILLLLIGLLLGVLVVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLR 63

Query: 244 EREVHEEEEEEEEEEE--------------ETKAKETKLKEKKKLGGGGGEEEEEEEEEE 289
           ER    +E   E EE               + +A++    E +           E E EE
Sbjct: 64  ERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEE 123

Query: 290 EEEEEEEE 297
            E++ + E
Sbjct: 124 LEKQLDNE 131


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 28.3 bits (63), Expect = 6.2
 Identities = 14/63 (22%), Positives = 26/63 (41%), Gaps = 2/63 (3%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEE--EEEEEEEEEEEEEEEIGGVVVSA 306
           +      E +  +K++ T   E+         E E  +E E+   +E EE+ I G++   
Sbjct: 664 DSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEE 723

Query: 307 VDF 309
            D 
Sbjct: 724 KDA 726



 Score = 28.3 bits (63), Expect = 6.7
 Identities = 12/67 (17%), Positives = 25/67 (37%)

Query: 234 QSENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           +         E +      E +   +     E++   + K+    GE  +E E+   +E 
Sbjct: 652 EESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDES 711

Query: 294 EEEEEIG 300
           EE+  +G
Sbjct: 712 EEDNIVG 718


>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain.  Cdc37 is a protein
           required for the activity of numerous eukaryotic protein
           kinases. This domains corresponds to the C terminal
           domain whose function is unclear. It is found C terminal
           to the Hsp90 chaperone (Heat shocked protein 90) binding
           domain pfam08565 and the N terminal kinase binding
           domain of Cdc37 pfam03234.
          Length = 89

 Score = 26.5 bits (59), Expect = 6.8
 Identities = 14/32 (43%), Positives = 16/32 (50%)

Query: 270 EKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGG 301
             K  G    +E EEE  EEE E+EE EE   
Sbjct: 58  NAKIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89


>gnl|CDD|217956 pfam04194, PDCD2_C, Programmed cell death protein 2, C-terminal
           putative domain. 
          Length = 165

 Score = 27.4 bits (61), Expect = 7.0
 Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 247 VHEEEEEEEE---EEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           V  EEEE E         + +E ++   ++L   GG  +E+EE +E E+ + +
Sbjct: 6   VFPEEEEYETLHAPSLPEEPEEREIDAFEELEKEGGGGKEDEETDESEKAKSD 58


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 27.9 bits (62), Expect = 7.0
 Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 7/49 (14%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           E+ EE EEEE +   +  +  ++         E+++E+E   +  E EI
Sbjct: 85  EKLEELEEEEREWYPSLNQMLEEN-------REQQKEKEARRQAREAEI 126


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 28.0 bits (63), Expect = 7.2
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 5/60 (8%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEE-----EEEEEEEEE 298
           E E     E+++EEE+  +A+E   +E  K      E E E+   E     E + +E+EE
Sbjct: 217 EAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEE 276



 Score = 27.6 bits (62), Expect = 8.1
 Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 10/52 (19%)

Query: 243 GEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294
            E+ +  E  + E  E E +     L+EK+K      EEE+  E +E   +E
Sbjct: 202 KEKAIEAERAKAEAAEAEQEL----LREKQK------EEEQMMEAQERSYQE 243


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 28.1 bits (63), Expect = 7.2
 Identities = 11/50 (22%), Positives = 28/50 (56%), Gaps = 6/50 (12%)

Query: 250 EEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
             E   + + + K+K  +  EK         E +E+++E+++EE+++++I
Sbjct: 302 LFEMISDLKRKLKSKFERDNEKLDA------EVKEKKKEKKKEEKKKKQI 345


>gnl|CDD|147845 pfam05914, RIB43A, RIB43A.  This family consists of several
           RIB43A-like eukaryotic proteins. Ciliary and flagellar
           microtubules contain a specialised set of
           protofilaments, termed ribbons, that are composed of
           tubulin and several associated proteins. RIB43A was
           first characterized in the unicellular biflagellate,
           Chlamydomonas reinhardtii although highly related
           sequences are present in several higher eukaryotes
           including humans. The function of this protein is
           unknown although the structure of RIB43A and its
           association with the specialised protofilament ribbons
           and with basal bodies is relevant to the proposed role
           of ribbons in forming and stabilising doublet and
           triplet microtubules and in organising their
           three-dimensional structure. Human RIB43A homologues
           could represent a structural requirement in centriole
           replication in dividing cells.
          Length = 379

 Score = 28.1 bits (63), Expect = 7.2
 Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 6/60 (10%)

Query: 241 KGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIG 300
           KG E+++ E+E   EEE+   K  + +   + +           E +E    +E  +++ 
Sbjct: 287 KGQEQQLQEKERRREEEQLREKEWDRQAINQARAA------VLLERQERRLRKELRKQLD 340


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 27.4 bits (61), Expect = 7.3
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query: 238 DYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           +  KG E+E+ E+E   +E + E    E K+ E ++      E++E E++E    + E  
Sbjct: 110 EERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIARLKSEAS 169

Query: 298 EI 299
           +I
Sbjct: 170 KI 171


>gnl|CDD|240533 cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic
           compound extrusion (MATE)-like proteins.  Escherichia
           coli Wzx and related proteins from other gram-negative
           bacteria are thought to act as flippases, assisting in
           the membrane translocation of lipopolysaccharides
           including those containing O-antigens. Proteins from the
           MATE family are involved in exporting metabolites across
           the cell membrane and are often responsible for
           multidrug resistance (MDR).
          Length = 402

 Score = 27.9 bits (63), Expect = 7.5
 Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 8/87 (9%)

Query: 35  IPAICRNFSHEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMR 94
            P I   + ++      +  ++ + +  I +PI  G+      F   ++ ++  F ++  
Sbjct: 267 FPRISELYKNDKDELKKLLKKSFKYLFLISLPIAIGLF----LFAEPIITLL--FGEEYL 320

Query: 95  STTNLLIINLAIADLLFIVCCVPFTAT 121
               +L I LA    LFI     F + 
Sbjct: 321 PAALVLQI-LAWG-FLFIFLNGIFGSI 345


>gnl|CDD|218316 pfam04893, Yip1, Yip1 domain.  The Yip1 integral membrane domain
           contains four transmembrane alpha helices. The domain is
           characterized by the motifs DLYGP and GY. The Yip1
           protein is a golgi protein involved in vesicular
           transport that interacts with GTPases.
          Length = 171

 Score = 27.4 bits (61), Expect = 7.6
 Identities = 21/144 (14%), Positives = 56/144 (38%), Gaps = 21/144 (14%)

Query: 50  NDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADL 109
           +    + +    ++   I   + ++LG F  AL++ ++A  +      +     L++   
Sbjct: 49  SGETIQGLTGGSALGSIIGGYLGLLLGLFLLALLLHLIA--KLFGGDGSFKQT-LSLVGY 105

Query: 110 LFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASM 169
             +   +       ++       +     L++V   A +++L L+ L   L   H +   
Sbjct: 106 ALLPLILGGLIALLLS-------LLLGALLLLVGLLALIWSLYLLYL--GLKAAHGL--- 153

Query: 170 SVRTERNALKAILITWIVIVLTAV 193
                 +  KA+L+  ++++L  V
Sbjct: 154 ------SKKKALLVALLLLLLLIV 171


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 26.5 bits (59), Expect = 7.6
 Identities = 6/46 (13%), Positives = 25/46 (54%)

Query: 252 EEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEE 297
           EE   E  +++A     ++  K      +E++   +++++++++++
Sbjct: 34  EEVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDD 79


>gnl|CDD|236476 PRK09354, recA, recombinase A; Provisional.
          Length = 349

 Score = 27.8 bits (63), Expect = 7.6
 Identities = 20/39 (51%), Positives = 24/39 (61%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEE 293
           +E  E     E K++EK  L     EEEEEE+EEEEEEE
Sbjct: 311 KENPELADEIEKKIREKLGLSAAAAEEEEEEDEEEEEEE 349


>gnl|CDD|221739 pfam12730, ABC2_membrane_4, ABC-2 family transporter protein.  This
           family is related to the ABC-2 membrane transporter
           family pfam01061.
          Length = 230

 Score = 27.7 bits (62), Expect = 7.6
 Identities = 19/145 (13%), Positives = 57/145 (39%), Gaps = 15/145 (10%)

Query: 61  VSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTA 120
           + +I+P+   ++ +L F  N         +      ++   ++     LL  +      +
Sbjct: 15  ILLILPLLLALLALLLFGVNGD----TGLSPGSLLLSSFSFLSALFLPLLIAILASLLVS 70

Query: 121 TDY--------VTTCWPFGDVW-CKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV 171
            ++        ++     G ++  K  L+V+   + + +L+L+ L     ++   +  S+
Sbjct: 71  REFKNGTIKLLLSLPISRGKIFLAK--LLVLLILSLLASLLLLLLSLIAGLLLGGSGFSL 128

Query: 172 RTERNALKAILITWIVIVLTAVPVF 196
                AL  +L+  ++ +L  + + 
Sbjct: 129 SLLLGALLLLLLLSLLALLLILLIL 153


>gnl|CDD|236332 PRK08668, PRK08668, NADH dehydrogenase subunit M; Validated.
          Length = 610

 Score = 28.0 bits (63), Expect = 7.8
 Identities = 13/52 (25%), Positives = 19/52 (36%), Gaps = 14/52 (26%)

Query: 63  IIVPIFFGIIIILGFFG--NALVVVVVAFNQQMRSTTNLLIINLAIADLLFI 112
            I PI +    I   FG  NAL V              L ++   IA +++ 
Sbjct: 481 GIGPIDYEGWEIKTGFGSWNALTV------------FILFVVGFIIAAIIYF 520


>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose
           2'-O-methyltransferase Hen1.  Members of this protein
           family are bacterial Hen1, a 3' terminal RNA ribose
           2'-O-methyltransferase that acts in bacterial RNA
           repair. All members of the seed alignment belong to a
           cassette with the RNA repair enzyme polynucleotide
           kinase-phosphatase (Pnkp). Chemically similar Hen1 in
           eukaryotes acts instead on small regulatory RNAs
           [Transcription, RNA processing, Protein synthesis, tRNA
           and rRNA base modification].
          Length = 462

 Score = 28.0 bits (63), Expect = 7.9
 Identities = 16/54 (29%), Positives = 20/54 (37%), Gaps = 15/54 (27%)

Query: 274 LGGGGGEEEEEEEEEEEEEEEEEEEIG------GVVVSAV---------DFGSG 312
           L      E EE E EE +EE  E+           VV+A+         D G G
Sbjct: 239 LAEADEAEPEEAETEEAQEEAAEKPPSLNRQRLEAVVAALRESGARSVLDLGCG 292


>gnl|CDD|223751 COG0679, COG0679, Predicted permeases [General function prediction
           only].
          Length = 311

 Score = 27.6 bits (62), Expect = 8.0
 Identities = 9/41 (21%), Positives = 14/41 (34%), Gaps = 4/41 (9%)

Query: 67  IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIA 107
            F G+ + L  FG   +   V F          L+  L + 
Sbjct: 109 GFLGLPVALSLFGEKGLAYAVIFLI----IGLFLMFTLGVI 145


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 27.9 bits (63), Expect = 8.1
 Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 7/57 (12%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
            V  EE  E E  +E   +E + + K+ L       E E   +EE+  ++   + G+
Sbjct: 385 YVPVEELLEIEGFDEETVEELRERAKEAL-------ETEALAQEEKLADDLLSLEGL 434



 Score = 27.9 bits (63), Expect = 8.5
 Identities = 12/59 (20%), Positives = 23/59 (38%), Gaps = 4/59 (6%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEE----EEEEEEEEEEE 298
             E     +EE+  ++    +    +   KL   G +  E+  E    E  + E +EE+
Sbjct: 412 ALETEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLEDLAEQAVDELIDIEGDEEK 470


>gnl|CDD|191647 pfam06947, DUF1290, Protein of unknown function (DUF1290).  This
          family consists of several bacterial small basic
          proteins of around 100 residues in length. The function
          of this family is unknown.
          Length = 88

 Score = 26.4 bits (59), Expect = 8.1
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query: 72 IIILGFFGNALVVVVVAF 89
          + I GFFGN L+  ++ +
Sbjct: 37 VFISGFFGNVLLAALLVY 54


>gnl|CDD|145556 pfam02478, Pneumo_phosprot, Pneumovirus phosphoprotein.  This
           family represents the phosphoprotein of Paramyxoviridae,
           a putative RNA polymerase alpha subunit that may
           function in template binding.
          Length = 266

 Score = 27.5 bits (61), Expect = 8.1
 Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 246 EVHEEEEEEEEEEEETKAKETKLKEK-KKLGG--GGGEEEEEEEEEEEEEEEEEEEI 299
           +  E   EEE +  +      KL EK K+L           E EEEEE+E+ EE++I
Sbjct: 210 KAAEMLREEESQRAKIGDGSVKLTEKAKELNKILEDESSSGESEEEEEDEDLEEDDI 266


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This is
           the highly conserved family of the major tail subunit
           protein.
          Length = 121

 Score = 27.0 bits (59), Expect = 8.1
 Identities = 12/51 (23%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
           +EE ++ E E E + KE+K +E+ K      E++   +++ +   ++++ I
Sbjct: 49  KEERDKMETEREKRDKESK-EERDKFISTMNEQQRLMDKQNDMMGQQQQSI 98


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 28.1 bits (62), Expect = 8.2
 Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 6/32 (18%)

Query: 274 LGGGGGEEEEEEEEE------EEEEEEEEEEI 299
           L    GEE EEE+ +      E EEE+EEEEI
Sbjct: 78  LFAEAGEEAEEEDNDRECPDTEAEEEDEEEEI 109


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 28.1 bits (62), Expect = 8.4
 Identities = 10/50 (20%), Positives = 19/50 (38%)

Query: 249 EEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           + E+ E    +    KE  L+           EE + E E E  + + ++
Sbjct: 135 QPEKNESGNNDHATDKENLLESDASSSNDSESEESDSESEIESSDSDHDD 184


>gnl|CDD|227499 COG5171, YRB1, Ran GTPase-activating protein (Ran-binding protein)
           [Intracellular trafficking and secretion].
          Length = 211

 Score = 27.3 bits (60), Expect = 8.4
 Identities = 11/54 (20%), Positives = 19/54 (35%), Gaps = 10/54 (18%)

Query: 256 EEEEETKAKETKLKEKKKLGGGGGEEEE----------EEEEEEEEEEEEEEEI 299
              EE K +++   E     G  G+  E          E +    +  EE+E +
Sbjct: 38  AGGEEKKVQQSPFLENAVPEGDEGKGPESPNIHFEPVVELQRVHLKTNEEDETV 91


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
           production and conversion].
          Length = 108

 Score = 26.6 bits (59), Expect = 8.5
 Identities = 19/54 (35%), Positives = 26/54 (48%), Gaps = 9/54 (16%)

Query: 249 EEEEEEEEEEEETKAKETK---LKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEI 299
            EE  E  EE E +A++     L+E ++      E EEE EE   E E+E   I
Sbjct: 38  REEAREIIEEAEEEAEKLAQEILEEARE------EAEEEAEEILAEAEKEASAI 85



 Score = 26.6 bits (59), Expect = 8.6
 Identities = 20/55 (36%), Positives = 26/55 (47%), Gaps = 6/55 (10%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           E+ + E  EE  E  EE + +  KL ++        EE  EE EEE EE   E E
Sbjct: 31  EQIIKEAREEAREIIEEAEEEAEKLAQEIL------EEAREEAEEEAEEILAEAE 79


>gnl|CDD|239657 cd03685, ClC_6_like, ClC-6-like chloride channel proteins. This CD
           includes ClC-6, ClC-7 and ClC-B, C, D in plants.
           Proteins in this family are ubiquitous in eukarotes and
           their functions are unclear. They are expressed in
           intracellular organelles membranes.  This family belongs
           to the ClC superfamily of chloride ion channels, which
           share the unique double-barreled architecture and
           voltage-dependent gating mechanism. The gating is
           conferred by the permeating anion itself, acting as the
           gating charge. ClC chloride ion channel superfamily
           perform a variety of functions including cellular
           excitability regulation, cell volume regulation,
           membrane potential stabilization, acidification of
           intracellular organelles, signal transduction, and
           transepithelial transport in animals.
          Length = 466

 Score = 28.0 bits (63), Expect = 8.6
 Identities = 22/102 (21%), Positives = 36/102 (35%), Gaps = 8/102 (7%)

Query: 38  ICRNFSHEYINDNDVDAEAME-RMVSIIVPIFFGIII-ILGFFGNALVVVVVAFNQQMRS 95
            C  F    +   D  +         +I  +  G+I  +LG   N L   V  F +++  
Sbjct: 251 KCGLFGPGGLIMFDGSSTKYLYTYFELIPFMLIGVIGGLLGALFNHLNHKVTRFRKRINH 310

Query: 96  TTNLL--IINLAIADLLFIVCCVP----FTATDYVTTCWPFG 131
              LL  +  L ++ +  +V        F    Y   CW FG
Sbjct: 311 KGKLLKVLEALLVSLVTSVVAFPQTLLIFFVLYYFLACWTFG 352


>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 26.6 bits (59), Expect = 8.6
 Identities = 12/45 (26%), Positives = 16/45 (35%)

Query: 254 EEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           EE  E+E +  E   ++ KK        EE         EE   E
Sbjct: 6   EERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVNRNRLEEGTVE 50


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 27.0 bits (60), Expect = 9.2
 Identities = 12/46 (26%), Positives = 25/46 (54%)

Query: 251 EEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEE 296
           ++E ++ ++E KAKE KL++  K+       + E E ++E+     
Sbjct: 57  QKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAF 102


>gnl|CDD|217401 pfam03169, OPT, OPT oligopeptide transporter protein.  The OPT
           family of oligopeptide transporters is distinct from the
           ABC pfam00005 and PTR pfam00854 transporter families.
           OPT transporters were first recognised in fungi (Candida
           albicans and Schizosaccharomyces pombe), but this
           alignment also includes orthologues from Arabidopsis
           thaliana. OPT transporters are thought to have 12-14
           transmembrane domains and contain the following motif:
           SPYxEVRxxVxxxDDP.
          Length = 619

 Score = 27.7 bits (62), Expect = 9.6
 Identities = 9/51 (17%), Positives = 22/51 (43%), Gaps = 2/51 (3%)

Query: 65  VPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCC 115
           VP+++ +  ++      L + +VA     +     L++ L +A +  I   
Sbjct: 349 VPMWWYLAGLVLSLV--LGIALVAALFPTQLPVWGLLLALLLAFVFAIPSA 397


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 27.7 bits (62), Expect = 9.7
 Identities = 13/62 (20%), Positives = 21/62 (33%)

Query: 246  EVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVS 305
             V E ++E  + EE  +  E  + E + +         E   E E     E E+    V 
Sbjct: 948  PVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVAPAQVP 1007

Query: 306  AV 307
              
Sbjct: 1008 EA 1009


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 27.0 bits (60), Expect = 9.8
 Identities = 9/35 (25%), Positives = 22/35 (62%)

Query: 264 KETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEE 298
           K  K     ++   GG+ ++++E++E++EE + E+
Sbjct: 37  KRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAED 71


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 27.7 bits (62), Expect = 9.8
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 244 EREVHEEEEEEEEEEEETKAKETKLKEKKKL 274
           E+E  E  +E++   +  K +E K KE +KL
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 27.5 bits (61), Expect = 9.9
 Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 4/48 (8%)

Query: 255 EEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGV 302
           E+ EE+T   E K  +K         E EE ++E+E+E+E  E +  V
Sbjct: 424 EKSEEDTLDLEIKSIKKYF----PFIEFEEVKKEKEKEKERPEFVEKV 467


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 26.1 bits (58), Expect = 9.9
 Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 6/47 (12%)

Query: 267 KLKEKKKLGGGGGEEEEEEEEEEEEEEEEEEEIGGVVVSAVDFGSGG 313
           K+KEKK+      E EE++EE  E  ++EEE    V + +   G GG
Sbjct: 51  KVKEKKE------EREEDKEELIERIKKEEETFEDVDLGSAGCGGGG 91


>gnl|CDD|223724 COG0651, HyfB, Formate hydrogenlyase subunit 3/Multisubunit Na+/H+
           antiporter, MnhD subunit [Energy production and
           conversion / Inorganic ion transport and metabolism].
          Length = 504

 Score = 27.7 bits (62), Expect = 9.9
 Identities = 22/148 (14%), Positives = 47/148 (31%), Gaps = 22/148 (14%)

Query: 56  AMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCC 115
               ++  I+       ++L F G   +           +    L + L        V  
Sbjct: 1   MNNLLLLPIILPLLAAALLLLFLGLRRLKAKRFL-----ALLGALGLLLLSILGFLEV-- 53

Query: 116 VPFTATDYVTTCW--PFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRT 173
                       W  PFG       ++     ++++ L++  +   LA ++ I  M    
Sbjct: 54  ---GGPVGGLGGWLAPFG------IVLAADPLSAIFLLIIALV-GLLASLYSIGYMRHEK 103

Query: 174 ERN---ALKAILITWIVIVLTAVPVFNM 198
           ER     L  +L+  ++       +FN+
Sbjct: 104 ERAYYYFLLLLLLAGMLGAFLTGDLFNL 131


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.136    0.404 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,111,907
Number of extensions: 1766906
Number of successful extensions: 18951
Number of sequences better than 10.0: 1
Number of HSP's gapped: 12019
Number of HSP's successfully gapped: 1615
Length of query: 318
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 221
Effective length of database: 6,635,264
Effective search space: 1466393344
Effective search space used: 1466393344
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.3 bits)