RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16246
         (118 letters)



>gnl|CDD|217293 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand
           binding domain.  This family is the extracellular ligand
           binding domain of these ion channels. This domain forms
           a pentameric arrangement in the known structure.
          Length = 215

 Score =  110 bits (278), Expect = 1e-31
 Identities = 36/63 (57%), Positives = 44/63 (69%), Gaps = 1/63 (1%)

Query: 50  NADGNFEVTLA-TKATIYYQGLVEWKPPAIYKSSCEIDVEYFPFDEQTCVLKFGSWTYDG 108
            ADG  ++T   T   +Y  G V W PPAIYKSSC +D+ YFPFD+Q C LKFGSWTY+G
Sbjct: 92  KADGIHDITTPNTNVRVYPDGTVLWSPPAIYKSSCPMDLTYFPFDQQNCSLKFGSWTYNG 151

Query: 109 FKV 111
            +V
Sbjct: 152 EEV 154


>gnl|CDD|233155 TIGR00860, LIC, Cation transporter family protein.  The
           Ligand-gated Ion Channel (LIC) Family of
           Neurotransmitter Receptors TC 1.A.9)Members of the LIC
           family of ionotropic neurotransmitter receptors are
           found only in vertebrate and invertebrate animals. They
           exhibit receptor specificity for (1)acetylcholine, (2)
           serotonin, (3) glycine, (4) glutamate and (5)
           g-aminobutyric acid (GABA). All of these receptor
           channels are probably hetero- orhomopentameric. The best
           characterized are the nicotinic acetyl-choline receptors
           which are pentameric channels of a2bgd subunit
           composition. All subunits arehomologous. The three
           dimensional structures of the protein complex in both
           the open and closed configurations have been solved at
           0.9 nm resolution.The channel protein complexes of the
           LIC family preferentially transport cations or anions
           depending on the channel (e.g., the acetylcholine
           receptors are cationselective while glycine receptors
           are anion selective) [Transport and binding proteins,
           Cations and iron carrying compounds].
          Length = 459

 Score = 77.1 bits (190), Expect = 7e-18
 Identities = 20/62 (32%), Positives = 26/62 (41%), Gaps = 2/62 (3%)

Query: 49  ENADGNFEVTLATKATI--YYQGLVEWKPPAIYKSSCEIDVEYFPFDEQTCVLKFGSWTY 106
              D  F     T   +  +  G V + P      +C +D+  FPFD Q C LKF SW Y
Sbjct: 120 NEKDARFHGITMTNVLVRIHPNGSVLYSPRITLTLACPMDLRNFPFDVQNCSLKFESWGY 179

Query: 107 DG 108
             
Sbjct: 180 TT 181


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 39.9 bits (94), Expect = 8e-05
 Identities = 21/48 (43%), Positives = 38/48 (79%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           K+ +K ++K E+E+K++++K   G+KKE EEEEEK+KK+ E+++  +E
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461



 Score = 39.5 bits (93), Expect = 1e-04
 Identities = 24/51 (47%), Positives = 36/51 (70%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           K   ++ EK+ +E KKE++KK    KK+ EEEEE+K+KK EEK+  +E A+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463



 Score = 39.5 bits (93), Expect = 1e-04
 Identities = 14/46 (30%), Positives = 34/46 (73%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
             ++K  K++EE++++K+++     +K+E EEEE++KK++ +E++ 
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459



 Score = 39.1 bits (92), Expect = 1e-04
 Identities = 18/63 (28%), Positives = 38/63 (60%), Gaps = 1/63 (1%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
           K  KK  EK EK+R++E+++K+  +   G+++EE+++++ E+K+  KE  +   E     
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKK-KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468

Query: 62  KAT 64
           +  
Sbjct: 469 EEE 471



 Score = 33.7 bits (78), Expect = 0.011
 Identities = 13/48 (27%), Positives = 29/48 (60%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           KK  K+ ++   K E++++E  ++K+ +    KKK++ EE++  K+  
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 39.7 bits (92), Expect = 1e-04
 Identities = 17/44 (38%), Positives = 32/44 (72%), Gaps = 2/44 (4%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           K+ KKR+E  EK +++ EQK    E++E E+E+EK++++  E++
Sbjct: 576 KLAKKREEAVEKAKREAEQK--AREEREREKEKEKERERERERE 617



 Score = 32.0 bits (72), Expect = 0.060
 Identities = 12/35 (34%), Positives = 25/35 (71%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKK 38
           +K ++E E+K R++ E++KE  +++E E E E ++
Sbjct: 586 EKAKREAEQKAREEREREKEKEKEREREREREAER 620



 Score = 28.5 bits (63), Expect = 0.99
 Identities = 14/38 (36%), Positives = 26/38 (68%), Gaps = 3/38 (7%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
           K + ++K +EE+ER+KE++K+    ++E E E E+  K
Sbjct: 589 KREAEQKAREEREREKEKEKE---REREREREAERAAK 623


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 36.4 bits (85), Expect = 0.002
 Identities = 18/45 (40%), Positives = 22/45 (48%), Gaps = 2/45 (4%)

Query: 12  EKERKKEEQKKEVGEKKEGEEEEEK--KKKKVEEKKNNKENADGN 54
             ERKK  +K+   EKK  +EE EK   KKK E      +  DG 
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452



 Score = 34.1 bits (79), Expect = 0.010
 Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           K++KK++ K EK+ +KEE +K   +KK     ++ K    E KK
Sbjct: 413 KLRKKQR-KAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455



 Score = 33.4 bits (77), Expect = 0.014
 Identities = 15/59 (25%), Positives = 28/59 (47%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATK 62
           +KK ++K+ K  KK E+++      + + E   KK K  + +  K + D   E    T+
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEKLARTE 469



 Score = 33.4 bits (77), Expect = 0.017
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
            +  +K+ EKEE E+   ++K E   KK    + E KK 
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 27.2 bits (61), Expect = 2.2
 Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 2/41 (4%)

Query: 2   KMKKKRKEKEEKE--RKKEEQKKEVGEKKEGEEEEEKKKKK 40
           K ++K ++K EKE   K   +KK     K+ +  + + KK 
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 35.7 bits (82), Expect = 0.003
 Identities = 13/51 (25%), Positives = 20/51 (39%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
           K   + K++ + K    K+   E    E E E+   K  E+K     A  N
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILN 842



 Score = 31.5 bits (71), Expect = 0.092
 Identities = 9/42 (21%), Positives = 16/42 (38%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           K+ + E    E + E    +  E+K  +       KK +  K
Sbjct: 809 KEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 34.9 bits (81), Expect = 0.004
 Identities = 14/43 (32%), Positives = 27/43 (62%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
           K K   E+K  +  K+V  K+  ++E+EK+K KV+++  + +N
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 31.1 bits (71), Expect = 0.11
 Identities = 12/46 (26%), Positives = 21/46 (45%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
           ++ K K   EK+  K  +K      ++ E+E+EK K K   +    
Sbjct: 376 LRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 29.5 bits (67), Expect = 0.33
 Identities = 11/51 (21%), Positives = 22/51 (43%), Gaps = 2/51 (3%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
           +KK  +  +K   K  +KKE  ++K   ++  +  K +   K  K +    
Sbjct: 385 EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI--GKRRKPSGTSE 433



 Score = 28.0 bits (63), Expect = 1.2
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
           K   EK+  +  K+   K   +K++ +E+ + KK+  + K   K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424



 Score = 27.6 bits (62), Expect = 1.7
 Identities = 10/48 (20%), Positives = 17/48 (35%), Gaps = 7/48 (14%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           K   K  E+KE++KE+ K +   +         K      K +     
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRD-------TKNIGKRRKPSGTSEE 434


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 34.0 bits (79), Expect = 0.011
 Identities = 16/48 (33%), Positives = 30/48 (62%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
            ++++K +E E   ++ E+ K+E+ EKKE  +EEE K  +  EK+  +
Sbjct: 530 RELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ 577



 Score = 28.6 bits (65), Expect = 0.64
 Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 3/36 (8%)

Query: 11  EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
           EE ER+ E++ +E     +   E EK K+++EEKK 
Sbjct: 526 EELERELEQKAEEAEALLK---EAEKLKEELEEKKE 558


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 33.1 bits (76), Expect = 0.020
 Identities = 11/38 (28%), Positives = 26/38 (68%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKV 41
           +++++EKE  + +K  +K +  E+K+ +E E+ +K K+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKI 591



 Score = 31.2 bits (71), Expect = 0.095
 Identities = 14/40 (35%), Positives = 28/40 (70%)

Query: 6   KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           K  +KEE +R+KEE++    +K+  + +++++KKK E +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 30.4 bits (69), Expect = 0.16
 Identities = 13/34 (38%), Positives = 22/34 (64%)

Query: 17  KEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
           KEE ++E  EK+  +E++  +K K +E+K  KE 
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKEL 583



 Score = 29.6 bits (67), Expect = 0.30
 Identities = 11/41 (26%), Positives = 31/41 (75%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           K+  ++E++E++  +++K + + K+ EE+++K+ +K+E+ K
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 27.3 bits (61), Expect = 2.2
 Identities = 11/37 (29%), Positives = 26/37 (70%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKK 37
            + K++++  +E++R ++ +K+E  +KKE E+ E+ K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 32.4 bits (74), Expect = 0.033
 Identities = 18/45 (40%), Positives = 34/45 (75%), Gaps = 1/45 (2%)

Query: 5  KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
          ++++EK+E+E K  ++++EV E++E +EE++KK KKV+E     E
Sbjct: 42 EEKEEKKEEEEKTTDKEEEVDEEEE-KEEKKKKTKKVKETTTEWE 85



 Score = 25.9 bits (57), Expect = 5.7
 Identities = 16/41 (39%), Positives = 27/41 (65%)

Query: 11 EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           EKE   EE+++E  EKKE EE+   K+++V+E++  +E  
Sbjct: 31 VEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK 71



 Score = 25.9 bits (57), Expect = 6.9
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query: 12 EKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
          EKE +KE   +E  E+KE ++EEE+K    EE+ + +E  +
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 32.5 bits (74), Expect = 0.034
 Identities = 16/50 (32%), Positives = 31/50 (62%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           +++K  KE+ E+ER+ EEQ++   EK   E +  + K +VE+++   +N 
Sbjct: 257 ELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNL 306



 Score = 26.4 bits (58), Expect = 4.6
 Identities = 10/38 (26%), Positives = 21/38 (55%), Gaps = 4/38 (10%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
           K++++R+ +EE  R K E+      + + E +E+  K 
Sbjct: 438 KLQEERRLREEAIRAKAEKTA----RMKAEMKEKTLKM 471


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 31.8 bits (73), Expect = 0.039
 Identities = 14/49 (28%), Positives = 34/49 (69%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           ++++ K+ EE++ + E++  E+  K E  E+ E++++++EEK++  E A
Sbjct: 125 LEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173



 Score = 29.9 bits (68), Expect = 0.20
 Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 6/53 (11%)

Query: 2   KMKKKRKEKEEKERKKEE-QKKEVGEKKEGEEE-EEKKKKKVEE----KKNNK 48
           ++KK  +EKEE E++  E + K    +K  EEE + ++K+  +E    KK N+
Sbjct: 128 EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQ 180


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 31.7 bits (73), Expect = 0.040
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 3  MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKK 38
          + KK  +  EK  KK E+ K   EKK+ ++EE K +
Sbjct: 63 LDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 26.7 bits (60), Expect = 2.2
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 2  KMKKKRKEKEEKERKKEEQKK 22
          K  +K K K EK++ K+E+ K
Sbjct: 76 KKAEKAKAKAEKKKAKKEEPK 96



 Score = 25.2 bits (56), Expect = 7.4
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 7  RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
               +K+  K  +K E   +K  + + EKKK K EE K
Sbjct: 59 EAALLDKKELKAWEKAEKKAEKA-KAKAEKKKAKKEEPK 96


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 32.3 bits (74), Expect = 0.043
 Identities = 14/62 (22%), Positives = 26/62 (41%), Gaps = 6/62 (9%)

Query: 1   MKMKKKRK-EKEEKERKKE-----EQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
            K+K+  + E+ E+E   E     E K    E++   EE+     + +E  +  E+ D  
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283

Query: 55  FE 56
             
Sbjct: 284 EI 285



 Score = 30.4 bits (69), Expect = 0.22
 Identities = 12/50 (24%), Positives = 20/50 (40%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
               + +  K+E+E   EE      E+KE  ++ E   K    K+   E 
Sbjct: 244 ETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293



 Score = 27.3 bits (61), Expect = 2.0
 Identities = 10/48 (20%), Positives = 17/48 (35%), Gaps = 1/48 (2%)

Query: 4   KKKRKEK-EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
                E   E +  K+EQ+    E      EE++   K E+    +  
Sbjct: 239 TDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286



 Score = 27.3 bits (61), Expect = 2.3
 Identities = 15/45 (33%), Positives = 24/45 (53%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
            K+KE K     E+K +  EKKE  + EEK++ ++ E  +    A
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERIEIAETWDTIPLA 692



 Score = 26.1 bits (58), Expect = 5.7
 Identities = 14/43 (32%), Positives = 27/43 (62%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
            MK K+  +   E+KKE +  +  E+K  +EE+++K K+ E++
Sbjct: 636 LMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 31.9 bits (73), Expect = 0.050
 Identities = 22/51 (43%), Positives = 34/51 (66%), Gaps = 7/51 (13%)

Query: 2   KMKKKRKEKEEKERKK---EEQ----KKEVGEKKEGEEEEEKKKKKVEEKK 45
           K+ KK +EK EK +++    EQ    KKE+G +K+ ++E EK K+K+EE K
Sbjct: 210 KITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELK 260


>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
           Provisional.
          Length = 784

 Score = 31.8 bits (72), Expect = 0.064
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 4/49 (8%)

Query: 5   KKRKEKEEKERKKEEQ----KKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           KK+ EK ++E    EQ    +KE+GE  +  +E E  K+K++  K  KE
Sbjct: 218 KKQMEKSQREYYLNEQMKAIQKELGEMDDAPDENEALKRKIDAAKMPKE 266


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 31.8 bits (73), Expect = 0.064
 Identities = 19/56 (33%), Positives = 36/56 (64%), Gaps = 7/56 (12%)

Query: 1   MKMKKKRKEKEEKERKK---EEQ----KKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
            +++KK KE+ EK +++    EQ    +KE+GE  + ++E E+ ++K+E+ K  KE
Sbjct: 212 KRIRKKVKEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEKLKLPKE 267


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 31.5 bits (72), Expect = 0.081
 Identities = 12/49 (24%), Positives = 34/49 (69%), Gaps = 2/49 (4%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           +K  ++ +++E +E+K+E++K+E   + +  +   ++++K+EEK+  K+
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEE--REAKLAKLSPEEQRKLEEKERKKQ 319



 Score = 29.9 bits (68), Expect = 0.22
 Identities = 16/47 (34%), Positives = 32/47 (68%), Gaps = 2/47 (4%)

Query: 1   MKMKKKRKEKEEKERKKEEQK--KEVGEKKEGEEEEEKKKKKVEEKK 45
           +K+  +   K +K R++EE+K  K   E+++ E +E+K++KK EE++
Sbjct: 251 LKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEERE 297


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 31.2 bits (70), Expect = 0.086
 Identities = 12/45 (26%), Positives = 26/45 (57%)

Query: 5    KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
            K ++EKE  E  +  +++E+G+K++  E+  +K      K+  K+
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKK 1071



 Score = 27.0 bits (59), Expect = 2.5
 Identities = 12/44 (27%), Positives = 29/44 (65%)

Query: 2    KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
            +M+  ++ KEE+  KKE+++++   K   +  +E  KK++++K+
Sbjct: 1034 RMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 30.8 bits (69), Expect = 0.11
 Identities = 16/52 (30%), Positives = 39/52 (75%), Gaps = 3/52 (5%)

Query: 2   KMKKKRKEKEEKERKKEEQKKE--VGEKKEGEEEE-EKKKKKVEEKKNNKEN 50
           +++K R+E+EE E + E +++E  + E+++ EE+E E++++ +EE+ ++ E+
Sbjct: 127 QIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEH 178



 Score = 25.4 bits (55), Expect = 9.1
 Identities = 11/51 (21%), Positives = 33/51 (64%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           +M+ +R+E++  ER+ +E+++   E++  EE+ +  + ++ E+  ++  +D
Sbjct: 141 RMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESD 191



 Score = 25.0 bits (54), Expect = 10.0
 Identities = 13/52 (25%), Positives = 32/52 (61%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
           ++R+EK ++   +EEQ++E  E+   E+ ++ + + +E+ ++  E+ D   E
Sbjct: 145 ERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTE 196


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 30.3 bits (69), Expect = 0.11
 Identities = 13/35 (37%), Positives = 24/35 (68%)

Query: 6   KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKK 40
           K+ EKEE+E +K  +K+ + + K+ EE  E K+++
Sbjct: 79  KKNEKEERELRKRAEKEALEQAKKEEELREAKRQQ 113


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 30.4 bits (69), Expect = 0.14
 Identities = 15/48 (31%), Positives = 32/48 (66%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
           +K+K+E   + E+ KKE  EK++ + +++K KKK ++ K+ K++   +
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108



 Score = 30.4 bits (69), Expect = 0.14
 Identities = 14/42 (33%), Positives = 30/42 (71%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           K+K K K++K +KK+++ K+  + K+ ++ E+K +K+ E+K 
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122



 Score = 30.0 bits (68), Expect = 0.21
 Identities = 20/63 (31%), Positives = 39/63 (61%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
           +++K +KE EEK++ K ++KK   +K + +++++ KK    EKK+ KE  D   ++T + 
Sbjct: 70  EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSY 129

Query: 62  KAT 64
             T
Sbjct: 130 SET 132



 Score = 29.3 bits (66), Expect = 0.36
 Identities = 13/49 (26%), Positives = 29/49 (59%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
            KK+K K++K++ K+++  +  +K E ++E+E + K  +  K+  E   
Sbjct: 86  WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134



 Score = 27.3 bits (61), Expect = 1.7
 Identities = 14/48 (29%), Positives = 28/48 (58%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           K KK +K+K++ + KK+++K +  EKK+ +E E+K +   +       
Sbjct: 87  KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134



 Score = 27.0 bits (60), Expect = 2.0
 Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 9   EKEEKERKKEEQKKEV-GEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
           E  E ++KK+E  +E+   KKE EE+++ K KK + KK   ++ D  
Sbjct: 56  EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 30.5 bits (69), Expect = 0.14
 Identities = 15/50 (30%), Positives = 29/50 (58%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
            KK  + K+E +  +E + K   EK + EE E+K+K++ +E K  ++  +
Sbjct: 382 TKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431



 Score = 28.2 bits (63), Expect = 0.95
 Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEE-EEKKKKKVEEKKNNKENAD 52
           +K+  K+ ++  +K  E K E    +E E + +E+K K+ E +K  KE AD
Sbjct: 371 LKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQAD 421



 Score = 27.8 bits (62), Expect = 1.3
 Identities = 15/51 (29%), Positives = 26/51 (50%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
              KK  E +++    EE + +  E+K  +EE EKK+K+  ++   K   D
Sbjct: 380 DYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKD 430



 Score = 27.5 bits (61), Expect = 1.7
 Identities = 10/51 (19%), Positives = 29/51 (56%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           K+ + + E +  E  + + K+E  +++E E++++++  + +EK+   E   
Sbjct: 384 KLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 30.6 bits (69), Expect = 0.16
 Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 3/47 (6%)

Query: 2   KMKKKRKE--KEEKERKKEE-QKKEVGEKKEGEEEEEKKKKKVEEKK 45
           ++KK R     E K RKKE  QK+   E++   E  +++   +   +
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARAR 377


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 30.2 bits (68), Expect = 0.18
 Identities = 16/48 (33%), Positives = 27/48 (56%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           +K  K+ E+  ++ EE++K+  E K  +  E K K + E +K  KE A
Sbjct: 101 EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEA 148



 Score = 27.9 bits (62), Expect = 1.2
 Identities = 17/48 (35%), Positives = 26/48 (54%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
            K + E E +++ KEE KK+  E+ + +   E KKK  E KK  +  A
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEA 179



 Score = 26.7 bits (59), Expect = 3.2
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           + +++   EK  K+ EQ  +  E+K+ + EE K K+  E K 
Sbjct: 93  ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKA 134



 Score = 26.3 bits (58), Expect = 4.8
 Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 2/50 (4%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           K  ++R   E+  ++ E+  K+  EK+  ++ EE K K+  E K   E  
Sbjct: 92  KELEQRAAAEKAAKQAEQAAKQAEEKQ--KQAEEAKAKQAAEAKAKAEAE 139



 Score = 26.3 bits (58), Expect = 4.9
 Identities = 16/48 (33%), Positives = 24/48 (50%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           KK+ E+E K +   E KK+  E K+  E E K K + + K   +E   
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKA 196



 Score = 26.0 bits (57), Expect = 5.5
 Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 1/60 (1%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATKA 63
           + K K   E ++K  E KK+  E +   + E K K K EE K   E A        A KA
Sbjct: 155 EAKAKAAAEAKKKAAEAKKK-AEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKA 213



 Score = 25.6 bits (56), Expect = 8.7
 Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 5   KKRKEKEEKERKKEEQK-KEVGEKKEGEEEEEKKKKKVEEKK 45
           +  K+ EEK+++ EE K K+  E K   E E +KK K E KK
Sbjct: 109 QAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 30.3 bits (69), Expect = 0.18
 Identities = 12/35 (34%), Positives = 26/35 (74%), Gaps = 3/35 (8%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
            K+K +E+K +K++++KK    KK+G++ ++K +K
Sbjct: 383 PKKKREEKKPQKRKKKKK---RKKKGKKRKKKGRK 414



 Score = 26.5 bits (59), Expect = 3.7
 Identities = 14/43 (32%), Positives = 25/43 (58%), Gaps = 4/43 (9%)

Query: 6   KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
           K K  +  ++K+EE+K     +K  ++++ KKK K  +KK  K
Sbjct: 376 KEKYPKPPKKKREEKKP----QKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 30.1 bits (68), Expect = 0.19
 Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 6/45 (13%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           K+KEE+ER+ E        ++ G E+ E+ + K + K+  K  ++
Sbjct: 121 KQKEEEERRVER------RRELGLEDPEQLRLKQKAKEEQKAESE 159



 Score = 27.0 bits (60), Expect = 2.7
 Identities = 8/42 (19%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 2   KMKKKRKEKEEKER---KKEEQKKEVGEKKEGEEEEEKKKKK 40
           K +++R+ +  +E      E+ + +   K+E + E E+ + +
Sbjct: 123 KEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRHR 164


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 30.4 bits (69), Expect = 0.20
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query: 10  KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           KE      EE +++V E+ E   E +++ KK   KK  K 
Sbjct: 461 KEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKI 500



 Score = 26.2 bits (58), Expect = 5.0
 Identities = 9/40 (22%), Positives = 18/40 (45%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
            K       E  +E+  ++  +  E +EE +K+  K  +K
Sbjct: 460 VKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKK 499



 Score = 25.8 bits (57), Expect = 7.5
 Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
           K       +E +E+  E+ ++    ++E ++E  KK KK+ E 
Sbjct: 461 KEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.0 bits (68), Expect = 0.25
 Identities = 20/43 (46%), Positives = 28/43 (65%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
           M    KRK K + ER  E+   EV EKK+ +++EEKKKK++E 
Sbjct: 305 MISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 29.6 bits (67), Expect = 0.25
 Identities = 10/37 (27%), Positives = 18/37 (48%)

Query: 9   EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           E ++ +  +E   +E       +EEE  + +K E KK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKK 194



 Score = 28.5 bits (64), Expect = 0.65
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
            K  E+   E  +     +  E  EGE++E KKKK
Sbjct: 162 AKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 26.5 bits (59), Expect = 3.3
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query: 10  KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           K+ K  ++   ++        EEE  + +KK  +KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 30.0 bits (67), Expect = 0.28
 Identities = 15/43 (34%), Positives = 30/43 (69%)

Query: 9   EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           E+EE+E ++EE+++E  E++E EEEE ++   +E  +  ++ A
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQA 906



 Score = 29.6 bits (66), Expect = 0.39
 Identities = 13/31 (41%), Positives = 25/31 (80%)

Query: 9   EKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
           + EE+E ++EE+++E  E++E EEEEE+ ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 29.2 bits (65), Expect = 0.52
 Identities = 13/29 (44%), Positives = 24/29 (82%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEK 36
           +E+EE+E ++EE+++E  E++E EEE E+
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 26.9 bits (59), Expect = 3.6
 Identities = 12/31 (38%), Positives = 24/31 (77%)

Query: 12  EKERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
           + E ++EE+++E  E++E EEEEE++++  E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 26.5 bits (58), Expect = 4.4
 Identities = 12/31 (38%), Positives = 25/31 (80%)

Query: 19  EQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           + ++E  E++E EEEEE+++++ EE++ N+E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 26.1 bits (57), Expect = 5.8
 Identities = 11/31 (35%), Positives = 25/31 (80%)

Query: 14  ERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
           + ++EE+++E  E++E EEEEE+++++  E+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 25.3 bits (55), Expect = 9.5
 Identities = 13/47 (27%), Positives = 29/47 (61%)

Query: 10  KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
           K++++        + G+ +E EEEEE+++++ EE++  +E  + N E
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 29.5 bits (66), Expect = 0.34
 Identities = 14/54 (25%), Positives = 31/54 (57%), Gaps = 6/54 (11%)

Query: 5  KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK------KVEEKKNNKENAD 52
          +++K+KEEK ++KE +K +  +K+   + + ++        K  EKK+ K + +
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVE 72



 Score = 29.5 bits (66), Expect = 0.45
 Identities = 15/55 (27%), Positives = 28/55 (50%)

Query: 2  KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
          K ++K KEKE K+ K  +++ +   + +   +     KK E+K   ++  D N E
Sbjct: 23 KKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENPE 77



 Score = 26.4 bits (58), Expect = 4.4
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 23 EVGEKKEGEEEEEKKKKKVEEKKNNKE 49
             EKK   EEE ++KKK EEK   KE
Sbjct: 6  SEAEKKILTEEELERKKKKEEKAKEKE 32



 Score = 25.6 bits (56), Expect = 7.4
 Identities = 14/62 (22%), Positives = 31/62 (50%)

Query: 3  MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATK 62
            +   E E+K   +EE +++  ++++ +E+E KK K  +++   K  A    + T   K
Sbjct: 1  GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60

Query: 63 AT 64
           +
Sbjct: 61 KS 62


>gnl|CDD|185768 cd09245, BRO1_UmRIM23-like, Protein-interacting, Bro1-like domain
           of Ustilago maydis Rim23 (PalC), and related domains.
           This family contains the Bro1-like domain of Ustilago
           maydis Rim23 (also known as PalC), and related proteins.
           It belongs to the BRO1_Alix_like superfamily which
           includes the Bro1-like domains of mammalian Alix
           (apoptosis-linked gene-2 interacting protein X),
           His-Domain type N23 protein tyrosine phosphatase
           (HD-PTP, also known as PTPN23), RhoA-binding proteins
           Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as
           PalA) from Saccharomyces cerevisiae, and related
           domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23
           interact with the ESCRT (Endosomal Sorting Complexes
           Required for Transport) system. Rim20 and Rim23
           participate in the response to the external pH via the
           Rim101 pathway. Through its Bro1-like domain, Rim23
           allows the interaction between the endosomal and plasma
           membrane complexes. Bro1-like domains are
           boomerang-shape, and part of the domain is a
           tetratricopeptide repeat (TPR)-like structure.
           Intermediates in the Rim101 pathway may play roles in
           the pathogenesis of fungal corneal infection during
           Candida albicans keratitis. This family lacks the
           V-shaped (V) domain found in many members of the
           BRO1_Alix_like superfamily.
          Length = 413

 Score = 29.3 bits (66), Expect = 0.39
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 4/54 (7%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEV 57
             K K K+  + K+E++K E G       EEE +  ++  KK  K N   +F+ 
Sbjct: 336 ASKAKLKKSWKEKREDRKVEKGAGV----EEELRTLEMLLKKYKKMNDTVSFQP 385


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 29.0 bits (65), Expect = 0.40
 Identities = 10/37 (27%), Positives = 18/37 (48%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKK 40
           +   + K E + KK+ +  +   +KE  + E KK K 
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 29.1 bits (66), Expect = 0.41
 Identities = 12/37 (32%), Positives = 25/37 (67%)

Query: 16  KKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           K++++KK+  ++ E  EE++KK    EE+K +++  D
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVD 118


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 29.3 bits (66), Expect = 0.41
 Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 2/64 (3%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
           + + K      K+ KKE++  E   KK+ E++++KKK+K E K   +             
Sbjct: 44  EEENKVATTSTKKDKKEDKNNE--SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101

Query: 62  KATI 65
               
Sbjct: 102 TKKK 105



 Score = 27.4 bits (61), Expect = 1.7
 Identities = 14/56 (25%), Positives = 22/56 (39%), Gaps = 1/56 (1%)

Query: 10 KEEKERKKEEQKKEVGEKK-EGEEEEEKKKKKVEEKKNNKENADGNFEVTLATKAT 64
             +E  K        +KK +   E +KK +K ++KK  K+      E  L  K  
Sbjct: 41 TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTP 96



 Score = 27.4 bits (61), Expect = 1.7
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
           K  K  + K++ E+KK+++K++   K EGE +   K  K  +K   K  
Sbjct: 59  KEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPP 107



 Score = 25.8 bits (57), Expect = 5.8
 Identities = 14/48 (29%), Positives = 30/48 (62%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           K K ++K+K++KE+K+ + + E     +  ++ +K KKK  + K N++
Sbjct: 67  KKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114



 Score = 25.8 bits (57), Expect = 6.5
 Identities = 17/49 (34%), Positives = 30/49 (61%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
            KK+ EK++K++K++++ K  GE K G +  +K KK  ++    K N D
Sbjct: 66  SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114



 Score = 25.8 bits (57), Expect = 7.0
 Identities = 16/51 (31%), Positives = 30/51 (58%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
            K +KK+K+K+EK+  K E + ++G K   + ++ KKK    +   + +NA
Sbjct: 68  KKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 29.3 bits (66), Expect = 0.46
 Identities = 19/66 (28%), Positives = 32/66 (48%), Gaps = 4/66 (6%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKK----EGEEEEEKKKKKVEEKKNNKENADGNFEV 57
           + KK   E EE + K+ E  K++G+ K    +  EE +K+ K+++E+      A    E 
Sbjct: 38  ERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEA 97

Query: 58  TLATKA 63
            L  K 
Sbjct: 98  ELQDKL 103


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 29.0 bits (66), Expect = 0.47
 Identities = 11/28 (39%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 18 EEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
          EEQ++ + E+   E  EE+++KK E+K 
Sbjct: 64 EEQRERL-EELAPELLEEEEEKKEEKKG 90


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 28.9 bits (65), Expect = 0.63
 Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 10/64 (15%)

Query: 3    MKKKRKEKEEKERKKEEQKKEV----------GEKKEGEEEEEKKKKKVEEKKNNKENAD 52
              + ++EKE   + + E  KE            EK   E+  E   KK + KK  K N +
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTE 1580

Query: 53   GNFE 56
               +
Sbjct: 1581 AELD 1584



 Score = 25.4 bits (56), Expect = 9.7
 Identities = 11/25 (44%), Positives = 19/25 (76%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGE 26
           K KKK K+KEE+ +++E+ + E+ E
Sbjct: 747 KKKKKEKKKEEEYKREEKARIEIAE 771


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 28.8 bits (65), Expect = 0.65
 Identities = 15/54 (27%), Positives = 31/54 (57%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
           +K K   K +E+++R +E ++  + E+KE   + E+ +KK+E+ +   E    N
Sbjct: 318 LKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKSN 371


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 28.9 bits (64), Expect = 0.73
 Identities = 14/35 (40%), Positives = 26/35 (74%)

Query: 9   EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
           E++ K+  ++ ++ +V E KEG+E ++KK KKV+E
Sbjct: 220 EEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKE 254


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 28.0 bits (63), Expect = 0.73
 Identities = 19/56 (33%), Positives = 35/56 (62%), Gaps = 4/56 (7%)

Query: 1  MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
           K ++K +EKEEKERK+E++K    E  +G  ++E+++K++EE +  K      + 
Sbjct: 16 EKREEKEREKEEKERKEEKEK----EWGKGLVQKEEREKRLEELEKAKNKPLARYA 67


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 28.5 bits (64), Expect = 0.76
 Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 4/45 (8%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEK----KEGEEEEEKKKKKVEEKK 45
           +K KE E++E K+E+  ++  E+    K+ E+E EK++ K  E K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEAK 196



 Score = 25.4 bits (56), Expect = 8.9
 Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 3/44 (6%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVE---EKKNNK 48
           K  E K ++ E+Q+++  +  E + E  +  KK+E   EK+  K
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 28.1 bits (63), Expect = 0.89
 Identities = 12/38 (31%), Positives = 19/38 (50%), Gaps = 2/38 (5%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEE--EEKKKK 39
           +K  K+KEE   ++ ++      KK    +  E KKKK
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 26.2 bits (58), Expect = 4.7
 Identities = 10/39 (25%), Positives = 19/39 (48%), Gaps = 2/39 (5%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           RK+ ++KE    E++KE        ++    +K   +KK
Sbjct: 159 RKDAKQKEEFAAEERKE--ALAAAAKKSATPQKVETKKK 195



 Score = 25.4 bits (56), Expect = 8.8
 Identities = 8/37 (21%), Positives = 18/37 (48%)

Query: 12  EKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
           E++  K++++    E+KE      KK    ++ +  K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 28.5 bits (64), Expect = 0.90
 Identities = 11/50 (22%), Positives = 25/50 (50%), Gaps = 3/50 (6%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEE---EKKKKKVEEKKNNK 48
            +K+KR+E   K  ++   + E  E    ++     E++K+++ +K   K
Sbjct: 193 ALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEK 242


>gnl|CDD|221268 pfam11853, DUF3373, Protein of unknown function (DUF3373).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria. Proteins in this
           family are typically between 472 to 574 amino acids in
           length.
          Length = 485

 Score = 28.3 bits (63), Expect = 0.91
 Identities = 18/75 (24%), Positives = 28/75 (37%), Gaps = 4/75 (5%)

Query: 12  EKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE---NADGNFEV-TLATKATIYY 67
           +  +K E  KKE+ E K   ++ +K+  K E+K          D  F V +L  K     
Sbjct: 28  DLLQKIEALKKELAELKAQLKDLQKRVDKTEKKSAGDRLTFGGDFRFRVDSLNYKTAALG 87

Query: 68  QGLVEWKPPAIYKSS 82
           Q +         K  
Sbjct: 88  QLMGFNPMVPGQKYK 102


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 28.1 bits (63), Expect = 1.0
 Identities = 9/46 (19%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           + E+EE+E+ ++E+++E   +      E+ + K+  ++   +E+ +
Sbjct: 121 QLEREEEEK-RDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
          protein 2.  THO and TREX form a eukaryotic complex
          which functions in messenger ribonucleoprotein
          metabolism and plays a role in preventing the
          transcription-associated genetic instability. Tho2,
          along with four other subunits forms THO.
          Length = 296

 Score = 28.1 bits (63), Expect = 1.1
 Identities = 17/47 (36%), Positives = 26/47 (55%)

Query: 1  MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNN 47
                 K+K+EK+R K   KK   E K+  E  EK KK++ E+K++
Sbjct: 45 SSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKSS 91


>gnl|CDD|199838 cd10280, PQQ_mGDH, Membrane-bound PQQ-dependent glucose
           dehydrogenase.  This bacterial subfamily of enzymes
           belongs to the dehydrogenase family with
           pyrroloquinoline quinone (PQQ) as cofactor, and is the
           only subfamily that is bound to the membrane. Glucose
           dehydrogenase converts D-glucose to
           D-glucono-1,5-lactone in a reaction that is coupled with
           the respiratory chain in the periplasmic oxidation of
           sugars and alcohols in gram-negative bacteria.
           Ubiquinone functions as the electron acceptor. The
           alignment model contains an 8-bladed beta-propeller.
          Length = 616

 Score = 28.3 bits (64), Expect = 1.1
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 89  YFPFDEQTCVLKFGSWTYDGFKVPGSVNGA 118
             PFD+  C ++F S  Y+G   P S++G 
Sbjct: 378 ATPFDQLACRIQFRSLRYEGLFTPPSLDGT 407


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 28.1 bits (64), Expect = 1.1
 Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 4  KKKRKEKEEKERKKEEQK---KEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
          +++R+ + E E  + E+    KE+G+ K   E+ E    +V+E K   +  +
Sbjct: 35 EERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALE 86



 Score = 25.8 bits (58), Expect = 5.5
 Identities = 11/43 (25%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 2  KMKKKRKEKEEKERKKEEQKKEVGEK-KEGEEEEEKKKKKVEE 43
          ++ + +++ E+ E    E  KE+ E+ K  E E ++ + ++EE
Sbjct: 57 EIGQAKRKGEDAEALIAEV-KELKEEIKALEAELDELEAELEE 98


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 28.3 bits (63), Expect = 1.1
 Identities = 11/43 (25%), Positives = 21/43 (48%)

Query: 10   KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
            K E+ R+K ++    GE    ++   +  KK   KK  K+ ++
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASE 1216


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 27.9 bits (62), Expect = 1.2
 Identities = 20/45 (44%), Positives = 34/45 (75%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
           + KKK+KEK ++E K  + K+E  EK+  +E+E++K+KKVEE ++
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159



 Score = 27.5 bits (61), Expect = 1.9
 Identities = 15/51 (29%), Positives = 31/51 (60%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
            +  KE+E+++ + +E+KK+  EK + E ++ K K++ +EK+  KE     
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEK 150



 Score = 27.2 bits (60), Expect = 2.1
 Identities = 20/46 (43%), Positives = 33/46 (71%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           K+K + KEEK++KKE+ K+E  ++K  EE +EK+  K +EK+  K+
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 27.2 bits (60), Expect = 2.5
 Identities = 13/48 (27%), Positives = 33/48 (68%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           ++K+++K+K+EK +++ + +K   E KE    +EK+K+K ++ +  ++
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159



 Score = 27.2 bits (60), Expect = 2.5
 Identities = 17/41 (41%), Positives = 27/41 (65%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
            K +  +E+E++KE+ K+E  +KKE  +EE K +K  EE K
Sbjct: 98  PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138



 Score = 25.6 bits (56), Expect = 7.3
 Identities = 17/48 (35%), Positives = 33/48 (68%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           K +KK+K+++ KE  K+ + KE  ++K   +E+EK+K+K  E+  ++E
Sbjct: 114 KEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDRE 161



 Score = 25.6 bits (56), Expect = 8.7
 Identities = 16/45 (35%), Positives = 23/45 (51%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           KKR+    K R K+  KK+   KK+   EEEK+++   E    K 
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKP 208



 Score = 25.6 bits (56), Expect = 8.9
 Identities = 13/46 (28%), Positives = 23/46 (50%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           K+K KEK+ +E +  E++K+    +     ++  KKK   KK    
Sbjct: 146 KEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPP 191


>gnl|CDD|220428 pfam09826, Beta_propel, Beta propeller domain.  Members of this
           family comprise secreted bacterial proteins containing
           C-terminal beta-propeller domain distantly related to
           WD-40 repeats.
          Length = 521

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 19/80 (23%), Positives = 27/80 (33%), Gaps = 11/80 (13%)

Query: 48  KENADGNFEVTLATKATIY------YQGLVEWKPPAIYKSSCEIDVEYFPFDEQTCVLKF 101
                 N    L + +T+Y      Y     W P          + E     E+T + KF
Sbjct: 195 TSGEIVNSTSLLGSGSTVYMSENNLYVASSHWYPERALPLRLRAEAEPQEETEKTEIYKF 254

Query: 102 ----GSWTYDGF-KVPGSVN 116
               G  TY G  +VPG + 
Sbjct: 255 SLDDGKVTYVGSGEVPGRLL 274


>gnl|CDD|220615 pfam10175, MPP6, M-phase phosphoprotein 6.  This is a family of
          M-phase phosphoprotein 6s which is necessary for
          generation of the 3' end of the 5.8S rRNA precursor. It
          preferentially binds to poly(C) and poly(U).
          Length = 104

 Score = 27.1 bits (60), Expect = 1.3
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 1  MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
          MK  ++ K+KEEK+   EE++  + E+ + +  +E +   +EE  
Sbjct: 8  MKFMQRTKDKEEKKADDEEERALISEEIDLKMLKETENFIIEEPS 52


>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
           (CLPTM1).  This family consists of several eukaryotic
           cleft lip and palate transmembrane protein 1 sequences.
           Cleft lip with or without cleft palate is a common birth
           defect that is genetically complex. The nonsyndromic
           forms have been studied genetically using linkage and
           candidate-gene association studies with only partial
           success in defining the loci responsible for orofacial
           clefting. CLPTM1 encodes a transmembrane protein and has
           strong homology to two Caenorhabditis elegans genes,
           suggesting that CLPTM1 may belong to a new gene family.
           This family also contains the human cisplatin resistance
           related protein CRR9p which is associated with
           CDDP-induced apoptosis.
          Length = 437

 Score = 28.0 bits (63), Expect = 1.3
 Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 14/82 (17%)

Query: 13  KERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATKATIYYQGLVE 72
             +KK ++K  +G K E EE EE+K    ++  ++      N  +++     ++    + 
Sbjct: 148 LPKKKVKKKNLLGGKSEKEEPEEEKTPAPDKIVSHWHP---NLTLSVVVDFGVFSLNQI- 203

Query: 73  WKPPAIYKSSCEIDVEYFPFDE 94
             PPAI          Y   + 
Sbjct: 204 --PPAI--------ARYLRVEP 215


>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
           Provisional.
          Length = 693

 Score = 27.7 bits (63), Expect = 1.4
 Identities = 14/36 (38%), Positives = 24/36 (66%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
            +K+E+E   +  K+EV E    EEEEE+ +K+++E
Sbjct: 260 TDKQEREAALDAIKEEVLEALAAEEEEEEDEKEIKE 295


>gnl|CDD|234052 TIGR02900, spore_V_B, stage V sporulation protein B.  SpoVB is the
           stage V sporulation protein B of the bacterial endopore
           formation program in Bacillus subtilis and various other
           Firmcutes. It is nearly universal among
           endospore-formers. Paralogs with rather high sequence
           similarity to SpoVB exist, including YkvU in B. subtilis
           and a number of proteins in the genus Clostridium.
           Member sequences for the seed alignment were chosen to
           select those proteins, no more than one to a genome,
           closest to B. subtilis SpoVB in a neighbor joining tree
           [Cellular processes, Sporulation and germination].
          Length = 488

 Score = 27.6 bits (62), Expect = 1.4
 Identities = 7/13 (53%), Positives = 9/13 (69%)

Query: 66  YYQGLVEWKPPAI 78
           Y+QG+   KPPA 
Sbjct: 138 YFQGISNMKPPAY 150


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 27.4 bits (61), Expect = 1.4
 Identities = 21/47 (44%), Positives = 34/47 (72%), Gaps = 2/47 (4%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
              +K  E EE+E+K++++KKEV  KKE +E+++KK+K VE K + K
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEV--KKEKKEKKDKKEKMVEPKGSKK 185



 Score = 27.0 bits (60), Expect = 2.1
 Identities = 15/36 (41%), Positives = 28/36 (77%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
           KKK++ K+EK+ KK++++K V  K   +++++KKKK
Sbjct: 158 KKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193


>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
           [Translation, ribosomal structure and biogenesis].
          Length = 335

 Score = 27.6 bits (62), Expect = 1.5
 Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 5/68 (7%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGE-KKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
           +KK  K    +ERK  E    + E KKE E+   +   ++E     +  A    +VTL  
Sbjct: 45  LKKLGKLSPLEERK--EVGALINELKKEVEDAITELTPELEAAGLWERLAFEKIDVTLPG 102

Query: 62  KATIYYQG 69
           +    Y G
Sbjct: 103 RR--IYPG 108


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 27.8 bits (62), Expect = 1.5
 Identities = 9/46 (19%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
           ++   K +   +  +  EE  K   E+  G +  E+  K+ +E ++
Sbjct: 895 LQNHHKTEALSQNNKYNEE--KSAQERMPGADSPEELMKRAKEYQD 938


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 27.4 bits (61), Expect = 1.6
 Identities = 12/45 (26%), Positives = 30/45 (66%), Gaps = 7/45 (15%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNN 47
           +KK +++++++E ++ +Q+K+       ++EEE++K+K EE    
Sbjct: 189 LKKLKQQQQKREEERRKQRKK-------QQEEEERKQKAEEAWQK 226


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 27.6 bits (61), Expect = 1.7
 Identities = 15/46 (32%), Positives = 32/46 (69%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADG 53
           +  EE+E ++EE+++E  E +E E E+E+++++VE    ++E  +G
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEG 488


>gnl|CDD|223771 COG0699, COG0699, Predicted GTPases (dynamin-related) [General
           function prediction only].
          Length = 546

 Score = 27.6 bits (61), Expect = 1.8
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
            +K KRK   +K+R K    +++ ++     EE  + + + E+
Sbjct: 490 LLKNKRKRLAQKQRLKRLYLEQLEDELLRTAEEILELRLLLEQ 532


>gnl|CDD|131369 TIGR02316, propion_prpE, propionate--CoA ligase.  This family
           contains one of three readily separable clades of
           proteins in the group of acetate and propionate--CoA
           ligases. Characterized members of this family act on
           propionate. From propionyl-CoA, there is a cyclic
           degradation pathway: it is ligated by PrpC to the TCA
           cycle intermediate oxaloacetate, acted upon further by
           PrpD and an aconitase, then cleaved by PrpB to pyruvate
           and the TCA cycle intermediate succinate.
          Length = 628

 Score = 27.6 bits (61), Expect = 1.9
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 59  LATKATIYYQGLVEWKPPAIYKSSCE 84
           LA  AT+ Y+GL     P ++ S  E
Sbjct: 298 LAGAATVLYEGLPTNPDPGVWWSIVE 323


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 27.6 bits (62), Expect = 1.9
 Identities = 11/54 (20%), Positives = 26/54 (48%), Gaps = 7/54 (12%)

Query: 4   KKKRKEKEEKERKKE------EQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           K + +  E++++K E      E ++   E+++    E + KK  E +    ++A
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREK-AAREARHKKAAEARAAKDKDA 487



 Score = 26.4 bits (59), Expect = 4.3
 Identities = 8/48 (16%), Positives = 19/48 (39%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           ++ R E+E+  R+   +K       + ++       +V+ KK      
Sbjct: 458 RQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQP 505


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 27.4 bits (60), Expect = 1.9
 Identities = 19/49 (38%), Positives = 33/49 (67%)

Query: 1    MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
            +K ++ +KE EE ++K EE KK+  EKK+    +++++KK EE +  KE
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778



 Score = 27.0 bits (59), Expect = 3.1
 Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 2/50 (4%)

Query: 2    KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEE--EKKKKKVEEKKNNKE 49
            +  K   E    E +  E+K E  EKK+ E ++  +  KKK EEKK   E
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395



 Score = 27.0 bits (59), Expect = 3.4
 Identities = 18/51 (35%), Positives = 33/51 (64%)

Query: 2    KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
            K K++ K+K +  +KK E+KK+  E K+  EE++KK  ++++    K+ AD
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD 1421



 Score = 26.6 bits (58), Expect = 4.0
 Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 3/49 (6%)

Query: 6    KRKEKEEKERKKEEQKKEVGEKKEGEE---EEEKKKKKVEEKKNNKENA 51
            K+ E+E K +  EE KK   +KK+ EE    EE +KK  E  K   E A
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701



 Score = 26.6 bits (58), Expect = 4.1
 Identities = 21/53 (39%), Positives = 32/53 (60%), Gaps = 4/53 (7%)

Query: 2    KMKKKRKEKEEKERKKEEQKKEVGEKKEGEE----EEEKKKKKVEEKKNNKEN 50
            K ++ +K +E++++  E  KKE  E K+ EE    E E+KKK  E KK  +EN
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN 1728



 Score = 25.9 bits (56), Expect = 6.7
 Identities = 16/52 (30%), Positives = 29/52 (55%)

Query: 1    MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
             K K +  +K ++ +KK E+ K+  E K+  EE +KK  + ++    K+ AD
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513



 Score = 25.9 bits (56), Expect = 6.7
 Identities = 14/48 (29%), Positives = 21/48 (43%)

Query: 2    KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
            K     K+K E+ +K  E  K   E    E E  ++K +  EKK  + 
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376



 Score = 25.9 bits (56), Expect = 7.0
 Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 4/52 (7%)

Query: 2    KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEE----EEKKKKKVEEKKNNKE 49
            + +   ++ E  E+KKEE KK+    K+  EE    +E KKK  E+KK   E
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409



 Score = 25.9 bits (56), Expect = 7.3
 Identities = 13/44 (29%), Positives = 27/44 (61%)

Query: 9    EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
            +  E+ +K EE KK+  E K+ EE+E+K  + ++++    + A+
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705



 Score = 25.5 bits (55), Expect = 8.5
 Identities = 17/52 (32%), Positives = 29/52 (55%), Gaps = 3/52 (5%)

Query: 1    MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEE---EEKKKKKVEEKKNNKE 49
            +K  ++ K+ EE ++K EE KK   ++K+  E    E ++ KK EE K  + 
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA 1712


>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
           Toc86/159, G and M domains.  The long precursor of the
           86K protein originally described is proposed to have
           three domains. The N-terminal A-domain is acidic,
           repetitive, weakly conserved, readily removed by
           proteolysis during chloroplast isolation, and not
           required for protein translocation. The other domains
           are designated G (GTPase) and M (membrane anchor); this
           family includes most of the G domain and all of M
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 763

 Score = 27.6 bits (61), Expect = 1.9
 Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 2/36 (5%)

Query: 2   KMKKKRKEKEEKERKKEEQK--KEVGEKKEGEEEEE 35
           K+ +K++ +EE +R K  +K  KE+GE  +G  EE 
Sbjct: 441 KLLQKKQWREELKRMKMMKKFGKEIGELPDGYSEEV 476


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 27.4 bits (62), Expect = 2.0
 Identities = 13/51 (25%), Positives = 27/51 (52%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           +K +   + KEE  + + E +KE+ E++   ++ EK+  + EE  + K   
Sbjct: 54  IKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLEL 104



 Score = 27.0 bits (61), Expect = 2.1
 Identities = 17/50 (34%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 2   KMKKKRKEKEEK-ERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
           K++K+  +KEE  +RK E  +K   E ++ E+E E+K++++E+K+   E 
Sbjct: 86  KLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135



 Score = 27.0 bits (61), Expect = 2.6
 Identities = 13/42 (30%), Positives = 27/42 (64%), Gaps = 2/42 (4%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
            ++ E  EK  ++ E+K++  E+K+  +E EKK++++EE   
Sbjct: 99  DRKLELLEKREEELEKKEKELEQKQ--QELEKKEEELEELIE 138



 Score = 26.7 bits (60), Expect = 3.5
 Identities = 13/42 (30%), Positives = 32/42 (76%), Gaps = 2/42 (4%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
            ++K+ +E E+KE++ E++++E+ +K+  EE EE  +++++E
Sbjct: 104 LLEKREEELEKKEKELEQKQQELEKKE--EELEELIEEQLQE 143



 Score = 26.3 bits (59), Expect = 4.7
 Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 5/52 (9%)

Query: 4  KKKRKEKEEKERKKEEQKKEVGEKK-----EGEEEEEKKKKKVEEKKNNKEN 50
           K ++ +EE +R  EE KKE    K     E +EE  K + + E++   + N
Sbjct: 31 AKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82



 Score = 25.5 bits (57), Expect = 9.7
 Identities = 18/59 (30%), Positives = 32/59 (54%), Gaps = 12/59 (20%)

Query: 2   KMKKKRK--EKEEKERKKE--------EQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
           ++ K R   EKE +ER+ E         QK+E  ++K   E  EK+++++E+K+   E 
Sbjct: 65  EIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRK--LELLEKREEELEKKEKELEQ 121


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 27.0 bits (60), Expect = 2.1
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
              ++KEKE+KE+ K ++K+ + EK  GEE  +K  K++ 
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELL 198


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 27.3 bits (61), Expect = 2.1
 Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 2   KMKKKRKEKEEKERKKE---EQKKEVGEKKEGEEEEEKKKKKVEEKK 45
            + K  KE ++K+R+ E   E  +   EKK   +EE KKKK  E K 
Sbjct: 222 PLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268



 Score = 25.4 bits (56), Expect = 9.9
 Identities = 13/37 (35%), Positives = 20/37 (54%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
           +  E +  KK + K+E+ +KK  E +  K  KKV  K
Sbjct: 242 EAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 27.3 bits (61), Expect = 2.2
 Identities = 15/38 (39%), Positives = 22/38 (57%), Gaps = 8/38 (21%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEK-KKKKVEEK 44
           KE+EEKE++K+        KKEG+EE     + K  E+
Sbjct: 391 KEREEKEKEKKA-------KKEGKEERRIHFQNKSIEE 421



 Score = 26.9 bits (60), Expect = 2.5
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEE 35
           K+R+EKE++++ K+E K+E     + +  EE
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEE 421



 Score = 26.1 bits (58), Expect = 5.5
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEE 33
            + ++K KEK+ K+  KEE++     K   E E
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
          structure and biogenesis].
          Length = 429

 Score = 27.2 bits (61), Expect = 2.3
 Identities = 15/53 (28%), Positives = 29/53 (54%), Gaps = 1/53 (1%)

Query: 1  MKMKKKRKEKEEKERKKEEQKKEVGE-KKEGEEEEEKKKKKVEEKKNNKENAD 52
           + +K  +E EE + ++ E  KE+G   K GE++ E+   +V+E K   +  +
Sbjct: 36 EERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELE 88



 Score = 25.6 bits (57), Expect = 6.6
 Identities = 7/39 (17%), Positives = 17/39 (43%)

Query: 2  KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKK 40
          ++ +  K  E+   +   + KE+ EK +  E    + + 
Sbjct: 58 EIGRALKRGEDDAEELIAEVKELKEKLKELEAALDELEA 96


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 27.1 bits (60), Expect = 2.3
 Identities = 12/46 (26%), Positives = 29/46 (63%)

Query: 6   KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           ++K+  E+ER K+ +K+ +  +++ ++ EE  K+   ++K  +E A
Sbjct: 93  QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138



 Score = 26.7 bits (59), Expect = 3.5
 Identities = 9/42 (21%), Positives = 21/42 (50%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           ++++K+ EE  ++   ++K+  E         K K + E K+
Sbjct: 114 QEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKR 155


>gnl|CDD|182896 PRK11007, PRK11007, PTS system trehalose(maltose)-specific
          transporter subunits IIBC; Provisional.
          Length = 473

 Score = 27.3 bits (61), Expect = 2.3
 Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 4/44 (9%)

Query: 32 EEEEKKKKKVEE---KKNNKENADGNFEVTLATKATIYYQGLVE 72
          +      K++E+    K    NA G F+V + T+   YY+ L+ 
Sbjct: 39 DPANADPKEIEQLPMVKGCFTNA-GQFQVVIGTEVGDYYKALIA 81


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 27.0 bits (60), Expect = 2.3
 Identities = 15/53 (28%), Positives = 27/53 (50%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
               KE  E  R+ ++ + +  E  +GEE+  KKK K + ++  +   + NFE
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPESTLEKNFE 203


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
          approximately 300 residues, found in plants and
          vertebrates. They contain a highly conserved DDRGK
          motif.
          Length = 189

 Score = 27.0 bits (60), Expect = 2.4
 Identities = 17/39 (43%), Positives = 30/39 (76%)

Query: 2  KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKK 40
          + +KK +EK E ERK+EE+ +E  EKK+ EEE ++++++
Sbjct: 29 EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67



 Score = 26.2 bits (58), Expect = 4.1
 Identities = 21/41 (51%), Positives = 31/41 (75%)

Query: 9  EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
          E+E +ERKK E+K+E   K+E E EEE++KKK EE++  +E
Sbjct: 25 EEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 26.7 bits (59), Expect = 2.4
 Identities = 5/44 (11%), Positives = 14/44 (31%), Gaps = 1/44 (2%)

Query: 2   KMKKKRKEKEEKERKKEEQKKE-VGEKKEGEEEEEKKKKKVEEK 44
            M +K              ++      +   ++ E+ +KK  + 
Sbjct: 101 YMYRKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKKWYQN 144


>gnl|CDD|129200 TIGR00092, TIGR00092, GTP-binding protein YchF.  This predicted
           GTP-binding protein is found in a single copy in every
           complete bacterial genome, and is found in Eukaryotes. A
           more distantly related protein, separated from this
           model, is found in the archaea. It is known to bind GTP
           and double-stranded nucleic acid. It is suggested to
           belong to a nucleoprotein complex and act as a
           translation factor [Unknown function, General].
          Length = 368

 Score = 27.0 bits (60), Expect = 2.4
 Identities = 13/71 (18%), Positives = 22/71 (30%), Gaps = 7/71 (9%)

Query: 9   EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK-------NNKENADGNFEVTLAT 61
              EK   + ++  E G+ K+ E    +    +            +KE       + L T
Sbjct: 139 FLVEKRIGRSKKSAEGGKDKKEELLLLEIILPLLNGGQMARHVDLSKEELILIKSLNLLT 198

Query: 62  KATIYYQGLVE 72
           K  I     V 
Sbjct: 199 KKPIILIANVS 209


>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395).  This
          domain is functionally uncharacterized. This domain is
          found in eukaryotes. This presumed domain is typically
          between 147 to 176 amino acids in length. This domain
          is found associated with pfam00226.
          Length = 144

 Score = 26.5 bits (59), Expect = 2.5
 Identities = 15/47 (31%), Positives = 27/47 (57%)

Query: 4  KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
            K ++K+E+ER++E+ K E+ +KK+  E        V E+K  +E 
Sbjct: 2  FNKAQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREE 48


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 27.3 bits (61), Expect = 2.5
 Identities = 15/66 (22%), Positives = 28/66 (42%)

Query: 1    MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLA 60
             ++K K K K  K RK + +KKE  +KK   ++ +K       K+ + +      +    
Sbjct: 1156 QRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN 1215

Query: 61   TKATIY 66
             K+   
Sbjct: 1216 KKSNSS 1221


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 26.6 bits (59), Expect = 2.6
 Identities = 11/56 (19%), Positives = 31/56 (55%)

Query: 1  MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
           K  K    +E+  ++ E ++++ G++++ E EE +K++ +E+ + + E      +
Sbjct: 5  CKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
          RNase Y, an endoribonuclease. The member from Bacillus
          subtilis, YmdA, has been shown to be involved in
          turnover of yitJ riboswitch [Transcription, Degradation
          of RNA].
          Length = 514

 Score = 26.8 bits (60), Expect = 2.6
 Identities = 12/50 (24%), Positives = 24/50 (48%), Gaps = 3/50 (6%)

Query: 4  KKKRKE---KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
           KK      KEE  + + E ++E+ E++   +  E++  + EE  + K  
Sbjct: 48 LKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKME 97



 Score = 26.4 bits (59), Expect = 3.9
 Identities = 12/42 (28%), Positives = 26/42 (61%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
           + KK +  E+KE++   ++K + EK+E  EE   ++++  E+
Sbjct: 99  LDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 26.8 bits (60), Expect = 2.7
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEE 34
             EKE+K+   +E+KK + E+K+  EE 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 26.5 bits (59), Expect = 2.7
 Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 2/34 (5%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKK 38
            +R   EEK+  +EE K++   ++  E E EKKK
Sbjct: 95  AQRARDEEKKLDEEEAKRQ--HEEAKEREREKKK 126


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 26.6 bits (59), Expect = 2.8
 Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 6/53 (11%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
           K KKK KEK  +E KK E         +  EE E      EE  +++E+A  N
Sbjct: 67  KSKKKDKEKLTEEEKKPE------SDDDKTEENENDPDNNEESGDSQESASAN 113


>gnl|CDD|177954 PLN02320, PLN02320, seryl-tRNA synthetase.
          Length = 502

 Score = 26.8 bits (59), Expect = 2.8
 Identities = 15/40 (37%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 10  KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           ++E ER + E +  V  K +G+ E  +++  VEE KN KE
Sbjct: 106 QKEVERLRAE-RNAVANKMKGKLEPSERQALVEEGKNLKE 144


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 26.9 bits (60), Expect = 2.9
 Identities = 11/48 (22%), Positives = 24/48 (50%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
           + K  E  E+  + EE+ +E+ E+ E  E E ++ +   E+  ++   
Sbjct: 329 ESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEE 376


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 27.1 bits (60), Expect = 2.9
 Identities = 12/42 (28%), Positives = 21/42 (50%)

Query: 11  EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           E+ E +K+E KK    KK+   +++ KK K +   +    A 
Sbjct: 193 EKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAAS 234


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
          consist of three concentric proteinaceous capsid
          layers. The innermost capsid (core) is made of VP2. The
          genomic RNA and the two minor proteins VP1 and VP3 are
          encapsidated within this layer. The N-terminus of
          rotavirus VP2 is necessary for the encapsidation of VP1
          and VP3.
          Length = 887

 Score = 26.8 bits (59), Expect = 3.3
 Identities = 7/38 (18%), Positives = 22/38 (57%)

Query: 13 KERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
            R+      +  ++K+ E++++K + +++EK  +K+ 
Sbjct: 5  NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKE 42


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 26.5 bits (59), Expect = 3.6
 Identities = 11/51 (21%), Positives = 23/51 (45%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADG 53
           +K + +E+  K++KK+++KK       G   E      V  + ++     G
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVG 222


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 26.6 bits (59), Expect = 3.7
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
           K++  K   + K E  KKE   +   +++E+K K +    +N
Sbjct: 354 KEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 26.7 bits (60), Expect = 3.7
 Identities = 10/48 (20%), Positives = 24/48 (50%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
            K+  EK E ++++E++K      +  ++ E +KKK+ +     +   
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215


>gnl|CDD|217127 pfam02590, SPOUT_MTase, Predicted SPOUT methyltransferase.  This
          family of proteins are predicted to be SPOUT
          methyltransferases.
          Length = 154

 Score = 26.1 bits (58), Expect = 3.8
 Identities = 14/68 (20%), Positives = 28/68 (41%), Gaps = 10/68 (14%)

Query: 6  KRKEKEEKERKKEEQKK----------EVGEKKEGEEEEEKKKKKVEEKKNNKENADGNF 55
          K KEK  K+   E  K+          E+ ++K G   + +  K+ E ++       G++
Sbjct: 9  KLKEKYVKDGIAEYLKRLSRYCKLELIELPDEKRGSAADIEAIKEKEGERILAAIPPGSY 68

Query: 56 EVTLATKA 63
           + L  + 
Sbjct: 69 VIALDERG 76


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 26.2 bits (58), Expect = 3.8
 Identities = 13/55 (23%), Positives = 30/55 (54%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVT 58
           K   +E  E E  KEE+K+    + + ++ + +K+ +  E++N +E+ + + E  
Sbjct: 53  KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENE 107



 Score = 25.5 bits (56), Expect = 8.3
 Identities = 13/43 (30%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 4  KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
          +++ K+ +++E  + E+ KE  E+KE    E+K+ K   EK++
Sbjct: 48 EQEAKKSDDQETAEIEEVKE--EEKEAANSEDKEDKGDAEKED 88


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 26.4 bits (59), Expect = 3.8
 Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 10  KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
            EE +     Q +    ++E EEEEE+++++  E
Sbjct: 285 DEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318


>gnl|CDD|234101 TIGR03074, PQQ_membr_DH, membrane-bound PQQ-dependent
           dehydrogenase, glucose/quinate/shikimate family.  This
           protein family has a phylogenetic distribution very
           similar to that coenzyme PQQ biosynthesis enzymes, as
           shown by partial phylogenetic profiling. Members of this
           family have several predicted transmembrane helices in
           the N-terminal region, and include the quinoprotein
           glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli
           and the quinate/shikimate dehydrogenase of Acinetobacter
           sp. ADP1 (EC 1.1.99.25). Sequences closely related
           except for the absense of the N-terminal hydrophobic
           region, scoring in the gray zone between the trusted and
           noise cutoffs, include PQQ-dependent glycerol (EC
           1.1.99.22) and and other polyol (sugar alcohol)
           dehydrogenases.
          Length = 764

 Score = 26.5 bits (59), Expect = 3.9
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 91  PFDEQTCVLKFGSWTYDGFKVPGSVNGA 118
           PFD+  C ++F S  Y+G   P S  G+
Sbjct: 526 PFDQLACRIQFKSLRYEGLYTPPSEQGS 553


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 26.5 bits (59), Expect = 3.9
 Identities = 15/42 (35%), Positives = 25/42 (59%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
           K+++ + + EE E + EEQ + V E  E +EE E + +  EE
Sbjct: 349 KIERYQADLEELEERLEEQNEVVEEADEQQEENEARAEAAEE 390


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.
          Trichoplein or mitostatin, was first defined as a
          meiosis-specific nuclear structural protein. It has
          since been linked with mitochondrial movement. It is
          associated with the mitochondrial outer membrane, and
          over-expression leads to reduction in mitochondrial
          motility whereas lack of it enhances mitochondrial
          movement. The activity appears to be mediated through
          binding the mitochondria to the actin intermediate
          filaments (IFs).
          Length = 349

 Score = 26.4 bits (59), Expect = 4.2
 Identities = 14/49 (28%), Positives = 31/49 (63%)

Query: 1  MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
          ++ KK+ K +E++E ++ ++  E    K   EEEE+++K+ EE++  + 
Sbjct: 26 IEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRA 74


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 26.3 bits (59), Expect = 4.2
 Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
           R + EE+    +EQ +    K+  E+E++ +K ++ 
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEIT 544


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 26.5 bits (59), Expect = 4.2
 Identities = 16/40 (40%), Positives = 28/40 (70%)

Query: 2  KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKV 41
          K+ K  ++ EEK  ++EE++    E++E EE+EE+KK+K 
Sbjct: 8  KLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKP 47


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 25.6 bits (57), Expect = 4.2
 Identities = 11/43 (25%), Positives = 23/43 (53%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
           ++ K +K+KE+ E    E K+   E K  E E  + + ++++ 
Sbjct: 58  EIGKAKKKKEDAEALIAEVKELKDELKALEAELRELEAELDKL 100


>gnl|CDD|200581 cd10958, CE4_NodB_like_2, Catalytic NodB homology domain of
           uncharacterized chitin deacetylases and hypothetical
           proteins.  This family includes some uncharacterized
           chitin deacetylases and hypothetical proteins, mainly
           from eukaryotes. Although their biological function is
           unknown, members in this family show high sequence
           homology to the catalytic NodB homology domain of
           Colletotrichum lindemuthianum chitin deacetylase
           (endo-chitin de-N-acetylase, ClCDA, EC 3.5.1.41), which
           catalyzes the hydrolysis of N-acetamido groups of
           N-acetyl-D-glucosamine residues in chitin, converting it
           to chitosan in fungal cell walls. Like ClCDA, this
           family is a member the carbohydrate esterase 4 (CE4)
           superfamily.
          Length = 190

 Score = 26.1 bits (58), Expect = 4.4
 Identities = 9/26 (34%), Positives = 12/26 (46%)

Query: 90  FPFDEQTCVLKFGSWTYDGFKVPGSV 115
           +PFD Q     F S+       PGS+
Sbjct: 128 YPFDPQIPSPWFNSFFLRRRVSPGSI 153


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 26.2 bits (58), Expect = 4.5
 Identities = 13/42 (30%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 1   MKMKKKRKEKEEKERKKEEQ--KKEVGEKKEGEEEEEKKKKK 40
           +K   K  E+  K+ ++EE+  KK V E  E E+   ++K++
Sbjct: 327 LKEFSKAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQKER 368


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 25.7 bits (57), Expect = 4.7
 Identities = 19/36 (52%), Positives = 26/36 (72%), Gaps = 1/36 (2%)

Query: 10  KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           K E ER K+E+KK   EKKE EE+E K++ K ++KK
Sbjct: 92  KLEHERNKQEKKKRSKEKKE-EEKERKRQLKQQKKK 126


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 26.2 bits (58), Expect = 4.8
 Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 6/57 (10%)

Query: 4  KKKRKEKEEKERKKEEQK------KEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
          K+K  +K+ K+ KK+         + + E +E E ++  KKKK ++KK  K+N    
Sbjct: 18 KQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEA 74


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 25.9 bits (58), Expect = 4.8
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query: 2  KMKKKRKEKEEKERKKEEQKKEVGEKK 28
          K + K   K +K+++K+E K    + K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 25.9 bits (58), Expect = 6.0
 Identities = 7/30 (23%), Positives = 16/30 (53%)

Query: 2  KMKKKRKEKEEKERKKEEQKKEVGEKKEGE 31
           + KK  +   K +KK+E+++    K + +
Sbjct: 60 LLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 25.2 bits (56), Expect = 10.0
 Identities = 8/29 (27%), Positives = 15/29 (51%)

Query: 17 KEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
          K+E K     +K+ E++E K  K   + +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
          archaebacterial protein family has no known function.
          Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 25.7 bits (57), Expect = 4.9
 Identities = 15/45 (33%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 7  RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE--KKNNKE 49
          RK ++ ++R+KE QK+    +K G++   KK +K +E   ++ KE
Sbjct: 40 RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKE 84


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 26.1 bits (58), Expect = 4.9
 Identities = 10/47 (21%), Positives = 26/47 (55%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
           ++K+   EKE  E +K+  +K+  +  +  ++EE   +++ E +  +
Sbjct: 77  ELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 26.1 bits (57), Expect = 4.9
 Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEEEKK---KKKVEEKKNNKENADGNFEVTLATKA 63
           RK ++++E K +  KKE  ++K+ E+E E K   KK    K    E AD   E+  ++ A
Sbjct: 57  RKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVA 116


>gnl|CDD|217360 pfam03087, DUF241, Arabidopsis protein of unknown function.  This
           family represents a number of Arabidopsis proteins.
           Their functions are unknown.
          Length = 230

 Score = 25.8 bits (57), Expect = 5.0
 Identities = 14/49 (28%), Positives = 22/49 (44%), Gaps = 2/49 (4%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
            K KK   +     R   E  +   EK+   EE +   K++EE +N+ E
Sbjct: 161 KKSKKGGSQSSLSIR--SELSRLDEEKRRDNEEVKDILKRLEELENSIE 207


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 26.2 bits (57), Expect = 5.0
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKE 29
           K K+KEK+EKE+++++ KK+  E  +
Sbjct: 201 KPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 25.8 bits (56), Expect = 7.8
 Identities = 16/42 (38%), Positives = 22/42 (52%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
           K KK+RKEKEEK++KK+           G E+  +     EE
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEE 322


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 25.8 bits (57), Expect = 5.2
 Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 2/40 (5%)

Query: 17  KEEQK--KEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
            EEQK  K+V ++K  +++++KKKKK ++KK +K+ A   
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134


>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
          Length = 110

 Score = 25.6 bits (57), Expect = 5.2
 Identities = 12/35 (34%), Positives = 24/35 (68%)

Query: 11 EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
          E + RK EE +K+  E+++ +EE+E+ + + E +K
Sbjct: 5  EIRRRKLEELQKQAQEQQQAQEEQEEAQAQAEAQK 39


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 25.9 bits (57), Expect = 5.2
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 4  KKKRKEKEEKERKKEEQKKEVGEKKE 29
           K RK K  K+RKK E+K E+ ++ E
Sbjct: 28 AKPRKIKRVKKRKKREEKDELDDEVE 53



 Score = 25.6 bits (56), Expect = 8.0
 Identities = 16/58 (27%), Positives = 21/58 (36%), Gaps = 18/58 (31%)

Query: 6  KRKEKEE-----------KERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
          KRK KEE               K+E K           +  KK+KK EEK    +  +
Sbjct: 3  KRKIKEEMLQILAPEIYGPRPVKDEAKPRK-------IKRVKKRKKREEKDELDDEVE 53


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 25.9 bits (57), Expect = 5.2
 Identities = 11/41 (26%), Positives = 18/41 (43%)

Query: 6   KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
            R  K EKE KK+  K+     +    +  K+KK  ++   
Sbjct: 121 DRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 25.9 bits (57), Expect = 5.4
 Identities = 8/34 (23%), Positives = 19/34 (55%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEE 35
           ++  + +  + ++ + E+  KE  +  +GEEEE 
Sbjct: 211 ELGDEPESADSEDNEDEDDPKEDEDDDQGEEEES 244


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 25.8 bits (57), Expect = 5.6
 Identities = 9/42 (21%), Positives = 18/42 (42%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
            ++ K+     K++ K +  +       G    EK +KK+ E
Sbjct: 91  LVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAE 132


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 26.1 bits (57), Expect = 5.6
 Identities = 11/51 (21%), Positives = 24/51 (47%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
            +  K   +  +    KE+++    +K+E E+ + + KK  EE    K++ 
Sbjct: 254 QQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHK 304



 Score = 25.7 bits (56), Expect = 6.4
 Identities = 12/50 (24%), Positives = 25/50 (50%), Gaps = 2/50 (4%)

Query: 5   KKRKEKEEK--ERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           K+R E  +K  E K   +  +    KE ++  E +K+++E+ +   +  D
Sbjct: 245 KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKND 294


>gnl|CDD|202558 pfam03159, XRN_N, XRN 5'-3' exonuclease N-terminus.  This family
           aligns residues towards the N-terminus of several
           proteins with multiple functions. The members of this
           family all appear to possess 5'-3' exonuclease activity
           EC:3.1.11.-. Thus, the aligned region may be necessary
           for 5' to 3' exonuclease function. The family also
           contains several Xrn1 and Xrn2 proteins. The 5'-3'
           exoribonucleases Xrn1p and Xrn2p/Rat1p function in the
           degradation and processing of several classes of RNA in
           Saccharomyces cerevisiae. Xrn1p is the main enzyme
           catalyzing cytoplasmic mRNA degradation in multiple
           decay pathways, whereas Xrn2p/Rat1p functions in the
           processing of rRNAs and small nucleolar RNAs (snoRNAs)
           in the nucleus.
          Length = 237

 Score = 25.8 bits (57), Expect = 6.0
 Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 7/45 (15%)

Query: 2   KMKKKR-------KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
           KM ++R       K+ +EKE + EE ++E+  +     E+ +K+K
Sbjct: 92  KMNQQRSRRFRAAKDAKEKEAEAEENREELETEGIKLPEKVEKEK 136


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 26.1 bits (58), Expect = 6.1
 Identities = 14/44 (31%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 17  KEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLA 60
           +EE ++ V E +   EE++K+K+++ E +NN ++     E TL 
Sbjct: 500 EEEIERMVKEAEANAEEDKKRKEEI-EARNNADSLAYQAEKTLK 542


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 25.7 bits (57), Expect = 6.3
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKE 23
            K KKK   KEEK+  KEE++K 
Sbjct: 97  EKEKKKAMSKEEKKAIKEEKEKL 119


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 25.8 bits (57), Expect = 6.4
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 5   KKRKEKEEKERKKEEQKKEVGEKKEGEE 32
           K +KE+ E++ ++EE+K    EK   EE
Sbjct: 153 KIKKERAEEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 25.7 bits (57), Expect = 6.4
 Identities = 8/39 (20%), Positives = 21/39 (53%)

Query: 10  KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
           +E K + + ++ ++  +K+E   E+EK  ++  + K   
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKA 142


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 26.1 bits (57), Expect = 6.6
 Identities = 16/71 (22%), Positives = 30/71 (42%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
           + K+K+KE+ +K  ++ E   E+    E  + +E K K+  +K           E+    
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226

Query: 62  KATIYYQGLVE 72
              + Y  L E
Sbjct: 227 LLYLDYLKLNE 237


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 26.0 bits (58), Expect = 6.8
 Identities = 19/84 (22%), Positives = 34/84 (40%), Gaps = 3/84 (3%)

Query: 6   KRKEKEEKE---RKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATK 62
           +RKEK+ KE     K    K++ E  E ++ EE++K +     +  ++     E     +
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549

Query: 63  ATIYYQGLVEWKPPAIYKSSCEID 86
                    E + P I   S E+ 
Sbjct: 550 EKDKETEEDEPEGPKIIVDSRELR 573


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 25.8 bits (57), Expect = 7.0
 Identities = 7/43 (16%), Positives = 18/43 (41%), Gaps = 3/43 (6%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
              + E   ++ E    + E++    ++ K  + +EE+K    
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRA---QQAKLAQLLEERKKTLA 213



 Score = 25.8 bits (57), Expect = 7.2
 Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
            ++KK +E++++  K E+Q K + E +    E +  +   + KK  K+ AD N  
Sbjct: 49  ALEKKIREQQDQRAKLEKQLKSL-ETEIASLEAQLIETADDLKKLRKQIADLNAR 102


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 25.5 bits (56), Expect = 7.1
 Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
               E E+ K++++ E  +KKE EE +  +KK  +  K  +
Sbjct: 82  ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 25.7 bits (56), Expect = 7.4
 Identities = 15/45 (33%), Positives = 27/45 (60%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           K + E++   EE ++E  E++ G  ++ KK KK++ KKN  +  D
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 25.7 bits (57), Expect = 7.6
 Identities = 10/33 (30%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 6   KRKEKEEKERKKEEQKKEVGE-KKEGEEEEEKK 37
           K +  EE+   +EE +++  E  ++G+ EE +K
Sbjct: 87  KAETLEERREIREEAEEKWEEALEKGDLEEARK 119


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 25.7 bits (57), Expect = 7.8
 Identities = 12/61 (19%), Positives = 21/61 (34%), Gaps = 12/61 (19%)

Query: 2  KMKKKRKE---KEEKERKKEEQKKEVGEKKEGEEEEEK---------KKKKVEEKKNNKE 49
          K K+  +E   KEE +        +   +K    ++E          + KKVE  +    
Sbjct: 18 KAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKKVEALRAKGV 77

Query: 50 N 50
           
Sbjct: 78 E 78


>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
          Length = 307

 Score = 25.5 bits (56), Expect = 7.9
 Identities = 11/46 (23%), Positives = 19/46 (41%), Gaps = 6/46 (13%)

Query: 1   MKMKKKRKEKEE------KERKKEEQKKEVGEKKEGEEEEEKKKKK 40
           +K + +++  EE      K  K    +K+V   +        KKKK
Sbjct: 251 LKPRLRKQIAEEFAKNPPKAYKSPSGRKDVTPSQNQAIASLPKKKK 296


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 25.8 bits (56), Expect = 7.9
 Identities = 19/67 (28%), Positives = 38/67 (56%), Gaps = 2/67 (2%)

Query: 10  KEEKERKKEEQKKEVGEKKEGEEEEEKKKKK--VEEKKNNKENADGNFEVTLATKATIYY 67
           +E++E  ++E  K V EK E EEEEE K  K  ++ +   K+ +  +   +  + A+++ 
Sbjct: 149 EEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGKSFWSAASVFS 208

Query: 68  QGLVEWK 74
           + L +W+
Sbjct: 209 KKLQKWR 215


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 25.8 bits (56), Expect = 8.0
 Identities = 9/44 (20%), Positives = 22/44 (50%)

Query: 9   EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
             +++ +K   + +        + EEEK +KKV +++  K+  +
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDE 148


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 25.5 bits (56), Expect = 8.1
 Identities = 10/41 (24%), Positives = 22/41 (53%)

Query: 5  KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
            + +  EK +   + +     KK   +E++++KKK ++KK
Sbjct: 37 NSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 25.4 bits (56), Expect = 8.1
 Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 5/73 (6%)

Query: 1   MKMKKKRKEKEEK-ERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTL 59
           ++ K K  E+E+K E  +E+  K++  +K GE+ EE  K    E K   E  +   ++ +
Sbjct: 89  LQRKLKELEREQKEEEVREKHNKKI-IEKFGEDLEEVYKFMKGEAK--VEEEEEK-QMEI 144

Query: 60  ATKATIYYQGLVE 72
             KA   Y  +V+
Sbjct: 145 LEKALKSYLKIVK 157


>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein.  Bacillus subtilis stage V
           sporulation protein R is involved in spore cortex
           formation. Little is known about cortex biosynthesis,
           except that it depends on several sigma E controlled
           genes, including spoVR.
          Length = 427

 Score = 25.6 bits (57), Expect = 8.3
 Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 10/49 (20%)

Query: 2   KMKKKRKEKEEKERKKEEQK----------KEVGEKKEGEEEEEKKKKK 40
           K   K   +EEK R++E ++          + +  ++E  EEE  ++K+
Sbjct: 170 KRPIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKR 218


>gnl|CDD|220265 pfam09491, RE_AlwI, AlwI restriction endonuclease.  This family
           includes the AlwI (recognises GGATC), Bsp6I (recognises
           GC^NGC), BstNBI (recognises GASTC), PleI(recognises
           GAGTC) and MlyI (recognises GAGTC) restriction
           endonucleases.
          Length = 429

 Score = 25.4 bits (56), Expect = 8.3
 Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 2   KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNN-KENADGNF 55
           ++ K RKE+ + +   ++  KE  EK+     +   KKK E  ++N K+ AD  F
Sbjct: 101 RILKFRKERLKNKDNIKKLSKEYFEKECYSISDSSSKKKFETSQSNLKDYADTFF 155


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 25.7 bits (56), Expect = 8.4
 Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 2/47 (4%)

Query: 3   MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           M+K+ K + +     E+Q  E  EKK  +EEEE   +   +   ++E
Sbjct: 481 MEKRLKSEADSRVNAEKQLAE--EKKRKKEEEETAARAAAQAAASRE 525


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 25.4 bits (56), Expect = 8.8
 Identities = 7/40 (17%), Positives = 22/40 (55%)

Query: 7   RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
           ++E EE +R+ E+ + E+   +    ++ +K +++  +  
Sbjct: 442 KRELEELKREIEKLESELERFRREVRDKVRKDREIRARDR 481


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family. 
          SseC is a secreted protein that forms a complex
          together with SecB and SecD on the surface of
          Salmonella. All these proteins are secreted by the type
          III secretion system. Many mucosal pathogens use type
          III secretion systems for the injection of effector
          proteins into target cells. SecB, SseC and SecD are
          inserted into the target cell membrane. where they form
          a small pore or translocon. In addition to SseC, this
          family includes the bacterial secreted proteins PopB,
          PepB, YopB and EspD which are thought to be directly
          involved in pore formation, and type III secretion
          system translocon.
          Length = 303

 Score = 25.5 bits (56), Expect = 8.8
 Identities = 8/39 (20%), Positives = 25/39 (64%)

Query: 4  KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
          K+ + + ++ ER +++Q+K+  E +E  ++  +K ++ +
Sbjct: 15 KQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAK 53


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 25.4 bits (56), Expect = 9.0
 Identities = 15/47 (31%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 6   KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
           KR+   E   ++ E+  E  E+K  E E E  + ++EE + +KE A+
Sbjct: 614 KREALAELNDERRERLAEKRERKR-ELEAEFDEARIEEAREDKERAE 659


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 25.2 bits (55), Expect = 9.0
 Identities = 10/50 (20%), Positives = 26/50 (52%)

Query: 1   MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
            ++ + +KE+E    + EE +KE  +      E +++K+++E ++     
Sbjct: 70  SELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLR 119


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 25.5 bits (56), Expect = 9.3
 Identities = 19/55 (34%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 8   KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATK 62
            E+EE E ++EE+ ++ G  KE  ++EE ++  VE K    E++DGN  + +  K
Sbjct: 390 DEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESK---YEDSDGNSSLAVGYK 441


>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
          Length = 423

 Score = 25.4 bits (56), Expect = 9.3
 Identities = 7/32 (21%), Positives = 15/32 (46%)

Query: 22  KEVGEKKEGEEEEEKKKKKVEEKKNNKENADG 53
           K+  +  EGE++  K K + +++        G
Sbjct: 84  KDPVDGAEGEQKPGKSKFEEDQRVEGSSAVTG 115


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 25.5 bits (56), Expect = 9.4
 Identities = 9/46 (19%), Positives = 24/46 (52%), Gaps = 2/46 (4%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
           K K  E+E  E+K++++K+ + +    +  E+  K+  +   ++  
Sbjct: 58  KMKAYEQEALEQKQKDKKRTLQDL--ADSFEKSDKESQDYSSSDSG 101


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 25.2 bits (55), Expect = 9.6
 Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 1/30 (3%)

Query: 13  KERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
           KE+ K+     +G KK  E+E E K + +E
Sbjct: 332 KEKNKKVPVMFLG-KKPKEKEVESKSQCIE 360


>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
          Length = 292

 Score = 25.2 bits (56), Expect = 9.6
 Identities = 13/33 (39%), Positives = 22/33 (66%), Gaps = 1/33 (3%)

Query: 6  KRKEKEEKERKKEEQKKEVGE-KKEGEEEEEKK 37
          K +E EE+   +EE +++  E K+EG+ EE +K
Sbjct: 40 KAEEIEERREVREEAEEKYEEAKEEGDLEEARK 72


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 25.1 bits (55), Expect = 9.6
 Identities = 14/60 (23%), Positives = 29/60 (48%)

Query: 4   KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATKA 63
              R+EK+E+E ++EE +  +     G E EE +++  +E   + E  + N     +  +
Sbjct: 175 ALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSS 234


>gnl|CDD|178354 PLN02753, PLN02753, triacylglycerol lipase.
          Length = 531

 Score = 25.4 bits (55), Expect = 9.7
 Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 7   RKEKEEKERKKE--EQKKEVGEKKEGEEEEEKKKKKVEEKK 45
           R E+E +ER+    ++ +  GE     E+  ++ KK EE++
Sbjct: 61  RLERERRERQGLLIDEAEGAGELWLTAEDIRRRDKKTEEER 101


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.305    0.127    0.357 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,894,001
Number of extensions: 521446
Number of successful extensions: 8199
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5567
Number of HSP's successfully gapped: 1891
Length of query: 118
Length of database: 10,937,602
Length adjustment: 81
Effective length of query: 37
Effective length of database: 7,344,928
Effective search space: 271762336
Effective search space used: 271762336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 53 (24.3 bits)