RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16486
         (150 letters)



>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 42.0 bits (98), Expect = 4e-05
 Identities = 15/43 (34%), Positives = 28/43 (65%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           +KR+   +K   E E++ +++ E EK+K++E+ERE E+  E  
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERA 621



 Score = 40.1 bits (93), Expect = 2e-04
 Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           KK EE  E  K K E E+K +EE+ERE+EK +E   E
Sbjct: 579 KKREEAVE--KAKREAEQKAREEREREKEKEKERERE 613



 Score = 37.4 bits (86), Expect = 0.002
 Identities = 17/42 (40%), Positives = 30/42 (71%), Gaps = 1/42 (2%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
           K+E  +   KR+ E+K + EE+E EK+K++E E++++ E ER
Sbjct: 580 KREEAVEKAKREAEQKAR-EEREREKEKEKEREREREREAER 620



 Score = 33.1 bits (75), Expect = 0.046
 Identities = 15/31 (48%), Positives = 22/31 (70%), Gaps = 2/31 (6%)

Query: 122 KKKKEEEEKKKK--EEKEREEEKREEENVEE 150
           KK++E  EK K+  E+K REE +RE+E  +E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKE 609


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 39.9 bits (94), Expect = 2e-04
 Identities = 23/46 (50%), Positives = 35/46 (76%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K++ E+KK+ ++K    KKKEEEE+++KE+KE E+E+ EEE  EE
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 39.5 bits (93), Expect = 2e-04
 Identities = 24/45 (53%), Positives = 33/45 (73%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K   + +KK EE+++EKKKK    KKK+EE+E E+EK+EEE  EE
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458



 Score = 39.1 bits (92), Expect = 3e-04
 Identities = 24/61 (39%), Positives = 40/61 (65%)

Query: 90  LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           +EF + +KK      +  +  +KK+ EEK+E+KKK    +KK++EE+E +E+K EE+  E
Sbjct: 399 IEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458

Query: 150 E 150
           E
Sbjct: 459 E 459



 Score = 39.1 bits (92), Expect = 3e-04
 Identities = 18/59 (30%), Positives = 36/59 (61%)

Query: 92  FRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
                K    +     + EKK++ E+KE++KK    ++K+++EE+E+E+++ E+E  EE
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460



 Score = 39.1 bits (92), Expect = 4e-04
 Identities = 19/57 (33%), Positives = 39/57 (68%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +  K +  +   +++ E++KK ++K+    KK+EEE+++++EK+ EE++ EEE  EE
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464



 Score = 38.4 bits (90), Expect = 6e-04
 Identities = 22/51 (43%), Positives = 35/51 (68%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           KK       K++     K +E+EEE++K+++EE+K++EE+E EEEK EEE 
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 38.0 bits (89), Expect = 8e-04
 Identities = 21/46 (45%), Positives = 34/46 (73%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           R+ + ++KKK     ++K+++EEEEK+KKEE++ EEE+  EE  EE
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 36.8 bits (86), Expect = 0.002
 Identities = 22/41 (53%), Positives = 33/41 (80%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           EKKKK    +++++++EEE++KK+EEKE EEE+ EEE  EE
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469



 Score = 36.4 bits (85), Expect = 0.003
 Identities = 20/46 (43%), Positives = 35/46 (76%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +KR+ EKK+K ++    KKK+EEEE++K++++E +EE+ EE   E+
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466



 Score = 35.7 bits (83), Expect = 0.005
 Identities = 19/46 (41%), Positives = 35/46 (76%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           ++ + +K+KK +    +KK++EEEE+K+K+E+E+EEE+ E E  +E
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467



 Score = 35.3 bits (82), Expect = 0.007
 Identities = 16/46 (34%), Positives = 35/46 (76%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +K + +K    ++KEEE+++++E+++++KEE+E E E+ +EE  E+
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472



 Score = 35.3 bits (82), Expect = 0.008
 Identities = 16/51 (31%), Positives = 32/51 (62%)

Query: 96  TKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           T++E       ++  KK K   ++ EKK++EE+++KKK+    ++++ EEE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445



 Score = 34.5 bits (80), Expect = 0.012
 Identities = 19/62 (30%), Positives = 39/62 (62%)

Query: 89  LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           L   +  TKK   +  +  +  +++K E+K++    K++EE++++E++++EEEK EEE  
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461

Query: 149 EE 150
            E
Sbjct: 462 AE 463



 Score = 34.1 bits (79), Expect = 0.018
 Identities = 19/45 (42%), Positives = 35/45 (77%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +++ +KKK    K++E++++EE+EKK++E++E EEE  EE+  EE
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470



 Score = 33.0 bits (76), Expect = 0.043
 Identities = 17/43 (39%), Positives = 36/43 (83%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           +   G+KK++ EE+E+EKK++E+EE++++ E+E+EEE+ +++ 
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 39.6 bits (92), Expect = 3e-04
 Identities = 20/52 (38%), Positives = 34/52 (65%)

Query: 98  KEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           K+   G     G     +EE+EEE++++EEEE++++EE+E EEE  E  ++E
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897



 Score = 38.4 bits (89), Expect = 6e-04
 Identities = 19/56 (33%), Positives = 33/56 (58%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           E  +E + E     +   G       + EEE++++EEEE++++EE+E EEE+ E E
Sbjct: 836 ELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 37.7 bits (87), Expect = 0.001
 Identities = 18/44 (40%), Positives = 27/44 (61%)

Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +Q EK        +    +EEEE++++EE+E EEE+ EEE  EE
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 33.8 bits (77), Expect = 0.029
 Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 6/49 (12%)

Query: 108 QGEKKKKNEEKEEE------KKKKEEEEKKKKEEKEREEEKREEENVEE 150
           QGE K+  +  +          ++EEEE++++EE+E EEE+ EEE  E 
Sbjct: 842 QGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890



 Score = 27.7 bits (61), Expect = 2.9
 Identities = 12/36 (33%), Positives = 27/36 (75%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           E++++ EE+EEE++++EEEE ++    E  E ++++
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 26.9 bits (59), Expect = 6.0
 Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           GE+     E+E E + K E E + +   ER+ E+  E  +E 
Sbjct: 660 GEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEA 701



 Score = 26.5 bits (58), Expect = 6.9
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 1/60 (1%)

Query: 91  EFRSETKKEGNL-GFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           E + E + EG +   RK + E + + E KE + K + E E+ + E +   E   +E  +E
Sbjct: 674 ETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIE 733


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 38.9 bits (91), Expect = 4e-04
 Identities = 14/37 (37%), Positives = 27/37 (72%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           E +++ EEKE  K++K   + KK+EEK+++E ++ E+
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 38.1 bits (89), Expect = 9e-04
 Identities = 14/35 (40%), Positives = 27/35 (77%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
           +R+ E+K+  +E++  +K K++EEKKKKE ++ E+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 32.7 bits (75), Expect = 0.062
 Identities = 20/53 (37%), Positives = 35/53 (66%), Gaps = 3/53 (5%)

Query: 101 NLGFR---KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           NLG R   K  G    K ++KEE +++KEE+E  K++++ R+ +K+EE+  +E
Sbjct: 530 NLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKE 582



 Score = 31.9 bits (73), Expect = 0.095
 Identities = 15/58 (25%), Positives = 34/58 (58%)

Query: 90  LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           L  R E K +G   ++    E+ ++ +E++E  K+++   K KK+E+++++E  + E 
Sbjct: 531 LGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 30.8 bits (70), Expect = 0.23
 Identities = 13/39 (33%), Positives = 28/39 (71%), Gaps = 2/39 (5%)

Query: 114 KNEEKEEEKKKKEEEEKK--KKEEKEREEEKREEENVEE 150
           K E + E+++K+  +E+K  +K +K+ E++K+E E +E+
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 35.7 bits (83), Expect = 0.005
 Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 19/72 (26%)

Query: 90  LEFRSETKKEGNLGFRKRQGEKKKKN----------EEKEEEKKKKEE--------EEKK 131
           L+   E  ++ +   R+ + EK  K           E+KEE+KK++ E        EE++
Sbjct: 251 LKLSPEVLRKVD-KTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQR 309

Query: 132 KKEEKEREEEKR 143
           K EEKER+++ R
Sbjct: 310 KLEEKERKKQAR 321



 Score = 34.9 bits (81), Expect = 0.008
 Identities = 19/47 (40%), Positives = 29/47 (61%)

Query: 100 GNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            NL        K  K  E+EEEK  K  EE++++E +E++EEK++EE
Sbjct: 249 ANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEE 295



 Score = 31.1 bits (71), Expect = 0.17
 Identities = 15/46 (32%), Positives = 26/46 (56%), Gaps = 5/46 (10%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
           E   E K+E     +K++  + K  +   EE++K EE+E+KK+  K
Sbjct: 282 EEAQEKKEE-----KKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 35.2 bits (81), Expect = 0.008
 Identities = 15/49 (30%), Positives = 29/49 (59%), Gaps = 3/49 (6%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEE---EKKKKEEKEREEEKREEENVEE 150
           +K    K + +  +E E K KEE+   E+ +K++KE+ +E +E+   +E
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431



 Score = 33.2 bits (76), Expect = 0.040
 Identities = 17/62 (27%), Positives = 31/62 (50%)

Query: 89  LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           L +   E K   +L   KRQ   K+ N++ ++  KK  E + +    +E E + +EE+  
Sbjct: 349 LTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLK 408

Query: 149 EE 150
           +E
Sbjct: 409 QE 410



 Score = 29.0 bits (65), Expect = 0.98
 Identities = 10/26 (38%), Positives = 20/26 (76%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKK 133
           Q E +KK +E+ +E K+K +++++KK
Sbjct: 409 QEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 29.0 bits (65), Expect = 0.99
 Identities = 12/29 (41%), Positives = 24/29 (82%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
           E+K K EE E+++K++ +E+K+K+++ ER
Sbjct: 404 EEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 26.3 bits (58), Expect = 6.9
 Identities = 12/55 (21%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 89  LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
             +   E K E +      + E K K E+ ++E+ +K+++E+  +++++R++++R
Sbjct: 381 YTKKLGEVKDETD---ASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 34.7 bits (79), Expect = 0.008
 Identities = 21/60 (35%), Positives = 39/60 (65%), Gaps = 2/60 (3%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE--EENVEE 150
           + + KK+ +L  R+ Q EK ++  E+ EE+ + E  E+K  E +++EE++RE  E+ +EE
Sbjct: 112 KDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEE 171


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 35.2 bits (81), Expect = 0.009
 Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 4/64 (6%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKK----KKEEEEKKKKEEKEREEEKREEE 146
           E ++E+ KE      + + EKKKK E+ +EE K    K+E +EK+  +EKE+E+EK+ EE
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156

Query: 147 NVEE 150
             + 
Sbjct: 157 PRDR 160



 Score = 33.3 bits (76), Expect = 0.032
 Identities = 14/57 (24%), Positives = 34/57 (59%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             +K         ++ + +   EE++E+++ KEE++KKK++ KE  ++++ +E  +E
Sbjct: 83  GGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139



 Score = 29.9 bits (67), Expect = 0.45
 Identities = 17/57 (29%), Positives = 34/57 (59%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E +KE     +K++ EK K+  +  + K++ +E+   K++EKE+E++  E  + EE
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162



 Score = 29.9 bits (67), Expect = 0.54
 Identities = 18/58 (31%), Positives = 35/58 (60%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           + + K+E      K + ++K+  +EKE+EK+KK EE + ++EEK+RE  + +    + 
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKP 178



 Score = 29.5 bits (66), Expect = 0.64
 Identities = 18/57 (31%), Positives = 32/57 (56%)

Query: 90  LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           L      K+    G +    + K   E K E  K++E+E+++ KEEK++++EK +EE
Sbjct: 71  LSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127



 Score = 28.3 bits (63), Expect = 1.9
 Identities = 14/54 (25%), Positives = 29/54 (53%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
             + +K       KR  ++K+K +EK+ E+ +  EEEKK++  + +   K+  +
Sbjct: 127 EPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180



 Score = 26.4 bits (58), Expect = 7.8
 Identities = 14/53 (26%), Positives = 25/53 (47%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
             E  KE      K + ++KK  E ++ E++KK E  + K   K+  ++K   
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPN 185


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 34.9 bits (81), Expect = 0.009
 Identities = 10/41 (24%), Positives = 24/41 (58%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            KK + ++++ EKK ++EE +K   +K+ E   ++ +  + 
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451



 Score = 32.6 bits (75), Expect = 0.060
 Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 2/43 (4%)

Query: 110 EKKK--KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E+KK  K + K E+K +KEE EK   ++K     K+ +    E
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452



 Score = 32.2 bits (74), Expect = 0.075
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           +R+ EKK + EE E+   KK+ E   KK +    E K+ + 
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458



 Score = 32.2 bits (74), Expect = 0.081
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
             E K+  KK+++ E+K +KEE E+   K++ E   
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAA 443



 Score = 29.9 bits (68), Expect = 0.45
 Identities = 10/46 (21%), Positives = 24/46 (52%), Gaps = 2/46 (4%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK--EREEEKREEENVE 149
             +GE+++         ++K+  +K++K EK  E+EE ++     +
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKK 438



 Score = 29.5 bits (67), Expect = 0.70
 Identities = 12/46 (26%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 105 RKRQGEKKKKNEEKEE-EKKKKEEEEKKKKEEKEREEEKREEENVE 149
           RK+  +K++K E+K E E+ +K   +KK +   ++ +    E    
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 28.4 bits (64), Expect = 1.7
 Identities = 8/35 (22%), Positives = 20/35 (57%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
               +K  +K++K E++ +K+E ++   +K+ E  
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAA 442



 Score = 28.0 bits (63), Expect = 2.5
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             + EE+E E       E+KK  +K+R+ EK+ E+   E
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAE 431


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 34.7 bits (80), Expect = 0.009
 Identities = 16/46 (34%), Positives = 33/46 (71%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +K   EK+K   +K++ KKKK++++ KK ++K+ + EK++E+  E+
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120



 Score = 33.9 bits (78), Expect = 0.016
 Identities = 11/44 (25%), Positives = 32/44 (72%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           K++ + KKK  +K+++K K ++++KK  + ++++E++ E++  +
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 31.6 bits (72), Expect = 0.088
 Identities = 14/45 (31%), Positives = 32/45 (71%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           ++Q  K KK + K+++ K K++++ KK ++ E+++EK  E+ +E+
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 28.9 bits (65), Expect = 0.73
 Identities = 12/52 (23%), Positives = 29/52 (55%)

Query: 95  ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           E K++     +K + +K K  ++K+++K  K E++ +K+ E + E+  +   
Sbjct: 79  EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130



 Score = 28.5 bits (64), Expect = 0.91
 Identities = 11/39 (28%), Positives = 27/39 (69%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
            +KK K+++K+++KK  + E+K +KE +++ E+  +  +
Sbjct: 92  KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130



 Score = 27.3 bits (61), Expect = 2.7
 Identities = 16/41 (39%), Positives = 31/41 (75%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +KKKK   +E EK KKE EEK+K + K+++ +K+++++ ++
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDK 101



 Score = 27.3 bits (61), Expect = 2.8
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
           +K  K ++K E+K +KE E+K +   K   E  
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETL 133



 Score = 26.2 bits (58), Expect = 7.2
 Identities = 9/51 (17%), Positives = 29/51 (56%)

Query: 95  ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           +  K      +K++ + K K ++K+++K +K++E++ + + ++  +   E 
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132



 Score = 25.8 bits (57), Expect = 8.7
 Identities = 11/36 (30%), Positives = 26/36 (72%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           E +K  +E EE++K K +++K KK++ + +++K ++
Sbjct: 70  EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDK 105



 Score = 25.8 bits (57), Expect = 9.8
 Identities = 12/45 (26%), Positives = 31/45 (68%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             + EK KK  E++++ K K+++ KKKK++ + +++ ++++  E+
Sbjct: 68  AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112


>gnl|CDD|212071 cd11502, SLC6sbd_NTT5, Neurotransmitter transporter 5;
           solute-binding domain.  Human NTT5 is encoded by the
           SLC6A16 gene. NTT5 is expressed in testis, pancreas, and
           prostate; its expression is predominantly intracellular,
           indicative of a vesicular location. Its substrates are
           unknown. This subgroup belongs to the solute carrier 6
           (SLC6) transporter family.
          Length = 543

 Score = 34.7 bits (80), Expect = 0.011
 Identities = 10/34 (29%), Positives = 18/34 (52%), Gaps = 4/34 (11%)

Query: 53  YFIGDSFETSVPWDKTVLLCINVKKRLTRECTVS 86
           +++G SF+  +PW++    C  VK     EC  +
Sbjct: 107 FYLGHSFQFPLPWEQ----CPLVKNSSDPECART 136


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 34.8 bits (80), Expect = 0.012
 Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 110 EKKKKNEEKEEEK-KKKEEEEKKKKEEKEREEEKREE 145
            KK +  EK E K +KKE  +K++  E+E   E R+E
Sbjct: 332 IKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQE 368



 Score = 29.4 bits (66), Expect = 0.74
 Identities = 10/34 (29%), Positives = 16/34 (47%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           E K+    +K E + +KKE  ++      E N E
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINRE 364


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 34.3 bits (79), Expect = 0.015
 Identities = 17/37 (45%), Positives = 25/37 (67%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           KK K E+K  E KKK E++KKKK+EK+  + + E + 
Sbjct: 55  KKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91



 Score = 31.6 bits (72), Expect = 0.11
 Identities = 16/38 (42%), Positives = 24/38 (63%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           K KK ++  E KKK E+++KKKKE+KE + E   +   
Sbjct: 56  KDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93



 Score = 31.6 bits (72), Expect = 0.11
 Identities = 11/35 (31%), Positives = 24/35 (68%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           E KKK+E+K+++KK+K+E + + + +   +  K+ 
Sbjct: 65  ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99



 Score = 27.4 bits (61), Expect = 3.2
 Identities = 14/58 (24%), Positives = 32/58 (55%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            SE + +      K+  ++ K NE K++ +KKK+++++KK+ + E E +   +   + 
Sbjct: 42  FSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99



 Score = 27.0 bits (60), Expect = 4.5
 Identities = 16/66 (24%), Positives = 28/66 (42%)

Query: 82  ECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
                  L  F  E  K      +K + E K    +K+ EKKKK+++EKK+ + +   + 
Sbjct: 32  LVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91

Query: 142 KREEEN 147
             +   
Sbjct: 92  GFKTPK 97



 Score = 27.0 bits (60), Expect = 5.0
 Identities = 11/59 (18%), Positives = 27/59 (45%)

Query: 90  LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
            E  ++         +K     + K + ++++KKKKE++E K + E +   +  ++   
Sbjct: 43  SEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101



 Score = 26.2 bits (58), Expect = 7.8
 Identities = 12/42 (28%), Positives = 20/42 (47%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           +K + E K + E K   K  K+ ++ KKK  K +  E  +  
Sbjct: 77  KKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 34.3 bits (79), Expect = 0.020
 Identities = 17/38 (44%), Positives = 28/38 (73%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           +  + K ++  EE+ KKK+++EKKK+EE +REE+ R E
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 32.0 bits (73), Expect = 0.11
 Identities = 13/34 (38%), Positives = 22/34 (64%)

Query: 117 EKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K+ E K  +  E+K K++K++E++K EE   EE
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREE 763



 Score = 31.6 bits (72), Expect = 0.13
 Identities = 15/34 (44%), Positives = 24/34 (70%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           K  E K  +  +++ ++KKKKE+K+ EE KREE+
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764



 Score = 30.8 bits (70), Expect = 0.23
 Identities = 16/42 (38%), Positives = 26/42 (61%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
             + + K  +  EEK KK+++++KKKEE+ + EEK   E  E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771



 Score = 30.5 bits (69), Expect = 0.33
 Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 3/65 (4%)

Query: 77   KRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
            +   +E     G  E  S+ +K+GNL         ++KN E++  +   ++ + KK+ + 
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVL---SNQEKNIEEDYAESDIKKRKNKKQYKS 1577

Query: 137  EREEE 141
              E E
Sbjct: 1578 NTEAE 1582



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 15/56 (26%), Positives = 27/56 (48%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E K+    G  +   EKK   E     ++K  EE+  + + K+R+ +K+ + N E 
Sbjct: 1526 EEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581



 Score = 28.1 bits (63), Expect = 2.3
 Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 92  FRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
           FR+   K+          EK KK ++KE     K++EE+ K+EEK R E
Sbjct: 725 FRNWNGKDAEFKISDSVEEKTKKKKKKE-----KKKEEEYKREEKARIE 768



 Score = 27.8 bits (62), Expect = 3.1
 Identities = 15/61 (24%), Positives = 29/61 (47%)

Query: 89   LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
            LL       +  N    +++      NE+K  E + + +EEK+   + E E +K ++ N+
Sbjct: 1487 LLLLNLNGNENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNL 1546

Query: 149  E 149
            E
Sbjct: 1547 E 1547


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 34.1 bits (78), Expect = 0.021
 Identities = 16/44 (36%), Positives = 27/44 (61%)

Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            + EKK   EE+ E KKKKEE+ K+K+ +K +  +K  +  ++ 
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQA 49



 Score = 32.2 bits (73), Expect = 0.10
 Identities = 17/44 (38%), Positives = 28/44 (63%)

Query: 103 GFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           G R     +KK   E+E E+KKK+EE+ K+KE K+ +  ++E +
Sbjct: 1   GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44



 Score = 30.6 bits (69), Expect = 0.35
 Identities = 12/43 (27%), Positives = 28/43 (65%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           +K   E++ + ++K+EEK K++E +K K  +KE + + + ++ 
Sbjct: 10  KKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQA 52



 Score = 27.9 bits (62), Expect = 2.4
 Identities = 21/72 (29%), Positives = 36/72 (50%), Gaps = 14/72 (19%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEK------------KKKEEKER 138
           E +  T++E  L  +K++ EK K+ E K+ +  +KE + K             KK EK+ 
Sbjct: 9   EKKILTEEE--LERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKS 66

Query: 139 EEEKREEENVEE 150
            +   E+EN E+
Sbjct: 67  RKRDVEDENPED 78


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 33.0 bits (76), Expect = 0.031
 Identities = 18/38 (47%), Positives = 25/38 (65%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           E KK  EEKEE +K+  E E K +  ++REEE+R+ E 
Sbjct: 128 EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEE 165



 Score = 30.3 bits (69), Expect = 0.31
 Identities = 12/47 (25%), Positives = 27/47 (57%), Gaps = 4/47 (8%)

Query: 104 FRKRQGEKKKKNEEKEEE----KKKKEEEEKKKKEEKEREEEKREEE 146
             +   + +++ EE E+     + K E  EK+++EE++ EE++  +E
Sbjct: 125 LEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE 171


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 32.9 bits (76), Expect = 0.031
 Identities = 12/35 (34%), Positives = 22/35 (62%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
            +K+ K  EK E+K +K + + +KK+ K+ E + R
Sbjct: 64  DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 28.3 bits (64), Expect = 1.2
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKK 132
            ++  +K +K + K E+KK K+EE K +
Sbjct: 71  WEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 27.9 bits (63), Expect = 1.4
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           K E K  EK +K+ E+ K K EK++ +++  +
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 27.1 bits (61), Expect = 2.7
 Identities = 9/32 (28%), Positives = 20/32 (62%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           ++E K  E+ EKK ++ K + E+K+ ++   +
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 26.3 bits (59), Expect = 5.7
 Identities = 13/32 (40%), Positives = 22/32 (68%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKE 137
           K++ +  +K E+K E+ K K E++K KKEE +
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 32.6 bits (75), Expect = 0.041
 Identities = 13/27 (48%), Positives = 19/27 (70%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEK 142
            EKE++K   +EE+K  KEEK++ EE 
Sbjct: 97  AEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 27.5 bits (62), Expect = 2.6
 Identities = 14/25 (56%), Positives = 15/25 (60%), Gaps = 2/25 (8%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEK 136
           KKK   KEE  KK  +EEK K EE 
Sbjct: 101 KKKAMSKEE--KKAIKEEKDKLEEP 123



 Score = 26.0 bits (58), Expect = 7.0
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 117 EKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           E E+EKKK   +E+KK  ++E+++ +  
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 26.0 bits (58), Expect = 8.1
 Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 106 KRQGEKKK--KNEEKEEEKKKKEEEEKK 131
           + + EKKK    EEK+  K++K++ E+ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 32.9 bits (75), Expect = 0.050
 Identities = 15/48 (31%), Positives = 33/48 (68%), Gaps = 2/48 (4%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKE--EEEKKKKEEKEREEEKREEENVEE 150
           ++ + ++KKK +++ EE ++K+  E+E+ K+ EKER   + +++  EE
Sbjct: 75  KRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEE 122



 Score = 29.4 bits (66), Expect = 0.71
 Identities = 11/56 (19%), Positives = 30/56 (53%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           + ++E  ++     ++R  + +K+    +E+KK+ EE  K+   ++++ EE   + 
Sbjct: 86  QQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141



 Score = 28.6 bits (64), Expect = 1.2
 Identities = 10/44 (22%), Positives = 26/44 (59%)

Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +Q +  ++   K+ EK++   +E+KK+ E+  ++   +++  EE
Sbjct: 93  QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136



 Score = 27.5 bits (61), Expect = 3.2
 Identities = 11/36 (30%), Positives = 27/36 (75%)

Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           N +++++K  K  EE++KK+E+++ EE ++++  E+
Sbjct: 65  NRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQ 100



 Score = 27.1 bits (60), Expect = 4.5
 Identities = 10/41 (24%), Positives = 18/41 (43%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            ++   + K + + E  KK   E KKK E +   +   E +
Sbjct: 160 AKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK 200



 Score = 26.3 bits (58), Expect = 8.5
 Identities = 15/53 (28%), Positives = 22/53 (41%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           +E KK+       +   + KK  E E +KK   E +KK   E +    K   E
Sbjct: 181 AEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAE 233


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 32.0 bits (73), Expect = 0.053
 Identities = 11/37 (29%), Positives = 22/37 (59%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           + E   + + + +EKKK+E  + K ++EK + E K+ 
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146



 Score = 30.5 bits (69), Expect = 0.23
 Identities = 12/48 (25%), Positives = 24/48 (50%), Gaps = 7/48 (14%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKERE-------EEKREEENVEE 150
           E+ ++    E E++   E + + KE+K+RE       +EK + E  + 
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146



 Score = 29.3 bits (66), Expect = 0.49
 Identities = 9/40 (22%), Positives = 23/40 (57%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
               +EE+E E++  +E E++   E + E +++++  V +
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK 131



 Score = 27.8 bits (62), Expect = 1.5
 Identities = 9/38 (23%), Positives = 20/38 (52%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
             ++     + E ++KKK E  K K E+++ + E ++ 
Sbjct: 109 TEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146



 Score = 25.5 bits (56), Expect = 9.0
 Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 1/39 (2%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKK 132
            ETK E     +KR+  K K  +EK + + KK +  K K
Sbjct: 115 PETKTESKEK-KKREVPKPKTEKEKPKTEPKKPKPSKPK 152


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 32.2 bits (74), Expect = 0.057
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEK 142
           EEKE++K   +EE+K  KEEKE+ EE 
Sbjct: 96  EEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 28.0 bits (63), Expect = 1.4
 Identities = 11/28 (39%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 106 KRQGEKKKKN--EEKEEEKKKKEEEEKK 131
           + + EKKK    EEK+  K++KE+ E+ 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 26.8 bits (60), Expect = 4.4
 Identities = 13/28 (46%), Positives = 21/28 (75%), Gaps = 3/28 (10%)

Query: 117 EKEEEKKK---KEEEEKKKKEEKEREEE 141
           E+E+EKKK   KEE++  K+E+++ EE 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 32.4 bits (74), Expect = 0.062
 Identities = 16/54 (29%), Positives = 28/54 (51%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K +  +G    + E   +     E+K K+ E E   +E+++ EEE+ EEE  +E
Sbjct: 134 KFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDE 187



 Score = 30.5 bits (69), Expect = 0.29
 Identities = 14/41 (34%), Positives = 26/41 (63%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +  +K    E++ K+ E E+  +++EK+ EEE+ EEE  E+
Sbjct: 148 DIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDED 188



 Score = 27.4 bits (61), Expect = 3.1
 Identities = 12/59 (20%), Positives = 28/59 (47%)

Query: 92  FRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             ++   +     + ++       EE++ ++K    E+K K+ E E  +E+ E++  EE
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179



 Score = 26.3 bits (58), Expect = 6.9
 Identities = 12/41 (29%), Positives = 30/41 (73%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           EKK K  E E+  ++ E++E++++EE+E +E+  ++++ ++
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 31.5 bits (72), Expect = 0.063
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
            +    EE+K  +EE +++ +E KERE EK++
Sbjct: 95  AQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126



 Score = 27.6 bits (62), Expect = 1.4
 Identities = 11/35 (31%), Positives = 24/35 (68%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +E+ E ++ ++EE+K  +EE +R+ E+ +E   E+
Sbjct: 90  DERAEAQRARDEEKKLDEEEAKRQHEEAKEREREK 124



 Score = 25.7 bits (57), Expect = 7.3
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           +E  E ++ + EE++  ++E K + EE +E E
Sbjct: 90  DERAEAQRARDEEKKLDEEEAKRQHEEAKERE 121



 Score = 25.7 bits (57), Expect = 8.0
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKE 134
           R R  EKK   EE + + ++ +E E++KK+
Sbjct: 97  RARDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 32.5 bits (74), Expect = 0.068
 Identities = 15/45 (33%), Positives = 25/45 (55%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K+Q E++ K +   E KKK  E +KK + E + + E + +   EE
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEE 193



 Score = 32.1 bits (73), Expect = 0.090
 Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 4/44 (9%)

Query: 106 KRQGEKKKKNEEKE----EEKKKKEEEEKKKKEEKEREEEKREE 145
            +Q E+  K  E++    EE K K+  E K K E E E++ +EE
Sbjct: 104 AKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEE 147



 Score = 31.7 bits (72), Expect = 0.11
 Identities = 7/45 (15%), Positives = 25/45 (55%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           ++Q E+ +K    E+ ++K+ E+    ++  ++ E+  ++   ++
Sbjct: 74  EQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQ 118



 Score = 30.6 bits (69), Expect = 0.24
 Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 2/44 (4%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE--KEREEEKREEE 146
           +++Q E+ K  +  E + K + E EKK KEE  K+ EEE + + 
Sbjct: 117 KQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKA 160



 Score = 29.8 bits (67), Expect = 0.49
 Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 5/58 (8%)

Query: 95  ETKKEGNLGFRKRQGEKKKKNEE--KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E  ++  L  R    +  K+ E+  K+ E+K+K+ EE K K+  E    K + E   E
Sbjct: 87  EQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEA---KAKAEAEAE 141



 Score = 29.8 bits (67), Expect = 0.53
 Identities = 8/42 (19%), Positives = 26/42 (61%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           ++RQ + +++ EE E+++  ++  +K+ ++    E+  ++ E
Sbjct: 67  QERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAE 108



 Score = 29.0 bits (65), Expect = 1.0
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
            K +   + K + + E +KK +EE KK+ EE+ + +   E
Sbjct: 124 AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAE 163



 Score = 28.6 bits (64), Expect = 1.2
 Identities = 15/46 (32%), Positives = 22/46 (47%), Gaps = 5/46 (10%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEE-----EEKKKKEEKEREEEKREE 145
           + +Q  + K   E E EKK KEE     EE+ K +     ++K  E
Sbjct: 125 KAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170



 Score = 27.5 bits (61), Expect = 2.9
 Identities = 15/46 (32%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +K + E KK+    EEE K K   E KKK  + +++ + E +   E
Sbjct: 142 KKAKEEAKKQ---AEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAE 184



 Score = 27.1 bits (60), Expect = 3.5
 Identities = 13/47 (27%), Positives = 28/47 (59%), Gaps = 5/47 (10%)

Query: 105 RKRQGEKKKKNEE---KEEEKKKKEEEEKKKKEE--KEREEEKREEE 146
           +  + EK++  E+   KE E++   E+  K+ E+  K+ EE++++ E
Sbjct: 76  QAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAE 122



 Score = 27.1 bits (60), Expect = 4.5
 Identities = 15/56 (26%), Positives = 28/56 (50%)

Query: 95  ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E   +     +K+  E K K   + + K + E E+K K+E K++ EE+ + +   E
Sbjct: 108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAE 163



 Score = 26.7 bits (59), Expect = 5.8
 Identities = 17/50 (34%), Positives = 20/50 (40%), Gaps = 9/50 (18%)

Query: 106 KRQGEKKKKNEEK----EEEKKKKEEEEKKKKEEK-----EREEEKREEE 146
           K   E KKK  E     E E K K E + K K E+     E  + K   E
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAE 208



 Score = 26.3 bits (58), Expect = 6.3
 Identities = 10/41 (24%), Positives = 27/41 (65%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           ++++   KK+ E +++ +++ EE EK++  E+ R++E  + 
Sbjct: 57  QQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQR 97


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 32.1 bits (74), Expect = 0.087
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
            +K+ K   K ++KK+K+E +  K + K R
Sbjct: 62  DKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 27.9 bits (63), Expect = 1.8
 Identities = 5/30 (16%), Positives = 18/30 (60%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           ++E K   + +KKK++++ +  + + +  +
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSKPRL 92



 Score = 27.9 bits (63), Expect = 2.3
 Identities = 7/26 (26%), Positives = 14/26 (53%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKK 131
            +   K +K +EK+E K  K + + +
Sbjct: 66  LKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 27.1 bits (61), Expect = 3.8
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREE 140
           K E K   K +K++E+++ K  K + +
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 26.3 bits (59), Expect = 6.0
 Identities = 7/36 (19%), Positives = 19/36 (52%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           K++      ++K+ K   + +KK+EK+  +  + + 
Sbjct: 53  KEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 30.9 bits (70), Expect = 0.089
 Identities = 16/26 (61%), Positives = 20/26 (76%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEE 141
               EEK +++EEEKKK+EEKE EEE
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEE 96



 Score = 28.2 bits (63), Expect = 0.70
 Identities = 13/26 (50%), Positives = 21/26 (80%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEE 141
              EE+ ++KEEE+KK++E++E EEE
Sbjct: 72  AAAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 27.1 bits (60), Expect = 1.9
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKE 134
             ++K EEKEEEKKK+EE+E++++E
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 27.1 bits (60), Expect = 2.3
 Identities = 10/24 (41%), Positives = 19/24 (79%)

Query: 123 KKKEEEEKKKKEEKEREEEKREEE 146
             +E+ E+K++E+K+ EE++ EEE
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEE 96



 Score = 26.3 bits (58), Expect = 3.8
 Identities = 15/31 (48%), Positives = 22/31 (70%), Gaps = 3/31 (9%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
             E+K +E+EE+KKKEE   E+E+ EEE + 
Sbjct: 73  AAEEKAEEKEEEKKKEE---EKEEEEEEALA 100



 Score = 25.2 bits (55), Expect = 9.6
 Identities = 10/23 (43%), Positives = 20/23 (86%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEE 128
           + + E+K++ ++KEEEK+++EEE
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEE 97


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 32.0 bits (73), Expect = 0.099
 Identities = 17/37 (45%), Positives = 30/37 (81%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           EK+  +EE+EEEK++K+EEE+K  +++E  +E+ E+E
Sbjct: 32  EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68



 Score = 30.5 bits (69), Expect = 0.30
 Identities = 16/41 (39%), Positives = 34/41 (82%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E++++ EEK+EE++K  ++E++  EE+E+EE+K++ + V+E
Sbjct: 39  EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79



 Score = 30.1 bits (68), Expect = 0.37
 Identities = 14/41 (34%), Positives = 28/41 (68%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           +++ E++K  +++EE  +++E+EEKKKK +K +E     E 
Sbjct: 46  EKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 30.1 bits (68), Expect = 0.43
 Identities = 17/37 (45%), Positives = 27/37 (72%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K  EKE   +++EEE+++KKEE+E+  +K EE + EE
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEE 65



 Score = 29.7 bits (67), Expect = 0.48
 Identities = 17/34 (50%), Positives = 25/34 (73%)

Query: 117 EKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           EKE EK+  +EEE+++KEEK+ EEEK  ++  E 
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEV 61



 Score = 29.7 bits (67), Expect = 0.54
 Identities = 14/41 (34%), Positives = 30/41 (73%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +++KK EE++   K++E +E+++KEEK+++ +K +E   E 
Sbjct: 44  KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEW 84



 Score = 29.4 bits (66), Expect = 0.67
 Identities = 13/41 (31%), Positives = 31/41 (75%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           EK++K EE+E+   K+EE ++++++E+++++ K+ +E   E
Sbjct: 43  EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83



 Score = 28.2 bits (63), Expect = 1.8
 Identities = 14/37 (37%), Positives = 29/37 (78%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            +K+  +E+EEE+K++++EE++K  +KE E ++ EE+
Sbjct: 31  VEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK 67



 Score = 27.4 bits (61), Expect = 2.9
 Identities = 15/40 (37%), Positives = 28/40 (70%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           E +K+  ++EEE++K+E++E+++K   + EE   EEE  E
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 27.0 bits (60), Expect = 4.2
 Identities = 14/41 (34%), Positives = 30/41 (73%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E+KK+ EEK  +K+++ +EE++K+E+K++ ++ +E     E
Sbjct: 45  EEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85



 Score = 27.0 bits (60), Expect = 4.9
 Identities = 14/41 (34%), Positives = 31/41 (75%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           EK+ + E  +EE+++++EE+K+++E+   +EE+ +EE  +E
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 31.6 bits (72), Expect = 0.11
 Identities = 16/64 (25%), Positives = 35/64 (54%), Gaps = 10/64 (15%)

Query: 87  NGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            G+ +F S+ K E          ++K+ +    E ++ K++EE + K++KE EE K  ++
Sbjct: 63  TGVKDFLSDQKPE----------DEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQK 112

Query: 147 NVEE 150
            +++
Sbjct: 113 KIDQ 116



 Score = 29.7 bits (67), Expect = 0.55
 Identities = 10/51 (19%), Positives = 24/51 (47%), Gaps = 1/51 (1%)

Query: 92  FRSE-TKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
           F SE     G   F   Q  + +K       + ++ +++++ + +K++E E
Sbjct: 55  FNSEFAGGTGVKDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEME 105


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 31.7 bits (73), Expect = 0.12
 Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 1/50 (2%)

Query: 102 LGFRKRQGEKKKKNEEK-EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           L   K  G + K +    EEE+ +KEE +++ +EE + E E+      EE
Sbjct: 21  LEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEE 70



 Score = 27.9 bits (63), Expect = 2.6
 Identities = 16/57 (28%), Positives = 26/57 (45%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E  KE  +  ++   + K+   E +      EEEE +K+E K   EE+ + E  E 
Sbjct: 7   YELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEA 63



 Score = 26.4 bits (59), Expect = 8.9
 Identities = 12/52 (23%), Positives = 24/52 (46%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           +    K+     +  +G K+ K  +    ++ +    KKKK++K  E+  RE
Sbjct: 124 KGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPRE 175


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 30.7 bits (70), Expect = 0.13
 Identities = 19/55 (34%), Positives = 33/55 (60%), Gaps = 8/55 (14%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKE--------REEEKREEENVEE 150
           +R + G      E++EE++++KEE+E+K+++EKE         E EKR EE  + 
Sbjct: 4   YRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKA 58



 Score = 25.7 bits (57), Expect = 7.3
 Identities = 15/58 (25%), Positives = 37/58 (63%), Gaps = 7/58 (12%)

Query: 92  FRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKK-------EEEEKKKKEEKEREEEK 142
           +R ++ +  ++  ++ + E++K+ +E++EEK+K+       +EE +K+ EE E+ + K
Sbjct: 4   YRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNK 61



 Score = 25.7 bits (57), Expect = 9.0
 Identities = 13/43 (30%), Positives = 28/43 (65%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           R+ +  +K++ E KEE++K+  +   +K+E ++R EE  + +N
Sbjct: 18  REEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 31.6 bits (71), Expect = 0.13
 Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 97   KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE--KEREEEKREEENVEE 150
            K E     +  +  KK + ++K+ E+ KK EE++KK  E  K+  EE ++ E +++
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709



 Score = 30.5 bits (68), Expect = 0.33
 Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 105  RKRQGEKKKKNEE--KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            +K+  E KKK EE  K +E KKK EE KK  E K++ EE ++ E  ++
Sbjct: 1417 KKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKK 1464



 Score = 30.5 bits (68), Expect = 0.40
 Identities = 15/54 (27%), Positives = 28/54 (51%)

Query: 93   RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            ++E  K+     +K+  E KK  E  + E +   +E +  +E+ E  E+K+EE 
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376



 Score = 29.7 bits (66), Expect = 0.64
 Identities = 17/54 (31%), Positives = 33/54 (61%)

Query: 93   RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            ++E  K+ +   +K +  KKK +E K+  + KK+ +E KK EE ++ +E ++ E
Sbjct: 1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531



 Score = 29.7 bits (66), Expect = 0.70
 Identities = 18/44 (40%), Positives = 28/44 (63%), Gaps = 3/44 (6%)

Query: 106  KRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            K++ E+ KK +E   K EE KKK +E KK  E K++ +E ++ E
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE 1519



 Score = 29.3 bits (65), Expect = 0.89
 Identities = 16/40 (40%), Positives = 27/40 (67%)

Query: 106  KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            +   EK +  E+K+EE KKK +  KKK EEK++ +E +++
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399



 Score = 29.0 bits (64), Expect = 1.0
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E KK      +  +  KK+  E K+ E+ KK+E E+KKK E+ ++ E+  +   EE
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734



 Score = 29.0 bits (64), Expect = 1.0
 Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 3/59 (5%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNE---EKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E KK+     +K   EKKK  E    +EE K K  EE KK +E+K++ EE ++ E  E+
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688



 Score = 29.0 bits (64), Expect = 1.1
 Identities = 19/56 (33%), Positives = 32/56 (57%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E KK+     +  + +KK +  +K EE KKK EE KK  E K++ EE ++ +  ++
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKK 1490



 Score = 29.0 bits (64), Expect = 1.2
 Identities = 16/53 (30%), Positives = 33/53 (62%)

Query: 93   RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            ++E  K+ +   +K +  KK +  +K+ E+ KK +E KKK EE ++ +E +++
Sbjct: 1439 KAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491



 Score = 28.6 bits (63), Expect = 1.4
 Identities = 15/56 (26%), Positives = 32/56 (57%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E KK+     +  + +KK +  +K+ ++ KK  E KKK +E ++ EE ++ +  ++
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529



 Score = 28.6 bits (63), Expect = 1.5
 Identities = 14/46 (30%), Positives = 25/46 (54%)

Query: 105  RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
                 E +   E+ E  +KKKEE +KK    K++ EEK++ +  ++
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398



 Score = 28.6 bits (63), Expect = 1.5
 Identities = 19/60 (31%), Positives = 35/60 (58%), Gaps = 4/60 (6%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEE----EEKKKKEEKEREEEKREEENVEE 150
            E KK+ +   +  + +KK    +K EE KK +E    EE KK +E ++ EEK++ + +++
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553



 Score = 28.6 bits (63), Expect = 1.6
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query: 106  KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
             ++ ++ KK EEK++  + K+ EE KK EEK++ EE ++ E
Sbjct: 1533 AKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAE 1573



 Score = 28.6 bits (63), Expect = 1.6
 Identities = 18/40 (45%), Positives = 29/40 (72%)

Query: 106  KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            K+  EKKK +E K+ E+KKK +E KKK EE ++ +E +++
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323



 Score = 28.2 bits (62), Expect = 1.9
 Identities = 18/56 (32%), Positives = 33/56 (58%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E KK+     +  + +KK +  +K +E KKK EE KK +E K++ EE ++ +  ++
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477



 Score = 28.2 bits (62), Expect = 2.0
 Identities = 17/51 (33%), Positives = 28/51 (54%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            E KK      +  + +KK    +K    KKK +E KKK EEK++ +E +++
Sbjct: 1389 EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439



 Score = 28.2 bits (62), Expect = 2.2
 Identities = 18/41 (43%), Positives = 28/41 (68%)

Query: 106  KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            K+  E KK +E K+ E+ KK +E KK +E+K+ +E K+ EE
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556



 Score = 28.2 bits (62), Expect = 2.3
 Identities = 20/54 (37%), Positives = 34/54 (62%)

Query: 97   KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K E     +  + +K+ + ++K+ E+ KK+EEEKKK    ++EEEK+ EE  +E
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776



 Score = 28.2 bits (62), Expect = 2.4
 Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 3/49 (6%)

Query: 105  RKRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            +K+  E KKK EE   K +E KK    +KK  E K++ EEK++ +  ++
Sbjct: 1390 KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438



 Score = 27.8 bits (61), Expect = 2.7
 Identities = 22/61 (36%), Positives = 37/61 (60%), Gaps = 3/61 (4%)

Query: 93   RSETKKEGNLGFRKRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
            ++E +K+     +K++ E+KKK EE    EEE K K  EE KK EE +++ E+ ++   +
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686

Query: 150  E 150
            E
Sbjct: 1687 E 1687



 Score = 27.8 bits (61), Expect = 2.7
 Identities = 21/63 (33%), Positives = 37/63 (58%), Gaps = 10/63 (15%)

Query: 93   RSETKKEGNLGFRKRQGEKKKKNEE----------KEEEKKKKEEEEKKKKEEKEREEEK 142
            +   + +     +K++ E+KKK EE          K EE KK+ EE+KKK EE +++EE+
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755

Query: 143  REE 145
            +++
Sbjct: 1756 KKK 1758



 Score = 27.8 bits (61), Expect = 2.8
 Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 2/48 (4%)

Query: 105  RKRQGEKKKKNEEKE--EEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            +K+  E KK    K+  +E KKK EE+KK  E K++ EE ++ +  ++
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451



 Score = 27.8 bits (61), Expect = 2.8
 Identities = 16/55 (29%), Positives = 25/55 (45%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
            E KK      +  + +KK    +K+ E+ KK  E  K + E   +E +  EE  E
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 27.8 bits (61), Expect = 2.9
 Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 3/44 (6%)

Query: 106  KRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            K++ E+ KK +E   K EE KKK +  KKK EE ++  E  + E
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE 1351



 Score = 27.8 bits (61), Expect = 2.9
 Identities = 19/56 (33%), Positives = 32/56 (57%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E KK      +  Q +KK+  E+K+ E+ KK EEE K K  +E ++ + +++  EE
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679



 Score = 27.8 bits (61), Expect = 3.1
 Identities = 21/56 (37%), Positives = 34/56 (60%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E KK      +  + +K ++ ++K E+ KKKE EEKKK EE ++ EE+ + +  EE
Sbjct: 1611 EAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666



 Score = 27.8 bits (61), Expect = 3.1
 Identities = 25/76 (32%), Positives = 44/76 (57%)

Query: 75   VKKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKE 134
            +KK+   E   +  L +   E K +     ++ + +KKK  E K++E++KK+    KK+E
Sbjct: 1707 LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEE 1766

Query: 135  EKEREEEKREEENVEE 150
            EK+ EE ++E+E V E
Sbjct: 1767 EKKAEEIRKEKEAVIE 1782



 Score = 27.4 bits (60), Expect = 3.6
 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 91   EFRSETKKEGNLGFRKRQGEKKKKNE--EKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            E + E  K+     +K+  EKKK +E  +K EE KKK +E KK    K++ +E +++
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426



 Score = 27.4 bits (60), Expect = 4.2
 Identities = 17/51 (33%), Positives = 30/51 (58%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            E KK+     +  + +KK +  +K +E KKK EE KKK +E ++  E +++
Sbjct: 1461 EAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511



 Score = 27.0 bits (59), Expect = 4.6
 Identities = 19/54 (35%), Positives = 33/54 (61%)

Query: 97   KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            KKE     +  + +KK+  E+K+ E+ KK EEE K K E+ ++E + +++  EE
Sbjct: 1695 KKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748



 Score = 27.0 bits (59), Expect = 5.0
 Identities = 18/56 (32%), Positives = 34/56 (60%)

Query: 90   LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            L+  +  KK+ +   +K + +KK    +K+ E+ KK +E KKK EE ++ EE +++
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465



 Score = 27.0 bits (59), Expect = 5.2
 Identities = 19/48 (39%), Positives = 31/48 (64%), Gaps = 3/48 (6%)

Query: 106  KRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K++ E+ KK EE   K EE KK +E +KK +E K+ +E K++ E  ++
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK 1497



 Score = 27.0 bits (59), Expect = 5.4
 Identities = 18/53 (33%), Positives = 33/53 (62%)

Query: 93   RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            ++E KK+ +   +K +  KK    +K+ E+ KK EE KKK EE ++ +E +++
Sbjct: 1426 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478



 Score = 27.0 bits (59), Expect = 5.5
 Identities = 14/45 (31%), Positives = 29/45 (64%)

Query: 106  KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             ++ ++ KK EE ++  + K+ EEKKK +E ++ EE ++ E  ++
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565



 Score = 27.0 bits (59), Expect = 5.8
 Identities = 18/51 (35%), Positives = 30/51 (58%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            E KK+     +  + +KK +  +K +E KKK EE KK  E K++ EE +++
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498



 Score = 26.6 bits (58), Expect = 6.7
 Identities = 19/46 (41%), Positives = 32/46 (69%), Gaps = 2/46 (4%)

Query: 107  RQGEKKKKNEE--KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            ++ E+KKK +E  K EEKKK +E +KK +E K+ +E K++ E  ++
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1329



 Score = 26.3 bits (57), Expect = 8.0
 Identities = 19/42 (45%), Positives = 26/42 (61%)

Query: 105  RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            RK +  KK +  +K EE KK  EE KK +EE+  EE ++ EE
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEE 1259



 Score = 26.3 bits (57), Expect = 8.3
 Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 110  EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
            E   K EE ++ +  K EE +K +E K++ E+ R+ E  
Sbjct: 1095 EAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEA 1133



 Score = 26.3 bits (57), Expect = 8.3
 Identities = 17/48 (35%), Positives = 32/48 (66%), Gaps = 2/48 (4%)

Query: 105  RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE--KEREEEKREEENVEE 150
            +    EKKK+  +K+ +  KK+ EEKKK +E  K+ EE+K++ + +++
Sbjct: 1365 KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412



 Score = 26.3 bits (57), Expect = 8.3
 Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 1/57 (1%)

Query: 95   ETKKEGNLGFRKRQGEKKKKNEEKE-EEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E KK+     +K    KKK  E K+  E  K E E    + E   E+ +  E+  EE
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375



 Score = 26.3 bits (57), Expect = 9.2
 Identities = 16/45 (35%), Positives = 26/45 (57%)

Query: 106  KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            ++  E +K  + K+ E  KK EE KK  EE ++ EE+R  E + +
Sbjct: 1212 RKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRK 1256



 Score = 26.3 bits (57), Expect = 9.4
 Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 93   RSETKKEGNLGFRKRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEE 146
             +E  +E      K++ E KKK +    K EEKKK +E +KK +E+K++ +E ++  
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.2 bits (71), Expect = 0.14
 Identities = 13/27 (48%), Positives = 19/27 (70%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEK 142
            +KEE+K   +EE+K  K EKE+ EE+
Sbjct: 96  AQKEEKKAMSKEEKKAIKAEKEKLEEE 122


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 31.4 bits (71), Expect = 0.14
 Identities = 13/39 (33%), Positives = 29/39 (74%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
           RK+  ++ +K   K++EK++ E+ +K K++ ++++EEKR
Sbjct: 152 RKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190



 Score = 26.8 bits (59), Expect = 4.8
 Identities = 18/48 (37%), Positives = 30/48 (62%), Gaps = 3/48 (6%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
              KK  +   RKR G++K+K   + E+ +K +E+ +KK+EEK R +E
Sbjct: 150 LSRKKSDDEH-RKRSGKQKEK--RRVEDSQKHKEDRRKKQEEKRRNDE 194


>gnl|CDD|176990 CHL00049, ndhB, NADH dehydrogenase subunit 2.
          Length = 494

 Score = 31.5 bits (72), Expect = 0.14
 Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 8/33 (24%)

Query: 24  GIPAGESNGRRGY--MLTYIIAYI----RDFAC 50
           GI  G+SN   GY  M+TY++ YI      FAC
Sbjct: 322 GIIVGDSND--GYASMITYMLFYIFMNLGTFAC 352


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 31.1 bits (71), Expect = 0.14
 Identities = 10/34 (29%), Positives = 17/34 (50%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
           K   E   +  E     K++E  E +KKE K+++
Sbjct: 163 KLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 28.1 bits (63), Expect = 1.4
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           K  +  EE   ++ E     KEE+  E EK+E 
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKES 192



 Score = 27.7 bits (62), Expect = 2.2
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKK 132
           F   + E     +E+E  + +K+E +KKK
Sbjct: 168 FAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 31.2 bits (70), Expect = 0.17
 Identities = 13/25 (52%), Positives = 21/25 (84%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEEKR 143
           EEE++ ++  EK K+E+K+RE+EKR
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKR 109



 Score = 28.9 bits (64), Expect = 0.98
 Identities = 14/32 (43%), Positives = 23/32 (71%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
           EKK K  +K+E+K+K++E +K KK+E E  + 
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227



 Score = 28.9 bits (64), Expect = 1.1
 Identities = 11/33 (33%), Positives = 26/33 (78%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           E+K ++ KKKE++EK+K+ +K++++E    +++
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSL 228



 Score = 28.5 bits (63), Expect = 1.4
 Identities = 14/43 (32%), Positives = 27/43 (62%)

Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           R  E  K  E+ +    +K+ ++ KKKE+KE+E+E+ +++  E
Sbjct: 179 RNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE 221



 Score = 27.7 bits (61), Expect = 2.5
 Identities = 10/30 (33%), Positives = 23/30 (76%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKE 137
           +  KK K +EK+E++K++++++KK+ E  +
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 27.0 bits (59), Expect = 4.4
 Identities = 11/37 (29%), Positives = 26/37 (70%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           EK      +++ KK K++E+K+K++E++++++K  E 
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224



 Score = 27.0 bits (59), Expect = 5.6
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
              +  + ++ E E+ KK  + KKKK+ KE+EE+K+
Sbjct: 259 PDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKK 294



 Score = 26.6 bits (58), Expect = 7.5
 Identities = 16/47 (34%), Positives = 28/47 (59%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
           +ET K    G      +K KK ++KE+++K+KE ++ KKKE +  + 
Sbjct: 181 AETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227



 Score = 26.2 bits (57), Expect = 9.9
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
           + +  KK    +K++++K+KEE++KKKK    R
Sbjct: 270 EAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 31.4 bits (71), Expect = 0.17
 Identities = 17/39 (43%), Positives = 27/39 (69%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           K++K+EEK EE K +E +E+ ++EEK    E  E+ N+E
Sbjct: 283 KEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIE 321



 Score = 27.5 bits (61), Expect = 3.3
 Identities = 14/32 (43%), Positives = 22/32 (68%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +E K+KKEE++++K EE + EE   E E  E+
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308



 Score = 27.1 bits (60), Expect = 4.5
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 121 EKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           EKK++++EEK ++ + E  +E+ EEE    
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310



 Score = 26.7 bits (59), Expect = 7.1
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           E KE++++K EE+ ++ K E+  EE + EE+
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308



 Score = 26.3 bits (58), Expect = 8.4
 Identities = 11/39 (28%), Positives = 23/39 (58%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           +KK+ +++E+ ++ K EE  ++ EE+E+       E V 
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVN 319


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 31.2 bits (71), Expect = 0.17
 Identities = 13/39 (33%), Positives = 29/39 (74%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           +R  E  +    +E E++++EE+E++KK++KER++ ++E
Sbjct: 167 RRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKE 205


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 30.8 bits (70), Expect = 0.18
 Identities = 18/54 (33%), Positives = 35/54 (64%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
              ++ E +      + EK+ + EE+E+++KKK++E KK+K+EK+ ++EK  E 
Sbjct: 127 ELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180



 Score = 29.3 bits (66), Expect = 0.60
 Identities = 14/37 (37%), Positives = 30/37 (81%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           EKK+K ++KE +K+KKE+++KK+K  + +  +K++++
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189



 Score = 27.8 bits (62), Expect = 1.9
 Identities = 14/52 (26%), Positives = 30/52 (57%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
            SE +    +       E++KK ++K++E KK+++E+K KKE+    +  ++
Sbjct: 134 TSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185



 Score = 27.8 bits (62), Expect = 2.0
 Identities = 13/38 (34%), Positives = 29/38 (76%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           E+KK+ ++K+E KK+K+E++ KK++  E +  K++++ 
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189



 Score = 27.0 bits (60), Expect = 3.7
 Identities = 12/39 (30%), Positives = 31/39 (79%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           K++ +KKK+ +++++EKK K+E+  + K  K+++++K++
Sbjct: 154 KKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 26.6 bits (59), Expect = 5.1
 Identities = 13/50 (26%), Positives = 31/50 (62%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
             +    +E     +K++ E KK+ +EK+++K+K  E +  KK++K++++
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 26.2 bits (58), Expect = 6.6
 Identities = 13/39 (33%), Positives = 29/39 (74%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
           +K + +KK+  +EK+E+K KKE+  + K  +K+++++K+
Sbjct: 154 KKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 25.8 bits (57), Expect = 9.3
 Identities = 16/42 (38%), Positives = 29/42 (69%), Gaps = 4/42 (9%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           +   + +K+ E +EEEKK+K    KKKKE K+ ++EK++++ 
Sbjct: 138 ETTAKVEKEAEVEEEEKKEK----KKKKEVKKEKKEKKDKKE 175


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 31.3 bits (71), Expect = 0.18
 Identities = 15/65 (23%), Positives = 26/65 (40%), Gaps = 11/65 (16%)

Query: 82  ECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKK----KEEEEKKKKEEKE 137
           E TV     E ++  K E          +  K NEEK  +++       EE  K+ +E +
Sbjct: 885 ENTVKGQYSELQNHHKTEAL-------SQNNKYNEEKSAQERMPGADSPEELMKRAKEYQ 937

Query: 138 REEEK 142
            + + 
Sbjct: 938 DKHKG 942


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 29.9 bits (68), Expect = 0.20
 Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 2/36 (5%)

Query: 104 FRKRQGEKKKKNEEK--EEEKKKKEEEEKKKKEEKE 137
           F++ + +KKKK+E +   E+K+KKE  E++K ++K 
Sbjct: 37  FKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 30.9 bits (70), Expect = 0.20
 Identities = 15/41 (36%), Positives = 28/41 (68%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +K    +E EE +++ EEE+++  +E E+E E++ E NVE+
Sbjct: 80  DKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEK 120



 Score = 29.7 bits (67), Expect = 0.50
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +    E  +  E KEEEK+    E+K+ K + E+E+E+ EEEN EE
Sbjct: 53  KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98



 Score = 28.9 bits (65), Expect = 0.91
 Identities = 17/57 (29%), Positives = 32/57 (56%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
               +E       ++ EK+  N E +E+K   E+E+++ +EE E E+E+  +EN +E
Sbjct: 53  KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKE 109



 Score = 28.5 bits (64), Expect = 1.1
 Identities = 13/42 (30%), Positives = 25/42 (59%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            K   EK+ +  E+E E++ +E  ++ +KE +E+ E   E+E
Sbjct: 80  DKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121



 Score = 28.2 bits (63), Expect = 1.5
 Identities = 16/61 (26%), Positives = 33/61 (54%), Gaps = 4/61 (6%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEE----EKKKKEEEEKKKKEEKEREEEKREEENVE 149
           +  ++E      +   E ++  EE++E    E K+ + + +K+ EE E E E+ +EE+ +
Sbjct: 45  AADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104

Query: 150 E 150
           E
Sbjct: 105 E 105


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 31.0 bits (70), Expect = 0.22
 Identities = 15/47 (31%), Positives = 27/47 (57%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           F K++ ++KK+ +E  E   K+E EE+  K +   E+E  E   ++E
Sbjct: 221 FFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDE 267



 Score = 28.6 bits (64), Expect = 1.3
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           ++  +EE+ EEK+K++ +  KK  E E E+E+ E 
Sbjct: 279 ERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEI 313



 Score = 26.3 bits (58), Expect = 6.5
 Identities = 14/55 (25%), Positives = 27/55 (49%)

Query: 90  LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
               +   ++ +    K  GE+    EE EE++K+K +  KK  E+++ +EE   
Sbjct: 259 SAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEI 313


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 30.8 bits (70), Expect = 0.23
 Identities = 11/41 (26%), Positives = 23/41 (56%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E ++   E  +E +K  E+EK  + E+E+ E    +E +++
Sbjct: 3   ELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDK 43



 Score = 29.6 bits (67), Expect = 0.57
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 5/63 (7%)

Query: 89  LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKK-KEEKEREEEKREEEN 147
           L E  +E  KE     RK   ++K    E+E+ +    +EE  K   E +R EE  +E  
Sbjct: 4   LREALAELAKE----LRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELE 59

Query: 148 VEE 150
            + 
Sbjct: 60  AKP 62


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 30.7 bits (70), Expect = 0.23
 Identities = 14/36 (38%), Positives = 17/36 (47%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
            GE   KN  K+  K K+ EEEK  KEE +      
Sbjct: 4   NGEPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAA 39



 Score = 27.3 bits (61), Expect = 4.0
 Identities = 11/52 (21%), Positives = 20/52 (38%)

Query: 99  EGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
                  K   +K+ K ++ EEEK  KEE +        +   ++     +E
Sbjct: 3   SNGEPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDE 54



 Score = 26.5 bits (59), Expect = 7.7
 Identities = 12/61 (19%), Positives = 22/61 (36%), Gaps = 3/61 (4%)

Query: 86  SNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           SNG    ++  KK       K+  E+K   EE +        + + +K     +E     
Sbjct: 3   SNGEPLSKNALKKRL---KAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPT 59

Query: 146 E 146
           +
Sbjct: 60  Q 60


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.0 bits (70), Expect = 0.25
 Identities = 20/56 (35%), Positives = 33/56 (58%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
              S T  E      +   E++   EE+EEE++++EEE++ ++EE E EEE+ E E
Sbjct: 422 SKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477



 Score = 29.9 bits (67), Expect = 0.46
 Identities = 16/41 (39%), Positives = 30/41 (73%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E  ++ EE+EEE++++E+E ++++ E E EEE+ E +N  E
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSE 483



 Score = 29.1 bits (65), Expect = 0.87
 Identities = 16/38 (42%), Positives = 27/38 (71%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             ++E EEE+  +EEEE++++EE+E +E + EE   EE
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEE 471



 Score = 29.1 bits (65), Expect = 1.00
 Identities = 14/42 (33%), Positives = 29/42 (69%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E+  + EE+EEE++++EE+E +++E ++ EEE+  E +   
Sbjct: 441 EEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGS 482



 Score = 28.7 bits (64), Expect = 1.2
 Identities = 16/37 (43%), Positives = 27/37 (72%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           + EE  EE++++EEEE+++++E E EE + EEE  E 
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476



 Score = 28.7 bits (64), Expect = 1.5
 Identities = 17/46 (36%), Positives = 29/46 (63%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
              +       E +EEE  ++EEEE++++EE+E+E E+ E E+ EE
Sbjct: 427 TSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472



 Score = 28.3 bits (63), Expect = 1.6
 Identities = 16/42 (38%), Positives = 29/42 (69%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            ++ ++ E  EEE++++EEEE++++E +E E E  EEE   E
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477



 Score = 28.3 bits (63), Expect = 1.8
 Identities = 21/74 (28%), Positives = 40/74 (54%)

Query: 76  KKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
           +KR  RE   ++      S+              E +++   +EEE++++EEEE++++ E
Sbjct: 404 RKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESE 463

Query: 136 KEREEEKREEENVE 149
           +E  E++ EEE VE
Sbjct: 464 EEEGEDEEEEEEVE 477



 Score = 27.6 bits (61), Expect = 3.0
 Identities = 16/40 (40%), Positives = 30/40 (75%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           E++++ EE+EEE++ +EEE + ++EE+E E +   EE +E
Sbjct: 448 EEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487



 Score = 27.2 bits (60), Expect = 4.1
 Identities = 14/40 (35%), Positives = 27/40 (67%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
              +  E+EE  +++EEEE++++EE++  EE+  E+  EE
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEE 473



 Score = 27.2 bits (60), Expect = 4.5
 Identities = 17/46 (36%), Positives = 28/46 (60%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
               GE      ++ EE++  EEEE++++EE+E E+E  EEE  +E
Sbjct: 425 SSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDE 470


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 30.7 bits (69), Expect = 0.25
 Identities = 18/48 (37%), Positives = 32/48 (66%), Gaps = 2/48 (4%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKE--EEEKKKKEEKEREEEKREEENVEE 150
           +K + ++KKK E+  EE K K+  E+E+ K+ EKER + + +++  EE
Sbjct: 75  KKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEE 122



 Score = 28.0 bits (62), Expect = 2.3
 Identities = 12/45 (26%), Positives = 29/45 (64%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           ++  E+ K  +  E+E+ K+ E+E+ K +E++++ E+ E++   E
Sbjct: 86  EQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLE 130



 Score = 26.8 bits (59), Expect = 5.7
 Identities = 13/46 (28%), Positives = 19/46 (41%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K   E K K E    +KK + E +   ++ K   E K + E   E
Sbjct: 177 AKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.4 bits (69), Expect = 0.25
 Identities = 14/41 (34%), Positives = 26/41 (63%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E+K++ + K   K K ++  K K EEKE+ + ++EE+ + E
Sbjct: 43  EEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRE 83



 Score = 30.4 bits (69), Expect = 0.30
 Identities = 16/41 (39%), Positives = 25/41 (60%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           K +   K K EEKE+ K++KEE+  ++ EE   E+E  E+ 
Sbjct: 57  KAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97



 Score = 29.2 bits (66), Expect = 0.77
 Identities = 15/43 (34%), Positives = 26/43 (60%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
              K KK  + + E+K+K + EK++K  +E EE+  E+E  E+
Sbjct: 54  AKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96



 Score = 28.1 bits (63), Expect = 1.8
 Identities = 14/41 (34%), Positives = 25/41 (60%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +   K + K+  K K EE+EK K+E++E+   + EE+  E+
Sbjct: 51  KVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPED 91



 Score = 27.3 bits (61), Expect = 2.9
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E ++K EEK +   K + ++  K + +E+E+ KRE+E    
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81



 Score = 26.5 bits (59), Expect = 4.8
 Identities = 10/46 (21%), Positives = 24/46 (52%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            ++     K   +K  + K +E+E+ K+++E++   E  E+   +E
Sbjct: 47  EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDE 92


>gnl|CDD|217283 pfam02913, FAD-oxidase_C, FAD linked oxidases, C-terminal domain.
           This domain has a ferredoxin-like fold.
          Length = 247

 Score = 30.4 bits (69), Expect = 0.26
 Identities = 13/69 (18%), Positives = 23/69 (33%), Gaps = 2/69 (2%)

Query: 17  SIAKRYGGIPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVK 76
                       E   R      Y +  +RD       +  + + SVPW +   L  ++K
Sbjct: 90  GAGDVVVAEDEAERE-RLWRARKYAMP-LRDALGGAGPLVFTEDVSVPWSRLPDLVADIK 147

Query: 77  KRLTRECTV 85
           + L +   V
Sbjct: 148 ELLAKYGLV 156


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 30.6 bits (69), Expect = 0.27
 Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 6/51 (11%)

Query: 97  KKEGNLGFRKR------QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
           K + + G +K       Q E+   N+   + KKK+   EK K+  K +E E
Sbjct: 352 KLQRHAGIKKAMEKLAAQQEEDAGNQGGGDCKKKQGASEKSKEGGKGKETE 402


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.5 bits (70), Expect = 0.29
 Identities = 20/66 (30%), Positives = 41/66 (62%), Gaps = 9/66 (13%)

Query: 90  LEFRSETKKEGNLGFRKRQGE---KKKKNEEKEEE-KKKKEEEEKKKKE-EKEREEEKRE 144
            + R+E +KE     R+R+ E    +K+  +KEE   +K E  EK+++E EK+ +E +++
Sbjct: 67  HKLRNEFEKE----LRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQK 122

Query: 145 EENVEE 150
           ++ +E+
Sbjct: 123 QQELEK 128



 Score = 29.7 bits (68), Expect = 0.61
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 6/47 (12%)

Query: 106 KRQGEKKKKNEEKEEEKKKKE------EEEKKKKEEKEREEEKREEE 146
           + + ++  +  +KE E  KKE      EE  K + E E+E  +R  E
Sbjct: 37  EEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE 83



 Score = 28.2 bits (64), Expect = 1.7
 Identities = 14/44 (31%), Positives = 30/44 (68%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
           +K   L  R+ + EKK+K  E+++++ +K+EEE ++  E++ +E
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 27.8 bits (63), Expect = 2.4
 Identities = 21/54 (38%), Positives = 37/54 (68%), Gaps = 3/54 (5%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +KE NL    R+ E  +K EE+ E+K+K+ E+++++ E+KE E E+  EE ++E
Sbjct: 93  QKEENL---DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 27.0 bits (61), Expect = 3.7
 Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 1/35 (2%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           K+   E KEE  K + E EK+ +E +  E +K E+
Sbjct: 56  KEALLEAKEEIHKLRNEFEKELRE-RRNELQKLEK 89


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 30.3 bits (69), Expect = 0.31
 Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 4/46 (8%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
           E R +TK        K+ G+  KK   K  EKK+KE+E+ K K+  
Sbjct: 375 ELRPKTKAPS----EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416



 Score = 26.8 bits (60), Expect = 5.8
 Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 4/52 (7%)

Query: 95  ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           +T K       KR  +K+K    ++E+ K K+     K   K R+     EE
Sbjct: 387 KTGKPSKKVLAKRAEKKEK----EKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 30.1 bits (68), Expect = 0.32
 Identities = 17/52 (32%), Positives = 35/52 (67%)

Query: 95  ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           + +++      + + E+KK  E++E E+K++EE E++++++KE EE K  EE
Sbjct: 15  QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREE 66



 Score = 29.3 bits (66), Expect = 0.51
 Identities = 20/37 (54%), Positives = 33/37 (89%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
           KR+GE+K++ E +EE +KKKEEEE+K++EE+ R+E++
Sbjct: 37  KREGERKEEEELEEEREKKKEEEERKEREEQARKEQE 73



 Score = 27.0 bits (60), Expect = 3.4
 Identities = 19/56 (33%), Positives = 37/56 (66%)

Query: 95  ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             KK   L  ++ + ++++  EE+ EE+KK EE+ + +++E+E  EE+RE++  EE
Sbjct: 4   GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59



 Score = 27.0 bits (60), Expect = 3.9
 Identities = 17/45 (37%), Positives = 33/45 (73%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K+ G KK+   E+++ ++++ E E++++EE+++ EEKRE E  EE
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEE 45


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 30.4 bits (69), Expect = 0.32
 Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 6/68 (8%)

Query: 89  LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEK------KKKEEEEKKKKEEKEREEEK 142
           L E +  T+      +R  +    K + E   ++       ++E E KKK  EK RE+EK
Sbjct: 272 LQELKETTENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEK 331

Query: 143 REEENVEE 150
           R EE  + 
Sbjct: 332 RLEELEQN 339



 Score = 28.4 bits (64), Expect = 1.6
 Identities = 15/54 (27%), Positives = 30/54 (55%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            +E ++E    F ++  EK+K+ EE E+   ++ +E   K EE +++ E  E+ 
Sbjct: 311 LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKR 364


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 29.5 bits (67), Expect = 0.32
 Identities = 15/32 (46%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 111 KKKKNEEKEEEKKK-KEEEEKKKKEEKEREEE 141
           KK + EE+E  K+  KE  E+ KKEE+ RE +
Sbjct: 79  KKNEKEERELRKRAEKEALEQAKKEEELREAK 110



 Score = 28.8 bits (65), Expect = 0.69
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K +KEE E +K+ EKE  E+ ++EE + E
Sbjct: 80  KNEKEERELRKRAEKEALEQAKKEEELRE 108



 Score = 26.1 bits (58), Expect = 6.5
 Identities = 11/33 (33%), Positives = 22/33 (66%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
           K + E +K+ E++  E+ KKEEE ++ K ++ +
Sbjct: 83  KEERELRKRAEKEALEQAKKEEELREAKRQQRK 115


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 30.4 bits (68), Expect = 0.33
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           K Q E KK +EE  + K  K  + K++ +  E+E E +E E
Sbjct: 285 KAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELE 325



 Score = 28.8 bits (64), Expect = 1.2
 Identities = 11/45 (24%), Positives = 24/45 (53%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K+  E+  K ++ +    K+E +  +K+ E +  E +++ E V E
Sbjct: 291 KKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAE 335



 Score = 27.7 bits (61), Expect = 2.7
 Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 8/53 (15%)

Query: 106 KRQGEKKKKNEEK-EEEKKKKEEEEKKKKEEKE-------REEEKREEENVEE 150
           K+  E +K+  EK + E KK +EE  K K+ K        +  EK  E+   E
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELE 325



 Score = 26.9 bits (59), Expect = 4.8
 Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 6/52 (11%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE------KEREEEKREEENVEE 150
           ++    +K++ E+ + E KK +EE  K K+       +E +  ++E E+ E 
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 30.6 bits (70), Expect = 0.35
 Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 2/45 (4%)

Query: 105 RKRQGEKKKK--NEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
            +   +K ++     KE EK K+E EEKK+K ++E ++   E E 
Sbjct: 529 ERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEK 573



 Score = 28.3 bits (64), Expect = 1.5
 Identities = 15/37 (40%), Positives = 25/37 (67%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           E +++ E+K EE +   +E +K KEE E ++EK +EE
Sbjct: 527 ELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563



 Score = 26.7 bits (60), Expect = 5.6
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             E+E E+K +E E   K+ EK +EE + ++E ++E
Sbjct: 527 ELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQE 562



 Score = 26.3 bits (59), Expect = 8.0
 Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 3/57 (5%)

Query: 90  LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           LE   E K E       ++ EK K  EE EE+K+K +EEE K  EE E+E ++  +E
Sbjct: 528 LERELEQKAE-EAEALLKEAEKLK--EELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 29.2 bits (66), Expect = 0.35
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKERE 139
                E+KK+EEEE+++KEE E E
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 29.2 bits (66), Expect = 0.41
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEE 129
              ++KK EE+EEE+K++ EEE
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEE 96



 Score = 28.4 bits (64), Expect = 0.66
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 125 KEEEEKKKKEEKEREEEKREEENVE 149
              EEKK++EE+E E+E+ EEE   
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 27.6 bits (62), Expect = 1.5
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 126 EEEEKKKKEEKEREEEKREEENVEE 150
              E+KK+EE+E EE++  EE    
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 27.2 bits (61), Expect = 1.5
 Identities = 9/23 (39%), Positives = 18/23 (78%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
               ++K+EEE++++E++E EEE
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEE 96



 Score = 27.2 bits (61), Expect = 1.7
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREE 140
                 ++KKEEEE+++++E+  EE
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 27.2 bits (61), Expect = 1.7
 Identities = 8/23 (34%), Positives = 16/23 (69%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
                ++++EE++++EEKE  EE
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEE 95



 Score = 26.1 bits (58), Expect = 3.9
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 124 KKEEEEKKKKEEKEREEEKREE 145
              EE+K+++EE+E +EE  EE
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEE 96



 Score = 26.1 bits (58), Expect = 4.7
 Identities = 6/23 (26%), Positives = 15/23 (65%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
                 +E++E++++EE++ E E
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESE 94



 Score = 25.7 bits (57), Expect = 5.5
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 126 EEEEKKKKEEKEREEEKREEE 146
               ++KKEE+E EEEK E E
Sbjct: 74  AAAAEEKKEEEEEEEEKEESE 94


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 30.2 bits (68), Expect = 0.37
 Identities = 14/44 (31%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 110 EKKKKNEEK----EEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           E+K++  EK    E E++++ EE+++++EEK   E  R +   E
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAE 295



 Score = 27.9 bits (62), Expect = 2.4
 Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 3/48 (6%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEE---EEKKKKEEKEREEEKREEENVEE 150
           KR+  +K   EE E E++ +E+   EE+K   E +R + K E E   E
Sbjct: 254 KRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301



 Score = 27.9 bits (62), Expect = 2.8
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
           ++K+ E EK  KEE ERE +  E+   EE
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREE 280



 Score = 26.8 bits (59), Expect = 6.0
 Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
           +RK Q E++     +E  + K E+  + K E KE+
Sbjct: 436 YRKLQEERR---LREEAIRAKAEKTARMKAEMKEK 467


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 29.8 bits (67), Expect = 0.39
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
            KKK  E ++E+KKK++E+KKKK+    E  
Sbjct: 144 HKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174



 Score = 27.1 bits (60), Expect = 3.1
 Identities = 14/36 (38%), Positives = 25/36 (69%), Gaps = 4/36 (11%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           EKK K ++ E++K++K    KKKKE+K++++    E
Sbjct: 141 EKKHKKKKHEDDKERK----KKKKEKKKKKKRHSPE 172



 Score = 27.1 bits (60), Expect = 3.5
 Identities = 9/34 (26%), Positives = 24/34 (70%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           +  +  E+K KK++ E  K+ +K+++E+K++++ 
Sbjct: 135 EGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKR 168


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 30.3 bits (68), Expect = 0.39
 Identities = 10/41 (24%), Positives = 20/41 (48%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +   + ++K +EK   ++E K +    E E E+   +  EE
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREE 832



 Score = 30.3 bits (68), Expect = 0.40
 Identities = 6/45 (13%), Positives = 20/45 (44%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             + ++K K +   +++ K E    + + E+   +++ E+     
Sbjct: 794 AARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAP 838



 Score = 29.5 bits (66), Expect = 0.64
 Identities = 10/42 (23%), Positives = 21/42 (50%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           R +Q  K+K   +KE + +    E + ++   ++ EEK  + 
Sbjct: 796 RAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDA 837



 Score = 28.8 bits (64), Expect = 1.3
 Identities = 12/56 (21%), Positives = 24/56 (42%), Gaps = 10/56 (17%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEE----EKKKKEEKEREEE------KREEENVE 149
            +++  EK   ++E + E    E E      K++EEK  +        K++   V+
Sbjct: 797 AKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852



 Score = 28.4 bits (63), Expect = 1.8
 Identities = 10/46 (21%), Positives = 22/46 (47%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +     K+K  E+   +K+ K E    + E +    ++REE+ ++ 
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDA 837


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 30.4 bits (69), Expect = 0.41
 Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 3/42 (7%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           FR    +KK+       E+K K+EE   KKE+ +REE++R E
Sbjct: 643 FRNWIEKKKEFKILDYTEEKTKKEE---KKEKNKREEKERIE 681



 Score = 30.0 bits (68), Expect = 0.47
 Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 111 KKKKN----EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           KK K     EE+EEE   + E   + K  K+ +E   EE+
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEED 263



 Score = 29.2 bits (66), Expect = 0.89
 Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           F K+  E  +  E +EE   + E   + K  ++E+E    E+
Sbjct: 222 FTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEED 263



 Score = 28.8 bits (65), Expect = 1.1
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           E+K+E K     EEK KKEEK+ + ++ E+E +E
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERIE 681



 Score = 28.8 bits (65), Expect = 1.2
 Identities = 11/38 (28%), Positives = 22/38 (57%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           KK +E  E ++++EE + + +   E +  K+E+E   E
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTE 261



 Score = 28.8 bits (65), Expect = 1.3
 Identities = 9/39 (23%), Positives = 22/39 (56%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           KK  E  E E++++E + + +   + +  ++ +E + EE
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEE 262



 Score = 28.0 bits (63), Expect = 2.4
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +KKKE +     EEK ++EEK+E+   EE
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREE 676


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 30.3 bits (68), Expect = 0.43
 Identities = 13/44 (29%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 102 LGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           + +R R+ E    N ++ +EK  +++++K + E KE+  +K+EE
Sbjct: 1   MAYRNRR-EANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEE 43



 Score = 28.0 bits (62), Expect = 2.4
 Identities = 9/45 (20%), Positives = 24/45 (53%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
                 ++K++EK+++K + E +EK   +++E   +  +    E+
Sbjct: 12  NNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQ 56


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 28.9 bits (65), Expect = 0.45
 Identities = 9/22 (40%), Positives = 19/22 (86%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKE 137
             +EEE++++EEEE++++ E+E
Sbjct: 74  AAEEEEEEEEEEEEEEEESEEE 95



 Score = 28.1 bits (63), Expect = 0.81
 Identities = 9/20 (45%), Positives = 18/20 (90%)

Query: 110 EKKKKNEEKEEEKKKKEEEE 129
           E++++ EE+EEE++++ EEE
Sbjct: 76  EEEEEEEEEEEEEEEESEEE 95



 Score = 28.1 bits (63), Expect = 0.83
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREE 140
                E++++EEEE++++EE+  EE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 28.1 bits (63), Expect = 0.95
 Identities = 10/23 (43%), Positives = 19/23 (82%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
              ++++EEEE++++EE+E EEE
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEE 95



 Score = 27.7 bits (62), Expect = 1.0
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 125 KEEEEKKKKEEKEREEEKREEEN 147
              EE++++EE+E EEE+  EE 
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEE 95



 Score = 27.7 bits (62), Expect = 1.0
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
               +++EEEE++++EE+E  EE
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEE 94



 Score = 27.7 bits (62), Expect = 1.3
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 124 KKEEEEKKKKEEKEREEEKREEENVE 149
              EEE++++EE+E EEE+ EEE + 
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 27.7 bits (62), Expect = 1.3
 Identities = 8/20 (40%), Positives = 17/20 (85%)

Query: 110 EKKKKNEEKEEEKKKKEEEE 129
            ++++ EE+EEE++++E EE
Sbjct: 75  AEEEEEEEEEEEEEEEESEE 94



 Score = 26.2 bits (58), Expect = 4.4
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
                ++EEEE++++EE+E E E
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESE 93



 Score = 25.8 bits (57), Expect = 5.0
 Identities = 8/21 (38%), Positives = 14/21 (66%)

Query: 126 EEEEKKKKEEKEREEEKREEE 146
               ++++EE+E EEE+ EE 
Sbjct: 72  AAAAEEEEEEEEEEEEEEEES 92


>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
          Length = 697

 Score = 30.1 bits (69), Expect = 0.47
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K++K+  E   +E +E E  + EEE+ ++
Sbjct: 328 KREKKAAELAAEEAEEEEAAEPEEESWDD 356



 Score = 27.4 bits (62), Expect = 3.8
 Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 102 LGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
           LGF   +  K++K   +   ++ +EEE  + +EE
Sbjct: 319 LGFLAYRLRKREKKAAELAAEEAEEEEAAEPEEE 352


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 28.1 bits (63), Expect = 0.47
 Identities = 20/39 (51%), Positives = 29/39 (74%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           KKK  + EEEK+++EEE +K++E KERE+ K  EE + E
Sbjct: 25  KKKELKAEEEKREEEEEARKREERKEREKNKSFEELLNE 63


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 29.8 bits (68), Expect = 0.48
 Identities = 9/20 (45%), Positives = 16/20 (80%)

Query: 116 EEKEEEKKKKEEEEKKKKEE 135
             +EEE++++EEEE++  EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEE 319



 Score = 29.1 bits (66), Expect = 0.74
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K   +E+ +E    + +    +EE+E EEE+ EEE  EE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 28.7 bits (65), Expect = 1.1
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 125 KEEEEKKKKEEKEREEEKREEE 146
              EE++++EE+E EEE  EEE
Sbjct: 299 AAAEEEEEEEEEEEEEEPSEEE 320



 Score = 28.7 bits (65), Expect = 1.2
 Identities = 8/20 (40%), Positives = 16/20 (80%)

Query: 110 EKKKKNEEKEEEKKKKEEEE 129
            ++++ EE+EEE+++  EEE
Sbjct: 301 AEEEEEEEEEEEEEEPSEEE 320



 Score = 26.4 bits (59), Expect = 6.1
 Identities = 7/22 (31%), Positives = 15/22 (68%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEE 129
               +++ EE+EEE++++  EE
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSEE 319



 Score = 26.0 bits (58), Expect = 8.5
 Identities = 14/42 (33%), Positives = 24/42 (57%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            +K   +EE +E    + +    ++EE+E EEE+ EE + EE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320


>gnl|CDD|216317 pfam01131, Topoisom_bac, DNA topoisomerase.  This subfamily of
           topoisomerase is divided on the basis that these enzymes
           preferentially relax negatively supercoiled DNA, from a
           5' phospho- tyrosine linkage in the enzyme-DNA covalent
           intermediate and has high affinity for single stranded
           DNA.
          Length = 365

 Score = 29.9 bits (68), Expect = 0.51
 Identities = 16/63 (25%), Positives = 31/63 (49%)

Query: 77  KRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
                +  +  G  EF++  K     G++K   ++K ++EE   E K+ EE + K  E +
Sbjct: 208 VYEETKVELEVGGEEFKASGKVLLEPGWKKVYPDEKDEDEEPLPELKEGEELDVKDVELE 267

Query: 137 ERE 139
           E++
Sbjct: 268 EKK 270


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 29.6 bits (66), Expect = 0.54
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K  K ++++   K +E  ++ K EEK+ +EE+ EEE  EE
Sbjct: 193 KAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEE 232



 Score = 28.5 bits (63), Expect = 1.3
 Identities = 14/42 (33%), Positives = 29/42 (69%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           + +Q +   K +E  +E K +E++ ++++EE+E EEE ++EE
Sbjct: 196 KPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237



 Score = 26.5 bits (58), Expect = 6.3
 Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 4/32 (12%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
           E+K+  EE+EEE    E EE+ K+EE +  ++
Sbjct: 216 EEKQPQEEQEEE----EVEEEAKQEEGQGTDD 243


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 29.6 bits (67), Expect = 0.55
 Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 1/44 (2%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK-EREEEKREEEN 147
           ++   EK+K+ E+  E +++  +E  K+  EK E E EK   E 
Sbjct: 220 QELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQ 263


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 29.7 bits (67), Expect = 0.56
 Identities = 9/34 (26%), Positives = 27/34 (79%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            KK ++++++++++  +++KK++E+E  ++K EE
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222



 Score = 29.3 bits (66), Expect = 0.68
 Identities = 12/32 (37%), Positives = 26/32 (81%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
           + +Q ++K++ E +++ KK++EEEE+K+K E+
Sbjct: 191 KLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222



 Score = 28.9 bits (65), Expect = 0.88
 Identities = 13/39 (33%), Positives = 30/39 (76%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           Q E KK+ +E E +K K+++++++++  K+R++++ EEE
Sbjct: 177 QEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEE 215



 Score = 28.5 bits (64), Expect = 1.1
 Identities = 13/42 (30%), Positives = 30/42 (71%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           R    E+ KK  ++ E KK K++++K+++E +++ ++++EEE
Sbjct: 173 RNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEE 214



 Score = 28.2 bits (63), Expect = 1.7
 Identities = 11/39 (28%), Positives = 25/39 (64%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
             +  +++K  +K  E+K+K+E EK+++E + R+   +E
Sbjct: 94  SAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKE 132



 Score = 28.2 bits (63), Expect = 1.8
 Identities = 9/31 (29%), Positives = 25/31 (80%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
           E KK  +++++ ++++ ++ KK++EE+ER++
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERKQ 218



 Score = 26.6 bits (59), Expect = 5.4
 Identities = 12/32 (37%), Positives = 23/32 (71%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
            K+Q +K+++   K+ +K+++EEE K+K EE 
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEERKQKAEEA 223



 Score = 26.6 bits (59), Expect = 5.5
 Identities = 12/37 (32%), Positives = 27/37 (72%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
           R+++ +K  + ++K+E +K++EE E +++  KE+ EE
Sbjct: 100 RQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136



 Score = 26.6 bits (59), Expect = 5.9
 Identities = 11/45 (24%), Positives = 30/45 (66%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
            +R   +++  +  +E + KK +++++K+EE+ R++ K+++E  E
Sbjct: 171 PERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEE 215



 Score = 26.2 bits (58), Expect = 7.3
 Identities = 12/42 (28%), Positives = 28/42 (66%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            K+   +KK  +  EE++K++ E+E+++ E ++R  +++ EE
Sbjct: 95  AKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136



 Score = 25.8 bits (57), Expect = 8.6
 Identities = 15/42 (35%), Positives = 29/42 (69%), Gaps = 2/42 (4%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKE--EKEREEEKREEENVEE 150
           K ++N  +EE KK+ +E E KK +  +++REEE+R++   ++
Sbjct: 170 KPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQ 211


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 29.7 bits (67), Expect = 0.59
 Identities = 15/63 (23%), Positives = 24/63 (38%)

Query: 88  GLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           GLLE ++E      L       + K+ N  KEE+           K  KE ++       
Sbjct: 190 GLLERKNELPHLKKLIILDTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNE 249

Query: 148 VEE 150
           +E+
Sbjct: 250 LED 252



 Score = 26.6 bits (59), Expect = 6.2
 Identities = 12/44 (27%), Positives = 19/44 (43%), Gaps = 1/44 (2%)

Query: 97  KKEGNLGFR-KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
           K+E N G      G K  K E+K  +   + E+      E ++E
Sbjct: 220 KEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKE 263


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 29.5 bits (67), Expect = 0.59
 Identities = 13/30 (43%), Positives = 24/30 (80%)

Query: 121 EKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K+++ EE++K  EE ++EEE+ EEE+V++
Sbjct: 326 RKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 29.7 bits (66), Expect = 0.61
 Identities = 16/51 (31%), Positives = 27/51 (52%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           K E  +   K  G+K  + EE E +++ +E EE KK+  ++   E  EE+ 
Sbjct: 225 KLEEEIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKF 275


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 29.7 bits (66), Expect = 0.61
 Identities = 14/45 (31%), Positives = 27/45 (60%)

Query: 106  KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K + + +++ E  E  ++ KEEE  KK++E+E+   K   +N +E
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKE 1067



 Score = 27.0 bits (59), Expect = 5.0
 Identities = 9/46 (19%), Positives = 29/46 (63%)

Query: 105  RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            + R  +++ K++E++E  +  +  ++++  +KE+E E+R  + + +
Sbjct: 1018 KDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 28.9 bits (65), Expect = 0.61
 Identities = 9/34 (26%), Positives = 21/34 (61%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           ++++  + KKK + E+ +KK +++   EKR+   
Sbjct: 26  RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALE 59



 Score = 25.8 bits (57), Expect = 7.4
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
             + +KK K+E+ E++ K++   EK++  EK R
Sbjct: 30  LLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGR 62


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 29.6 bits (67), Expect = 0.68
 Identities = 13/32 (40%), Positives = 25/32 (78%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
            KKK EEK+ +K+KK+++ KKK ++++++  K
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 28.4 bits (64), Expect = 1.5
 Identities = 11/39 (28%), Positives = 29/39 (74%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           K +  K  K + +E++ +K+++++K+KK+ K+R+++ R+
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 28.0 bits (63), Expect = 2.2
 Identities = 13/46 (28%), Positives = 29/46 (63%), Gaps = 5/46 (10%)

Query: 105 RKRQGEKKKKNEE-----KEEEKKKKEEEEKKKKEEKEREEEKREE 145
            KR  E K+K  +     +EE+K +K +++KK+K++ ++ ++K  +
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 29.0 bits (65), Expect = 0.69
 Identities = 14/48 (29%), Positives = 21/48 (43%), Gaps = 3/48 (6%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKK---KKEEKEREEEKREEENVEE 150
           K   +K K ++ K+  KK   E EKK    + E   E++  E   V  
Sbjct: 113 KVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAA 160



 Score = 27.0 bits (60), Expect = 3.5
 Identities = 12/55 (21%), Positives = 20/55 (36%)

Query: 96  TKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             K+  L   K+  +K     EK+  + + E   +KK  E      +      EE
Sbjct: 115 NAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169



 Score = 26.3 bits (58), Expect = 5.9
 Identities = 10/34 (29%), Positives = 15/34 (44%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           KK NE + E   +K+  E      +E    + EE
Sbjct: 137 KKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEE 170


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 29.2 bits (66), Expect = 0.70
 Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 91  EFRSETKKE-GNLGFRKRQGEKKKKNEEKEEEKKK-KEEEEKKKKEEKEREEEKREEE 146
           E  S+ +K+   L   +R+ E+K+  EE+E   +  K   E+ + ++K +E +K E+E
Sbjct: 107 EITSDVRKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDE 164



 Score = 26.1 bits (58), Expect = 7.8
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 124 KKEEEEKKKKEEKEREEEKREEENVEE 150
           + E EE++K++E+ERE   R  ++  E
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSE 147


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 28.9 bits (65), Expect = 0.78
 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 5/43 (11%)

Query: 113 KKNEEKEEEK-----KKKEEEEKKKKEEKEREEEKREEENVEE 150
            K EEK +E+     KKK +  K  KE +++   K  +  V  
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAV 195


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 28.9 bits (65), Expect = 0.79
 Identities = 10/37 (27%), Positives = 10/37 (27%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
              E    KKKK   E    E      E       EE
Sbjct: 117 PTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEE 153


>gnl|CDD|216381 pfam01237, Oxysterol_BP, Oxysterol-binding protein. 
          Length = 335

 Score = 29.1 bits (66), Expect = 0.90
 Identities = 16/32 (50%), Positives = 19/32 (59%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           EE    + EEEK + EEK+RE  K  EE  EE
Sbjct: 276 EEGDYDEAEEEKLRLEEKQRERRKEREEKGEE 307


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 28.8 bits (65), Expect = 0.91
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEK 136
           +KK + KEE KKKK +E K  K  K
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALK 273


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 29.2 bits (66), Expect = 0.93
 Identities = 13/37 (35%), Positives = 25/37 (67%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           E KK+ ++ EE+ ++ E E +KK+EE E+ + K  ++
Sbjct: 99  ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135



 Score = 27.3 bits (61), Expect = 3.9
 Identities = 16/39 (41%), Positives = 26/39 (66%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           GE+  + E + EE KK+ ++ ++K E+ E E EK+EEE 
Sbjct: 87  GEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEEL 125


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 28.4 bits (64), Expect = 0.94
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           EK+++ EE+ E ++  EEE+  +  EKE  + KRE+    E
Sbjct: 100 EKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENE 140


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 29.1 bits (65), Expect = 0.95
 Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E KK+ E   +E ++ E+E+     E    +E++E+   EE
Sbjct: 74  ELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEE 114


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 28.3 bits (64), Expect = 0.98
 Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 108 QGEKKKKNEEKEEEKKKKEE-EEKKKKEEKEREE 140
           Q +  +  +EK   + + +E E K K+ E E + 
Sbjct: 73  QAKIARYKKEKARYRSEAKELEAKAKEAEAESDH 106



 Score = 27.5 bits (62), Expect = 1.7
 Identities = 6/36 (16%), Positives = 16/36 (44%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           + K    ++EK +   E K+ + + +  E + +   
Sbjct: 73  QAKIARYKKEKARYRSEAKELEAKAKEAEAESDHAL 108


>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
           Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the heterodimeric topo I from
           Leishmania donvanni. Topo I enzymes are divided into:
           topo type IA (bacterial) and type IB (eukaryotic). Topo
           I relaxes superhelical tension in duplex DNA by creating
           a single-strand nick, the broken strand can then rotate
           around the unbroken strand to remove DNA supercoils and,
           the nick is religated, liberating topo I. These enzymes
           regulate the topological changes that accompany DNA
           replication, transcription and other nuclear processes. 
           Human topo I is the target of a diverse set of
           anticancer drugs including camptothecins (CPTs). CPTs
           bind to the topo I-DNA complex and inhibit re-ligation
           of the single-strand nick, resulting in the accumulation
           of topo I-DNA adducts. In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topo I play putative roles in
           organizing the kinetoplast DNA network unique to these
           parasites.  This family may represent more than one
           structural domain.
          Length = 212

 Score = 28.7 bits (64), Expect = 1.0
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEK 142
            EKE++K + +EE+K  KEEK++E E 
Sbjct: 93  REKEKKKSRTKEEKKALKEEKDKEAEP 119


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 28.7 bits (65), Expect = 1.0
 Identities = 10/45 (22%), Positives = 22/45 (48%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +R  ++ KK E   E++K    E K K +   ++ + R ++   +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157



 Score = 26.0 bits (58), Expect = 6.7
 Identities = 12/47 (25%), Positives = 23/47 (48%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            + ++ E++ K  E   EK+K    E+K K +  R++ K   +  E 
Sbjct: 110 VQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156



 Score = 26.0 bits (58), Expect = 8.0
 Identities = 8/46 (17%), Positives = 24/46 (52%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           ++ + + KK+    E+EK  + E + K K  +++ + + ++   + 
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158


>gnl|CDD|185768 cd09245, BRO1_UmRIM23-like, Protein-interacting, Bro1-like domain
           of Ustilago maydis Rim23 (PalC), and related domains.
           This family contains the Bro1-like domain of Ustilago
           maydis Rim23 (also known as PalC), and related proteins.
           It belongs to the BRO1_Alix_like superfamily which
           includes the Bro1-like domains of mammalian Alix
           (apoptosis-linked gene-2 interacting protein X),
           His-Domain type N23 protein tyrosine phosphatase
           (HD-PTP, also known as PTPN23), RhoA-binding proteins
           Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as
           PalA) from Saccharomyces cerevisiae, and related
           domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23
           interact with the ESCRT (Endosomal Sorting Complexes
           Required for Transport) system. Rim20 and Rim23
           participate in the response to the external pH via the
           Rim101 pathway. Through its Bro1-like domain, Rim23
           allows the interaction between the endosomal and plasma
           membrane complexes. Bro1-like domains are
           boomerang-shape, and part of the domain is a
           tetratricopeptide repeat (TPR)-like structure.
           Intermediates in the Rim101 pathway may play roles in
           the pathogenesis of fungal corneal infection during
           Candida albicans keratitis. This family lacks the
           V-shaped (V) domain found in many members of the
           BRO1_Alix_like superfamily.
          Length = 413

 Score = 28.9 bits (65), Expect = 1.1
 Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 4/47 (8%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
           KKE  L   K       K + K+  K+K+  E++K ++    EEE R
Sbjct: 323 KKE--LEDLKSPSGVASKAKLKKSWKEKR--EDRKVEKGAGVEEELR 365


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 28.5 bits (64), Expect = 1.1
 Identities = 16/61 (26%), Positives = 35/61 (57%), Gaps = 2/61 (3%)

Query: 86  SNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           S+   E RS+ +++      ++Q E++++  E+  E    E+ E+ + ++K +EE+K E 
Sbjct: 101 SDPNYEIRSDVRRQLR-FLAQKQKEEEERRVERRRE-LGLEDPEQLRLKQKAKEEQKAES 158

Query: 146 E 146
           E
Sbjct: 159 E 159



 Score = 26.6 bits (59), Expect = 5.3
 Identities = 11/37 (29%), Positives = 21/37 (56%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K +E+EE + ++  E   +  E+ R ++K +EE   E
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAE 157


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 28.6 bits (64), Expect = 1.1
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            R+    KK+++K++EK  +EE++ +  ++K  E E   + N E 
Sbjct: 59  PRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEES 103



 Score = 28.6 bits (64), Expect = 1.2
 Identities = 12/46 (26%), Positives = 26/46 (56%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +KR    +K    K+ +KK KE+  +++K+ +  +++  E EN  +
Sbjct: 53  KKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPD 98



 Score = 26.6 bits (59), Expect = 4.1
 Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 2/44 (4%)

Query: 105 RKRQGEKKKKN--EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           RK    KK K   +EK  E++KK E +  K EE E + +  EE 
Sbjct: 60  RKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEES 103



 Score = 26.6 bits (59), Expect = 5.4
 Identities = 10/45 (22%), Positives = 26/45 (57%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            +Q +KK+    ++    KK +++ K+K  +E ++ + +++  EE
Sbjct: 48  PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEE 92


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 5/37 (13%)

Query: 114 KNEEKEEEKKKKEEEEKK-----KKEEKEREEEKREE 145
           K E +  E++KK+ EE K     ++   ERE+  RE 
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREA 471


>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
           family.  This model represents a relatively
           well-conserved region near the C-terminus of the tape
           measure protein of a lambda and related phage. This
           protein, which controls phage tail length, is typically
           about 1000 residues in length. Both low-complexity
           sequence and insertion/deletion events appear common in
           this family. Mutational studies suggest a ruler or
           template role in the determination of phage tail length.
           Similar behavior is attributed to proteins from
           distantly related or unrelated families in other phage
           [Mobile and extrachromosomal element functions, Prophage
           functions].
          Length = 332

 Score = 28.6 bits (64), Expect = 1.1
 Identities = 10/62 (16%), Positives = 24/62 (38%)

Query: 89  LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           LL  +   +K   L     +  + + +E     K +K  EE ++K  +  ++      ++
Sbjct: 5   LLTQQIADRKLKKLNTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASI 64

Query: 149 EE 150
             
Sbjct: 65  RA 66


>gnl|CDD|216259 pfam01039, Carboxyl_trans, Carboxyl transferase domain.  All of the
           members in this family are biotin dependent
           carboxylases. The carboxyl transferase domain carries
           out the following reaction; transcarboxylation from
           biotin to an acceptor molecule. There are two recognised
           types of carboxyl transferase. One of them uses acyl-CoA
           and the other uses 2-oxoacid as the acceptor molecule of
           carbon dioxide. All of the members in this family
           utilise acyl-CoA as the acceptor molecule.
          Length = 487

 Score = 28.7 bits (65), Expect = 1.1
 Identities = 6/29 (20%), Positives = 13/29 (44%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
           + E      + K+ E  + ++  E EE+ 
Sbjct: 416 RKELAAATMRYKDLEALRAQKIAEYEEQL 444


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 28.8 bits (65), Expect = 1.2
 Identities = 8/29 (27%), Positives = 17/29 (58%)

Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
           + K+EEE   +  + + E  + EEE++ +
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESIND 340



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 122 KKKKEEEEKKKKEEKEREEEKREEEN 147
           ++ K+EEE   +  K +EE   EEE 
Sbjct: 311 RRSKQEEEAAAEAAKAQEEAAEEEEE 336



 Score = 26.8 bits (60), Expect = 4.7
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEK 130
           R +Q E+      K +E+  +EEEE 
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEES 337


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 10/35 (28%), Positives = 21/35 (60%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
             R   ++ K   ++E++ +K  E+K K+ +KE+E
Sbjct: 95  ALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 28.5 bits (64), Expect = 1.2
 Identities = 17/33 (51%), Positives = 24/33 (72%), Gaps = 2/33 (6%)

Query: 118 KEEEKKKKEEEEKKKKEEKER--EEEKREEENV 148
           +E EK KKE  E+K++EE+E+  EEEK  EE +
Sbjct: 149 RELEKIKKERAEEKEREEEEKAAEEEKAREEEI 181



 Score = 27.4 bits (61), Expect = 3.1
 Identities = 18/32 (56%), Positives = 24/32 (75%), Gaps = 1/32 (3%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
           E +K  +E+ EEK ++EEEEK  +EEK REEE
Sbjct: 150 ELEKIKKERAEEK-EREEEEKAAEEEKAREEE 180


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 28.7 bits (65), Expect = 1.3
 Identities = 15/45 (33%), Positives = 26/45 (57%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K + EK +  +E+E+EK  KE  E  KK E E++++ +  +  E 
Sbjct: 169 KEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEG 213


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 28.7 bits (64), Expect = 1.3
 Identities = 17/40 (42%), Positives = 23/40 (57%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           KKKK ++ +E+KKKK  E  K+    E  EE  E  N E+
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEK 724


>gnl|CDD|235245 PRK04183, PRK04183, glutamyl-tRNA(Gln) amidotransferase subunit D;
           Validated.
          Length = 419

 Score = 28.7 bits (65), Expect = 1.3
 Identities = 7/38 (18%), Positives = 17/38 (44%)

Query: 101 NLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
           N+G    +  + +  E+ E+ K++   +E +K      
Sbjct: 40  NIGIDIDKIAEIELLEKGEKPKQEPPPKEIEKDPGLPN 77


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 28.5 bits (64), Expect = 1.4
 Identities = 13/38 (34%), Positives = 26/38 (68%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           QG     N E+++E++++E+EE++++EE+  E E  E 
Sbjct: 336 QGRCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 26.6 bits (59), Expect = 7.0
 Identities = 14/28 (50%), Positives = 22/28 (78%)

Query: 123 KKKEEEEKKKKEEKEREEEKREEENVEE 150
           ++K+EEE+++ EE+E EEE+ EE   EE
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 26.2 bits (58), Expect = 9.3
 Identities = 13/31 (41%), Positives = 20/31 (64%), Gaps = 2/31 (6%)

Query: 120 EEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             ++K EEEE  +++E+E EEE+  EE   E
Sbjct: 343 NPEQKDEEEE--QEDEEEEEEEEEPEEPEPE 371


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 28.1 bits (63), Expect = 1.4
 Identities = 12/38 (31%), Positives = 17/38 (44%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K E K + K K + +  KK EE+ + E K  E     
Sbjct: 93  PKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPAS 130



 Score = 28.1 bits (63), Expect = 1.6
 Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
           E K K + K + K  K+ EE+ K+E K  E 
Sbjct: 96  EPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 28.3 bits (63), Expect = 1.5
 Identities = 15/43 (34%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 104 FRKRQGEKKKKNEEKE--EEKKKKEEEEKKKKEEKEREEEKRE 144
           F  R+  +K + E++E  +  K K E E+K KEE  ++ E  E
Sbjct: 3   FELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIE 45


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 28.2 bits (63), Expect = 1.5
 Identities = 7/32 (21%), Positives = 20/32 (62%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
           K ++ E   ++++KK EE +++ ++   + E+
Sbjct: 20  KLQQLERARDKQEKKAEEYQEQIKKAIEKAEE 51


>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2.  This family of
           mediator complex subunit 2 proteins is conserved in
           fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
           with and phosphorylates Med2. Post-translational
           modifications of Mediator subunits are important for
           regulation of gene expression.
          Length = 99

 Score = 27.4 bits (61), Expect = 1.5
 Identities = 13/20 (65%), Positives = 18/20 (90%)

Query: 112 KKKNEEKEEEKKKKEEEEKK 131
           + K +++EEE+KKKEEEEKK
Sbjct: 80  ENKKKQEEEERKKKEEEEKK 99



 Score = 27.0 bits (60), Expect = 2.2
 Identities = 14/19 (73%), Positives = 18/19 (94%)

Query: 120 EEKKKKEEEEKKKKEEKER 138
           E KKK+EEEE+KKKEE+E+
Sbjct: 80  ENKKKQEEEERKKKEEEEK 98


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 28.5 bits (63), Expect = 1.5
 Identities = 16/35 (45%), Positives = 23/35 (65%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            E+EEE + +E+EE  +KE  +  +EK EEE  EE
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174


>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3).  This family
           consists of several optic atrophy 3 (OPA3) proteins.
           OPA3 deficiency causes type III 3-methylglutaconic
           aciduria (MGA) in humans. This disease manifests with
           early bilateral optic atrophy, spasticity,
           extrapyramidal dysfunction, ataxia, and cognitive
           deficits, but normal longevity.
          Length = 134

 Score = 27.6 bits (62), Expect = 1.6
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           +K  +KEEE +++ EE + +  E E E E+++  
Sbjct: 101 RKEAKKEEELQQELEELEARVGELELEIERQKAR 134


>gnl|CDD|233752 TIGR02153, gatD_arch, glutamyl-tRNA(Gln) amidotransferase, subunit
           D.  This peptide is found only in the Archaea. It is
           part of a heterodimer, with GatE (TIGR00134), that acts
           as an amidotransferase on misacylated Glu-tRNA(Gln) to
           produce Gln-tRNA(Gln). The analogous amidotransferase
           found in bacteria is the GatABC system, although GatABC
           homologs in the Archaea appear to act instead on
           Asp-tRNA(Asn) [Protein synthesis, tRNA aminoacylation].
          Length = 404

 Score = 28.5 bits (64), Expect = 1.6
 Identities = 9/38 (23%), Positives = 17/38 (44%)

Query: 101 NLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
           N+GF+  +    +  EE  E ++     E +KK    +
Sbjct: 27  NIGFKISEIRNIEVLEEGSEPREVPPPAEIEKKPGLPK 64


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 28.5 bits (64), Expect = 1.7
 Identities = 8/25 (32%), Positives = 16/25 (64%)

Query: 119 EEEKKKKEEEEKKKKEEKEREEEKR 143
            EE K +E+   K + + ER++++R
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRR 607



 Score = 27.3 bits (61), Expect = 3.5
 Identities = 10/46 (21%), Positives = 28/46 (60%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            ++Q  ++++   + +EK++ ++E K    E++  +E  +EE V++
Sbjct: 670 EQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEERVQQ 715


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 27.8 bits (62), Expect = 1.7
 Identities = 12/66 (18%), Positives = 27/66 (40%), Gaps = 9/66 (13%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKK---------KEEEEKKKKEEKEREEEKR 143
           R+   ++      K   E   K   +++  +K          E  +K   +EK+ E++K 
Sbjct: 14  RALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKP 73

Query: 144 EEENVE 149
           +++ V 
Sbjct: 74  KKKKVP 79


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 28.2 bits (63), Expect = 1.7
 Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 1/70 (1%)

Query: 76  KKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEE-EKKKKEEEEKKKKE 134
           K          N L+E  S  +KE        +   K++ + K+E E++ +E +E+++ +
Sbjct: 503 KTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNK 562

Query: 135 EKEREEEKRE 144
           + E E+E +E
Sbjct: 563 KLELEKEAQE 572



 Score = 27.5 bits (61), Expect = 3.0
 Identities = 10/39 (25%), Positives = 21/39 (53%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
             E+     EK    +K+ E++ +  E+  +E+EK ++E
Sbjct: 509 FKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKE 547



 Score = 27.5 bits (61), Expect = 3.6
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 1/61 (1%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEE-EKKKKEEEEKKKKEEKEREEEKREEENVE 149
            F  E K+E N+   K    +K+  ++ E  EK  KE+E+ KK+ E+E EE K  E N +
Sbjct: 504 TFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKK 563

Query: 150 E 150
            
Sbjct: 564 L 564



 Score = 26.7 bits (59), Expect = 5.7
 Identities = 13/41 (31%), Positives = 23/41 (56%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           R+R  + + + E +E  K  K+E E   +E KE++  K +E
Sbjct: 558 RERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKE 598



 Score = 26.7 bits (59), Expect = 6.6
 Identities = 15/42 (35%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 105 RKRQGEKKKKNEEK-EEEKKKKEEEEKKKKEEKEREEEKREE 145
            K   +K +  E+  +E++K K+E E++ +E KERE  K+ E
Sbjct: 524 EKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLE 565


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 27.7 bits (62), Expect = 1.7
 Identities = 19/46 (41%), Positives = 36/46 (78%), Gaps = 2/46 (4%)

Query: 103 GFRKRQGEKKKKNEE--KEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           GF KR+ +++KK +E  KE+E++++ EE K+ +EE+++E EK+ +E
Sbjct: 26  GFHKRKQQRRKKAQEEAKEKEREERIEERKRIREERKQELEKQLKE 71


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 28.1 bits (63), Expect = 1.8
 Identities = 12/44 (27%), Positives = 27/44 (61%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           F K   +  ++N +KEEE++++ ++  K+  E E+   +++E N
Sbjct: 326 FLKEFSKAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQKERN 369


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 28.3 bits (64), Expect = 1.8
 Identities = 13/41 (31%), Positives = 21/41 (51%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           K  G   K+ E+  EE   + +E K+K +E E   ++ E E
Sbjct: 57  KEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDELEAE 97


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 28.0 bits (62), Expect = 1.8
 Identities = 16/49 (32%), Positives = 26/49 (53%)

Query: 99  EGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           E +L  +   G KKK+ ++KE EK  +  E+  +  E + E E  E E+
Sbjct: 81  EKDLALQPPPGGKKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPED 129



 Score = 27.2 bits (60), Expect = 2.9
 Identities = 10/44 (22%), Positives = 22/44 (50%)

Query: 103 GFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           G +K++ +KK+  +  +  +K  +  E K + E    E+   E+
Sbjct: 91  GGKKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPEDPPPED 134


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 28.1 bits (63), Expect = 1.9
 Identities = 17/57 (29%), Positives = 32/57 (56%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            + KKE    F++++   +++  E EEE K ++   K  K  ++ ++E+ EEE  EE
Sbjct: 437 KKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEE 493


>gnl|CDD|119356 cd02877, GH18_hevamine_XipI_class_III, This conserved domain family
           includes xylanase inhibitor Xip-I, and the class III
           plant chitinases such as hevamine, concanavalin B, and
           PPL2, all of which have a glycosyl hydrolase family 18
           (GH18) domain. Hevamine is a class III endochitinase
           that hydrolyzes the linear polysaccharide chains of
           chitin and peptidoglycan and is important for defense
           against pathogenic bacteria and fungi.  PPL2 (Parkia
           platycephala lectin 2) is a class III chitinase from
           Parkia platycephala seeds that hydrolyzes beta(1-4)
           glycosidic bonds linking
           2-acetoamido-2-deoxy-beta-D-glucopyranose units in
           chitin.
          Length = 280

 Score = 28.0 bits (63), Expect = 1.9
 Identities = 9/40 (22%), Positives = 19/40 (47%), Gaps = 4/40 (10%)

Query: 17  SIAKRYGGIPAGESNGRRGYM----LTYIIAYIRDFACDY 52
           S AK + G+PA       GY+    L  ++  ++  + ++
Sbjct: 218 SNAKVFLGLPASPEAAGSGYVDPSELASLVLPVKQKSPNF 257


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 28.1 bits (63), Expect = 2.1
 Identities = 15/36 (41%), Positives = 26/36 (72%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
            K + + +K + E +E+KK+K++EEKKKK+ +  EE
Sbjct: 315 SKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 28.0 bits (62), Expect = 2.1
 Identities = 13/36 (36%), Positives = 20/36 (55%)

Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           N+ +E E K   E   K + E++ + E+ EEE  EE
Sbjct: 295 NDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE 330


>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
          Length = 437

 Score = 27.8 bits (62), Expect = 2.1
 Identities = 18/34 (52%), Positives = 22/34 (64%), Gaps = 1/34 (2%)

Query: 117 EKEEEKKKKEE-EEKKKKEEKEREEEKREEENVE 149
           E EEEK   EE E + K  +K+R E+KRE EN E
Sbjct: 225 ETEEEKLLFEEAEARNKLRKKKRGEQKREFENGE 258


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 28.0 bits (62), Expect = 2.1
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 110  EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
            + +K NE+  EE  ++ EE  +   + + E E+ E    +
Sbjct: 4058 DDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQ 4097


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 27.6 bits (62), Expect = 2.1
 Identities = 14/43 (32%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE-EEKREEE 146
           RK++ + +K  ++ ++E K+  EE K +K E++RE   +R+ E
Sbjct: 25  RKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAE 67


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 27.8 bits (62), Expect = 2.2
 Identities = 5/37 (13%), Positives = 25/37 (67%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
             +++   EE+ E+ + +  ++  +K+++++++E+++
Sbjct: 76  SNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQ 112


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 28.0 bits (63), Expect = 2.2
 Identities = 16/55 (29%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEE---KKKKEEEEKKKKEEKEREEEKREEENV 148
           ++E  L  +    +KK++N EK+E+    K+K  +EK+++ E+   E++ E E +
Sbjct: 87  QREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERI 141


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 28.1 bits (62), Expect = 2.3
 Identities = 16/45 (35%), Positives = 31/45 (68%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K++ E+++K  E+EE+++K+EE ++K +EE+E+   K E E    
Sbjct: 220 KKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRA 264



 Score = 26.2 bits (57), Expect = 9.5
 Identities = 16/51 (31%), Positives = 31/51 (60%)

Query: 100 GNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           G  G +   G++ +K ++K++E   + EE KKK+EE+ +  E+ E+   +E
Sbjct: 190 GAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQE 240


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 28.0 bits (62), Expect = 2.3
 Identities = 13/43 (30%), Positives = 23/43 (53%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
             ++K K   +  EK+ K E + +   EK+  EEK+ ++  EE
Sbjct: 469 SAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEE 511



 Score = 26.0 bits (57), Expect = 10.0
 Identities = 13/43 (30%), Positives = 24/43 (55%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
           K++  +  +K + + K K EE    +K+ +E  K ++ EKE E
Sbjct: 535 KQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETE 577


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 28.0 bits (62), Expect = 2.3
 Identities = 10/45 (22%), Positives = 30/45 (66%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K++   +K  +E EEE  ++E+++++  ++ + E+E  ++++ +E
Sbjct: 33  KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDE 77


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 27.5 bits (61), Expect = 2.4
 Identities = 10/37 (27%), Positives = 24/37 (64%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           K++   EK+  +++  +E  KK+ ++  +EE +E++N
Sbjct: 3   KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQN 39



 Score = 26.0 bits (57), Expect = 8.4
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
           EK    EE  +E  KKE +E  K+E++E++  
Sbjct: 9   EKDLPQEETTDEAPKKEAKEAPKEEDQEKQNP 40


>gnl|CDD|202551 pfam03142, Chitin_synth_2, Chitin synthase.  Members of this family
           are fungal chitin synthase EC:2.4.1.16 enzymes. They
           catalyze chitin synthesis as follows:
           UDP-N-acetyl-D-glucosamine +
           {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP +
           {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1).
          Length = 527

 Score = 27.8 bits (62), Expect = 2.4
 Identities = 15/35 (42%), Positives = 20/35 (57%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
           R  +GEKKKK   ++E K    +   K+ EE ERE
Sbjct: 493 RVVEGEKKKKAHGEDEGKFDSSKIPMKRWEEYERE 527


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 27.7 bits (61), Expect = 2.5
 Identities = 15/51 (29%), Positives = 26/51 (50%)

Query: 94  SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
             + +  N+G R  +G K+K NEE EE+ + +   E   ++E+    E  E
Sbjct: 200 PSSSERSNIGGRGPRGHKRKLNEEDEEDAESESSFESSNEDEEGSSSEADE 250


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 27.9 bits (62), Expect = 2.5
 Identities = 11/32 (34%), Positives = 21/32 (65%), Gaps = 3/32 (9%)

Query: 118 KEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           K+E K +K +  KK+K+ +E++E    ++ VE
Sbjct: 25  KDEAKPRKIKRVKKRKKREEKDE---LDDEVE 53


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 27.4 bits (61), Expect = 2.5
 Identities = 7/37 (18%), Positives = 22/37 (59%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           + K+K + +++ KK K + + K K + + + + + ++
Sbjct: 83  KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKK 119



 Score = 27.4 bits (61), Expect = 3.2
 Identities = 10/40 (25%), Positives = 21/40 (52%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K+K   EK+ +K K + + K K + K + + K ++   + 
Sbjct: 85  KEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKT 124



 Score = 26.7 bits (59), Expect = 5.6
 Identities = 9/40 (22%), Positives = 19/40 (47%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           EK K  ++ ++ K K + + K K + K + + K+      
Sbjct: 86  EKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTA 125



 Score = 26.3 bits (58), Expect = 6.7
 Identities = 7/32 (21%), Positives = 15/32 (46%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
           KK K + K + K K + + + K ++   +   
Sbjct: 95  KKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 27.7 bits (62), Expect = 2.6
 Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%)

Query: 113 KKNEEKEEEKK-KKEEEEKKKKEEKEREEEKRE 144
           K+ EEKE+EKK KKE +E+++   + +  E+ E
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423



 Score = 27.7 bits (62), Expect = 2.6
 Identities = 13/33 (39%), Positives = 21/33 (63%), Gaps = 1/33 (3%)

Query: 118 KEEEKKKKEEEEKKK-KEEKEREEEKREEENVE 149
           KE E+K+KE++ KK+ KEE+    + +  E  E
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 27.9 bits (62), Expect = 2.6
 Identities = 12/45 (26%), Positives = 29/45 (64%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K + + KK  +++ +E   K++EE+ + E++ + EE  E+++ +E
Sbjct: 150 KAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDE 194


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 27.5 bits (61), Expect = 2.7
 Identities = 15/43 (34%), Positives = 26/43 (60%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           +K++   +KK   K+ +KKKKE+EE  +   +E  + +  EEN
Sbjct: 52  KKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEN 94


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 27.5 bits (61), Expect = 2.8
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 6/42 (14%)

Query: 100 GNLGFRKR------QGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
           GN G   R        EKKKK + K+ + K K++E+KKK  E
Sbjct: 74  GNTGISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKARE 115


>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 26.6 bits (59), Expect = 2.9
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           E  E+E++K E  E+K K+   RE    E  NV
Sbjct: 7   ERLEKEQRKLEAPEEKSKKNAAREVPLEENLNV 39


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 27.7 bits (62), Expect = 3.0
 Identities = 15/62 (24%), Positives = 33/62 (53%), Gaps = 3/62 (4%)

Query: 89  LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           L E ++E ++E     ++++ E   K EE+   +    E ++   E++ R E +RE+E +
Sbjct: 179 LAELKAEEEEELERALKEKREELLSKLEEELLARL---ESKEAALEKQLRLEFEREKEEL 235

Query: 149 EE 150
            +
Sbjct: 236 RK 237


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 27.0 bits (60), Expect = 3.0
 Identities = 13/42 (30%), Positives = 27/42 (64%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           R+ + + KK+  EKE+EK+++  +   + E++  E EK++ E
Sbjct: 73  RRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAE 114



 Score = 26.2 bits (58), Expect = 5.1
 Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKE-EKEREEEKREEENVEE 150
           + R+ + K K EE E+EK+K+E   K   E EKER E ++++   + 
Sbjct: 71  KLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKL 117



 Score = 25.4 bits (56), Expect = 9.3
 Identities = 12/46 (26%), Positives = 31/46 (67%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           RK+  EK ++++ K ++++ ++E+EK+++  K   E ++E   +E+
Sbjct: 65  RKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEK 110


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 27.0 bits (60), Expect = 3.1
 Identities = 8/33 (24%), Positives = 13/33 (39%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           +KN+            E+   E  +   +KREE
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKKREE 136


>gnl|CDD|163616 cd00840, MPP_Mre11_N, Mre11 nuclease, N-terminal
          metallophosphatase domain.  Mre11 (also known as SbcD
          in Escherichia coli) is a subunit of the MRX protein
          complex. This complex includes: Mre11, Rad50, and
          Xrs2/Nbs1, and plays a vital role in several nuclear
          processes including DNA double-strand break repair,
          telomere length maintenance, cell cycle checkpoint
          control, and meiotic recombination, in eukaryotes.
          During double-strand break repair, the MRX complex is
          required to hold the two ends of a broken chromosome
          together.  In vitro studies show that Mre11 has 3'-5'
          exonuclease activity on dsDNA templates and
          endonuclease activity on dsDNA and ssDNA templates. In
          addition to the N-terminal phosphatase domain, the
          eukaryotic MRE11 members of this family have a
          C-terminal DNA binding domain (not included in this
          alignment model).  MRE11-like proteins are found in
          prokaryotes and archaea was well as in eukaryotes.
          Mre11 belongs to the metallophosphatase (MPP)
          superfamily.  MPPs are functionally diverse, but all
          share a conserved domain with an active site consisting
          of two metal ions (usually manganese, iron, or zinc)
          coordinated with octahedral geometry by a cage of
          histidine, aspartate, and asparagine residues. The MPP
          superfamily includes: Mre11/SbcD-like exonucleases,
          Dbr1-like RNA lariat debranching enzymes, YfcE-like
          phosphodiesterases, purple acid phosphatases (PAPs),
          YbbF-like UDP-2,3-diacylglucosamine hydrolases, and
          acid sphingomyelinases (ASMases).  The conserved domain
          is a double beta-sheet sandwich with a di-metal active
          site made up of residues located at the C-terminal side
          of the sheets. This domain is thought to allow for
          productive metal coordination.
          Length = 223

 Score = 27.3 bits (61), Expect = 3.1
 Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 1/43 (2%)

Query: 38 LTYIIAYIRDFACDYYFI-GDSFETSVPWDKTVLLCINVKKRL 79
             I+    +   D+  I GD F+++ P  + + L I   +RL
Sbjct: 30 FEEIVELAIEEKVDFVLIAGDLFDSNNPSPEALELLIEALRRL 72


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 27.7 bits (61), Expect = 3.2
 Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 8/68 (11%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKK--------NEEKEEEKKKKEEEEKKKKEEKEREEEK 142
           E         +   +K     ++K         EEK E+K +K  + KK  E+ E +  +
Sbjct: 97  ELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156

Query: 143 REEENVEE 150
            E  +VEE
Sbjct: 157 SEVSDVEE 164


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 27.5 bits (61), Expect = 3.3
 Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 102 LGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
              RKR+  +     E   ++ + + +E +   EKER+  + E++N
Sbjct: 353 GPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDKN 398


>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
          Length = 292

 Score = 27.1 bits (61), Expect = 3.3
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
           K EE EE ++ +EE E+K +E KE  + + 
Sbjct: 40  KAEEIEERREVREEAEEKYEEAKEEGDLEE 69


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 27.4 bits (61), Expect = 3.3
 Identities = 11/50 (22%), Positives = 24/50 (48%)

Query: 101 NLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           N    K   E++++ E++ EE+   +  +K +    + + E    E +EE
Sbjct: 107 NYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEE 156


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 27.2 bits (61), Expect = 3.3
 Identities = 13/63 (20%), Positives = 25/63 (39%), Gaps = 5/63 (7%)

Query: 93  RSETKKEGNLGFRKRQG-----EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
                 +  LG  +         K +  + K  EK+K+    ++++ E E    KRE + 
Sbjct: 1   EELDALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDR 60

Query: 148 VEE 150
           + E
Sbjct: 61  LRE 63


>gnl|CDD|217206 pfam02731, SKIP_SNW, SKIP/SNW domain.  This domain is found in
           chromatin proteins.
          Length = 158

 Score = 26.9 bits (60), Expect = 3.4
 Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 2/32 (6%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKERE--EEKREE 145
            ++ E +++  E+EK++KEEK RE  +  REE
Sbjct: 126 RQRAELQRQLAEKEKQEKEEKLRELAQRAREE 157


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 27.1 bits (60), Expect = 3.6
 Identities = 15/40 (37%), Positives = 27/40 (67%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K KK++++E++ +  +  E+K+ +E E EEE  EEE  E+
Sbjct: 243 KNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEED 282


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 27.2 bits (61), Expect = 3.6
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
           K E  EE ++ +EE E+K +E  E+ + + 
Sbjct: 87  KAETLEERREIREEAEEKWEEALEKGDLEE 116


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 27.4 bits (61), Expect = 3.7
 Identities = 17/65 (26%), Positives = 30/65 (46%)

Query: 77  KRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
           K+   E  +   L EF+    KE  L   +   EK  +  E+  E +++ ++E  KK +K
Sbjct: 440 KKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKK 499

Query: 137 EREEE 141
             E +
Sbjct: 500 IPEVD 504



 Score = 26.2 bits (58), Expect = 8.8
 Identities = 11/64 (17%), Positives = 24/64 (37%)

Query: 87  NGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
               +   +      L   + +  ++ +   + +E+ KKE  +K KK  +       EE 
Sbjct: 453 EEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVDTYLLLEEL 512

Query: 147 NVEE 150
            + E
Sbjct: 513 GINE 516


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 26.5 bits (59), Expect = 3.7
 Identities = 19/60 (31%), Positives = 34/60 (56%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E ++ T KE +    K + + +KK EEK+ E +K  +   ++K E E E+ ++E E +E 
Sbjct: 4   EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63



 Score = 26.1 bits (58), Expect = 5.4
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           E +K  +   EEK + E E+ +K+ E+   E  R E
Sbjct: 34  EAEKLAKMSAEEKAEYELEKLEKELEELEAELARRE 69


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 27.4 bits (61), Expect = 3.8
 Identities = 8/40 (20%), Positives = 18/40 (45%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           K   + K   E+  E   + +E +K+  E++   +E +  
Sbjct: 61  KEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAA 100


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 27.0 bits (60), Expect = 3.8
 Identities = 10/42 (23%), Positives = 22/42 (52%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           F+KRQ E +K  +E + ++ K +++ K +      + E   +
Sbjct: 53  FKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIK 94


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 26.9 bits (60), Expect = 3.9
 Identities = 12/40 (30%), Positives = 25/40 (62%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           KKKK +++EE KK K++ E    + + R++  + ++ + E
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLE 162


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 26.0 bits (58), Expect = 3.9
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 121 EKKKKEEEEKKKKEEKEREEE 141
                 EEEKK++EE+E E++
Sbjct: 61  AAAAAAEEEKKEEEEEEEEDD 81



 Score = 25.3 bits (56), Expect = 5.9
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query: 127 EEEKKKKEEKEREEEKREEE 146
                 +EEK+ EEE+ EE+
Sbjct: 61  AAAAAAEEEKKEEEEEEEED 80



 Score = 25.3 bits (56), Expect = 7.0
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 121 EKKKKEEEEKKKKEEKEREEE 141
                   E++KKEE+E EEE
Sbjct: 59  AAAAAAAAEEEKKEEEEEEEE 79



 Score = 25.3 bits (56), Expect = 7.9
 Identities = 7/21 (33%), Positives = 14/21 (66%)

Query: 109 GEKKKKNEEKEEEKKKKEEEE 129
                  EEK+EE++++EE++
Sbjct: 61  AAAAAAEEEKKEEEEEEEEDD 81


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 26.9 bits (60), Expect = 4.0
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 117 EKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E+E+++  K+ +E   +E  E +EE+  EE   E
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAE 168


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 26.5 bits (58), Expect = 4.1
 Identities = 17/40 (42%), Positives = 23/40 (57%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           KK+K  EK+ + KKKEE+   K  E E  +E  E E + E
Sbjct: 29  KKEKVSEKKGKSKKKEEKPNGKIPEHEPNQEVTEVEVIIE 68


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 26.6 bits (59), Expect = 4.1
 Identities = 10/35 (28%), Positives = 26/35 (74%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           E ++E+K+  +E E+K+  + ++E+++++E+ V E
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAE 128



 Score = 26.6 bits (59), Expect = 4.5
 Identities = 13/30 (43%), Positives = 23/30 (76%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
           EK++K   KE E+K+  + +K+KKE+KE++
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 25.9 bits (57), Expect = 6.6
 Identities = 9/32 (28%), Positives = 24/32 (75%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
           E +K+ +E  +E ++KE  + KK++++++E++
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125


>gnl|CDD|215505 PLN02935, PLN02935, Bifunctional NADH kinase/NAD(+) kinase.
          Length = 508

 Score = 27.1 bits (60), Expect = 4.2
 Identities = 13/44 (29%), Positives = 22/44 (50%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           +R  E K   + +  E K+K E E+ + ++ E +E    E N E
Sbjct: 51  RRVAEGKALAQAEAAEWKRKYELERARNQQLEHKELSSGECNEE 94


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 26.6 bits (59), Expect = 4.3
 Identities = 10/36 (27%), Positives = 19/36 (52%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           + K+K E   EE+K+      KK    ++ E K+++
Sbjct: 161 DAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 26.6 bits (59), Expect = 4.6
 Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 99  EGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           +  L + +R+  K+K+    EE K+      KK    ++ E +K++
Sbjct: 151 QAGLWYAERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 26.2 bits (58), Expect = 6.3
 Identities = 13/35 (37%), Positives = 16/35 (45%), Gaps = 5/35 (14%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKK 125
           EF +E +KE           KK    +K E KKKK
Sbjct: 167 EFAAEERKE-----ALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 26.5 bits (59), Expect = 4.3
 Identities = 14/47 (29%), Positives = 28/47 (59%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            ++    K+++  E   E K  E+ ++KK++E   EE KRE++ ++E
Sbjct: 87  LQQEVEAKRERLLEARRELKALEKLKEKKQKEYRAEEAKREQKEMDE 133


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 26.4 bits (59), Expect = 4.4
 Identities = 13/36 (36%), Positives = 26/36 (72%), Gaps = 2/36 (5%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEE--EKKKKEEKE 137
           F++++ +KKKK E +  E+KKK+    E++K+++K 
Sbjct: 81  FKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 25.6 bits (57), Expect = 8.3
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           K ++K+++KKK+ E  ++KK++    EE++E
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKE 112



 Score = 25.6 bits (57), Expect = 9.0
 Identities = 13/38 (34%), Positives = 26/38 (68%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K+ ++K+++KK+ E  E+KKK+    EE K +++ V+ 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDA 119


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 26.9 bits (59), Expect = 4.5
 Identities = 15/41 (36%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEE-KEREEEKREE 145
           ++  EK+  +E++E+ KK  E+ E+ K EE KE +E K+++
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKK 248


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP) is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 124

 Score = 26.4 bits (58), Expect = 4.5
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEE 135
           K KK   +E EKK  +++EKK    
Sbjct: 53  KPKKISAEEAEKKLLQKKEKKALTN 77


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 26.9 bits (60), Expect = 4.7
 Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 8/57 (14%)

Query: 102 LGFRKRQ--GEKKKKNEEK---EEEKKKKEEEEKKKK---EEKEREEEKREEENVEE 150
           L F +R    ++ K  EE    E E+ KK E E+ ++   EE++ EEE+  +E+ ++
Sbjct: 245 LTFDRRAQPTDRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADD 301


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 26.7 bits (59), Expect = 4.8
 Identities = 13/60 (21%), Positives = 30/60 (50%)

Query: 73  INVKKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKK 132
           I   ++  ++    N     R + + E  L F K + E+++   +KEEE+++  + + K+
Sbjct: 122 IETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQ 181


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 26.6 bits (59), Expect = 4.9
 Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 110 EKKKKNEEKEEEKKKKEE--EEKKKKEEKEREEEKREEENVEE 150
            K  K+E  EE+  K+ E  EE K KEE    EE  +EE +E+
Sbjct: 5   CKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIED 47



 Score = 26.6 bits (59), Expect = 5.3
 Identities = 15/41 (36%), Positives = 26/41 (63%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +  K+NE KEE+K K+E+ E ++ E++E  E+  E   V+ 
Sbjct: 16  DCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKI 56


>gnl|CDD|205200 pfam13019, Telomere_Sde2, Telomere stability and silencing.  Sde2
           has been identified in fission yeast as an important
           factor in telomere formation and maintenance. This is a
           more N-terminal domain on these nuclear proteins, and is
           essential for telomeric silencing and genomic stability.
          Length = 163

 Score = 26.3 bits (58), Expect = 5.0
 Identities = 13/33 (39%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 115 NEEKE--EEKKKKEEEEKKKKEEKEREEEKREE 145
           NE K   E    K E EKK+KE + +  E   E
Sbjct: 127 NEAKALAEYLAIKPEMEKKEKEARRKRWESVVE 159


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 27.0 bits (60), Expect = 5.1
 Identities = 12/57 (21%), Positives = 25/57 (43%)

Query: 91  EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
             R E ++E     +K +    +  +    E + +E E+     E E EE +++ +N
Sbjct: 74  REREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDN 130



 Score = 26.2 bits (58), Expect = 9.5
 Identities = 10/61 (16%), Positives = 26/61 (42%)

Query: 89  LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           LL      +++     R+    ++++  +KEE+   + E+    + + E  E+      +
Sbjct: 60  LLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSAREL 119

Query: 149 E 149
           E
Sbjct: 120 E 120


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 26.2 bits (58), Expect = 5.3
 Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEK 136
            K+KK ++K+++KKKK+  +K  K++K
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 26.2 bits (58), Expect = 6.0
 Identities = 10/28 (35%), Positives = 21/28 (75%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKK 133
           K + ++KK+ ++K+++KKKK  ++  KK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 26.7 bits (60), Expect = 5.3
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
           EE    K++ E    K+ EEKE++ EK E
Sbjct: 514 EEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 25.9 bits (58), Expect = 9.2
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
            E+E    K++ E    KE +E+E++  + E
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542


>gnl|CDD|205219 pfam13038, DUF3899, Domain of unknown function (DUF3899).  Putative
           Tryptophanyl-tRNA synthetase.
          Length = 92

 Score = 25.7 bits (57), Expect = 5.3
 Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 2/34 (5%)

Query: 103 GFRK--RQGEKKKKNEEKEEEKKKKEEEEKKKKE 134
           GFR+  RQ  +KK+ +  EE++   +++ KKKK 
Sbjct: 33  GFRRLFRQLVEKKEKDSIEEDEFANDKKVKKKKY 66


>gnl|CDD|219996 pfam08738, Gon7, Gon7 family.  In S. cerevisiae Gon7 is a member of
           the KEOPS protein complex. A protein complex proposed to
           be involved in transcription and promoting telomere
           uncapping and telomere elongation.
          Length = 97

 Score = 25.5 bits (56), Expect = 5.7
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEE 141
           E  EE+KKK   + + +KE +++EEE
Sbjct: 69  ERMEEDKKKAAAKGRGEKEIEDKEEE 94


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 26.6 bits (59), Expect = 5.8
 Identities = 15/46 (32%), Positives = 23/46 (50%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           R+R  E + + EE EE  ++ +E+ +  KEE E  E   EE     
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLL 353


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 26.3 bits (59), Expect = 5.8
 Identities = 14/34 (41%), Positives = 24/34 (70%)

Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
           RQG   +  EE+EEE +++EEEE++++ E+   E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 25.9 bits (58), Expect = 9.8
 Identities = 11/34 (32%), Positives = 23/34 (67%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           ++    E  ++++EE E++++EE+E E E+ E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 26.8 bits (58), Expect = 5.9
 Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 6/53 (11%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           ++E KKE  L       E+      K+E+ + K  +++ +K EK+RE  +RE+
Sbjct: 146 KNEKKKERQLA------EQLAAKRLKDEQHRHKARKQELRKREKDRERARRED 192


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 26.4 bits (58), Expect = 5.9
 Identities = 9/57 (15%), Positives = 17/57 (29%), Gaps = 2/57 (3%)

Query: 93  RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
             E K+       +     K    +K   KKK     +  K        K + ++ +
Sbjct: 141 PPEAKEVKKPKKGQSPKVPKAPKPKKP--KKKGSVSNRSVKMPGIDPRSKPDWKSQD 195


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 25.8 bits (57), Expect = 6.1
 Identities = 9/33 (27%), Positives = 17/33 (51%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
                   E   E  + EEEEK+++ E+E +++
Sbjct: 70  AAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 25.1 bits (55), Expect = 9.7
 Identities = 7/33 (21%), Positives = 16/33 (48%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
                   E   E  + +EEE++++ EE+  ++
Sbjct: 70  AAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
           to archaeal Holliday junction resolvase [Nucleotide
           transport and metabolism].
          Length = 175

 Score = 26.3 bits (58), Expect = 6.2
 Identities = 16/51 (31%), Positives = 30/51 (58%), Gaps = 4/51 (7%)

Query: 99  EGNLGFRKRQGEKKKKNEEKE----EEKKKKEEEEKKKKEEKEREEEKREE 145
           +G +  + R+ E+  +  E+E    E + +KEEE K K+  +++ EE RE+
Sbjct: 28  QGKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARED 78


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 26.6 bits (58), Expect = 6.4
 Identities = 18/53 (33%), Positives = 30/53 (56%)

Query: 96  TKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           T   GN+G    Q  KK  N  K+  K+ + +E+++++EE+E EE + EE   
Sbjct: 105 TPGTGNIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQ 157


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 26.3 bits (58), Expect = 6.5
 Identities = 10/46 (21%), Positives = 22/46 (47%)

Query: 102 LGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
           L   +++  +K K+  +  E+ K  E++K + ++ E E E      
Sbjct: 72  LYALQKELLEKLKDLAEVNERLKALEKKKAELKDLEEEREGILRSK 117


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 26.2 bits (58), Expect = 6.6
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
            EE E  K + EEE  KKK++K++++ K+      
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGS 200


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 25.7 bits (57), Expect = 6.7
 Identities = 15/55 (27%), Positives = 32/55 (58%), Gaps = 8/55 (14%)

Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER--------EEEKREEENVEE 150
           ++K+ G K   ++  + +++ +EE E K+K E+E+        EEE+ E++ + E
Sbjct: 53  YKKKIGYKVPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLE 107


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 26.4 bits (58), Expect = 6.8
 Identities = 13/52 (25%), Positives = 24/52 (46%)

Query: 95  ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           E   E       R+ ++K + E KE+ +   E+    K+EE    +E + +E
Sbjct: 83  EPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDE 134


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 26.6 bits (59), Expect = 6.9
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
           RK + ++ K+   KE E++ KEE++ KKK  
Sbjct: 277 RKTKAQRNKEKRRKELEREAKEEKQLKKKLA 307


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 26.3 bits (58), Expect = 7.1
 Identities = 12/71 (16%), Positives = 25/71 (35%)

Query: 76  KKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
           KK++ ++  + +    F                   +K   + E+ KKK    +   K++
Sbjct: 80  KKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKK 139

Query: 136 KEREEEKREEE 146
           K+    K E  
Sbjct: 140 KKVLSSKDELI 150


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
           describes the F1/F0 ATP synthase b subunit in bacteria
           only. Scoring just below the trusted cutoff are the
           N-terminal domains of Mycobacterial b/delta fusion
           proteins and a subunit from an archaeon, Methanosarcina
           barkeri, in which the ATP synthase homolog differs in
           architecture and is not experimentally confirmed. This
           model helps resolve b from the related b' subunit.
           Within the family is an example from a
           sodium-translocating rather than proton-translocating
           ATP synthase [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 147

 Score = 25.8 bits (57), Expect = 7.2
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            K+ +E  EE K +  EE +K K +   E E  +E+  EE
Sbjct: 71  NKRGSEILEEAKAEAREEREKIKAQARAEIEAEKEQAREE 110


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 26.1 bits (58), Expect = 7.2
 Identities = 19/39 (48%), Positives = 24/39 (61%)

Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
               E+K EE   +E EE +K EE+E EEE+ EEEN  E
Sbjct: 1   MSDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELE 39


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 26.6 bits (59), Expect = 7.3
 Identities = 15/40 (37%), Positives = 22/40 (55%)

Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
           E + K +E  EE  + EE+ ++ KEE E  E + EE   E
Sbjct: 327 ELESKLDELAEELAELEEKLEELKEELESLEAELEELEAE 366


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 26.0 bits (58), Expect = 7.4
 Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 110 EKKKKNEEKEEEKKKKE--EEEKKKKEEKEREEEKREEE 146
           EK  + E ++ EKKK    EE+K+K  E+  E E+ +  
Sbjct: 9   EKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQST 47


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 25.3 bits (56), Expect = 7.7
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           K  E+KEE ++ KEE  ++ K+E+E  E+     
Sbjct: 51  KVKEKKEEREEDKEELIERIKKEEETFEDVDLGS 84


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 26.2 bits (57), Expect = 7.9
 Identities = 12/64 (18%), Positives = 23/64 (35%), Gaps = 2/64 (3%)

Query: 88  GLLEFRSETKKEGNLGFRKRQGEKKKKN--EEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
              E   E +   +       GE       E K+++K+K+ EE + K      +     +
Sbjct: 431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMNSEDASLASQ 490

Query: 146 ENVE 149
           E + 
Sbjct: 491 EGMS 494


>gnl|CDD|232958 TIGR00399, metG_C_term, methionyl-tRNA synthetase C-terminal
           region/beta chain.  The methionyl-tRNA synthetase (metG)
           is a class I amino acyl-tRNA ligase. This model
           describes a region of the methionyl-tRNA synthetase that
           is present at the C-terminus of MetG in some species (E.
           coli, B. subtilis, Thermotoga maritima, Methanobacterium
           thermoautotrophicum), and as a separate beta chain in
           Aquifex aeolicus. It is absent in a number of other
           species (e.g. Mycoplasma genitalium, Mycobacterium
           tuberculosis), while Pyrococcus horikoshii has both a
           full length MetG and a second protein homologous to the
           beta chain only. Proteins hit by This model should
           called methionyl-tRNA synthetase beta chain if and only
           if the model metG hits a separate protein not also hit
           by This model [Protein synthesis, tRNA aminoacylation].
          Length = 137

 Score = 25.8 bits (57), Expect = 8.0
 Identities = 12/31 (38%), Positives = 21/31 (67%)

Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
           ++K EE K K  ++K+KK+E E+  E ++E 
Sbjct: 1   DKKIEELKLKGAKKKEKKDEGEKALEPQKET 31


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 25.9 bits (57), Expect = 8.1
 Identities = 18/45 (40%), Positives = 29/45 (64%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
            ++ E+K + EE E+EK+  E E +   E++E+EE  +EEE  EE
Sbjct: 52  SKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEE 96


>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
            Members of this family function as part of the Mediator
           (Med) complex, which links DNA-bound transcriptional
           regulators and the general transcription machinery,
           particularly the RNA polymerase II enzyme. They play a
           role in basal transcription by mediating activation or
           repression according to the specific complement of
           transcriptional regulators bound to the promoter.
          Length = 176

 Score = 25.8 bits (57), Expect = 8.2
 Identities = 14/37 (37%), Positives = 23/37 (62%)

Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
           G +    + +EEE++K  E E K +EE ERE ++R +
Sbjct: 140 GIEPPGFDPEEEEERKAREAEAKTEEEDEREAQERNQ 176


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 26.2 bits (58), Expect = 8.4
 Identities = 9/33 (27%), Positives = 16/33 (48%)

Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
            K +E++E  ++ EE E +   E     E  E+
Sbjct: 140 PKTDEEKELLREVEELESRLATEPSPAPELEEQ 172


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 25.3 bits (55), Expect = 8.7
 Identities = 11/17 (64%), Positives = 15/17 (88%)

Query: 114 KNEEKEEEKKKKEEEEK 130
           K E K+EEKK++EEEE+
Sbjct: 87  KAEAKKEEKKEEEEEEE 103


>gnl|CDD|240407 PTZ00412, PTZ00412, leucyl aminopeptidase; Provisional.
          Length = 569

 Score = 26.1 bits (57), Expect = 8.7
 Identities = 9/29 (31%), Positives = 20/29 (68%)

Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKE 134
           KR+G KK +N +K+ +  +K+   +++K+
Sbjct: 541 KRKGIKKAQNAKKDLKVVQKKRHGRRRKQ 569


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 25.6 bits (56), Expect = 8.7
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 122 KKKKEEEEKKKKEEKEREEE 141
             KKEEE++++  E  +EEE
Sbjct: 94  TSKKEEEKEEEIPEPTKEEE 113


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 26.0 bits (58), Expect = 8.8
 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 14/73 (19%)

Query: 85  VSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEK------------EEEKKKKEEEEKKK 132
           +S  L + RS   K      R+ + +   K+ E+            E+E K+ EEE  + 
Sbjct: 55  LSEALDKLRSYLPKLN--PLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISEL 112

Query: 133 KEEKEREEEKREE 145
           + E +  E++ E 
Sbjct: 113 ENEIKELEQEIER 125


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family consists
           of several ATP synthase E chain sequences which are
           components of the CF(0) subunit.
          Length = 83

 Score = 24.7 bits (54), Expect = 8.8
 Identities = 8/35 (22%), Positives = 20/35 (57%)

Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
           +K +  ++ + +EK  EK K    +KK+ +++ + 
Sbjct: 34  KKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKS 68


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 26.2 bits (58), Expect = 8.9
 Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
              K++K + K++  KKK ++ KK  ++ +    +   EN E 
Sbjct: 8   NEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEA 50


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 26.2 bits (58), Expect = 9.1
 Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 97  KKEGNLGFRKRQGEKKKKNEEK--EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           +KE  L  ++    K+K+N  K    E KK   +E++ K E  + EE +  EN ++
Sbjct: 360 RKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQ 415


>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex.  Med21 has been
           known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
           Drosophila. The heterodimer of the two subunits Med7 and
           Med21 appears to act as a hinge between the middle and
           the tail regions of Mediator.
          Length = 132

 Score = 25.3 bits (56), Expect = 9.4
 Identities = 13/34 (38%), Positives = 24/34 (70%)

Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
           + E+E+ ++ KE EE+ ++ E EREE  +E+E +
Sbjct: 85  SSEEEQLRRIKELEEELREVEAEREEAVKEKEKL 118


>gnl|CDD|241412 cd13258, PH_PLEKHJ1, Pleckstrin homology domain containing, family
           J member 1 Pleckstrin homology (PH) domain.  PLEKHJ1
           (also called GNRPX2/Guanine nucleotide-releasing protein
           x ). It contains a single PH domain. Very little
           information is known about PLEKHJ1. PLEKHJ1 has been
           shown to interact with IKBKG (inhibitor of kappa light
           polypeptide gene enhancer in B-cells, kinase gamma) and
           KRT33B (keratin 33B). PH domains have diverse functions,
           but in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 144

 Score = 25.6 bits (56), Expect = 9.7
 Identities = 9/24 (37%), Positives = 12/24 (50%)

Query: 42  IAYIRDFACDYYFIGDSFETSVPW 65
           I++I +    YYF   S E  V W
Sbjct: 86  ISFIDEPERKYYFECRSEEQCVEW 109


>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
           family 4 of glycosyl hydrolases [Carbohydrate transport
           and metabolism].
          Length = 442

 Score = 26.1 bits (58), Expect = 9.8
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
           K+KK +E E+   +  E  K++KE  E  ++   +E  EE
Sbjct: 268 KEKKPDEIEKFGTRANEVMKREKELFELYKKPELKEKPEE 307


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.131    0.364 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,799,653
Number of extensions: 743794
Number of successful extensions: 15107
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9292
Number of HSP's successfully gapped: 2625
Length of query: 150
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 62
Effective length of database: 7,034,450
Effective search space: 436135900
Effective search space used: 436135900
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 54 (24.4 bits)