RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7478
         (180 letters)



>gnl|CDD|215772 pfam00179, UQ_con, Ubiquitin-conjugating enzyme.  Proteins destined
           for proteasome-mediated degradation may be
           ubiquitinated. Ubiquitination follows conjugation of
           ubiquitin to a conserved cysteine residue of UBC
           homologues. TSG101 is one of several UBC homologues that
           lacks this active site cysteine.
          Length = 139

 Score = 72.6 bits (179), Expect = 7e-17
 Identities = 26/97 (26%), Positives = 47/97 (48%), Gaps = 17/97 (17%)

Query: 91  TLSVQIPLHRVHEQGVPPVV------YPPSIYNHQSTLYSGTVCLNVINQE-WTPLYDLS 143
            L ++ P     +   PP V      Y P++        SG +CL+++  E W+P   + 
Sbjct: 47  KLDIEFPEDYPFK---PPKVKFTTKIYHPNVDP------SGEICLDILKDENWSPALTIE 97

Query: 144 NIFETFLPQLLTYPNPTDPLNGDAAAMYLDKPDKYKE 180
            +    +  LL+ PNP DPLN +AA +Y    +++++
Sbjct: 98  QVL-LSIQSLLSEPNPEDPLNAEAAKLYRKNREEFEK 133


>gnl|CDD|238117 cd00195, UBCc, Ubiquitin-conjugating enzyme E2, catalytic (UBCc)
           domain. This is part of the ubiquitin-mediated protein
           degradation pathway in which a thiol-ester linkage forms
           between a conserved cysteine and the C-terminus of
           ubiquitin and complexes with ubiquitin protein ligase
           enzymes, E3.  This pathway regulates many fundamental
           cellular processes.  There are also other E2s which form
           thiol-ester linkages without the use of E3s as well as
           several UBC homologs (TSG101, Mms2, Croc-1 and similar
           proteins) which lack the active site cysteine essential
           for ubiquitination and appear to function in DNA repair
           pathways which were omitted from the scope of this CD.
          Length = 141

 Score = 69.2 bits (170), Expect = 2e-15
 Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 14/81 (17%)

Query: 107 PPVV------YPPSIYNHQSTLYSGTVCLNVINQE-WTPLYDLSNIFETFLPQLLTYPNP 159
           PP V      Y P++        +G +CL+++    W+P Y L  +  +    LL  PNP
Sbjct: 62  PPKVRFVTKIYHPNVDE------NGKICLSILKTHGWSPAYTLRTVLLSLQS-LLNEPNP 114

Query: 160 TDPLNGDAAAMYLDKPDKYKE 180
           +DPLN +AA +Y +  +++K+
Sbjct: 115 SDPLNAEAAKLYKENREEFKK 135


>gnl|CDD|227410 COG5078, COG5078, Ubiquitin-protein ligase [Posttranslational
           modification, protein turnover, chaperones].
          Length = 153

 Score = 68.1 bits (167), Expect = 6e-15
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 6/74 (8%)

Query: 112 PPSIY-----NHQSTLYSGTVCLNVINQEWTPLYDLSNIFETFLPQLLTYPNPTDPLNGD 166
           PP +       H +   SG VCL+++   W+P+Y L  I  +    LL+ PNP  PLN +
Sbjct: 69  PPKVRFTTKIFHPNVDPSGNVCLDILKDRWSPVYTLETILLSLQSLLLS-PNPDSPLNTE 127

Query: 167 AAAMYLDKPDKYKE 180
           AA +Y +  ++Y++
Sbjct: 128 AATLYREDKEEYEK 141


>gnl|CDD|214562 smart00212, UBCc, Ubiquitin-conjugating enzyme E2, catalytic domain
           homologues.  Proteins destined for proteasome-mediated
           degradation may be ubiquitinated. Ubiquitination follows
           conjugation of ubiquitin to a conserved cysteine residue
           of UBC homologues. This pathway functions in regulating
           many fundamental processes required for cell
           viability.TSG101 is one of several UBC homologues that
           lacks this active site cysteine.
          Length = 145

 Score = 60.4 bits (147), Expect = 3e-12
 Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 14/81 (17%)

Query: 107 PPVV------YPPSIYNHQSTLYSGTVCLNVINQE-WTPLYDLSNIFETFLPQLLTYPNP 159
           PP V      Y P++ +      SG +CL+++ QE W+P   L  +  + L  LL+ PNP
Sbjct: 61  PPKVKFITKIYHPNVDS------SGEICLDILKQEKWSPALTLETVLLS-LQSLLSEPNP 113

Query: 160 TDPLNGDAAAMYLDKPDKYKE 180
             PLN DAA +Y    +++K+
Sbjct: 114 DSPLNADAAELYKKNREEFKK 134


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 44.5 bits (106), Expect = 8e-06
 Identities = 17/56 (30%), Positives = 38/56 (67%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           +    K  KK++EE+ E+       +++EE+++EEK+K++++KEEE+++  E++  
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 43.4 bits (103), Expect = 3e-05
 Identities = 20/63 (31%), Positives = 37/63 (58%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           L  S        K  +K +++ +EE    +++    +KK+EE+++EK+KKEEEK++E E+
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461

Query: 61  RRT 63
              
Sbjct: 462 AEE 464



 Score = 43.0 bits (102), Expect = 3e-05
 Identities = 23/50 (46%), Positives = 38/50 (76%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           + + +K+KK++        EEEEEE+EKK+EEK++E+++ EEEK++E EK
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472



 Score = 37.6 bits (88), Expect = 0.002
 Identities = 19/45 (42%), Positives = 35/45 (77%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
             +  +KK++EEE+E+    EE+EEEEE+ +EEK++E++KK+++ 
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 28.3 bits (64), Expect = 2.3
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           EEE E     K+  KK KK  E+ +KK  E+++ 
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKE 429



 Score = 26.4 bits (59), Expect = 9.6
 Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 32  EEEEEE----KKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTRCVEGE 85
            EEE E     KK  KK +K  ++ EKK+E EK+        GK   +       + E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452


>gnl|CDD|177768 PLN00172, PLN00172, ubiquitin conjugating enzyme; Provisional.
          Length = 147

 Score = 41.7 bits (97), Expect = 3e-05
 Identities = 22/73 (30%), Positives = 42/73 (57%), Gaps = 2/73 (2%)

Query: 107 PPVVYPPSIYNHQSTLYSGTVCLNVINQEWTPLYDLSNIFETFLPQLLTYPNPTDPLNGD 166
           P V +   IY H +   +G++CL+++  +W+P   +S +  + +  LLT PNP DPL  +
Sbjct: 65  PKVQFTTKIY-HPNINSNGSICLDILRDQWSPALTVSKVLLS-ISSLLTDPNPDDPLVPE 122

Query: 167 AAAMYLDKPDKYK 179
            A ++ +   +Y+
Sbjct: 123 IARVFKENRSRYE 135


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 42.1 bits (99), Expect = 7e-05
 Identities = 18/54 (33%), Positives = 36/54 (66%)

Query: 6  TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
            +V   +  ++K+E+++EE    ++EEE +E++++E+KK+K KK +E   E E
Sbjct: 32 EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85



 Score = 35.1 bits (81), Expect = 0.014
 Identities = 18/44 (40%), Positives = 33/44 (75%)

Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          KE E E     EEEE+EE+K++EEK  +K+++ +E++++ EK++
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKK 72



 Score = 34.4 bits (79), Expect = 0.023
 Identities = 21/55 (38%), Positives = 34/55 (61%)

Query: 9  VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
          V K   ++   EEE+EE    +EEEE+   K+EE  +E++K+E++KK +  K  T
Sbjct: 27 VEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETT 81



 Score = 32.8 bits (75), Expect = 0.088
 Identities = 17/47 (36%), Positives = 30/47 (63%)

Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           K ++++++++  D+E    EEEE+EE+KKK +K KE   + E   K
Sbjct: 43 EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 18/51 (35%), Positives = 35/51 (68%), Gaps = 1/51 (1%)

Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
          ++ +KE  DEE    E+EE++EE++K   K+E+  +EEEK+++ +K + + 
Sbjct: 29 KEVEKEVPDEEEEE-EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVK 78


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 41.5 bits (97), Expect = 1e-04
 Identities = 18/48 (37%), Positives = 36/48 (75%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
           +K  +      GG+ EEEEEE+++EE+++E++++EEE+++E E+  +L
Sbjct: 849 EKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896



 Score = 39.6 bits (92), Expect = 5e-04
 Identities = 18/47 (38%), Positives = 34/47 (72%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           K+ +K  +      GG  EEEEEEE+++EE+++E++++EEE+++  E
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 35.7 bits (82), Expect = 0.011
 Identities = 16/44 (36%), Positives = 35/44 (79%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           K+ E+  + GGG +  + EEE+++EE+++E++++EEE+++E E+
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 34.6 bits (79), Expect = 0.026
 Identities = 19/44 (43%), Positives = 32/44 (72%), Gaps = 4/44 (9%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +EEE+EE    EEEEEEEE+++EE+++E++ +E    +  E R+
Sbjct: 864 EEEEEEE----EEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQ 903



 Score = 31.9 bits (72), Expect = 0.18
 Identities = 16/42 (38%), Positives = 30/42 (71%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           ++++EEE+EE    EEEEEEEE+++E ++    +  E ++K+
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 30.7 bits (69), Expect = 0.42
 Identities = 16/47 (34%), Positives = 26/47 (55%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           +R  + + E  EE GG  E+E E E K E + + +   E + ++EGE
Sbjct: 650 ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696



 Score = 27.7 bits (61), Expect = 4.7
 Identities = 12/47 (25%), Positives = 22/47 (46%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           R  ++ E   E  G   EE   E ++E + + K + E E +   E++
Sbjct: 644 RTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERK 690



 Score = 27.3 bits (60), Expect = 5.2
 Identities = 17/45 (37%), Positives = 24/45 (53%), Gaps = 5/45 (11%)

Query: 20  EEEDEEGGGGEEEEEEEE-----KKKEEKKKEKKKKEEEKKKEGE 59
           E E E G  GEE E+E E     K + E + ++K+ E E + E E
Sbjct: 729 EGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAE 773



 Score = 27.3 bits (60), Expect = 6.5
 Identities = 15/47 (31%), Positives = 22/47 (46%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           K    K E E EE     E E E  + + E +  ++ +E E + EGE
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGE 748



 Score = 26.9 bits (59), Expect = 7.2
 Identities = 14/42 (33%), Positives = 24/42 (57%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           + EE ++EG G  E + E E + + K+ E + + E + KE E
Sbjct: 737 EGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778



 Score = 26.9 bits (59), Expect = 8.7
 Identities = 16/46 (34%), Positives = 26/46 (56%), Gaps = 3/46 (6%)

Query: 17  KKKEEEDEEGG---GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           + +E EDE  G   G  E E E ++K+ E + E + + +E + EGE
Sbjct: 737 EGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGE 782


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 38.1 bits (89), Expect = 0.002
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           E+EE+E  K++++ ++ KK+EE+KKKE EK     
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 32.3 bits (74), Expect = 0.11
 Identities = 15/42 (35%), Positives = 28/42 (66%), Gaps = 7/42 (16%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +++KEE+       E  +E++  +K +K++EKKKKE EK ++
Sbjct: 554 QREKEEK-------EALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 30.4 bits (69), Expect = 0.57
 Identities = 15/37 (40%), Positives = 25/37 (67%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
           +R K++KE   E+    + +++EE+KKKE +K EK K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 28.5 bits (64), Expect = 2.6
 Identities = 7/33 (21%), Positives = 21/33 (63%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           +++EE ++E+++KE  K+++  +K  ++     
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKK 580



 Score = 27.7 bits (62), Expect = 3.7
 Identities = 15/48 (31%), Positives = 30/48 (62%), Gaps = 4/48 (8%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
              K + +++K+E+E  +    E++   + KK+EEKKK++ +K E+ K
Sbjct: 547 LDDKEELQREKEEKEALK----EQKRLRKLKKQEEKKKKELEKLEKAK 590


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 37.0 bits (86), Expect = 0.002
 Identities = 23/50 (46%), Positives = 35/50 (70%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +Y   KKKK+E  EE    ++E EE++K K +KKK KKKK+++K K+ +K
Sbjct: 56  EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDK 105



 Score = 34.3 bits (79), Expect = 0.018
 Identities = 8/45 (17%), Positives = 32/45 (71%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +++K K +++  +    ++++++++KK ++ +K+ +K+ E+K ++
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 33.9 bits (78), Expect = 0.023
 Identities = 12/47 (25%), Positives = 32/47 (68%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +Y+ ++K K ++ +     ++++++++ KK++K ++K +KE E K E
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 32.4 bits (74), Expect = 0.071
 Identities = 13/67 (19%), Positives = 36/67 (53%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
            +  + ++ K KK++  ++    ++++++++  K EKK EK+ +++ +        TL+ 
Sbjct: 76  KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135

Query: 67  IRIGKPA 73
           +   KP 
Sbjct: 136 LSELKPR 142



 Score = 32.0 bits (73), Expect = 0.10
 Identities = 13/47 (27%), Positives = 32/47 (68%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
            K KKE E+++    ++++ +++K K++ KK+ KK ++ +KK+ ++ 
Sbjct: 72  EKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 31.6 bits (72), Expect = 0.13
 Identities = 11/45 (24%), Positives = 28/45 (62%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           KK+ EE+ +     ++ +++++K K++K  +K  K E+K ++  +
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119



 Score = 29.3 bits (66), Expect = 0.91
 Identities = 11/50 (22%), Positives = 29/50 (58%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
            + +K +++ E     + ++++ KKK++K K+KK  +++ K E +  +  
Sbjct: 69  EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 11/45 (24%), Positives = 28/45 (62%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
               +K ++E EE    + ++++ +KKK++ K +K  K+++K ++
Sbjct: 68  AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112



 Score = 26.6 bits (59), Expect = 6.4
 Identities = 11/56 (19%), Positives = 25/56 (44%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
              K++ K K+++D++     E+++E+E + + +   K   E        K R   
Sbjct: 90  KSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRKYA 145


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 37.9 bits (88), Expect = 0.002
 Identities = 14/48 (29%), Positives = 30/48 (62%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
           +V       ++ E + +E    +EE E+++K++ ++ KEK++K+E KK
Sbjct: 387 EVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 37.5 bits (87), Expect = 0.002
 Identities = 15/50 (30%), Positives = 31/50 (62%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           K+  + K+E D       + +EE+ K++E +KK+K++ +E+K+K  +  R
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 32.8 bits (75), Expect = 0.075
 Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 3/51 (5%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEE---EEEEKKKEEKKKEKKKKEEEKKKEG 58
           K    K + +  +E     +EE+   EE EKK++E+  E K+K ++ +++ 
Sbjct: 384 KLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 28.6 bits (64), Expect = 1.8
 Identities = 13/48 (27%), Positives = 27/48 (56%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
           +    T  S+    K K+E+  +E    +++E+ +E K++ +K E+KK
Sbjct: 387 EVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 28.6 bits (64), Expect = 2.1
 Identities = 15/48 (31%), Positives = 30/48 (62%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           K   KK ++  ++ G  ++E +  E+ + + K+EK K+EE +KK+ E+
Sbjct: 372 KEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQ 419



 Score = 27.5 bits (61), Expect = 4.2
 Identities = 8/48 (16%), Positives = 25/48 (52%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
                +    +  + K +EE  +    E++++E+  + +EK+++ ++K
Sbjct: 386 GEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 36.5 bits (85), Expect = 0.002
 Identities = 20/59 (33%), Positives = 35/59 (59%), Gaps = 2/59 (3%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEK--KKEEKKKEKKKKEEEKKKEGEKRRTL 64
           +V K     KK++EE+EE     EE +EEE+  +  EK+  K K+E+ ++ E +++  L
Sbjct: 89  KVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEIL 147



 Score = 34.6 bits (80), Expect = 0.010
 Identities = 16/51 (31%), Positives = 32/51 (62%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           K +++++EEE E     EEE+ +E  +KE  K +++K+ E ++K+ E  + 
Sbjct: 99  KEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKE 149


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 37.4 bits (87), Expect = 0.002
 Identities = 12/48 (25%), Positives = 21/48 (43%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
           LQN   T+      +  +++   E   G +  EE  ++ KE + K K 
Sbjct: 895 LQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHKG 942



 Score = 30.5 bits (69), Expect = 0.50
 Identities = 8/59 (13%), Positives = 23/59 (38%), Gaps = 2/59 (3%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKK--EEKKKEKKKKEEEKKKE 57
           ++    T   +Y   +   + E         EE+  +++    +  +E  K+ +E + +
Sbjct: 881 IRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDK 939


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 37.0 bits (86), Expect = 0.002
 Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 6/49 (12%)

Query: 20  EEEDEEGGGGEEEEEEE------EKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +  D++    + E+E        EK K+E+ +EK+++EEEK  E EK R
Sbjct: 129 DSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAR 177



 Score = 26.2 bits (58), Expect = 9.0
 Identities = 11/25 (44%), Positives = 19/25 (76%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEK 54
           +E  EE+E+++EEK  E++K  EE+
Sbjct: 156 KERAEEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 36.2 bits (84), Expect = 0.004
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           ++         EE E     KEE+  E +KKE +KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 35.8 bits (83), Expect = 0.006
 Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 5/46 (10%)

Query: 12  YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           Y   K  K  E+        EE E     +E++  + +K+E KKK+
Sbjct: 156 YAEWKDAKLLEE-----FAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           ++ +  EE   EE +     KEEE  +  +K 
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKE 191



 Score = 27.3 bits (61), Expect = 3.4
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            EE   EE +     KE++  E EKK+  +K
Sbjct: 165 LEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 37.0 bits (85), Expect = 0.004
 Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 4/54 (7%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEE----KKKEKKKKEEEKKKEGEKRR 62
               + KK EED++     ++ EE+EKK  E    + +E KK EE KKKE E+++
Sbjct: 1663 AAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK 1716



 Score = 34.0 bits (77), Expect = 0.036
 Identities = 19/50 (38%), Positives = 32/50 (64%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            K+ ++ K++E EE    EE ++ EE+ K +  +E KK EE+KKK  E ++
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682



 Score = 34.0 bits (77), Expect = 0.038
 Identities = 24/56 (42%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE----KKKEKKKKEEEKKKEGEKRR 62
            K K  + KK EEDE+      ++E EE KK E    K+ E+KKK EE KK  E+ +
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729



 Score = 34.0 bits (77), Expect = 0.040
 Identities = 18/52 (34%), Positives = 28/52 (53%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            K    KKK EE  ++    ++  E ++K  E KK E+ KK +E KK  E ++
Sbjct: 1484 KADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535



 Score = 33.6 bits (76), Expect = 0.050
 Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
            K +  +KKKEE  ++    +++ EE++K  E KKK   +KKK +E KK    K++
Sbjct: 1365 KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419



 Score = 33.6 bits (76), Expect = 0.053
 Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 3/60 (5%)

Query: 6    TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
              +  K     KKK EE ++    +++ EE++KK +E KK    KKK +E KKK  EK++
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432



 Score = 33.6 bits (76), Expect = 0.060
 Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEKRR 62
            +  K    KKK EE+ ++    ++    ++K  E KKK  EKKK +E KKK  E ++
Sbjct: 1389 EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445



 Score = 33.2 bits (75), Expect = 0.062
 Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 3/71 (4%)

Query: 1    LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKE 57
            L+     +  K +  KK +EE   +     ++ EE++KK EE KK   ++KK  E  KKE
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE 1697

Query: 58   GEKRRTLTFIR 68
             E+ +    ++
Sbjct: 1698 AEEAKKAEELK 1708



 Score = 32.8 bits (74), Expect = 0.083
 Identities = 16/53 (30%), Positives = 26/53 (49%)

Query: 10   SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            +K K    KK+ E+ +      + E E    E +  E+K +  EKKKE  K++
Sbjct: 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 32.4 bits (73), Expect = 0.12
 Identities = 20/56 (35%), Positives = 34/56 (60%), Gaps = 4/56 (7%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK----EKKKKEEEKKKEGEKRR 62
            K ++ KKK+ EE ++    ++ EEE + K  E+ K    +KKK EE KK E ++++
Sbjct: 1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689



 Score = 32.4 bits (73), Expect = 0.13
 Identities = 11/53 (20%), Positives = 24/53 (45%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            K K  + KK+ +  +    E ++  E  K E +    + +  E+K E  +++ 
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 32.4 bits (73), Expect = 0.14
 Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGEKRR 62
                    + E  EE     E+++EE KKK +   KK E+KKK +E KK+ E+ +
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDK 1404



 Score = 32.4 bits (73), Expect = 0.14
 Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEKRR 62
            +  K     KKK +E ++    +++ +E +KK EEKKK  E KKK EE KK  E ++
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451



 Score = 32.0 bits (72), Expect = 0.18
 Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
            K K  + KK +E ++     ++ +E +KK EE KK   E KK  E KKK  E ++
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517



 Score = 32.0 bits (72), Expect = 0.20
 Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 11/64 (17%)

Query: 10   SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE-----------KKKEKKKKEEEKKKEG 58
            +K K  + KK EE+++     +++E EEKKK E           K  E+ KK EE KK+ 
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677

Query: 59   EKRR 62
            E+ +
Sbjct: 1678 EEAK 1681



 Score = 31.6 bits (71), Expect = 0.21
 Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEKRR 62
            K K  + KK +E ++     ++ EE +KK EE KK  E KKK EE KK  E ++
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKK 1490



 Score = 31.6 bits (71), Expect = 0.22
 Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 3/49 (6%)

Query: 17   KKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
            KKK EE ++    +++ EE +KK +  KK   E KK  E  K E E   
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356



 Score = 31.6 bits (71), Expect = 0.23
 Identities = 15/52 (28%), Positives = 24/52 (46%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            K    KKK EE  ++    +++ EE +K  E  K E +   +E +   EK  
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 31.6 bits (71), Expect = 0.24
 Identities = 12/56 (21%), Positives = 25/56 (44%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            +  K     KKK +  ++     ++  E  K + E   ++ +  EEK +  EK++ 
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374



 Score = 31.6 bits (71), Expect = 0.26
 Identities = 23/53 (43%), Positives = 35/53 (66%), Gaps = 1/53 (1%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKK-EEKKKEKKKKEEEKKKEGE 59
            + +K K  + KKE E+++    E +++EEEKKK    KKE++KK EE +KE E
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778



 Score = 31.3 bits (70), Expect = 0.27
 Identities = 19/53 (35%), Positives = 30/53 (56%)

Query: 10   SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            +K K  + KK  E ++     ++ EE +K  E KK E+ KK +E KK  EK++
Sbjct: 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547



 Score = 31.3 bits (70), Expect = 0.28
 Identities = 21/55 (38%), Positives = 34/55 (61%), Gaps = 2/55 (3%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            ++ K +  +KKK EE ++     + + EE KK+ E  ++KKK EE KK E EK++
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE--EDKKKAEEAKKDEEEKKK 1758



 Score = 30.9 bits (69), Expect = 0.36
 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 9/61 (14%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---------EKKKKEEEKKKEGEKR 61
            K K  +KKK +E ++     ++ +E +KK EE KK         E KK +E KKK  E +
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483

Query: 62   R 62
            +
Sbjct: 1484 K 1484



 Score = 30.9 bits (69), Expect = 0.39
 Identities = 16/52 (30%), Positives = 27/52 (51%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
              K  +  K E +      E  EE+ E  +++K++ KKK +  KKK  EK++
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 30.9 bits (69), Expect = 0.41
 Identities = 17/55 (30%), Positives = 31/55 (56%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            +  K     KKK +E ++    +++ +E +K +E KK ++ KK EE KK  E ++
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541



 Score = 30.9 bits (69), Expect = 0.46
 Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEE--KKKEKKKKEEEKKKEGEKRR 62
            + +K  + ++ EE    +E ++ EEKKK +  KK E+ KK EEKKK  E ++
Sbjct: 1520 EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKK 1571



 Score = 30.5 bits (68), Expect = 0.55
 Identities = 21/74 (28%), Positives = 40/74 (54%)

Query: 1    LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            L+     +  K +  KK +EE   +    ++E EE++KK EE KK++++K++    + E+
Sbjct: 1707 LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEE 1766

Query: 61   RRTLTFIRIGKPAL 74
             +    IR  K A+
Sbjct: 1767 EKKAEEIRKEKEAV 1780



 Score = 30.5 bits (68), Expect = 0.58
 Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK-EKKKKEEEKKKEGEKRR 62
            +  K     KK +E  ++    +++ +E +K  E KKK ++ KK EE KK  E ++
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529



 Score = 30.1 bits (67), Expect = 0.66
 Identities = 19/49 (38%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 15   RKKKKEEEDEEGGGGEEEEEEEEKKKEE--KKKEKKKKEEEKKKEGEKR 61
            +K ++ ++ +E    EE+++ +E KK E  KK E+KKK EE KK  E +
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576



 Score = 30.1 bits (67), Expect = 0.70
 Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 3/56 (5%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE---KKKKEEEKKKEGEK 60
            +  K     KK +E  ++    +++ +  +KK EE KK     K + E    E E 
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361



 Score = 29.7 bits (66), Expect = 0.87
 Identities = 20/51 (39%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK-EKKKKEEEKKKEGEKRR 62
            K+  + K++ DE     E ++ +E KK EE KK ++ KK EEKKK  E ++
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553



 Score = 29.7 bits (66), Expect = 0.95
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 3/56 (5%)

Query: 10   SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
            +K K  + KK+ E+++     +++ EE KK +E KK   E KK EE KKK  E ++
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471



 Score = 29.7 bits (66), Expect = 0.98
 Identities = 18/59 (30%), Positives = 32/59 (54%), Gaps = 4/59 (6%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK----KKKEEEKKKEGEKRR 62
            ++ K +   K K  E+ +    ++++ EE KK EE +K+     KK+ EE KK  E ++
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709



 Score = 29.3 bits (65), Expect = 1.2
 Identities = 19/47 (40%), Positives = 30/47 (63%), Gaps = 1/47 (2%)

Query: 17   KKKEEEDEEGGGGEEEEEEEEKKKEEKKK-EKKKKEEEKKKEGEKRR 62
            KKK EE ++    +++ EE +K  E KKK E+ KK +E KK+ E+ +
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496



 Score = 29.3 bits (65), Expect = 1.5
 Identities = 19/73 (26%), Positives = 34/73 (46%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
            K    KKK EE  +     ++ EE ++K  E KK  + KK+ ++ K+ E+ +     +  
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKA 1530

Query: 71   KPALKPRPTRCVE 83
            + A K    +  E
Sbjct: 1531 EEAKKADEAKKAE 1543



 Score = 29.0 bits (64), Expect = 1.6
 Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 3/53 (5%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGEKRR 62
            K+  + K+    +    E +++ EEKKK +   KK E+ KK +E KK+ E+ +
Sbjct: 1405 KKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457



 Score = 29.0 bits (64), Expect = 1.8
 Identities = 19/53 (35%), Positives = 32/53 (60%), Gaps = 3/53 (5%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGEKRR 62
            + RK  + ++ EE    +E ++ EEKKK +   KK E+ KK +E KK+ E+ +
Sbjct: 1276 EARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328



 Score = 28.6 bits (63), Expect = 2.2
 Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 2/67 (2%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
            K  + ++ K+ E+ +    E ++ EEEKKK E+   KKK+ EEKKK  E ++     +I 
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQL--KKKEAEEKKKAEELKKAEEENKIK 1662

Query: 71   KPALKPR 77
                  +
Sbjct: 1663 AAEEAKK 1669



 Score = 28.6 bits (63), Expect = 2.4
 Identities = 20/50 (40%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 15   RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEKRR 62
            + ++  + DE     E+++ +E KK EEKKK  E KKK EE KK  E ++
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK 1322



 Score = 28.6 bits (63), Expect = 2.5
 Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 4/54 (7%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK----KKKEEEKKKEGEKRR 62
            K+ ++KK+ ++ +    E ++ +E KKK E+ K+K    KKK EE KK  E  +
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349



 Score = 28.2 bits (62), Expect = 2.7
 Identities = 15/50 (30%), Positives = 29/50 (58%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
             ++  + ++ +E+    E ++ EE KK EEKKK ++ K+ E+ K    R+
Sbjct: 1533 AKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582



 Score = 28.2 bits (62), Expect = 2.8
 Identities = 17/55 (30%), Positives = 33/55 (60%)

Query: 8    QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            +++ + RR+   + E+       ++ EE++K  E KK E+KKK +E KK+ E+ +
Sbjct: 1261 RMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAK 1315



 Score = 27.8 bits (61), Expect = 4.0
 Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 4/54 (7%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE----KKKKEEEKKKEGEKRR 62
            K  +KKK +E ++     ++ +E +KK EE KK+    KKK EE KK     + 
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350



 Score = 27.4 bits (60), Expect = 5.1
 Identities = 16/65 (24%), Positives = 36/65 (55%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
            +++K  + ++ EE    EE+++ EE KK E+ K    ++ E+ K+ E+ R    +++ + 
Sbjct: 1544 EKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603

Query: 73   ALKPR 77
              K +
Sbjct: 1604 EKKMK 1608



 Score = 27.4 bits (60), Expect = 5.4
 Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
            K K  + KK EE ++    +++ EE +K  E KKK    KKK +  KKK  E ++
Sbjct: 1289 KKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343



 Score = 27.4 bits (60), Expect = 6.2
 Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 4/55 (7%)

Query: 10   SKYKRRKKKKEEEDEEGGGGEEEEEEEE--KKKEEKKK--EKKKKEEEKKKEGEK 60
            +  +    +++ E  E    E +++ +   KK EEKKK  E KKK EE KK+ ++
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409



 Score = 27.4 bits (60), Expect = 6.3
 Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 1/68 (1%)

Query: 14   RRKKKKEEEDEEGGGGEEEEEEEEKKKEE-KKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
            R+ +   + +EE    E  + E+ KK E  KK E+ KK+ E+ K+ E+ R    IR  + 
Sbjct: 1200 RKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEE 1259

Query: 73   ALKPRPTR 80
            A      R
Sbjct: 1260 ARMAHFAR 1267



 Score = 27.0 bits (59), Expect = 8.4
 Identities = 16/52 (30%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEK 60
            K K  +KKK +E ++    ++++ +E KK    KK  ++ KK+ E+KK+ ++
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 36.7 bits (85), Expect = 0.004
 Identities = 17/66 (25%), Positives = 36/66 (54%), Gaps = 4/66 (6%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKK----KEKKKKEEEKKK 56
            Q    +     ++RKKK++++ EE    +  E+E  K+ E+++    ++KK+ EE  K+
Sbjct: 67  QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ 126

Query: 57  EGEKRR 62
              K++
Sbjct: 127 AALKQK 132



 Score = 29.8 bits (67), Expect = 0.87
 Identities = 12/43 (27%), Positives = 20/43 (46%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           K K E + +      ++   E KK+ + +  KK   E KK+ E
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAE 188



 Score = 29.4 bits (66), Expect = 0.94
 Identities = 10/49 (20%), Positives = 22/49 (44%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           +   +++KK+ E+       ++++ EE   +     K K E E K+   
Sbjct: 110 RLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAA 158



 Score = 28.6 bits (64), Expect = 1.8
 Identities = 15/41 (36%), Positives = 19/41 (46%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           K K E E +      ++   E KKK E +  KK   E KKK
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKK 186



 Score = 27.5 bits (61), Expect = 4.1
 Identities = 16/46 (34%), Positives = 21/46 (45%), Gaps = 1/46 (2%)

Query: 16  KKKKEEEDEEGGGGEEEEE-EEEKKKEEKKKEKKKKEEEKKKEGEK 60
           KKK E E       E +++ E E KK+   + KKK   E K    K
Sbjct: 184 KKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAK 229



 Score = 27.1 bits (60), Expect = 5.8
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 4/49 (8%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           K     KKK E +      ++   E +KK E +   K   E +KK E E
Sbjct: 162 KAAAEAKKKAEAEAA----KKAAAEAKKKAEAEAAAKAAAEAKKKAEAE 206



 Score = 27.1 bits (60), Expect = 5.9
 Identities = 11/59 (18%), Positives = 26/59 (44%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           LQ     +  + K+ +K++    E+    EE  ++   K+++ ++   K     K + E
Sbjct: 92  LQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAE 150



 Score = 27.1 bits (60), Expect = 6.2
 Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 4/45 (8%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           KKK E E ++    +   E ++K   E K    K   E K   EK
Sbjct: 200 KKKAEAEAKK----KAAAEAKKKAAAEAKAAAAKAAAEAKAAAEK 240


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 36.2 bits (84), Expect = 0.005
 Identities = 24/114 (21%), Positives = 45/114 (39%), Gaps = 6/114 (5%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           Q +      K  ++ KK  ++D+        E +E + K+  KK+KKKK+++KKK   + 
Sbjct: 15  QQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEA 74

Query: 62  RTLTFIRIGKPALKPR----PTRCVEGEGAPTRTLSVQ--IPLHRVHEQGVPPV 109
             L +      +          R +          +    IP+ +  + G  PV
Sbjct: 75  YDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQTQPPTIPVSKQFKDGEYPV 128


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 36.6 bits (85), Expect = 0.005
 Identities = 17/30 (56%), Positives = 23/30 (76%), Gaps = 2/30 (6%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +  EEK K++KKKEKKK+EE K++  EK R
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKRE--EKAR 766



 Score = 32.0 bits (73), Expect = 0.17
 Identities = 13/27 (48%), Positives = 23/27 (85%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           EE+ +++KKKE+KK+E+ K+EE+ + E
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 30.8 bits (70), Expect = 0.46
 Identities = 20/67 (29%), Positives = 27/67 (40%), Gaps = 18/67 (26%)

Query: 11   KYKRRKKKKEEEDEEGGGGEEEEEEE------------EKKKEE------KKKEKKKKEE 52
            K    + + +EE E  G GE E ++E            EK  EE       KK K KK+ 
Sbjct: 1516 KSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQY 1575

Query: 53   EKKKEGE 59
            +   E E
Sbjct: 1576 KSNTEAE 1582


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 36.0 bits (83), Expect = 0.006
 Identities = 21/60 (35%), Positives = 39/60 (65%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
                QV + K++KK+K +E+ +    +EE +E+   KE++K+++KK EE + +E EK+R
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166



 Score = 34.5 bits (79), Expect = 0.024
 Identities = 20/76 (26%), Positives = 43/76 (56%)

Query: 5   STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
           + T+ +K  + +  KEEE E+    EE+++++EK KEE K  K K+E ++K+  +++   
Sbjct: 90  AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKE 149

Query: 65  TFIRIGKPALKPRPTR 80
              ++ +P  +    +
Sbjct: 150 KEKKVEEPRDREEEKK 165



 Score = 34.1 bits (78), Expect = 0.033
 Identities = 22/58 (37%), Positives = 34/58 (58%)

Query: 5   STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           S    +K K  K+ K E  +E    +E+ +EE+KKK+EK KE+ K  + K++  EKR 
Sbjct: 85  SKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142



 Score = 32.6 bits (74), Expect = 0.095
 Identities = 20/67 (29%), Positives = 31/67 (46%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
           KR +K   +         +E + E  K+EEK+KE+ K+E++KKKE  K          + 
Sbjct: 78  KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137

Query: 73  ALKPRPT 79
             K  P 
Sbjct: 138 KEKRPPK 144



 Score = 32.2 bits (73), Expect = 0.12
 Identities = 14/57 (24%), Positives = 34/57 (59%)

Query: 6   TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
                   + K  KE ++E G   E+E+E+ +++K++KK++ K++ +++K + E + 
Sbjct: 83  GGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139



 Score = 31.8 bits (72), Expect = 0.19
 Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 3/55 (5%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEE---EEEKKKEEKKKEKKKKEEEKKKEGEKR 61
            K   ++K+KE+E +     + EEE   E  + K   KK  KKK   KKKE  + 
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193



 Score = 27.9 bits (62), Expect = 3.6
 Identities = 14/56 (25%), Positives = 33/56 (58%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           +V K   +    + +  +    E  +EEE++K++ K+++KKKKE+ K++  +++  
Sbjct: 79  RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPK 134


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 36.1 bits (84), Expect = 0.007
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 18  KKEEEDEEGGGGEEEEEEEEK--KKEEKKKEKKKKEEEKKKEGEKRR 62
             E E+EEG  G     E +K  KK+ K ++K +KEE +K   +K+ 
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA 439



 Score = 36.1 bits (84), Expect = 0.007
 Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 5/47 (10%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           K+RK +K+ E EE      E+   +KK E   K+ K  + E KK   
Sbjct: 417 KQRKAEKKAEKEE-----AEKAAAKKKAEAAAKKAKGPDGETKKVDP 458


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 35.7 bits (83), Expect = 0.007
 Identities = 18/66 (27%), Positives = 39/66 (59%), Gaps = 11/66 (16%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK-----------KKEEEKKK 56
            + K  + ++++EE+  +    E +EE +EKK+E+KK+E++           +K EEK++
Sbjct: 257 VLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKER 316

Query: 57  EGEKRR 62
           + + R+
Sbjct: 317 KKQARK 322



 Score = 31.1 bits (71), Expect = 0.24
 Identities = 15/39 (38%), Positives = 26/39 (66%), Gaps = 3/39 (7%)

Query: 30  EEEEEEEEKKKEEKKKE---KKKKEEEKKKEGEKRRTLT 65
           EEEEE+  K  EE+++E   +KK+E++K++   K   L+
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLS 304


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 33.8 bits (78), Expect = 0.013
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 29  GEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
                  EEKK+EE+++E+K++ EE+   G
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESEEEAAAG 100



 Score = 31.8 bits (73), Expect = 0.055
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEE 53
                 EE+K++EE+++EK++ EEE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 29.2 bits (66), Expect = 0.52
 Identities = 6/26 (23%), Positives = 18/26 (69%)

Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEE 52
                 EE+++++EE++++++ +EE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 28.8 bits (65), Expect = 0.76
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGE 59
                 EEKK+E++++EE+++ E E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 8/25 (32%), Positives = 16/25 (64%)

Query: 22 EDEEGGGGEEEEEEEEKKKEEKKKE 46
                  E++EEEEE++++E+ +E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEE 95



 Score = 27.2 bits (61), Expect = 2.1
 Identities = 12/25 (48%), Positives = 16/25 (64%), Gaps = 4/25 (16%)

Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEE 42
             EE +E    EEEEEEE+++ EE
Sbjct: 75 AAAEEKKE----EEEEEEEKEESEE 95



 Score = 26.8 bits (60), Expect = 3.4
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKE 46
                   ++EEEEEE++KEE ++E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 26.1 bits (58), Expect = 6.5
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEE 42
              EE    EEEEEE+E+ +EE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEE 96


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
          approximately 300 residues, found in plants and
          vertebrates. They contain a highly conserved DDRGK
          motif.
          Length = 189

 Score = 34.7 bits (80), Expect = 0.013
 Identities = 21/62 (33%), Positives = 38/62 (61%)

Query: 8  QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
          ++   KR K ++++   +    EEEE EE KK EEK++ ++K+EEE ++E EK++     
Sbjct: 2  KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEER 61

Query: 68 RI 69
          + 
Sbjct: 62 KE 63



 Score = 34.3 bits (79), Expect = 0.018
 Identities = 22/48 (45%), Positives = 37/48 (77%)

Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           +R ++KK EE  EG   EEEE EEE++K+++++E+K++EE+ +KE E
Sbjct: 26 EEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQE 73



 Score = 29.7 bits (67), Expect = 0.59
 Identities = 22/52 (42%), Positives = 37/52 (71%), Gaps = 2/52 (3%)

Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEK--KKEKKKKEEEKKKEGEKRR 62
          +++++ +EEE EE    EE+ E E K++EE   ++EKKK+EEE+K+  E+ R
Sbjct: 18 RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQAR 69



 Score = 29.3 bits (66), Expect = 0.82
 Identities = 16/43 (37%), Positives = 31/43 (72%)

Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
          R+ +++EE+E     E+++EEEE+K+ E++  K+++E EK K 
Sbjct: 38 REGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKS 80



 Score = 28.1 bits (63), Expect = 2.3
 Identities = 14/51 (27%), Positives = 29/51 (56%)

Query: 9  VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
            + K  ++ +EE +++    E +E EE+ +KE+++ EK K     ++EG 
Sbjct: 39 EGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGT 89


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 33.6 bits (77), Expect = 0.015
 Identities = 14/29 (48%), Positives = 23/29 (79%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
             EE+ EEK++E+KK+E+K++EEE+   G
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEEEALAG 101



 Score = 31.3 bits (71), Expect = 0.083
 Identities = 14/24 (58%), Positives = 20/24 (83%)

Query: 30 EEEEEEEEKKKEEKKKEKKKKEEE 53
             EE+ E+K+EEKKKE++K+EEE
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEE 95



 Score = 30.5 bits (69), Expect = 0.16
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGE 59
              ++K E+K+E+KKKEEEK++E E
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEE 96



 Score = 30.2 bits (68), Expect = 0.24
 Identities = 13/27 (48%), Positives = 21/27 (77%)

Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGE 59
              E+K EEK++EKKK+EE++++E E
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 34.7 bits (79), Expect = 0.018
 Identities = 24/89 (26%), Positives = 44/89 (49%), Gaps = 2/89 (2%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           L +S    V K++  +  K  E  +    E++ ++   KK+EKK+++K+++++KKKE E 
Sbjct: 167 LADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKK--PKKKEKKEKEKERDKDKKKEVEG 224

Query: 61  RRTLTFIRIGKPALKPRPTRCVEGEGAPT 89
            ++L       PA         E   A T
Sbjct: 225 FKSLLLALDDSPASAASVAEADEASLANT 253



 Score = 31.6 bits (71), Expect = 0.21
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           EEE    ++ E+ K+EKKK+E+EK+      
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 30.0 bits (67), Expect = 0.78
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 32  EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
           EEE   +++ EK K +KKK E++K+   +  +L
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSL 117



 Score = 27.7 bits (61), Expect = 4.2
 Identities = 14/46 (30%), Positives = 25/46 (54%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
               + + +     E EE ++  K +KKK++K+KEE+KKK+     
Sbjct: 256 GTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHH 301


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 34.7 bits (80), Expect = 0.019
 Identities = 22/81 (27%), Positives = 36/81 (44%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
             S   +S +   + K      +    E++  E +KK E+KKK+KK+K+E K +   K  
Sbjct: 33  VLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92

Query: 63  TLTFIRIGKPALKPRPTRCVE 83
             T  +  K   KP   +  E
Sbjct: 93  FKTPKKSKKTKKKPPKPKPNE 113



 Score = 32.8 bits (75), Expect = 0.070
 Identities = 18/79 (22%), Positives = 35/79 (44%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
             ST    + K      +++ +E    E +++ E+KKK++K+K++ K E E K   +  +
Sbjct: 38  ILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPK 97

Query: 63  TLTFIRIGKPALKPRPTRC 81
                +   P  KP     
Sbjct: 98  KSKKTKKKPPKPKPNEDVD 116



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 15/54 (27%), Positives = 26/54 (48%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            K +++KKKK+E+ E    GE +   +  KK +K K+K  K +  +        
Sbjct: 68  KKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNK 121


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 34.6 bits (80), Expect = 0.020
 Identities = 16/37 (43%), Positives = 28/37 (75%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           EEEDEE    ++E++EEE+++E++KK+KK  E  + +
Sbjct: 354 EEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 33.8 bits (78), Expect = 0.042
 Identities = 13/38 (34%), Positives = 27/38 (71%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           +EE++E+    +E++EEEE++++EKKK+K  +    + 
Sbjct: 354 EEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 31.1 bits (71), Expect = 0.30
 Identities = 15/38 (39%), Positives = 28/38 (73%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +EE++E     ++E++EE+++EEK+K+KKK  E  + E
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 30.0 bits (68), Expect = 0.79
 Identities = 13/47 (27%), Positives = 29/47 (61%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
           E+ D      +EE+E+ + + +E+++E++K++++KK     R  L F
Sbjct: 347 EDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPF 393


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 33.4 bits (77), Expect = 0.021
 Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 20/65 (30%)

Query: 13  KRRKKKKEEEDEEG------------GGGEEEEEEEEKKKEEKKKEKKKKE--------E 52
           +  K+ + E DE+G               EE  E + K+KE+KKK+KK+ E        E
Sbjct: 39  EEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIRE 98

Query: 53  EKKKE 57
           +KK+E
Sbjct: 99  KKKEE 103


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 33.9 bits (78), Expect = 0.022
 Identities = 13/55 (23%), Positives = 24/55 (43%)

Query: 5   STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
                    +   + EE  EE       E++E+K   EK+ E+ ++E E++ E  
Sbjct: 48  EQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102



 Score = 30.1 bits (68), Expect = 0.45
 Identities = 20/64 (31%), Positives = 32/64 (50%), Gaps = 6/64 (9%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALK 75
            + KE++ +     EE EEE E++ EE   E +K+ EEK +   ++       I  P+ K
Sbjct: 75  SEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEK------EITNPSWK 128

Query: 76  PRPT 79
           P  T
Sbjct: 129 PVGT 132



 Score = 30.1 bits (68), Expect = 0.51
 Identities = 15/47 (31%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 16  KKKKEEEDEEGGGGEEEEEE--EEKKKEEKKKEKKKKEEEKKKEGEK 60
            ++ +EE++E    E++E++   EK+ EE ++E ++++EE   E EK
Sbjct: 62  IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108



 Score = 29.7 bits (67), Expect = 0.61
 Identities = 14/56 (25%), Positives = 27/56 (48%)

Query: 5  STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          +     + K+   ++  E EE    E+E    E K+++   EK+ +E E++ E E 
Sbjct: 44 AAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99



 Score = 29.3 bits (66), Expect = 0.75
 Identities = 11/55 (20%), Positives = 30/55 (54%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +    + S  +   + +E ++EE      E++E++   E++ +E +++ EE+ +E
Sbjct: 47  DEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 11/59 (18%), Positives = 30/59 (50%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             +   +  K   ++  + EE +E        E++E K + +K++++ +EE ++++ E 
Sbjct: 44  AAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 13/45 (28%), Positives = 24/45 (53%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +   +E E     ++E  E E+ KEE+K+    +++E K + EK
Sbjct: 42 DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEK 86



 Score = 28.2 bits (63), Expect = 2.3
 Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 4/59 (6%)

Query: 2  QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          Q ++  Q +K    ++  E E+ +    EEE+E    + +E K + +K++EE ++E E+
Sbjct: 43 QAAADEQEAKKSDDQETAEIEEVK----EEEKEAANSEDKEDKGDAEKEDEESEEENEE 97



 Score = 27.4 bits (61), Expect = 3.3
 Identities = 16/54 (29%), Positives = 25/54 (46%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
            +S  +  K    K+ +E E+E     EE  +E EK+ EEK +   +KE     
Sbjct: 73  ANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 33.9 bits (78), Expect = 0.023
 Identities = 22/52 (42%), Positives = 32/52 (61%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           S+     +  E+E       E E EEEEKK+++KKKE KK+++EKK + EK 
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177



 Score = 30.8 bits (70), Expect = 0.22
 Identities = 17/61 (27%), Positives = 39/61 (63%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           + S     +K ++  + +EEE +E    +E ++E+++KK++K+K  + K  +KKK+ +K+
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192

Query: 62  R 62
           +
Sbjct: 193 K 193



 Score = 30.1 bits (68), Expect = 0.39
 Identities = 23/53 (43%), Positives = 35/53 (66%), Gaps = 4/53 (7%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           K    K E+E E     EEEE++E+KKK+E KKEKK+K+++K+K  E + +  
Sbjct: 137 KETTAKVEKEAEV----EEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185



 Score = 29.7 bits (67), Expect = 0.53
 Identities = 17/61 (27%), Positives = 38/61 (62%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
             S T++     + +K+ E E+EE    ++++E +++KKE+K K++K  E +  K+ +K+
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189

Query: 62  R 62
           +
Sbjct: 190 K 190



 Score = 29.3 bits (66), Expect = 0.77
 Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 18  KKEEEDEEGGGGEE---EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            +   + E    E     E+E E ++EEKK++KKKKE +K+K+ +K +
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173



 Score = 28.1 bits (63), Expect = 1.8
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query: 21  EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             D        E E  EK+   K +++ + EEE+KKE +K
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKK 159



 Score = 27.0 bits (60), Expect = 5.3
 Identities = 11/41 (26%), Positives = 23/41 (56%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            +      G E E  E+E   + +K+ + ++EE+K+K+ +K
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKK 161


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
          represents the L12P protein of the large (50S) subunit
          of the archaeal ribosome.
          Length = 105

 Score = 32.3 bits (74), Expect = 0.033
 Identities = 9/33 (27%), Positives = 20/33 (60%)

Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
                   EEE+++EE+++E++++ EE+   G
Sbjct: 67 AAAAAAAAAEEEEEEEEEEEEEEEESEEEAMAG 99



 Score = 32.3 bits (74), Expect = 0.042
 Identities = 7/28 (25%), Positives = 17/28 (60%)

Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
                    EE+++EE+++E++++E E
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEESE 93



 Score = 28.9 bits (65), Expect = 0.55
 Identities = 6/33 (18%), Positives = 19/33 (57%)

Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
                    E+++EE+++E++++EE +++   
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 26.2 bits (58), Expect = 6.2
 Identities = 13/25 (52%), Positives = 17/25 (68%), Gaps = 4/25 (16%)

Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEE 42
             EE+EE    EEEEEEEE+++ E
Sbjct: 73 AAAEEEEE----EEEEEEEEEEESE 93



 Score = 25.8 bits (57), Expect = 8.5
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEE 42
              EE    EEEEEEEE++ EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEE 94


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 32.8 bits (75), Expect = 0.043
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           R     ++E+EE      +E E+E   E K + K+KK+ E  K
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK 131



 Score = 32.4 bits (74), Expect = 0.056
 Identities = 11/43 (25%), Positives = 23/43 (53%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           R+    +++EE    E  +E E++   E K E K+K++ +  +
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK 131



 Score = 32.0 bits (73), Expect = 0.075
 Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 12  YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGK 71
           ++      +EE+E      +E E+E+  + + + ++KKK E  K + EK +  T  +  K
Sbjct: 88  FRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147

Query: 72  PALKPR 77
           P+ KP+
Sbjct: 148 PS-KPK 152



 Score = 28.6 bits (64), Expect = 1.2
 Identities = 11/53 (20%), Positives = 28/53 (52%)

Query: 28  GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTR 80
             ++EEEE E++  ++ +++   E + + + +K+R +   +  K   K  P +
Sbjct: 93  SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 33.5 bits (76), Expect = 0.044
 Identities = 19/59 (32%), Positives = 34/59 (57%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           L+  +    ++ +R  +K+EEEDE+    EE EE E  ++E    E  +K++EK+ + E
Sbjct: 90  LEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAE 148


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 32.7 bits (75), Expect = 0.045
 Identities = 17/38 (44%), Positives = 26/38 (68%)

Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
          K  K++KEE+++E G G  ++EE EK+ EE +K K K 
Sbjct: 25 KEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62



 Score = 30.4 bits (69), Expect = 0.25
 Identities = 18/46 (39%), Positives = 31/46 (67%), Gaps = 4/46 (8%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
          +K++E+E E+    EE+E +EEK+KE  K   +K+E EK+ E  ++
Sbjct: 16 EKREEKEREK----EEKERKEEKEKEWGKGLVQKEEREKRLEELEK 57



 Score = 30.4 bits (69), Expect = 0.30
 Identities = 16/52 (30%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 9  VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          + + +  K++++EE E     +EE+E+E  K   +K+E++K+ EE +K   K
Sbjct: 14 IEEKREEKEREKEEKER----KEEKEKEWGKGLVQKEEREKRLEELEKAKNK 61



 Score = 27.7 bits (62), Expect = 2.2
 Identities = 13/40 (32%), Positives = 30/40 (75%)

Query: 23 DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          D+ G   + EE+ EEK++E+++KE+K+++E++  +G  ++
Sbjct: 6  DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQK 45


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 32.3 bits (74), Expect = 0.048
 Identities = 24/68 (35%), Positives = 35/68 (51%), Gaps = 23/68 (33%)

Query: 13  KRRKKKKEEEDEEG-------------GGGEEEEEEEEKKKEEKKKEKKKKE-------- 51
           + RK+ + E DE+G              G E  +  EE+KKE++KK+KKKKE        
Sbjct: 39  RERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQ 98

Query: 52  --EEKKKE 57
             E+KK+E
Sbjct: 99  LREKKKEE 106



 Score = 25.7 bits (57), Expect = 9.6
 Identities = 20/67 (29%), Positives = 35/67 (52%), Gaps = 10/67 (14%)

Query: 6   TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKK-------KEKKKKEEE---KK 55
            T V   ++     EE  +     ++E+E+++KKK+E +       +EKKK+E+    KK
Sbjct: 54  VTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKK 113

Query: 56  KEGEKRR 62
            E +K R
Sbjct: 114 FEEDKER 120


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 32.5 bits (75), Expect = 0.051
 Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 6/53 (11%)

Query: 11 KYKRRKKKKEEE--DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
          +YK  K+  E    D++    E +  E+ +KK EK K K +K++ KK+E + R
Sbjct: 50 EYKDLKESLEAALLDKK----ELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 27.5 bits (62), Expect = 2.5
 Identities = 10/28 (35%), Positives = 16/28 (57%), Gaps = 2/28 (7%)

Query: 35 EEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          ++E K  EK   +KK E+ K K  +K+ 
Sbjct: 65 KKELKAWEK--AEKKAEKAKAKAEKKKA 90


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 33.3 bits (77), Expect = 0.054
 Identities = 12/54 (22%), Positives = 26/54 (48%), Gaps = 9/54 (16%)

Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
          +YK  K++ +             +++E K   K ++KK+K+E K  + + +  L
Sbjct: 48 QYKEMKEELKAA---------LLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRL 92


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 33.2 bits (76), Expect = 0.055
 Identities = 15/45 (33%), Positives = 25/45 (55%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
               K+  E   EG  GE+++EE    +E+  K+K KK+ ++K E
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPE 195



 Score = 28.2 bits (63), Expect = 2.6
 Identities = 13/50 (26%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
              KE  +    G + E+++EE    E+   KKK +++++++ E   TL 
Sbjct: 152 SDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPES--TLE 199



 Score = 27.0 bits (60), Expect = 6.0
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           + ED++  G + EE+  +KK ++K++ K +   EK  E
Sbjct: 166 DGEDDDEEGSDGEEDGAKKKAKKKRQRKPESTLEKNFE 203


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 32.3 bits (74), Expect = 0.057
 Identities = 17/49 (34%), Positives = 31/49 (63%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           RKK KE+   +    ++EE E+EK+KEE+  +   + E+++ E EK++ 
Sbjct: 65  RKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKA 113


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
          subunit.  This is a family of proteins which are
          subunits of the eukaryotic translation initiation
          factor 3 (eIF3). In yeast it is called Hcr1. The
          Saccharomyces cerevisiae protein eIF3j (HCR1) has been
          shown to be required for processing of 20S pre-rRNA and
          binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 32.7 bits (75), Expect = 0.062
 Identities = 14/43 (32%), Positives = 25/43 (58%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          EEEDEE    + +   + K K+  K + ++KE+ K+++ EK  
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81



 Score = 30.4 bits (69), Expect = 0.41
 Identities = 11/43 (25%), Positives = 26/43 (60%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          +EE++E    E+ +   + K ++  K K +++E+ K+E E++ 
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80



 Score = 30.0 bits (68), Expect = 0.50
 Identities = 12/48 (25%), Positives = 26/48 (54%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
          EE++E+     +   + + KK  K K ++K++ +++KE +  R L   
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEED 87


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 33.2 bits (76), Expect = 0.065
 Identities = 14/47 (29%), Positives = 19/47 (40%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
           K EE  EE G  E+++   +          KK    KKK  E +  L
Sbjct: 842 KMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVL 888


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 33.0 bits (76), Expect = 0.076
 Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 36  EEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            EKK++EK+K K KK     K   KRR 
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 32.2 bits (74), Expect = 0.12
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 35  EEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            E+K+KE++K + KK+  + K  G++R+
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 30.7 bits (70), Expect = 0.35
 Identities = 15/35 (42%), Positives = 20/35 (57%), Gaps = 2/35 (5%)

Query: 38  KKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
           KK   K+ EKK+KE+EK K  ++ R      IGK 
Sbjct: 393 KKVLAKRAEKKEKEKEKPKVKKRHRDTK--NIGKR 425



 Score = 29.5 bits (67), Expect = 0.92
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
             +K  K  +       E++E+E+EK K +K+    K   +++K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 28.4 bits (64), Expect = 2.2
 Identities = 10/42 (23%), Positives = 23/42 (54%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           KK  +   +      E++E+E++K + KK+ +  K   K+++
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 28.0 bits (63), Expect = 3.2
 Identities = 15/45 (33%), Positives = 21/45 (46%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPR 77
              E+K  +  KK   K+ E+K+KE EK +     R  K   K R
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRR 426


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
          Members of this family are associated with type VII
          secretion of WXG100 family targets in the Firmicutes,
          but not in the Actinobacteria. This highly divergent
          protein family consists largely of a central region of
          highly polar low-complexity sequence containing
          occasional LF motifs in weak repeats about 17 residues
          in length, flanked by hydrophobic N- and C-terminal
          regions [Protein fate, Protein and peptide secretion
          and trafficking].
          Length = 150

 Score = 32.0 bits (73), Expect = 0.088
 Identities = 15/48 (31%), Positives = 25/48 (52%)

Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
          KK  E + +    E E ++E    EE+KK   +K +EK ++ E +  L
Sbjct: 33 KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQL 80


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 32.0 bits (73), Expect = 0.099
 Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 5/46 (10%)

Query: 12  YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           Y  RK  K++E+        EE +E      KK    +K E KKK+
Sbjct: 156 YAERKDAKQKEEFA-----AEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 27.3 bits (61), Expect = 3.8
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           ++EE   E++KE      KK    +K E +K
Sbjct: 164 QKEEFAAEERKEALAAAAKKSATPQKVETKK 194


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 32.5 bits (74), Expect = 0.10
 Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 1/60 (1%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
           K  K+KKEE+DEE     + EE +E+ +EE+K    +  E+   E  K+     I  G+P
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKEVIA-GRP 335



 Score = 29.0 bits (65), Expect = 1.6
 Identities = 17/54 (31%), Positives = 29/54 (53%), Gaps = 3/54 (5%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGE 59
           + + K +K++K+EE  E    EE +EE E++++    +  EK   E  KK   E
Sbjct: 276 LKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKE 329


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 31.6 bits (72), Expect = 0.10
 Identities = 11/75 (14%), Positives = 27/75 (36%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPA 73
            R+++ +EE E    GE +   +      +    +K+ E++    +++      R     
Sbjct: 66  LRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKD 125

Query: 74  LKPRPTRCVEGEGAP 88
            +      V+   A 
Sbjct: 126 PRGGTQDVVDKSQAS 140



 Score = 26.2 bits (58), Expect = 7.4
 Identities = 9/54 (16%), Positives = 24/54 (44%), Gaps = 4/54 (7%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEE----EEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +R  K++ E +EEG      +      E     ++ +++   ++ K    ++R+
Sbjct: 68  ERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERK 121


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 32.8 bits (75), Expect = 0.10
 Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 9   VSKYKRRKKKKEEE----DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           VSKYK RK++ E +    +E     E+  EE EK+ E+ +++ +K E  ++ + E R 
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRE 224



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 21/55 (38%), Positives = 34/55 (61%), Gaps = 3/55 (5%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE---KKKKEEEKKKEGEK 60
           ++K K  +K+ EE +EE    EEE EE +++ EE +KE    K + EE ++E E+
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285



 Score = 28.1 bits (63), Expect = 2.8
 Identities = 11/50 (22%), Positives = 22/50 (44%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +  + K++ E  +    E E   EE ++   + E+ K+E E+K       
Sbjct: 324 RLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEE 373



 Score = 27.0 bits (60), Expect = 7.1
 Identities = 14/54 (25%), Positives = 25/54 (46%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            K +  K++ EE +      E+   E E+ KEE +++     EE ++  E  R 
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALRE 383



 Score = 27.0 bits (60), Expect = 7.5
 Identities = 12/61 (19%), Positives = 26/61 (42%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           L+          +R ++ KE+ +      EE E   E+ ++   + ++ KEE ++K    
Sbjct: 311 LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSAL 370

Query: 61  R 61
            
Sbjct: 371 L 371


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 31.7 bits (73), Expect = 0.11
 Identities = 13/38 (34%), Positives = 25/38 (65%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
            Y +RKKKK+++ +EG   EE++++    +E K+ +K 
Sbjct: 79  AYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 29.4 bits (67), Expect = 0.67
 Identities = 10/35 (28%), Positives = 23/35 (65%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
           K+KK+++ ++  G   EE++++    E++KE +K 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.6 bits (74), Expect = 0.11
 Identities = 17/32 (53%), Positives = 23/32 (71%), Gaps = 1/32 (3%)

Query: 31 EEEEEEEKKKEEKKKEKK-KKEEEKKKEGEKR 61
          EEE E +KKKEEK KEK+ KK +  +KE + +
Sbjct: 15 EEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46



 Score = 32.6 bits (74), Expect = 0.12
 Identities = 15/31 (48%), Positives = 22/31 (70%)

Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          E E E+K   E++ E+KKK+EEK KE E ++
Sbjct: 5  ESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 30.3 bits (68), Expect = 0.68
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKK 56
          E++   EE+ + +KKKE+K KE+E KK
Sbjct: 9  EKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 29.1 bits (65), Expect = 1.7
 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            EEE E KKK+E+K   K+KE +K K  +K  
Sbjct: 13 LTEEELERKKKKEEK--AKEKELKKLKAAQKEA 43



 Score = 28.3 bits (63), Expect = 2.5
 Identities = 14/35 (40%), Positives = 24/35 (68%)

Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          G   E E E++   +EE +++KKK+E+ K+KE +K
Sbjct: 1  GSRTESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 28.3 bits (63), Expect = 2.9
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
          G   E E E++   EE+ + KKKKEE+ K++  K+
Sbjct: 1  GSRTESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 27.9 bits (62), Expect = 3.3
 Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 9/44 (20%)

Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
          RKKKKE         E+ +E+E KK +  +KE K K + ++   
Sbjct: 20 RKKKKE---------EKAKEKELKKLKAAQKEAKAKLQAQQASD 54


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 32.0 bits (73), Expect = 0.12
 Identities = 17/41 (41%), Positives = 24/41 (58%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           + EDE+       E E+ KKKEE + +KKK+ EE K   +K
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113



 Score = 29.7 bits (67), Expect = 0.63
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
           K E+E E      E E+ ++K++ E KK+K+ +E +  ++ 
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113



 Score = 27.4 bits (61), Expect = 4.3
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
           + E+E+E      + ++ KKKEE + K+ ++   L  ++
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQ 111



 Score = 27.0 bits (60), Expect = 5.0
 Identities = 11/41 (26%), Positives = 24/41 (58%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           K + E+E        E+ +++E+ + +KKKE ++ +  +KK
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 32.2 bits (73), Expect = 0.13
 Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 2/62 (3%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEG--GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +SS +     +R  K   E   +      E+ EE EE+K EE+    KK ++ KK +G+K
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348

Query: 61  RR 62
             
Sbjct: 349 NG 350



 Score = 31.5 bits (71), Expect = 0.20
 Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           R+ K   E       E++E+ EE ++E+ ++E    ++ KK +  K + 
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348



 Score = 30.7 bits (69), Expect = 0.36
 Identities = 18/52 (34%), Positives = 30/52 (57%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           + K  K K  E D+E  G E ++  E+  +E+ KK+KKK  + KKK  + ++
Sbjct: 209 RGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKK 260



 Score = 30.3 bits (68), Expect = 0.55
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
           ++ EEE  E  GG  ++ ++ KK + KK    K + +
Sbjct: 321 EESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357



 Score = 29.9 bits (67), Expect = 0.69
 Identities = 16/54 (29%), Positives = 33/54 (61%), Gaps = 3/54 (5%)

Query: 10  SKYKRRKKKKEEEDEEG---GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            K  + K  + +++++G     G E+ +EE+ KK++KK  K KK+ +  K+G++
Sbjct: 210 GKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263



 Score = 29.5 bits (66), Expect = 1.1
 Identities = 12/44 (27%), Positives = 21/44 (47%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           ++ ++ E+ E    EEE    +K K+ KK + KK   +K     
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358



 Score = 28.8 bits (64), Expect = 2.0
 Identities = 15/44 (34%), Positives = 24/44 (54%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           K + E+DE+    EEE+ EEE    +K K+ KK + +K    + 
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354



 Score = 27.6 bits (61), Expect = 3.9
 Identities = 14/48 (29%), Positives = 29/48 (60%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           + K  E +DE+ G   ++  E+  +++ KKK+KK  + +KK + +K+ 
Sbjct: 214 KIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKG 261



 Score = 26.8 bits (59), Expect = 6.6
 Identities = 12/44 (27%), Positives = 20/44 (45%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           + +  EE EE    EE    ++ KK +K K KK   ++   +  
Sbjct: 316 QDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359



 Score = 26.8 bits (59), Expect = 7.6
 Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 4/47 (8%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           ++ +  +E E+E+     EEE    KK ++ KK K KK    K + +
Sbjct: 315 EQDEDSEESEEEK----NEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357



 Score = 26.5 bits (58), Expect = 8.8
 Identities = 11/45 (24%), Positives = 24/45 (53%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
            EE E+    E   + E ++ E+ ++ +++K EE+    +K + L
Sbjct: 297 PEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKL 341


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 32.2 bits (74), Expect = 0.13
 Identities = 12/29 (41%), Positives = 24/29 (82%)

Query: 34  EEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           + EEKK +++KK+KK+K++ KK++ + R+
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 26.9 bits (60), Expect = 6.6
 Identities = 14/43 (32%), Positives = 27/43 (62%), Gaps = 9/43 (20%)

Query: 29  GEEEEEEEEKKKEE---------KKKEKKKKEEEKKKEGEKRR 62
           G+E +EE  K+ EE         KKK ++KK +++KK+ ++++
Sbjct: 361 GDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKK 403


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 31.3 bits (71), Expect = 0.13
 Identities = 14/32 (43%), Positives = 24/32 (75%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           E E+E++E  KE ++KE  K ++EKK++ EK+
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 30.1 bits (68), Expect = 0.41
 Identities = 12/30 (40%), Positives = 21/30 (70%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            EE E+E+K+  K+ E+K+  + KK++ EK
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEK 121



 Score = 29.7 bits (67), Expect = 0.49
 Identities = 10/31 (32%), Positives = 22/31 (70%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            EE E+E+K+  ++ +EK+  + +K+K+ +K
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKK 122



 Score = 28.6 bits (64), Expect = 1.4
 Identities = 25/67 (37%), Positives = 37/67 (55%), Gaps = 2/67 (2%)

Query: 21  EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK--KKEGEKRRTLTFIRIGKPALKPRP 78
            E+ E    E  +E EEK+  + KKEKK+K+E+K  +K  +K+ T T     K A K   
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 79  TRCVEGE 85
           T+  EG+
Sbjct: 152 TKKEEGK 158



 Score = 27.4 bits (61), Expect = 2.9
 Identities = 18/60 (30%), Positives = 31/60 (51%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRP 78
           ++E+ E     EE+E  + KK++++KKEKK  E+  KK+  K    T  +  K     + 
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155


>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein.  Bacillus subtilis stage V
           sporulation protein R is involved in spore cortex
           formation. Little is known about cortex biosynthesis,
           except that it depends on several sigma E controlled
           genes, including spoVR.
          Length = 427

 Score = 32.2 bits (74), Expect = 0.14
 Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 2/55 (3%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            V +YKR  K   EE++     EE EE  + +  +  +    + E+ ++E  +R+
Sbjct: 165 HVDRYKRPIKLSLEEEKA--RQEEREEYLQSQVNDLWRTIPPQREKLEEERARRK 217


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.5 bits (72), Expect = 0.16
 Identities = 14/28 (50%), Positives = 21/28 (75%), Gaps = 3/28 (10%)

Query: 32  EEEEEEKK---KEEKKKEKKKKEEEKKK 56
           EEE+E+KK   KEEKK  K++KE+ ++ 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 31.5 bits (72), Expect = 0.19
 Identities = 16/28 (57%), Positives = 20/28 (71%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           EEE+EKKK   K+EKK  +EEK+K  E 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 26.5 bits (59), Expect = 7.7
 Identities = 12/28 (42%), Positives = 21/28 (75%), Gaps = 3/28 (10%)

Query: 37  EKKKEEKK---KEKKKKEEEKKKEGEKR 61
           E++KE+KK   KE+KK  +E+K++ E+ 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 31.1 bits (71), Expect = 0.16
 Identities = 16/37 (43%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 31  EEEEEEEKKKEEKKK-EKKKKEEEKKKEGEKRRTLTF 66
           E+EE E +K+ EK+  E+ KKEEE ++   ++R L F
Sbjct: 82  EKEERELRKRAEKEALEQAKKEEELREAKRQQRKLNF 118


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 31.7 bits (72), Expect = 0.17
 Identities = 17/47 (36%), Positives = 24/47 (51%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
              K K E E E+    E +++ EE+ K +   E KKK  E KK+ E
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE 176



 Score = 31.3 bits (71), Expect = 0.20
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           K    K+K+ EE       + ++  E K K E + EKK KEE KK+  E+ +
Sbjct: 112 KQAEEKQKQAEE------AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAK 157



 Score = 31.0 bits (70), Expect = 0.29
 Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 3/52 (5%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           K K+  + K + + E    E++ +EE KK+ E++ + K   E KKK  E ++
Sbjct: 125 KAKQAAEAKAKAEAEA---EKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKK 173



 Score = 30.6 bits (69), Expect = 0.35
 Identities = 11/45 (24%), Positives = 24/45 (53%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             K+ E+  +    ++++ EE K K+  + + K + E +KK  E+
Sbjct: 103 AAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEE 147



 Score = 30.2 bits (68), Expect = 0.57
 Identities = 14/47 (29%), Positives = 26/47 (55%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           KK+ EEE +     E +++  E KK+ + + K K E + K + E+ +
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAK 195



 Score = 29.8 bits (67), Expect = 0.77
 Identities = 12/47 (25%), Positives = 26/47 (55%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            K+ E+  ++    +++ EE + K+  + K K + E EKK + E ++
Sbjct: 104 AKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150



 Score = 27.9 bits (62), Expect = 3.3
 Identities = 11/56 (19%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            Q +  + R+K+ E+        E+  ++ E+  ++ ++++K+ EE K K+  + +
Sbjct: 82  KQRAAEQARQKELEQRAAA----EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133



 Score = 26.7 bits (59), Expect = 6.5
 Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 3/51 (5%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKK---KEKKKKEEEKKKEGEKRR 62
           R+KK E++ EE       E+  +K+ E++    K  K+ E+  K+  EK++
Sbjct: 69  RQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK 119



 Score = 26.7 bits (59), Expect = 7.0
 Identities = 13/41 (31%), Positives = 20/41 (48%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
            KKK  E ++    E + + E K K + ++ K K E  K K
Sbjct: 164 AKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204



 Score = 26.7 bits (59), Expect = 8.1
 Identities = 11/53 (20%), Positives = 23/53 (43%), Gaps = 3/53 (5%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
             +++  E+  ++        E+  K+ E+  K   EK+K+ EE K +     
Sbjct: 80  AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEA 132


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 31.6 bits (72), Expect = 0.19
 Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 2/61 (3%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG--EKRRTLTFIRIGKPALKPR 77
           + ED+E    +EEE E+E ++EE+ K+KK+ + +  KE   +K++          A  PR
Sbjct: 60  DSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPR 119

Query: 78  P 78
           P
Sbjct: 120 P 120


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 31.6 bits (71), Expect = 0.21
 Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 4/50 (8%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           K  RR ++K          + EEE+ EKK  +++K KK  E+ + +  E 
Sbjct: 112 KTPRRTRRKAA----AASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSES 157



 Score = 27.3 bits (60), Expect = 5.3
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 2/55 (3%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK--KKEGE 59
           T+    KRRK KK +ED E  G E E  + E+ +     E + +EE    K +GE
Sbjct: 133 TEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDGE 187



 Score = 26.9 bits (59), Expect = 6.4
 Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 12/71 (16%)

Query: 4   SSTTQVSKYKRRKKKKEEEDEEGGGGEEE---EEEEEKKKEEKKKEKKKK---------E 51
           S        KR  KK   E  E      E    E+    KE KK  ++ +          
Sbjct: 69  SKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDV 128

Query: 52  EEKKKEGEKRR 62
           EE+K E + R+
Sbjct: 129 EEEKTEKKVRK 139



 Score = 26.9 bits (59), Expect = 7.7
 Identities = 13/69 (18%), Positives = 20/69 (28%), Gaps = 3/69 (4%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT---LTFIRIGK 71
            +KK E         +E     +KK     K   KK   +  E  +  +   +       
Sbjct: 48  SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107

Query: 72  PALKPRPTR 80
              K  P R
Sbjct: 108 KESKKTPRR 116


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 30.1 bits (68), Expect = 0.22
 Identities = 8/25 (32%), Positives = 17/25 (68%)

Query: 29  GEEEEEEEEKKKEEKKKEKKKKEEE 53
           G E   E ++ +EE+K+E+ ++E +
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEESD 100



 Score = 30.1 bits (68), Expect = 0.23
 Identities = 8/26 (30%), Positives = 18/26 (69%)

Query: 28  GGEEEEEEEEKKKEEKKKEKKKKEEE 53
           G E   E +E ++EEK++E +++ ++
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEESDD 101



 Score = 28.5 bits (64), Expect = 0.90
 Identities = 9/26 (34%), Positives = 12/26 (46%)

Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEE 42
               E        EEEE+EE+ +EE
Sbjct: 73 AAAGAEAAAEADEAEEEEKEEEAEEE 98



 Score = 28.1 bits (63), Expect = 1.0
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           G         E   E  + E+++KEEE ++E
Sbjct: 67 AGAAAAAAAGAEAAAEADEAEEEEKEEEAEEE 98



 Score = 27.0 bits (60), Expect = 2.8
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 24 EEGGGGEEEEEEEEKKKEEKKKE 46
                E +E EEE+K+EE ++E
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEE 98



 Score = 26.6 bits (59), Expect = 3.7
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEE 42
               E   E    EEEE+EEE ++E 
Sbjct: 73 AAAGAEAAAEADEAEEEEKEEEAEEES 99



 Score = 26.6 bits (59), Expect = 4.3
 Identities = 6/34 (17%), Positives = 17/34 (50%)

Query: 26  GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           G         E   + ++ +E++K+EE +++  +
Sbjct: 68  GAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDD 101



 Score = 26.2 bits (58), Expect = 6.3
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query: 24 EEGGGGEEEEEEEEKKKEEKKKE 46
           E     +E EEEEK++E +++ 
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEES 99



 Score = 25.8 bits (57), Expect = 7.5
 Identities = 8/27 (29%), Positives = 12/27 (44%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEE 42
                 +      E EEEE+E++ EE
Sbjct: 71 AAAAAGAEAAAEADEAEEEEKEEEAEE 97


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 31.5 bits (72), Expect = 0.22
 Identities = 12/50 (24%), Positives = 28/50 (56%), Gaps = 3/50 (6%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGEKRRTLTF 66
           +EEDEE    E++E++++   ++    ++   ++   +++  EKR  L F
Sbjct: 134 DEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEF 183



 Score = 30.3 bits (69), Expect = 0.47
 Identities = 9/42 (21%), Positives = 27/42 (64%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           K E+DE+    +++++   +++  +++ ++++ EEK+ E E 
Sbjct: 142 KSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEF 183



 Score = 27.6 bits (62), Expect = 3.6
 Identities = 9/48 (18%), Positives = 24/48 (50%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           R  +++++D+E    E+EE  + +  E+   +    +   ++   +RR
Sbjct: 121 RDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERR 168


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 31.4 bits (71), Expect = 0.23
 Identities = 9/33 (27%), Positives = 18/33 (54%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
           R   +E  D +     E+E+E E + +E++K+ 
Sbjct: 538 RNSTEERIDLDADDWTEDEDENEMETDEERKKP 570



 Score = 28.7 bits (64), Expect = 2.0
 Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 13/58 (22%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKK-------------EEKKKEKKKKEEEK 54
           S  +R     ++  E+    E E +EE KK              ++ K  K  +EEE 
Sbjct: 540 STEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEA 597


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 31.6 bits (72), Expect = 0.23
 Identities = 18/51 (35%), Positives = 25/51 (49%), Gaps = 3/51 (5%)

Query: 12  YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           Y    K   E+ E   G +  EE  E+  EE   + KKK+ +KK+  EK R
Sbjct: 395 YYESAKTALEKAE---GKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFR 442



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 14/47 (29%), Positives = 25/47 (53%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           + Y+  K   E+ + +    E  EE  E+   + KK+K+KK+E  +K
Sbjct: 394 AYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440



 Score = 26.6 bits (59), Expect = 9.1
 Identities = 19/82 (23%), Positives = 29/82 (35%), Gaps = 7/82 (8%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK-----EEEKKKEGEKRRTL 64
            KYK+ K  K   D +    +E     E  K   +K + KK      EE  +EG  +   
Sbjct: 370 KKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKK 429

Query: 65  TFIRIGKPALKPRPTRCVEGEG 86
              +  K     +    V  +G
Sbjct: 430 KKRK--KKEWFEKFRWFVSSDG 449


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 31.2 bits (71), Expect = 0.24
 Identities = 23/55 (41%), Positives = 37/55 (67%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           +S  KR+ K K E D E    E +E+++EKKKEEKKK++ ++ EE+ ++ E + T
Sbjct: 306 ISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 15/53 (28%), Positives = 28/53 (52%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           KR KK     +       + + + E+  E+   E K+K++EKKKE +K++ + 
Sbjct: 294 KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIE 346



 Score = 26.9 bits (60), Expect = 6.7
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            +   +   +     E + E+   + ++KK++KKKEE+KKK+ E+
Sbjct: 303 FEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 31.0 bits (70), Expect = 0.24
 Identities = 12/46 (26%), Positives = 31/46 (67%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +KK +++     G+++E+   +  ++ K++++KK+EEK++  E +R
Sbjct: 152 RKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKR 197



 Score = 29.5 bits (66), Expect = 0.90
 Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE--KKKKEEEKKKEGEKR 61
            K  R   +K+ +DE      +++E+   +  +K KE  +KK+EE+++ + +KR
Sbjct: 144 EKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKR 197



 Score = 29.1 bits (65), Expect = 1.2
 Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 7/60 (11%)

Query: 10  SKYKRRKKKKEEE---DEEGGGGEEEEEEE----EKKKEEKKKEKKKKEEEKKKEGEKRR 62
              + R+KK+EE+   DE+   G              K+E  KEK++K  + ++  E + 
Sbjct: 177 KHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPKEKRQKHHDPERRLEPQS 236


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 31.5 bits (71), Expect = 0.25
 Identities = 22/107 (20%), Positives = 38/107 (35%), Gaps = 12/107 (11%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK---------KKEGEKR 61
           K   R K+K +E  +G   +E + E    + E ++   K+ EEK          K+ +  
Sbjct: 792 KAAARAKQKPKE--KGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPY 849

Query: 62  RTLTFIRIGKPALKPRPTRCVEGEGAPTRTLSVQIPLHRVHEQGVPP 108
           +     RI  P      +   +    P R     + + R  E   P 
Sbjct: 850 KVDNMTRI-LPQQSRYISFIKDDRFVPVRKFKGGVVVLRDREPKEPV 895



 Score = 30.7 bits (69), Expect = 0.46
 Identities = 11/52 (21%), Positives = 25/52 (48%), Gaps = 5/52 (9%)

Query: 6   TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +T +    R K+K +E+     G  ++E + E    E + E+   ++ ++K 
Sbjct: 788 STTIKAAARAKQKPKEK-----GPNDKEIKIESPSVETEGERCTIKQREEKG 834


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 31.4 bits (72), Expect = 0.26
 Identities = 18/50 (36%), Positives = 25/50 (50%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +YK+ K        E GGG E E E  +K+E +K      E EK +E E+
Sbjct: 828 EYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEE 877



 Score = 29.1 bits (66), Expect = 1.5
 Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 7/56 (12%)

Query: 9   VSKYKRRKKKKEE----EDEEGGGGEEEEEEEEKKKEEKKKEKKK---KEEEKKKE 57
           V +YK  KK K      + E GGG E E E  EK++ EK         K EEK++E
Sbjct: 823 VERYKEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEE 878


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 30.9 bits (70), Expect = 0.26
 Identities = 16/46 (34%), Positives = 25/46 (54%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
            K+ E  EE  G EE+ E EE +KEE  ++ ++  E K +E +  
Sbjct: 17 CCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62



 Score = 30.5 bits (69), Expect = 0.31
 Identities = 16/45 (35%), Positives = 26/45 (57%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          K  K E  EE    E E +EE+K KEE  + ++ ++EE  ++ E+
Sbjct: 6  KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEE 50



 Score = 29.7 bits (67), Expect = 0.58
 Identities = 17/39 (43%), Positives = 21/39 (53%)

Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           + EE E+    E+ EE  E K EE K E  K +EE KK
Sbjct: 34 LEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKK 72



 Score = 29.7 bits (67), Expect = 0.64
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
          EE ++E    + EE  E K +E K +  K KEE KK E E
Sbjct: 37 EEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENE 76



 Score = 29.0 bits (65), Expect = 1.2
 Identities = 15/50 (30%), Positives = 25/50 (50%)

Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
            KE E++E   G+EE+ E E+ ++E+  E  ++  E K E  K      
Sbjct: 17 CCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKL 66



 Score = 28.6 bits (64), Expect = 1.5
 Identities = 13/50 (26%), Positives = 27/50 (54%)

Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          + + +  K E  +E+     E +EE++ K+E+ + E+ +KEE  +   E 
Sbjct: 2  EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEES 51


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 30.4 bits (69), Expect = 0.28
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
           +EEDEE    EE+++E+E ++EE   +K K
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 29.6 bits (67), Expect = 0.46
 Identities = 12/31 (38%), Positives = 22/31 (70%), Gaps = 3/31 (9%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
           E+E+EE    +EE+++E++ +EE+   KK K
Sbjct: 119 EDEEEED---DEEDDDEDESEEEESPVKKVK 146


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 30.9 bits (70), Expect = 0.29
 Identities = 18/55 (32%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 7   TQVSKYKRRK--KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
             +SK+KR+     +EEED +      E++ +E + E+  +E +K EEE+++E E
Sbjct: 130 LALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEE 184



 Score = 27.0 bits (60), Expect = 5.0
 Identities = 13/38 (34%), Positives = 26/38 (68%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
           KK KE E E+    +E++EEEE+++EE+ ++    +++
Sbjct: 158 KKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD 195


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
          family.  This model family represents the major capsid
          protein component of the heads (capsids) of
          bacteriophage HK97, phi-105, P27, and related phage.
          This model represents one of several analogous families
          lacking detectable sequence similarity. The gene
          encoding this component is typically located in an
          operon encoding the small and large terminase subunits,
          the portal protein and the prohead or maturation
          protease [Mobile and extrachromosomal element
          functions, Prophage functions].
          Length = 384

 Score = 31.2 bits (71), Expect = 0.29
 Identities = 10/70 (14%), Positives = 22/70 (31%)

Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
            +   +  +E  +     +  E EEEK + +  KE+  K + +    E+       +  
Sbjct: 4  LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPA 63

Query: 71 KPALKPRPTR 80
                    
Sbjct: 64 ASGEGGGGEE 73


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.8 bits (70), Expect = 0.32
 Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 8/49 (16%)

Query: 20  EEEDEEGGGGEEEEEE--------EEKKKEEKKKEKKKKEEEKKKEGEK 60
           E +DE    G E E E        + K K ++ KEK++KE E++ + EK
Sbjct: 252 ESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEK 300



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 9/44 (20%), Positives = 21/44 (47%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            EE D++G    ++E   E  + E +   K    ++K + ++ +
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNK 285



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 10/43 (23%), Positives = 21/43 (48%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +E   EG   E E   +  + + K K ++ KE+ +K+   + +
Sbjct: 255 DESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAK 297



 Score = 26.6 bits (59), Expect = 9.5
 Identities = 11/43 (25%), Positives = 20/43 (46%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           EEE ++    E  E E E   +  + ++K K +  K++  K  
Sbjct: 250 EEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKEL 292


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 30.6 bits (70), Expect = 0.32
 Identities = 12/28 (42%), Positives = 20/28 (71%), Gaps = 3/28 (10%)

Query: 32  EEEEEEKK---KEEKKKEKKKKEEEKKK 56
           E E+E+KK   KEEKK  K++K++ ++ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 30.6 bits (70), Expect = 0.35
 Identities = 15/28 (53%), Positives = 18/28 (64%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           E E+EKKK   K+EKK  +EEK K  E 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 30.9 bits (71), Expect = 0.33
 Identities = 10/47 (21%), Positives = 23/47 (48%), Gaps = 7/47 (14%)

Query: 32  EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF-------IRIGK 71
           + EEE    +E+ +    KE E+K++  ++  +T        +R+ +
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEITIDDFDKVELRVAE 557



 Score = 29.4 bits (67), Expect = 1.1
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           + EEE    K++ +    K+ EE++KK  + 
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541



 Score = 28.2 bits (64), Expect = 3.0
 Identities = 7/30 (23%), Positives = 18/30 (60%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           EEE    +++ E    ++ +++E+K ++ E
Sbjct: 513 EEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.9 bits (71), Expect = 0.34
 Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
            E +E     +EEEEE+E        E +  E+
Sbjct: 187 SELEELDDDEDEEEEEDENDDSLAADESELPEK 219



 Score = 27.5 bits (62), Expect = 4.4
 Identities = 6/38 (15%), Positives = 16/38 (42%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
                      +++E+EE++++E        E E  ++
Sbjct: 182 PAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219



 Score = 27.1 bits (61), Expect = 5.4
 Identities = 8/51 (15%), Positives = 23/51 (45%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
            +E+    G   EE +++E++++EE + +     +E +   +       + 
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALA 228


>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding.  This domain is
          found on CASC3 (cancer susceptibility candidate gene 3
          protein) which is also known as Barentsz (Btz). CASC3
          is a component of the EJC (exon junction complex) which
          is a complex that is involved in post-transcriptional
          regulation of mRNA in metazoa. The complex is formed by
          the association of four proteins (eIF4AIII, Barentsz,
          Mago, and Y14), mRNA, and ATP. This domain wraps around
          eIF4AIII and stacks against the 5' nucleotide.
          Length = 116

 Score = 29.7 bits (67), Expect = 0.34
 Identities = 12/40 (30%), Positives = 21/40 (52%)

Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
          K E E + G     E  E ++ KE K++E +K ++E  + 
Sbjct: 1  KVESERQSGRTPSAEPTEPKEDKERKRREHEKYDDEDDEN 40


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 30.9 bits (70), Expect = 0.34
 Identities = 13/51 (25%), Positives = 28/51 (54%)

Query: 5   STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
           +T   S   +    +E+ D      +E+++EE K KEEK+++ + ++  K+
Sbjct: 331 ATRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381



 Score = 29.0 bits (65), Expect = 1.5
 Identities = 13/50 (26%), Positives = 30/50 (60%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           T+ S    ++   EE+++     ++E+++EE K +E+K+E  + E+  K+
Sbjct: 332 TRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 30.8 bits (70), Expect = 0.34
 Identities = 17/54 (31%), Positives = 32/54 (59%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPA 73
           E+E+E+    ++ +E+ +++  E+ +E+  +EEE + E EK   L   RI  PA
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPA 186



 Score = 28.5 bits (64), Expect = 2.1
 Identities = 12/41 (29%), Positives = 23/41 (56%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +E +      EEE++E  KK +E   E+  +E+E++   E+
Sbjct: 125 KEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEE 165



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 8/42 (19%), Positives = 25/42 (59%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
            E++EE     ++ +E+  ++  ++ E++  EEE+ +  +++
Sbjct: 132 SEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 12/36 (33%), Positives = 22/36 (61%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
           K    KK +ED +    EE+EEE  +++E + +++K
Sbjct: 138 KDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173



 Score = 27.7 bits (62), Expect = 3.1
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
           +  K+ +++      EE+EEE  ++EE + EK+K  E
Sbjct: 140 EAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176



 Score = 27.7 bits (62), Expect = 3.5
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query: 21  EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           E D+E    + E+EEE+ +  +K KE   +E  ++ E E 
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEA 161



 Score = 27.3 bits (61), Expect = 4.5
 Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 3/44 (6%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           KE E  +    E+EEE++E  K+ K+   ++  EE ++E  +  
Sbjct: 125 KEIESSDS---EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEE 165


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 30.8 bits (70), Expect = 0.35
 Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 3/44 (6%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           EEE+EE    EE EEE E+   ++   K+  EE K++E  +RR+
Sbjct: 151 EEEEEEP---EEMEEELEEDAADRDARKRAAEEAKEQEELRRRS 191


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 30.4 bits (69), Expect = 0.36
 Identities = 11/44 (25%), Positives = 27/44 (61%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           +  ++K++EE+ E      +   E+ + ++K KE +K+E+E+ +
Sbjct: 124 REEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMR 167



 Score = 30.0 bits (68), Expect = 0.50
 Identities = 20/61 (32%), Positives = 33/61 (54%), Gaps = 10/61 (16%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEE------KKKEEKKKEKKKKEEEKKKEGEKRR 62
           V K  R  ++ E E+EE    + +EEE E      K + E+ + K+K +E +K+E E+ R
Sbjct: 112 VRKQLRFLEQLEREEEE----KRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMR 167

Query: 63  T 63
            
Sbjct: 168 H 168



 Score = 26.5 bits (59), Expect = 7.6
 Identities = 12/50 (24%), Positives = 25/50 (50%), Gaps = 4/50 (8%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
           RK+ +  E  E     EEEE+ ++++ E+     K   E+ +  +K + +
Sbjct: 113 RKQLRFLEQLE----REEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEM 158


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 30.9 bits (70), Expect = 0.39
 Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 2/60 (3%)

Query: 4   SSTTQVSKYKRRKKKKEEEDEEGGGGEEE--EEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           SS    +  K  KKKK +     GG  E   ++EE+  K+ KK +KK +++  K   + +
Sbjct: 409 SSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSK 468


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.5 bits (70), Expect = 0.39
 Identities = 18/52 (34%), Positives = 26/52 (50%)

Query: 9  VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          V K     K KE E+E     EE ++E E  K+E   E K++  + + E EK
Sbjct: 24 VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK 75



 Score = 28.6 bits (65), Expect = 2.1
 Identities = 12/47 (25%), Positives = 29/47 (61%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
            + +     +K+EEE E+     E++++E +KKEE+ +E  +++ ++
Sbjct: 97  NLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 27.8 bits (63), Expect = 3.4
 Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 4/64 (6%)

Query: 1  LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +  +  ++ + +   K+  EE ++    E E  ++E   E K++  K + E +K+  E+
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKK----EAEAIKKEALLEAKEEIHKLRNEFEKELRER 80

Query: 61 RRTL 64
          R  L
Sbjct: 81 RNEL 84


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
          family of proteins is found in bacteria and viruses.
          Proteins in this family are typically between 180 and
          214 amino acids in length.
          Length = 125

 Score = 29.5 bits (67), Expect = 0.39
 Identities = 14/51 (27%), Positives = 27/51 (52%)

Query: 7  TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
            ++K K + +KK+EE +       +   EEK + E +K +K+ EE + + 
Sbjct: 15 KAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAEL 65


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 30.6 bits (69), Expect = 0.42
 Identities = 12/53 (22%), Positives = 31/53 (58%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
            +S  +     ++RK+++     E    E+E++ +E + + +K+ K+++EE+K
Sbjct: 345 SDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397



 Score = 27.5 bits (61), Expect = 4.9
 Identities = 12/52 (23%), Positives = 28/52 (53%)

Query: 4   SSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
            S    S   R++K++          +E+E++ ++ + + +KE+K++ EE K
Sbjct: 346 DSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 29.7 bits (67), Expect = 0.45
 Identities = 12/28 (42%), Positives = 21/28 (75%)

Query: 38  KKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           K+K ++KK+KKKK+++KKK+  K+    
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 29.3 bits (66), Expect = 0.62
 Identities = 12/25 (48%), Positives = 21/25 (84%)

Query: 32  EEEEEEKKKEEKKKEKKKKEEEKKK 56
           +E+ ++KK+++KKK+KKKK+  KK 
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 28.9 bits (65), Expect = 0.77
 Identities = 12/30 (40%), Positives = 24/30 (80%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +E+ +++K+K++KKK+KKKK  +K  + +K
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 28.1 bits (63), Expect = 1.6
 Identities = 13/29 (44%), Positives = 24/29 (82%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           +E+ ++KK++KKK+KKKK++  KK  +K+
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 27.0 bits (60), Expect = 4.0
 Identities = 11/28 (39%), Positives = 21/28 (75%)

Query: 36  EEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           +EK K++K+K+KKKK+++KK   +  + 
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 26.2 bits (58), Expect = 8.0
 Identities = 12/38 (31%), Positives = 26/38 (68%), Gaps = 4/38 (10%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
           K+  K+K ++ ++    ++++++++KKK  KK  KKKK
Sbjct: 102 KQVLKEKAKQKKQ----KKKKKKKKKKKTSKKAAKKKK 135



 Score = 25.8 bits (57), Expect = 8.9
 Identities = 11/27 (40%), Positives = 22/27 (81%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKK 56
            +++++++KKK++KKK+  KK  +KKK
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 29.9 bits (68), Expect = 0.46
 Identities = 12/54 (22%), Positives = 31/54 (57%), Gaps = 4/54 (7%)

Query: 8  QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK----KKKEEEKKKE 57
          +  K KR+++K+  +  + G  E ++  EE K E+ ++++    +++ E ++K 
Sbjct: 19 KAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKA 72


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 30.5 bits (68), Expect = 0.47
 Identities = 16/50 (32%), Positives = 28/50 (56%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            K R K +EE++        +EEE  KK++E+++  +K   +  KE  K+R
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKR 1072



 Score = 28.6 bits (63), Expect = 2.4
 Identities = 14/46 (30%), Positives = 25/46 (54%)

Query: 30   EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALK 75
            +EE+E  E  +  K++E  KKE+E+++   K     +  + K  LK
Sbjct: 1029 QEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 29.8 bits (67), Expect = 0.50
 Identities = 18/52 (34%), Positives = 32/52 (61%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           K++ +    EE   +  G +  E++ +KKK E  KE+KKK++EKKK+ ++  
Sbjct: 119 KHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170



 Score = 29.0 bits (65), Expect = 0.89
 Identities = 15/56 (26%), Positives = 35/56 (62%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
            ++    +  + +    ++   D EG  G E++ +++K +++K+++KKKKE++KKK
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166



 Score = 26.7 bits (59), Expect = 6.5
 Identities = 19/46 (41%), Positives = 26/46 (56%), Gaps = 10/46 (21%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
           K+ KKKK E+D+E           +KKK+EKKK+KK+   E    G
Sbjct: 142 KKHKKKKHEDDKE----------RKKKKKEKKKKKKRHSPEHPGVG 177



 Score = 26.3 bits (58), Expect = 7.7
 Identities = 17/56 (30%), Positives = 30/56 (53%)

Query: 6   TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
             Q  K K + K K+   ++    E   + E  K  EKK +KKK E++K+++ +K+
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKK 160


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 30.3 bits (68), Expect = 0.52
 Identities = 18/44 (40%), Positives = 33/44 (75%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
              +E E+EE    EEEEEEEE+++E++ +E++ ++EE+++E E
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477



 Score = 26.8 bits (59), Expect = 8.2
 Identities = 18/42 (42%), Positives = 29/42 (69%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           ++EE  EE    EEEEEEEE++ EE++ E +++EEE + +  
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481



 Score = 26.4 bits (58), Expect = 9.6
 Identities = 15/43 (34%), Positives = 29/43 (67%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
           ++++EEE+EE    +E EEEE + +EE+++ +     E++ EG
Sbjct: 446 EEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEG 488


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 30.0 bits (68), Expect = 0.53
 Identities = 13/35 (37%), Positives = 24/35 (68%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
           K EEE+EE    E E+E+EE + EE++++  + ++
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 28.9 bits (65), Expect = 1.3
 Identities = 12/35 (34%), Positives = 25/35 (71%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
           K ++EEE+++    E+E+EE+E ++EE+  ++  K
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 26.9 bits (60), Expect = 6.1
 Identities = 9/35 (25%), Positives = 25/35 (71%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
           K  ++++E++  E    +EE+E+EE+++++ + +K
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 30.0 bits (68), Expect = 0.53
 Identities = 13/30 (43%), Positives = 21/30 (70%), Gaps = 3/30 (10%)

Query: 32  EEEEEEKK---KEEKKKEKKKKEEEKKKEG 58
           + ++EEKK   KEEKK  K +KE+ +++ G
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124



 Score = 29.2 bits (66), Expect = 0.83
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           + ++E+KK   K+EKK  + EK+K  E+
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEE 122


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 30.4 bits (68), Expect = 0.54
 Identities = 12/45 (26%), Positives = 31/45 (68%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
           R     +++DE+    +++EE++E+++EE+++ K   +E+++ EG
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEG 192



 Score = 28.9 bits (64), Expect = 2.0
 Identities = 15/53 (28%), Positives = 29/53 (54%), Gaps = 5/53 (9%)

Query: 13  KRRKKKKEEEDEEGGG-----GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +RR +   EED           ++E+E+E+   EE  +E++++EEE K   ++
Sbjct: 134 RRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDE 186



 Score = 28.5 bits (63), Expect = 2.6
 Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 3/54 (5%)

Query: 20  EEEDEEGGGGEEEEEEEEKK---KEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
           E++D+E    EEEEEEEE K    E+++ E  +    +K E +K     FI  G
Sbjct: 162 EDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTDCFKFIEAG 215



 Score = 27.7 bits (61), Expect = 4.2
 Identities = 11/45 (24%), Positives = 30/45 (66%)

Query: 22  EDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
           +D++    E++++EE+ ++EE+++E+ K  +++ +E E     T+
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTY 198


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 30.2 bits (69), Expect = 0.58
 Identities = 16/53 (30%), Positives = 28/53 (52%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
            +E+ E    ++EEE+E+  KE  +  KK + E+KK+          ++IGK 
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKK 220



 Score = 28.3 bits (64), Expect = 2.7
 Identities = 14/43 (32%), Positives = 24/43 (55%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             + ++E      ++EEE+EK  +E  +  KK E EKKK+ + 
Sbjct: 165 IDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKN 207


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
          family includes archaebacterial L12, eukaryotic P0, P1
          and P2.
          Length = 88

 Score = 28.7 bits (65), Expect = 0.59
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 35 EEEKKKEEKKKEKKKKEEE 53
                EE+KKE++++EEE
Sbjct: 61 AAAAAAEEEKKEEEEEEEE 79



 Score = 28.4 bits (64), Expect = 0.66
 Identities = 7/20 (35%), Positives = 14/20 (70%)

Query: 34 EEEEKKKEEKKKEKKKKEEE 53
                +EEKK+E++++EE+
Sbjct: 61 AAAAAAEEEKKEEEEEEEED 80



 Score = 28.0 bits (63), Expect = 0.89
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
                   EEEKK+EE+++E
Sbjct: 58 AAAAAAAAAEEEKKEEEEEEE 78



 Score = 26.4 bits (59), Expect = 3.3
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 39 KKEEKKKEKKKKEEEKKKEGE 59
                 E++KKEEE+++E +
Sbjct: 60 AAAAAAAEEEKKEEEEEEEED 80



 Score = 26.0 bits (58), Expect = 4.7
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
                  EEE+K++EE+++E
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEE 79



 Score = 26.0 bits (58), Expect = 5.0
 Identities = 4/26 (15%), Positives = 15/26 (57%)

Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEE 53
                     ++E+K++E++++E++
Sbjct: 56 AAAAAAAAAAAEEEKKEEEEEEEEDD 81



 Score = 25.3 bits (56), Expect = 8.7
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 37 EKKKEEKKKEKKKKEEEKKKE 57
                  +E+KK+EEE+++E
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEE 79


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
          several eukaryotic synaphin 1 and 2 proteins.
          Synaphin/complexin is a cytosolic protein that
          preferentially binds to syntaxin within the SNARE
          complex. Synaphin promotes SNAREs to form precomplexes
          that oligomerise into higher order structures. A
          peptide from the central, syntaxin binding domain of
          synaphin competitively inhibits these two proteins from
          interacting and prevents SNARE complexes from
          oligomerising. It is thought that oligomerisation of
          SNARE complexes into a higher order structure creates a
          SNARE scaffold for efficient, regulated fusion of
          synaptic vesicles. Synaphin promotes neuronal
          exocytosis by promoting interaction between the
          complementary syntaxin and synaptobrevin transmembrane
          regions that reside in opposing membranes prior to
          fusion.
          Length = 139

 Score = 29.5 bits (66), Expect = 0.60
 Identities = 15/53 (28%), Positives = 30/53 (56%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
          K     +++EG   + EEE+EE ++  ++ E+++K + +K E E+      IR
Sbjct: 17 KGMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIR 69


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.0 bits (68), Expect = 0.62
 Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           EEE  KKK++KKK+K KK        E 
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEA 204



 Score = 28.9 bits (65), Expect = 1.2
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 29  GEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
             EEE E  K + E+++ KKKK+++KKK 
Sbjct: 164 PTEEEVELLKARLEEERAKKKKKKKKKKT 192



 Score = 27.3 bits (61), Expect = 4.1
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           EEE  ++KKK++KKK KK        E  
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEAT 205


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 29.8 bits (67), Expect = 0.62
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
          S+ K  +K K     +     ++   +EKK+E+KK +KKK
Sbjct: 38 SQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77



 Score = 26.7 bits (59), Expect = 7.2
 Identities = 11/51 (21%), Positives = 21/51 (41%)

Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           K       K    ++     +   + +  +  KK   ++K+EEKKK  +K
Sbjct: 26 DKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKK 76



 Score = 26.3 bits (58), Expect = 9.4
 Identities = 7/41 (17%), Positives = 15/41 (36%)

Query: 22 EDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          E E+    +++   E   K   + +  +K +   K     R
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAER 57


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
          represents the eukaryotic large ribosomal protein P1.
          Eukaryotic P1 and P2 are functionally equivalent to the
          bacterial protein L7/L12, but are not homologous to
          L7/L12. P1 is located in the L12 stalk, with proteins
          P2, P0, L11, and 28S rRNA. P1 and P2 are the only
          proteins in the ribosome to occur as multimers, always
          appearing as sets of heterodimers. Recent data indicate
          that eukaryotes have four copies (two heterodimers),
          while most archaeal species contain six copies of L12p
          (three homodimers) and bacteria may have four or six
          copies (two or three homodimers), depending on the
          species. Experiments using S. cerevisiae P1 and P2
          indicate that P1 proteins are positioned more
          internally with limited reactivity in the C-terminal
          domains, while P2 proteins seem to be more externally
          located and are more likely to interact with other
          cellular components. In lower eukaryotes, P1 and P2 are
          further subdivided into P1A, P1B, P2A, and P2B, which
          form P1A/P2B and P1B/P2A heterodimers. Some plant
          species have a third P-protein, called P3, which is not
          homologous to P1 and P2. In humans, P1 and P2 are
          strongly autoimmunogenic. They play a significant role
          in the etiology and pathogenesis of systemic lupus
          erythema (SLE). In addition, the ribosome-inactivating
          protein trichosanthin (TCS) interacts with human P0,
          P1, and P2, with its primary binding site located in
          the C-terminal region of P2. TCS inactivates the
          ribosome by depurinating a specific adenine in the
          sarcin-ricin loop of 28S rRNA.
          Length = 103

 Score = 28.8 bits (65), Expect = 0.63
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
                 E ++E+KKEE+++E
Sbjct: 73 AAAAAAAEAKKEEKKEEEEEE 93



 Score = 28.4 bits (64), Expect = 0.89
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEEE 53
                    E KKEEKK+E++++ ++
Sbjct: 70 AAAAAAAAAAEAKKEEKKEEEEEESDD 96



 Score = 27.7 bits (62), Expect = 1.6
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
                E ++EEKK+EE+++ 
Sbjct: 74 AAAAAAEAKKEEKKEEEEEES 94



 Score = 25.4 bits (56), Expect = 9.6
 Identities = 9/19 (47%), Positives = 14/19 (73%)

Query: 39 KKEEKKKEKKKKEEEKKKE 57
             E KKE+KK+EEE++ +
Sbjct: 77 AAAEAKKEEKKEEEEEESD 95


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family. 
          SseC is a secreted protein that forms a complex
          together with SecB and SecD on the surface of
          Salmonella. All these proteins are secreted by the type
          III secretion system. Many mucosal pathogens use type
          III secretion systems for the injection of effector
          proteins into target cells. SecB, SseC and SecD are
          inserted into the target cell membrane. where they form
          a small pore or translocon. In addition to SseC, this
          family includes the bacterial secreted proteins PopB,
          PepB, YopB and EspD which are thought to be directly
          involved in pore formation, and type III secretion
          system translocon.
          Length = 303

 Score = 29.7 bits (67), Expect = 0.64
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          E   ++++KK E+ +E+ KK  EK +E +K
Sbjct: 25 ERARDKQEKKAEEYQEQIKKAIEKAEEAKK 54


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.0 bits (68), Expect = 0.67
 Identities = 11/47 (23%), Positives = 26/47 (55%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
              K++++ +E+ +      E++ + +E+ K++K KE K  +  KK 
Sbjct: 229 ESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKV 275



 Score = 27.7 bits (62), Expect = 3.1
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           E EE+ E  +   +KK K K+E +KKK  E +
Sbjct: 236 ETEEDVEAAESRAEKKRKSKEEIKKKKPKESK 267


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 29.8 bits (68), Expect = 0.67
 Identities = 9/24 (37%), Positives = 18/24 (75%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEE 53
           +    EEE+++EE+++E++  EEE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 27.5 bits (62), Expect = 4.4
 Identities = 11/42 (26%), Positives = 23/42 (54%), Gaps = 5/42 (11%)

Query: 23  DEEGGGGEEEEE-----EEEKKKEEKKKEKKKKEEEKKKEGE 59
            ++    EE +E      +    EE+++E++++EEE+  E E
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 26.4 bits (59), Expect = 8.6
 Identities = 9/38 (23%), Positives = 23/38 (60%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
           K   +EE +E    + +    E+++EE+++E++++  E
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318



 Score = 26.4 bits (59), Expect = 9.3
 Identities = 8/38 (21%), Positives = 21/38 (55%)

Query: 23  DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           D++    E +E    + +    +E++++EEE+++E   
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 29.4 bits (66), Expect = 0.67
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 28  GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           G EE  +EE++++EE K+E K+K+ EKK +  +R  
Sbjct: 91  GSEEALDEEKERREESKEEMKEKKFEKKLKELRRAV 126


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 29.8 bits (67), Expect = 0.69
 Identities = 16/52 (30%), Positives = 32/52 (61%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
                + S  K +K   EE+D++ G   EE+E +E ++EE+ +E++++E+E 
Sbjct: 233 QDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDED 284



 Score = 28.3 bits (63), Expect = 2.4
 Identities = 15/43 (34%), Positives = 30/43 (69%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           K K+ +DEE    + +  EE++ KE++++E+ ++EEE++ E E
Sbjct: 243 KNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.0 bits (68), Expect = 0.72
 Identities = 17/65 (26%), Positives = 34/65 (52%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
           +Y+  +++  E   E  G   E EE EK++EE KK  +K +EE ++  + ++ L  +   
Sbjct: 660 EYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKA 719

Query: 71  KPALK 75
              ++
Sbjct: 720 LERVE 724



 Score = 27.3 bits (61), Expect = 5.0
 Identities = 16/50 (32%), Positives = 29/50 (58%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           + R K+ EE++E     +++ +E EK+ EE ++  +  EE K K+ E  R
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELER 376



 Score = 27.3 bits (61), Expect = 5.9
 Identities = 21/56 (37%), Positives = 32/56 (57%), Gaps = 3/56 (5%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKK---EEKKKEKKKKEEEKKKEGEK 60
           ++ K ++  K+ EE  EE    E+E E  E  K   EEK +E +++ EE KKE E+
Sbjct: 222 ELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE 277


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 30.0 bits (68), Expect = 0.74
 Identities = 16/59 (27%), Positives = 32/59 (54%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             S   +       K+KKE ++EE    EEE + E+   +  K+ +K ++EE+++E ++
Sbjct: 434 AESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 10/55 (18%)

Query: 16  KKKKEEEDEEGGG------GEEEEEEEEKKKEEKKKEKKKK----EEEKKKEGEK 60
            +KKEE D E         GEEE +EEE ++  KK   ++K      EK+ E +K
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKK 438


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 29.7 bits (67), Expect = 0.76
 Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 4/57 (7%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE----KKKEGEKRRT 63
             K+ +++K+ +       ++E E+E ++ E +++  K+K EE    K ++  K+RT
Sbjct: 94  SAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRT 150



 Score = 28.5 bits (64), Expect = 1.7
 Identities = 13/44 (29%), Positives = 29/44 (65%), Gaps = 5/44 (11%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
             KK K+++ +      EEE  +++KK+++++E+K+K EE  ++
Sbjct: 188 ELKKLKQQQQKR-----EEERRKQRKKQQEEEERKQKAEEAWQK 226



 Score = 28.2 bits (63), Expect = 2.7
 Identities = 12/59 (20%), Positives = 38/59 (64%), Gaps = 4/59 (6%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
                  VS+ + +K+ +E E ++     ++++++ +++  K+++K+++EEE+K++ E+
Sbjct: 168 SAKPERNVSQEEAKKRLQEWELKK----LKQQQQKREEERRKQRKKQQEEEERKQKAEE 222



 Score = 26.2 bits (58), Expect = 9.1
 Identities = 12/56 (21%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE---KKKKEEEKKKEGEKRRT 63
           + K +K  +E++ +E     EE E  ++  +EK +E   +K ++  K++  + ++ 
Sbjct: 101 QKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKE 156


>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
           family 4 of glycosyl hydrolases [Carbohydrate transport
           and metabolism].
          Length = 442

 Score = 29.9 bits (68), Expect = 0.77
 Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 12  YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           Y  ++KK +E ++ G    E  + E   KE  +  KK + +EK +E EKR  
Sbjct: 265 YDMKEKKPDEIEKFGTRANEVMKRE---KELFELYKKPELKEKPEELEKRIG 313


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 29.8 bits (67), Expect = 0.77
 Identities = 19/48 (39%), Positives = 25/48 (52%), Gaps = 3/48 (6%)

Query: 13  KRR---KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           KRR   K  KEE + E    E+   EEEK   E  + + K E EK++E
Sbjct: 254 KRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301



 Score = 28.3 bits (63), Expect = 2.9
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           EE+  E EK  +E+ + +++ EE++++E EK
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEK 282


>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
           recombination, and repair].
          Length = 292

 Score = 29.6 bits (66), Expect = 0.77
 Identities = 14/38 (36%), Positives = 27/38 (71%)

Query: 28  GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           G  EE + E++++E+ K ++K+K+ EKK +  +R+T T
Sbjct: 209 GSLEELDREKQRREKMKDDRKEKKLEKKIKELRRKTRT 246


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 28.8 bits (65), Expect = 0.78
 Identities = 13/30 (43%), Positives = 23/30 (76%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            E E  K+E+KK+ K+KKEEEK+++ + ++
Sbjct: 93  LEHERNKQEKKKRSKEKKEEEKERKRQLKQ 122


>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG.  Members of this
           family are the protein YabG, demonstrated for Bacillus
           subtilis to be an endopeptidase able to release
           N-terminal peptides from a number of sporulation
           proteins, including CotT, CotF, and SpoIVA. It appears
           to be expressed under control of sigma-K [Cellular
           processes, Sporulation and germination].
          Length = 283

 Score = 29.6 bits (67), Expect = 0.83
 Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 8/63 (12%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
           V + +R+K+KK  +        E  E   +   +  K  K+K+E +   G K   L F  
Sbjct: 52  VDESERKKRKKRFK--------ERMETSYRLFRQDYKLMKEKKEYRATGGYKNNQLYFGM 103

Query: 69  IGK 71
            G+
Sbjct: 104 PGR 106


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 29.1 bits (66), Expect = 0.84
 Identities = 15/55 (27%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 6   TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             ++ + K+R  + + + E+   G EE EE   +  E+ K+ +K+ ++ K E EK
Sbjct: 75  KKELEELKQRIAELQAQIEKLKKGREETEERT-ELLEELKQLEKELKKLKAELEK 128


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 29.6 bits (67), Expect = 0.86
 Identities = 11/43 (25%), Positives = 26/43 (60%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             +E+D+E    ++ ++E+E+KKE K+ EK   +++   + + 
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDD 193



 Score = 26.5 bits (59), Expect = 9.7
 Identities = 9/43 (20%), Positives = 24/43 (55%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
              E++DE+    + ++E+EEKK+ ++ ++    ++    E +
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDD 193


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 29.9 bits (67), Expect = 0.87
 Identities = 17/57 (29%), Positives = 27/57 (47%)

Query: 13   KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRI 69
            KR    KEE          E EE+E++  + + +K++ EEEKK+   +    T  R 
Sbjct: 964  KRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRF 1020



 Score = 29.6 bits (66), Expect = 1.00
 Identities = 14/57 (24%), Positives = 28/57 (49%)

Query: 1    LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
             +  +  +  +  +R    +EE          E EE++++  K + KK++ EE+KKE
Sbjct: 951  KEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKE 1007



 Score = 28.8 bits (64), Expect = 1.9
 Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 2/63 (3%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKE--EEKKKEGEKRRTLTFIR 68
           K +R KK  EE +       + EE + ++ + K++ KK  E  + K+K   +   L ++ 
Sbjct: 172 KKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLD 231

Query: 69  IGK 71
             K
Sbjct: 232 YLK 234



 Score = 28.8 bits (64), Expect = 2.1
 Identities = 10/47 (21%), Positives = 24/47 (51%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +++ E E+E  G  E+ +++E  KK  ++ E   +     +E + + 
Sbjct: 154 ERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQE 200



 Score = 28.4 bits (63), Expect = 2.3
 Identities = 20/62 (32%), Positives = 30/62 (48%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRI 69
              K+  +K+EE   +     +EEE+E+K +EE+ K   K+EEE K E  K         
Sbjct: 255 ESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDE 314

Query: 70  GK 71
            K
Sbjct: 315 EK 316



 Score = 28.0 bits (62), Expect = 3.1
 Identities = 18/54 (33%), Positives = 30/54 (55%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRI 69
           ++K E   +E    EEEE+ E+ K EE+K+EK K +EE+ +  E+        +
Sbjct: 765 EEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELL 818



 Score = 27.6 bits (61), Expect = 5.2
 Identities = 13/60 (21%), Positives = 30/60 (50%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALK 75
           K+++          EEE+E++ +++E K   K+++E + +    +RR +      K + K
Sbjct: 263 KEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEK 322


>gnl|CDD|216095 pfam00748, Calpain_inhib, Calpain inhibitor.  This region is found
           multiple times in calpain inhibitor proteins.
          Length = 131

 Score = 28.6 bits (64), Expect = 0.92
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 29  GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
           G+ E + EEKK  E K ++K KEE+ +K GE+  T+
Sbjct: 72  GQREPDPEEKKPVEDKVKEKAKEEKLEKLGEREDTI 107


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 29.4 bits (66), Expect = 0.92
 Identities = 9/74 (12%), Positives = 29/74 (39%), Gaps = 2/74 (2%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            +        + +K +  E++E      ++E  ++++  E++  ++ ++E        R 
Sbjct: 321 RAPELHAENAEIKKTRTAEKNEAKA--RKKEIAQKRRAAEREINREARQERAAAMARARA 378

Query: 63  TLTFIRIGKPALKP 76
               ++  K  L  
Sbjct: 379 RRAAVKAKKKGLID 392


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 29.8 bits (67), Expect = 0.92
 Identities = 12/56 (21%), Positives = 25/56 (44%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
              +  ++ + ++     E EE  +    E+++EK++  EE      +R    FIR
Sbjct: 782 YLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEEGFSTWSRRDFNAFIR 837



 Score = 29.4 bits (66), Expect = 1.3
 Identities = 11/43 (25%), Positives = 27/43 (62%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          ++++      +E+EE++E+ +   K E  K+E+ + KE +K++
Sbjct: 30 DDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQK 72


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 28.8 bits (65), Expect = 0.96
 Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 8/48 (16%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           KRR   +           E  +E  +K++  +K ++KK + ++KE EK
Sbjct: 91  KRRIALRL--------RRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 29.6 bits (66), Expect = 0.96
 Identities = 16/38 (42%), Positives = 21/38 (55%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
            E  EE    EEE+EEEE K+   K+ KKK+ +   K 
Sbjct: 768 AESSEEDESSEEEKEEEENKEVSAKRAKKKQRKNMLKS 805



 Score = 27.3 bits (60), Expect = 5.5
 Identities = 12/43 (27%), Positives = 25/43 (58%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           + E +      E  EE+E  +EEK++E+ K+   K+ + ++R+
Sbjct: 758 DGEQDNSDSFAESSEEDESSEEEKEEEENKEVSAKRAKKKQRK 800


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.
          Trichoplein or mitostatin, was first defined as a
          meiosis-specific nuclear structural protein. It has
          since been linked with mitochondrial movement. It is
          associated with the mitochondrial outer membrane, and
          over-expression leads to reduction in mitochondrial
          motility whereas lack of it enhances mitochondrial
          movement. The activity appears to be mediated through
          binding the mitochondria to the actin intermediate
          filaments (IFs).
          Length = 349

 Score = 29.5 bits (67), Expect = 0.97
 Identities = 16/43 (37%), Positives = 28/43 (65%)

Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
          K+R K +E+E+E       EEE  +   EE+++E+K+KEE ++
Sbjct: 29 KKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERRE 71


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 29.6 bits (67), Expect = 0.98
 Identities = 9/53 (16%), Positives = 17/53 (32%), Gaps = 4/53 (7%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            Y  ++KK++E         E  +    K  EK K+   +        +    
Sbjct: 763 NYTEKQKKEKES----KSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAV 811


>gnl|CDD|239285 cd02987, Phd_like_Phd, Phosducin (Phd)-like family, Phd
          subfamily; Phd is a cytosolic regulator of G protein
          functions. It specifically binds G protein betagamma
          (Gbg)-subunits with high affinity, resulting in the
          solubilization of Gbg from the plasma membrane. This
          impedes the formation of a functional G protein trimer
          (G protein alphabetagamma), thereby inhibiting G
          protein-mediated signal transduction. Phd also inhibits
          the GTPase activity of G protein alpha. Phd can be
          phosphorylated by protein kinase A and G
          protein-coupled receptor kinase 2, leading to its
          inactivation. Phd was originally isolated from the
          retina, where it is highly expressed and has been
          implicated to play an important role in light
          adaptation. It is also found in the pineal gland,
          liver, spleen, striated muscle and the brain. The
          C-terminal domain of Phd adopts a thioredoxin fold, but
          it does not contain a CXXC motif. Phd interacts with G
          protein beta mostly through the N-terminal helical
          domain.
          Length = 175

 Score = 28.8 bits (65), Expect = 1.0
 Identities = 11/54 (20%), Positives = 31/54 (57%), Gaps = 6/54 (11%)

Query: 9  VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          ++ +++ K+ KE E       E+++++E+K++  ++  +++ +E   K    RR
Sbjct: 13 INDWRKFKQLKESEQ------EDDDDDEDKEEFLQQYREQRMQEMHAKLPFGRR 60


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 15/68 (22%), Positives = 30/68 (44%), Gaps = 10/68 (14%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEE----------EEKKKEEKKKEKKKKE 51
           +   T++  + +     + E   E  G ++E+E           EEK+  +K ++  K E
Sbjct: 225 KLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLE 284

Query: 52  EEKKKEGE 59
             K+K+ E
Sbjct: 285 ILKEKKDE 292



 Score = 29.2 bits (66), Expect = 1.5
 Identities = 10/29 (34%), Positives = 20/29 (68%)

Query: 29  GEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
            +++E +     EEK K+++KKE+ K++E
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREE 676



 Score = 28.0 bits (63), Expect = 2.8
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 34  EEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
            EE+ KKEEKK++ K++E+E+ +  E   T+   +  
Sbjct: 659 TEEKTKKEEKKEKNKREEKERIEIAETWDTIPLAQAI 695



 Score = 28.0 bits (63), Expect = 3.4
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 24  EEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
           E+    +  +  EEK K+E+KKEK K+EE++
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 26.9 bits (60), Expect = 6.6
 Identities = 10/32 (31%), Positives = 23/32 (71%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           E+++E +     E+K +K++K+E+ K+E ++R
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKER 679


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 29.6 bits (66), Expect = 1.1
 Identities = 9/39 (23%), Positives = 19/39 (48%)

Query: 20   EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
            E+ +E+G     +E EE  +   K  E+ ++ E  + + 
Sbjct: 4060 EKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQA 4098



 Score = 26.5 bits (58), Expect = 9.7
 Identities = 8/38 (21%), Positives = 19/38 (50%)

Query: 23   DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            +++    E+  EE  ++ EE  ++  K +EE ++    
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVP 4094


>gnl|CDD|216799 pfam01940, DUF92, Integral membrane protein DUF92.  Members of
          this family have several predicted transmembrane
          helices. The function of these prokaryotic proteins is
          unknown.
          Length = 224

 Score = 29.0 bits (66), Expect = 1.1
 Identities = 5/20 (25%), Positives = 8/20 (40%)

Query: 9  VSKYKRRKKKKEEEDEEGGG 28
           +K   ++K      E  GG
Sbjct: 55 ATKLGYKRKAALGIAEARGG 74


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 27.8 bits (62), Expect = 1.1
 Identities = 7/35 (20%), Positives = 21/35 (60%)

Query: 25 EGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
          EG   E+++++E+   ++ K +K   +++  ++ E
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80



 Score = 27.4 bits (61), Expect = 1.2
 Identities = 7/32 (21%), Positives = 19/32 (59%)

Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          GEE E++++ + ++   +K  K+++   + E 
Sbjct: 47 GEEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78



 Score = 25.1 bits (55), Expect = 9.7
 Identities = 6/34 (17%), Positives = 19/34 (55%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
          E E+ E    +E++++++ K ++   +    E++
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 29.3 bits (66), Expect = 1.1
 Identities = 11/54 (20%), Positives = 25/54 (46%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
            +++  K++    + ++        E ++   ++E+ K E  K EE K+ E  K
Sbjct: 361 KELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGK 414


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 28.7 bits (65), Expect = 1.1
 Identities = 8/32 (25%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 33  EEEEEKKKEEKKKEKKKKE--EEKKKEGEKRR 62
           + +  + K+EK + + + +  E K KE E   
Sbjct: 73  QAKIARYKKEKARYRSEAKELEAKAKEAEAES 104



 Score = 27.2 bits (61), Expect = 3.5
 Identities = 8/36 (22%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE 42
            ++++YK+ K +   E +E    E + +E E + + 
Sbjct: 74  AKIARYKKEKARYRSEAKE---LEAKAKEAEAESDH 106


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 29.2 bits (65), Expect = 1.2
 Identities = 12/45 (26%), Positives = 25/45 (55%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           R+K++E ++          +E+++  E +K+E +K + E KK  E
Sbjct: 251 RQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDE 295



 Score = 28.4 bits (63), Expect = 2.5
 Identities = 15/57 (26%), Positives = 30/57 (52%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +  +   ++ K +   KK +EE  +    +  + ++E K  EK+ E K+ E +KK+E
Sbjct: 275 VAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331



 Score = 27.7 bits (61), Expect = 3.8
 Identities = 13/55 (23%), Positives = 27/55 (49%), Gaps = 10/55 (18%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKK----------KEKKKKEEEKKKEGEK 60
           K +++ +  +    ++ +E  +K++E K           KE K+  E +K+E EK
Sbjct: 231 KAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEK 285



 Score = 27.3 bits (60), Expect = 5.1
 Identities = 12/42 (28%), Positives = 21/42 (50%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           K+ K+  + +    E+ + E +K  EE  K K  K  + K+E
Sbjct: 270 KEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQE 311



 Score = 26.5 bits (58), Expect = 9.2
 Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           KE E +E     ++ +EE  KK+    + ++K +  +   +K+R
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQR 247


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 29.2 bits (66), Expect = 1.2
 Identities = 13/42 (30%), Positives = 25/42 (59%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGK 71
           ++E ++ E+K E+ + E +KKEEE +K   K     + ++ K
Sbjct: 101 KKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAK 142



 Score = 28.4 bits (64), Expect = 2.5
 Identities = 17/60 (28%), Positives = 28/60 (46%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKP 76
           KK EE+ E+     E++EEE +K + K  +K  K+  KK +      L  +   K   + 
Sbjct: 105 KKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEK 164


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 28.7 bits (65), Expect = 1.2
 Identities = 11/55 (20%), Positives = 31/55 (56%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            +K K + ++ E++ ++     E+E+   ++++ K K  +KK + + ++ E + T
Sbjct: 105 EAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 29.2 bits (66), Expect = 1.2
 Identities = 7/41 (17%), Positives = 23/41 (56%)

Query: 2    QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE 42
             + S+  + + +++K     ED++    ++ E+E+++  E+
Sbjct: 1346 ASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386



 Score = 26.5 bits (59), Expect = 9.1
 Identities = 16/59 (27%), Positives = 29/59 (49%)

Query: 4    SSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
             +TT    +     K EE  EE    EE+E  +E++ + K K K  K  + K + ++++
Sbjct: 1122 KNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKK 1180


>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2.  This family of
          mediator complex subunit 2 proteins is conserved in
          fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
          with and phosphorylates Med2. Post-translational
          modifications of Mediator subunits are important for
          regulation of gene expression.
          Length = 99

 Score = 27.8 bits (62), Expect = 1.3
 Identities = 12/20 (60%), Positives = 19/20 (95%)

Query: 37 EKKKEEKKKEKKKKEEEKKK 56
          E KK+++++E+KKKEEE+KK
Sbjct: 80 ENKKKQEEEERKKKEEEEKK 99


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 29.3 bits (65), Expect = 1.3
 Identities = 17/43 (39%), Positives = 29/43 (67%), Gaps = 1/43 (2%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           KK EE  E    E E++  E+++ EK+KEK++ E E+++E E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKER-EREREREAER 620



 Score = 28.5 bits (63), Expect = 2.3
 Identities = 13/31 (41%), Positives = 23/31 (74%)

Query: 32  EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           EE  E+ K+E ++K ++++E EK+KE E+ R
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEKEKERER 612


>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
           Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the heterodimeric topo I from
           Leishmania donvanni. Topo I enzymes are divided into:
           topo type IA (bacterial) and type IB (eukaryotic). Topo
           I relaxes superhelical tension in duplex DNA by creating
           a single-strand nick, the broken strand can then rotate
           around the unbroken strand to remove DNA supercoils and,
           the nick is religated, liberating topo I. These enzymes
           regulate the topological changes that accompany DNA
           replication, transcription and other nuclear processes. 
           Human topo I is the target of a diverse set of
           anticancer drugs including camptothecins (CPTs). CPTs
           bind to the topo I-DNA complex and inhibit re-ligation
           of the single-strand nick, resulting in the accumulation
           of topo I-DNA adducts. In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topo I play putative roles in
           organizing the kinetoplast DNA network unique to these
           parasites.  This family may represent more than one
           structural domain.
          Length = 212

 Score = 28.7 bits (64), Expect = 1.3
 Identities = 15/29 (51%), Positives = 18/29 (62%)

Query: 34  EEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            E+EKKK   K+EKK  +EEK KE E   
Sbjct: 93  REKEKKKSRTKEEKKALKEEKDKEAEPYM 121


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 29.3 bits (66), Expect = 1.3
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE 42
            +     +K +EEE E+    EEEEE EE + EE
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 27.7 bits (62), Expect = 4.2
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEK 43
           ++K EEE++E    EEEEEE E+ + E+
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 27.3 bits (61), Expect = 5.5
 Identities = 17/39 (43%), Positives = 28/39 (71%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
           ++ EEE+EE   GE +EE+E +++EE++ E+  KEEE  
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206



 Score = 26.6 bits (59), Expect = 8.3
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
           E++DEE    +EEEEEEE++ EE + E+
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPEE 372


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 29.3 bits (65), Expect = 1.3
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 5/52 (9%)

Query: 13  KRRKKKKEEEDEEGGG-----GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           K  +K+  +EDEE        GEE + EE K+ +E KK+K KK +E  KE E
Sbjct: 209 KTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYE 260



 Score = 27.3 bits (60), Expect = 5.8
 Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 4/55 (7%)

Query: 30  EEEEEEEEKKKEE----KKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTR 80
           +E+EE+ +K  E+    K +E K+ +E KKK+ +K + +T     +   KP  TR
Sbjct: 217 DEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKPLWTR 271


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 29.3 bits (66), Expect = 1.3
 Identities = 13/60 (21%), Positives = 25/60 (41%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           +        S       K  +E+++G     E E+      E  KEK +K ++ K++ +K
Sbjct: 218 INKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKK 277



 Score = 28.9 bits (65), Expect = 1.7
 Identities = 11/55 (20%), Positives = 22/55 (40%), Gaps = 3/55 (5%)

Query: 9   VSKYKRRKKKKEEEDE---EGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           + K K     KEE++        G +  +EE+K  +   + +       + + EK
Sbjct: 210 LIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEK 264


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 29.0 bits (65), Expect = 1.4
 Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 2/49 (4%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
             K   E     GG   +E++E+E  +E K++ +KK++ +KK    K  
Sbjct: 324 DEKSLSEFLGYMGG--IDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
          glycoprotein C-terminal domain.  The trypanosome
          parasite expresses these proteins to evade the immune
          response.
          Length = 98

 Score = 27.8 bits (62), Expect = 1.4
 Identities = 10/44 (22%), Positives = 19/44 (43%)

Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          K+ + +       +    E   +K + KK+KK  ++  K EG  
Sbjct: 30 KEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73



 Score = 27.0 bits (60), Expect = 2.4
 Identities = 11/40 (27%), Positives = 17/40 (42%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
           KK      +  G E   E+ + KK++K  +K  K E   
Sbjct: 34 AKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73



 Score = 27.0 bits (60), Expect = 2.5
 Identities = 9/44 (20%), Positives = 18/44 (40%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
          K+ K +++          E   +K + KK +K  K+  K +   
Sbjct: 30 KEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73



 Score = 27.0 bits (60), Expect = 2.6
 Identities = 15/46 (32%), Positives = 19/46 (41%)

Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
          K K  K KK         G E   E+ K K++KK  KK  + E   
Sbjct: 28 KPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73



 Score = 25.5 bits (56), Expect = 7.9
 Identities = 9/42 (21%), Positives = 18/42 (42%), Gaps = 3/42 (7%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
          K K+ +  + G    +    E      +K + KK +++ KK 
Sbjct: 28 KPKEGKAKKNGAPVTQTAGTE---TTTEKCKGKKDKKDCKKG 66


>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
          Length = 233

 Score = 28.6 bits (64), Expect = 1.4
 Identities = 15/45 (33%), Positives = 20/45 (44%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            K  E EE    EEE  + +K    KKK +   E E + E E + 
Sbjct: 186 NKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVEIEYEDEIEYKS 230


>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
          Length = 147

 Score = 28.1 bits (64), Expect = 1.4
 Identities = 8/34 (23%), Positives = 14/34 (41%)

Query: 35 EEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
          E  + + E K  ++  E E   E  +  T+T   
Sbjct: 48 EARRAELEAKAAEELAEAEALAEKLEGLTVTIKA 81


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 28.9 bits (65), Expect = 1.4
 Identities = 11/44 (25%), Positives = 20/44 (45%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           +EE+E+     EEEE E+ ++E  +  +    E     G    +
Sbjct: 375 DEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSES 418



 Score = 26.6 bits (59), Expect = 8.9
 Identities = 7/44 (15%), Positives = 19/44 (43%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
            +++E ED E  G +  E+   +   +   + + K +++     
Sbjct: 385 HEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDS 428


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 28.8 bits (64), Expect = 1.5
 Identities = 18/63 (28%), Positives = 38/63 (60%)

Query: 2   QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
             S  +   K ++++KKKEE+  E    ++  E+E  K+ EK++ K ++++++ +E EK+
Sbjct: 67  IQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQ 126

Query: 62  RTL 64
             L
Sbjct: 127 AQL 129



 Score = 28.0 bits (62), Expect = 2.8
 Identities = 16/47 (34%), Positives = 20/47 (42%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           K K   E ++      +  EE K K E    KKK E E K   EK +
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAK 208



 Score = 26.8 bits (59), Expect = 6.6
 Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 1/56 (1%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEE-EEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           +  K  + ++K++EE       E++ + E  K K   +  K K   E KK+ E+  
Sbjct: 122 EAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAA 177


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 29.0 bits (66), Expect = 1.5
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          EEEE +KEE K+E +++ + + +E     
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAE 67



 Score = 27.1 bits (61), Expect = 5.7
 Identities = 14/42 (33%), Positives = 19/42 (45%)

Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          EE+E      + E EEE K E ++    + EEE K E     
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAA 80



 Score = 27.1 bits (61), Expect = 6.6
 Identities = 15/53 (28%), Positives = 27/53 (50%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           K+ K+K     + G GG+       +++  KKK+K+K  E+  +E     T+T
Sbjct: 131 KKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETIT 183



 Score = 26.7 bits (60), Expect = 8.0
 Identities = 12/44 (27%), Positives = 19/44 (43%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
          EEE+      + E EEE K + E+    + +EE K +       
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPA 82


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 29.0 bits (65), Expect = 1.5
 Identities = 22/61 (36%), Positives = 30/61 (49%), Gaps = 9/61 (14%)

Query: 11  KYKRRKKKKEEEDE---------EGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           K KR++ KK  EDE              EEEE EE +     KKE++K+E     +G +R
Sbjct: 292 KEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRR 351

Query: 62  R 62
           R
Sbjct: 352 R 352



 Score = 27.9 bits (62), Expect = 3.5
 Identities = 17/47 (36%), Positives = 29/47 (61%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           R    E+E  E  G +E+E+E+E K   ++ + +++ EEK+KE  KR
Sbjct: 251 RDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR 297


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 28.0 bits (62), Expect = 1.5
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 34  EEEEKKKEEKKKEKKKKEEEKKKE 57
                K E KK+EKK++EEE++ +
Sbjct: 82  ATAGAKAEAKKEEKKEEEEEEEDD 105


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
          family consists of several hypothetical eukaryotic
          proteins of unknown function.
          Length = 142

 Score = 28.1 bits (63), Expect = 1.6
 Identities = 20/47 (42%), Positives = 33/47 (70%), Gaps = 4/47 (8%)

Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           K KKE EDEE     +++ EE+K+K+E+K  KK+ + +KKK+ +K+
Sbjct: 57 EKWKKETEDEEF----QQKREEKKRKDEEKTAKKRAKRQKKKQKKKK 99



 Score = 27.3 bits (61), Expect = 3.0
 Identities = 13/46 (28%), Positives = 28/46 (60%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           K K++K KK  + EE  G +  EE  ++++E ++ ++++  E  +K
Sbjct: 97  KKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 28.5 bits (64), Expect = 1.6
 Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 4/47 (8%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           K+KEEE+       E   E   +  E+ + K+K +EE+K E E+ R 
Sbjct: 121 KQKEEEERR----VERRRELGLEDPEQLRLKQKAKEEQKAESEETRH 163



 Score = 27.0 bits (60), Expect = 5.9
 Identities = 8/33 (24%), Positives = 22/33 (66%), Gaps = 2/33 (6%)

Query: 30  EEEEEEEEKKKEEKKKEK--KKKEEEKKKEGEK 60
            ++++EEE+++ E+++E   +  E+ + K+  K
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAK 151


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 28.6 bits (64), Expect = 1.6
 Identities = 8/35 (22%), Positives = 17/35 (48%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
           KE+ED++ G  EE    +   ++     ++ +  E
Sbjct: 231 KEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265


>gnl|CDD|225340 COG2719, SpoVR, Uncharacterized conserved protein [Function
           unknown].
          Length = 495

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 5/74 (6%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEK---KKEKKKKEEEKKKEGEK--RR 62
            V +YKR KK   EE+       EE  + +     +   KK  +   ++ ++   +    
Sbjct: 172 GVDRYKRPKKISSEEERARQEEREEYLQSQVNDLWRTLPKKPGEAAVKDARRFPSEPEED 231

Query: 63  TLTFIRIGKPALKP 76
            L FI    P L+P
Sbjct: 232 LLYFIEKNSPLLEP 245


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 28.7 bits (65), Expect = 1.7
 Identities = 14/53 (26%), Positives = 23/53 (43%), Gaps = 3/53 (5%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE---KKKKEEEKKKEGE 59
           +K K   K+++       G  ++E E   K  E+      K+K+  E K E E
Sbjct: 482 AKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAE 534



 Score = 28.7 bits (65), Expect = 1.9
 Identities = 10/45 (22%), Positives = 22/45 (48%), Gaps = 3/45 (6%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           K+++       G  ++E E   K+    E+   E++K+KE  + +
Sbjct: 489 KEQKITITASSGLSDDEIERMVKDA---EEYAAEDKKRKERIEAK 530


>gnl|CDD|240397 PTZ00390, PTZ00390, ubiquitin-conjugating enzyme; Provisional.
          Length = 152

 Score = 28.2 bits (63), Expect = 1.7
 Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 2/62 (3%)

Query: 107 PPVVYPPSIYNHQSTLYSGTVCLNVINQEWTPLYDLSNIFETFLPQLLTYPNPTDPLNGD 166
           P V +   IY H +    G +CL+++  +W+P   +  +  + +  LL+ P P DPL+  
Sbjct: 66  PKVRFLTKIY-HPNIDKLGRICLDILKDKWSPALQIRTVLLS-IQALLSAPEPDDPLDTS 123

Query: 167 AA 168
            A
Sbjct: 124 VA 125


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 28.8 bits (65), Expect = 1.7
 Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 4/36 (11%)

Query: 12  YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
           Y RR+ K+EEE       E  + +EE  +EE++   
Sbjct: 308 YLRRRSKQEEEAAA----EAAKAQEEAAEEEEESIN 339



 Score = 28.4 bits (64), Expect = 2.5
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 36  EEKKKEEKKKEKKKKEEEKKKEGE 59
             K++EE   E  K +EE  +E E
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEE 335


>gnl|CDD|133433 cd05297, GH4_alpha_glucosidase_galactosidase, Glycoside Hydrolases
           Family 4; Alpha-glucosidases and alpha-galactosidases.
           Glucosidases cleave glycosidic bonds to release glucose
           from oligosaccharides. Alpha-glucosidases and
           alpha-galactosidases release alpha-D-glucose and
           alpha-D-galactose, respectively, via the hydrolysis of
           alpha-glycopyranoside bonds. Some bacteria
           simultaneously translocate and phosphorylate
           disaccharides via the phosphoenolpyruvate-dependent
           phosphotransferase system (PEP-PTS). After
           translocation, these phospho-disaccharides may be
           hydrolyzed by the GH4 glycoside hydrolases such as the
           alpha-glucosidases. Other organsisms (such as archaea
           and Thermotoga maritima) lack the PEP-PTS system, but
           have several enzymes normally associated with the
           PEP-PTS operon. Alpha-glucosidases and
           alpha-galactosidases are part of the NAD(P)-binding
           Rossmann fold superfamily, which includes a wide variety
           of protein families including the NAD(P)-binding domains
           of alcohol dehydrogenases, tyrosine-dependent
           oxidoreductases, glyceraldehyde-3-phosphate
           dehydrogenases, formate/glycerate dehydrogenases,
           siroheme synthases, 6-phosphogluconate dehydrogenases,
           aminoacid dehydrogenases, repressor rex, and NAD-binding
           potassium channel domains, among others.
          Length = 423

 Score = 28.7 bits (65), Expect = 1.7
 Identities = 14/53 (26%), Positives = 21/53 (39%), Gaps = 5/53 (9%)

Query: 12  YKRRKKKKEEEDEEGGGG-----EEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
            K  KK    E  E   G     +  E  EE+ K    +  K++ +  K+ GE
Sbjct: 259 RKETKKIWYGEFNEDEYGGRDEEQGWEWYEERLKLILAEIDKEELDPVKRSGE 311


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
          family is found from fungi to humans, but its exact
          function is not known.
          Length = 88

 Score = 27.2 bits (61), Expect = 1.7
 Identities = 19/50 (38%), Positives = 35/50 (70%), Gaps = 4/50 (8%)

Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          + RK++ EE ++      EE E E K++EE+K+++K+  EE++K+ E+RR
Sbjct: 43 EERKEQMEELEKA----REETERERKEREERKEKRKRAIEERRKKIEERR 88


>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
          Length = 110

 Score = 27.5 bits (62), Expect = 1.8
 Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 2/61 (3%)

Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPA 73
          RR+K +E + +     + +EE+EE + +   + +K+    +    E R  L  I++ KP 
Sbjct: 7  RRRKLEELQKQAQEQQQAQEEQEEAQAQA--EAQKQAILRQILTPEARERLNRIKLVKPE 64

Query: 74 L 74
           
Sbjct: 65 F 65


>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
          Length = 530

 Score = 28.5 bits (64), Expect = 1.9
 Identities = 9/35 (25%), Positives = 16/35 (45%)

Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           EE  E+  K+  KK   K E+ +++   K    +
Sbjct: 6  VEESGEKISKKAAKKAAAKAEKLRREATAKAAAAS 40


>gnl|CDD|218574 pfam05389, MecA, Negative regulator of genetic competence (MecA).
           This family contains several bacterial MecA proteins.
           The development of competence in Bacillus subtilis is
           regulated by growth conditions and several regulatory
           genes. In complex media competence development is poor,
           and there is little or no expression of late competence
           genes. Mec mutations permit competence development and
           late competence gene expression in complex media,
           bypassing the requirements for many of the competence
           regulatory genes. The mecA gene product acts negatively
           in the development of competence. Null mutations in mecA
           allow expression of a late competence gene comG, under
           conditions where it is not normally expressed, including
           in complex media and in cells mutant for several
           competence regulatory genes. Overexpression of MecA
           inhibits comG transcription.
          Length = 212

 Score = 28.3 bits (64), Expect = 1.9
 Identities = 11/49 (22%), Positives = 23/49 (46%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
           K + D+     E+E+E+     E+K ++K   E E   +  K  +  ++
Sbjct: 77  KSDSDDGLNLEEDEDEDISDLLEQKFRDKGLDEMEDTLDDSKEESTEYV 125


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein
          is found in a wide range of eukaryotes. It is a nuclear
          protein and is suggested to be DNA binding. In plants,
          this family is essential for correct circadian clock
          functioning by acting as a light-quality regulator
          coordinating the activities of blue and red light
          signalling pathways during plant growth - inhibiting
          growth in red light but promoting growth in blue light.
          Length = 233

 Score = 28.5 bits (64), Expect = 1.9
 Identities = 16/51 (31%), Positives = 28/51 (54%), Gaps = 5/51 (9%)

Query: 13 KRRKKKKE-----EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
          K++KKKK+      +D+E    +E E+E++  KE  + ++      KKK G
Sbjct: 1  KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIG 51


>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
           transduction mechanisms].
          Length = 208

 Score = 28.3 bits (63), Expect = 2.0
 Identities = 16/67 (23%), Positives = 23/67 (34%), Gaps = 2/67 (2%)

Query: 30  EEEEEEEEKKKEEKKKE--KKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTRCVEGEGA 87
           EE  E   K+  E K     ++ E++ KK  E           +PA   RP      E  
Sbjct: 93  EEHVEHARKQLAEAKARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPREEV 152

Query: 88  PTRTLSV 94
            T    +
Sbjct: 153 HTPVSDI 159


>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain.  This alpha
          helical domain is found in a set of bacterial plasmid
          replication proteins. The domain is found to the
          C-terminus of the primase/polymerase domain. Mutants of
          this domain are defective in template binding,
          dinucleotide formation and conformation change prior to
          DNA extension.
          Length = 135

 Score = 27.9 bits (62), Expect = 2.1
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           G++EEE+ E+  E+ K+E  K +  K K  E  R
Sbjct: 1  EGKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIR 35


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 28.5 bits (63), Expect = 2.2
 Identities = 17/51 (33%), Positives = 33/51 (64%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           K KR +++K  E+EE    +EE + + +++EEK++ K++ E  + +  EKR
Sbjct: 220 KKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKR 270



 Score = 27.3 bits (60), Expect = 4.5
 Identities = 15/48 (31%), Positives = 24/48 (50%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
             +  +EEE EE     EE EE E   + ++K   +  EE +KE ++ 
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEP 154



 Score = 27.3 bits (60), Expect = 4.6
 Identities = 13/55 (23%), Positives = 34/55 (61%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
            + T +  + +  ++++EE +E  G  + E++ + +  EE +KE+K+ E E++++
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEK 161



 Score = 27.3 bits (60), Expect = 5.2
 Identities = 16/47 (34%), Positives = 25/47 (53%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
             ++EE++E     EE EE E   K E+K + +  EE +K+E E   
Sbjct: 110 TVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEP 156



 Score = 26.9 bits (59), Expect = 6.6
 Identities = 17/73 (23%), Positives = 35/73 (47%)

Query: 3   NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           N +  +  K + R++++E E+ EG    E++ +    +E +K+EK+ + EE++K      
Sbjct: 108 NETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSL 167

Query: 63  TLTFIRIGKPALK 75
                      LK
Sbjct: 168 EENNGEFMTHKLK 180


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 28.4 bits (64), Expect = 2.3
 Identities = 15/45 (33%), Positives = 28/45 (62%)

Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           +K +E+ +EE      E EEEE++++E++K++K    +K K  E
Sbjct: 13 VEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITE 57



 Score = 27.2 bits (61), Expect = 5.2
 Identities = 14/38 (36%), Positives = 26/38 (68%)

Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
          EEEEEEE  + EE+++E+ ++E+++K     +  +T I
Sbjct: 21 EEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEI 58


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 27.9 bits (62), Expect = 2.3
 Identities = 20/48 (41%), Positives = 33/48 (68%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
            K+KR++KK+  E+ E    E E+E++  +K  ++KEKKK+ EE K+E
Sbjct: 148 LKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEE 195


>gnl|CDD|223014 PHA03231, PHA03231, glycoprotein BALF4; Provisional.
          Length = 829

 Score = 28.4 bits (64), Expect = 2.4
 Identities = 10/70 (14%), Positives = 28/70 (40%), Gaps = 7/70 (10%)

Query: 6   TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKK-------KEEKKKEKKKKEEEKKKEG 58
           TT+    +   +    ++ E    +E + EE ++          +++++KK +++     
Sbjct: 742 TTKALPSEATGEYGNGDESEESDFDEYKLEEAREMLLAMHLLSAEERQEKKAKKKNSGPA 801

Query: 59  EKRRTLTFIR 68
                L  +R
Sbjct: 802 LLSSALDRLR 811


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 26.9 bits (60), Expect = 2.4
 Identities = 12/42 (28%), Positives = 22/42 (52%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
              EE +E+    +  +     ++  +KK+KKKK ++K KE
Sbjct: 14 NGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKE 55


>gnl|CDD|225826 COG3288, PntA, NAD/NADP transhydrogenase alpha subunit [Energy
           production and conversion].
          Length = 356

 Score = 28.1 bits (63), Expect = 2.5
 Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 1/37 (2%)

Query: 16  KKKKEEEDEEGGGG-EEEEEEEEKKKEEKKKEKKKKE 51
            K    EDEE  GG  +E  EE   K+ +   ++ KE
Sbjct: 208 AKFLAVEDEESAGGYAKEMSEEFIAKQAELVAEQAKE 244


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
          sialoprotein (BSP) is a major structural protein of the
          bone matrix that is specifically expressed by
          fully-differentiated osteoblasts. The expression of
          bone sialoprotein (BSP) is normally restricted to
          mineralised connective tissues of bones and teeth where
          it has been associated with mineral crystal formation.
          However, it has been found that ectopic expression of
          BSP occurs in various lesions, including oral and
          extraoral carcinomas, in which it has been associated
          with the formation of microcrystalline deposits and the
          metastasis of cancer cells to bone.
          Length = 291

 Score = 28.1 bits (62), Expect = 2.5
 Identities = 14/49 (28%), Positives = 22/49 (44%)

Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
          ++  +      E+   G   EEE EEE   EE+  E     E+++ E E
Sbjct: 41 RFSVQSGSDSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAE 89


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 28.3 bits (63), Expect = 2.5
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 32  EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
           ++EE+   +     +KK+   EK KEG K +   F
Sbjct: 369 QQEEDAGNQGGGDCKKKQGASEKSKEGGKGKETEF 403



 Score = 26.8 bits (59), Expect = 7.8
 Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 2/45 (4%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           +K  ++   ++      +   + KKK+     +K KE  K KE E
Sbjct: 360 KKAMEKLAAQQEEDAGNQGGGDCKKKQ--GASEKSKEGGKGKETE 402


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 27.8 bits (62), Expect = 2.5
 Identities = 11/59 (18%), Positives = 27/59 (45%)

Query: 6   TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
           TT       +K KK+++++     ++ E +++K +E +      +E    +E     +L
Sbjct: 57  TTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSL 115


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 28.3 bits (63), Expect = 2.5
 Identities = 11/59 (18%), Positives = 25/59 (42%), Gaps = 4/59 (6%)

Query: 4   SSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           S T QV+  KR+  + + + +      +  + ++   + KK++K KK   +        
Sbjct: 117 SETKQVTVSKRKGSELDSDSDSS----DSSKGKKLVNKPKKRQKYKKATIQSPTSLTEE 171


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 28.1 bits (62), Expect = 2.6
 Identities = 12/53 (22%), Positives = 30/53 (56%)

Query: 5   STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
              +VS+  + ++++E E+ E    E + +  ++  EEK+++ + +E E+  E
Sbjct: 285 ENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTE 337


>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
          Length = 111

 Score = 27.2 bits (60), Expect = 2.6
 Identities = 14/44 (31%), Positives = 22/44 (50%)

Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
          K E+E+E      + EE+ +K +EE          E +KE EK+
Sbjct: 30 KNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKK 73


>gnl|CDD|224667 COG1753, COG1753, Predicted antotoxin, copG family [Signal
          transduction mechanisms].
          Length = 74

 Score = 26.3 bits (58), Expect = 2.7
 Identities = 14/39 (35%), Positives = 20/39 (51%)

Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
          +   +K+   E      G   EEE E+ KKEEK+ EK +
Sbjct: 32 ELIEKKRGNLEVLMRAFGTLSEEEVEKIKKEEKEAEKMR 70


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 28.0 bits (63), Expect = 2.7
 Identities = 19/47 (40%), Positives = 29/47 (61%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           + K+K  +E  E     +  EE  K+K++KKK+KKKKEE ++   EK
Sbjct: 63  QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 27.6 bits (62), Expect = 2.7
 Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 7/53 (13%)

Query: 10  SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           S+ ++  KK EEE       +EE E+   + E K +  +K+EEE+++  EKR 
Sbjct: 123 SELEQEIKKLEEE-------KEELEKRVAELEAKLEAIEKREEEERQIEEKRH 168



 Score = 26.4 bits (59), Expect = 8.2
 Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 5/47 (10%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKE--KKKKEEEKKKEGEKRR 62
           +K  + E+G   + E E+E KK EE+K+E  K+  E E K E  ++R
Sbjct: 113 RKALQAEQG---KSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKR 156


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 27.9 bits (62), Expect = 2.8
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 34  EEEEKKKEEKKKEKKKKEEEKKKEGE 59
           E+++K K +K K K KK+E+KKK  E
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKARE 115


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
          represents the eukaryotic large ribosomal protein P2.
          Eukaryotic P1 and P2 are functionally equivalent to the
          bacterial protein L7/L12, but are not homologous to
          L7/L12. P2 is located in the L12 stalk, with proteins
          P1, P0, L11, and 28S rRNA. P1 and P2 are the only
          proteins in the ribosome to occur as multimers, always
          appearing as sets of heterodimers. Recent data indicate
          that eukaryotes have four copies (two heterodimers),
          while most archaeal species contain six copies of L12p
          (three homodimers). Bacteria may have four or six
          copies of L7/L12 (two or three homodimers) depending on
          the species. Experiments using S. cerevisiae P1 and P2
          indicate that P1 proteins are positioned more
          internally with limited reactivity in the C-terminal
          domains, while P2 proteins seem to be more externally
          located and are more likely to interact with other
          cellular components. In lower eukaryotes, P1 and P2 are
          further subdivided into P1A, P1B, P2A, and P2B, which
          form P1A/P2B and P1B/P2A heterodimers. Some plants have
          a third P-protein, called P3, which is not homologous
          to P1 and P2. In humans, P1 and P2 are strongly
          autoimmunogenic. They play a significant role in the
          etiology and pathogenesis of systemic lupus erythema
          (SLE). In addition, the ribosome-inactivating protein
          trichosanthin (TCS) interacts with human P0, P1, and
          P2, with its primary binding site in the C-terminal
          region of P2. TCS inactivates the ribosome by
          depurinating a specific adenine in the sarcin-ricin
          loop of 28S rRNA.
          Length = 109

 Score = 26.8 bits (60), Expect = 2.8
 Identities = 6/21 (28%), Positives = 11/21 (52%)

Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
                   ++E+KKEE ++E
Sbjct: 78 AAAAAAAAAKKEEKKEESEEE 98


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).
          This family consists of several Plasmodium falciparum
          SPAM (secreted polymorphic antigen associated with
          merozoites) proteins. Variation among SPAM alleles is
          the result of deletions and amino acid substitutions in
          non-repetitive sequences within and flanking the
          alanine heptad-repeat domain. Heptad repeats in which
          the a and d position contain hydrophobic residues
          generate amphipathic alpha-helices which give rise to
          helical bundles or coiled-coil structures in proteins.
          SPAM is an example of a P. falciparum antigen in which
          a repetitive sequence has features characteristic of a
          well-defined structural element.
          Length = 164

 Score = 27.5 bits (61), Expect = 2.9
 Identities = 11/50 (22%), Positives = 26/50 (52%)

Query: 2  QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKE 51
          Q     +  + +   ++ E+ ++E    E+EEEEEE +++    +  +K+
Sbjct: 48 QEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97



 Score = 26.7 bits (59), Expect = 5.6
 Identities = 15/51 (29%), Positives = 33/51 (64%)

Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
          + +K++++E+EE    EE EE E+ + EE+  E +++EEE +++    + +
Sbjct: 44 KDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDI 94



 Score = 26.4 bits (58), Expect = 6.5
 Identities = 13/47 (27%), Positives = 31/47 (65%)

Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
          +++EEEDEE     E+ E+EE+  E++++E++ +E+    +  +++ 
Sbjct: 52 EEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKN 98


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 27.9 bits (62), Expect = 2.9
 Identities = 8/60 (13%), Positives = 26/60 (43%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           L+    + +   +   + KE+   E    + ++    + + E + E + +EE  +++  +
Sbjct: 222 LEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENE 281


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 28.0 bits (62), Expect = 2.9
 Identities = 12/37 (32%), Positives = 25/37 (67%)

Query: 17  KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
           +++EEE E    GE+  + E ++ ++K+ E+++ EEE
Sbjct: 210 EEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEE 246



 Score = 26.9 bits (59), Expect = 6.7
 Identities = 14/45 (31%), Positives = 23/45 (51%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           ++EEEDEE G     E   E  +EE+++ +   + E   + E  R
Sbjct: 188 REEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGER 232


>gnl|CDD|219951 pfam08644, SPT16, FACT complex subunit (SPT16/CDC68).  Proteins in
           this family are subunits the FACT complex. The FACT
           complex plays a role in transcription initiation and
           promotes binding of TATA-binding protein (TBP) to a TATA
           box in chromatin.
          Length = 152

 Score = 27.6 bits (62), Expect = 2.9
 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 7/45 (15%)

Query: 39  KKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPA----LKPRPT 79
           +K  KK+E ++KE+    E EK   L   R  +P     +  RPT
Sbjct: 91  QKRVKKRETERKEKADVVEQEK---LILNRNKRPPRLKDVYIRPT 132


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 27.6 bits (62), Expect = 3.0
 Identities = 13/55 (23%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 9   VSKYKRRKKKKEEEDE---EGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           V ++  ++  K+ E +   E    ++ +E+   KK  +KKE+  K E+++ + +K
Sbjct: 110 VPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 26.8 bits (60), Expect = 3.0
 Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 30 EEEEEEEEKKKEEKKKEKKKKEEE 53
          E +EEE+ +++EE K+ K  K EE
Sbjct: 2  ERKEEEKAQREEELKRLKNLKREE 25


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
           MRVI1 proteins which are related to the
           lymphoid-restricted membrane protein (JAW1) and the IP3
           receptor associated cGMP kinase substrates A and B
           (IRAGA and IRAGB). The function of MRVI1 is unknown
           although mutations in the Mrvi1 gene induces myeloid
           leukaemia by altering the expression of a gene important
           for myeloid cell growth and/or differentiation so it has
           been speculated that Mrvi1 is a tumour suppressor gene.
           IRAG is very similar in sequence to MRVI1 and is an
           essential NO/cGKI-dependent regulator of IP3-induced
           calcium release. Activation of cGKI decreases
           IP3-stimulated elevations in intracellular calcium,
           induces smooth muscle relaxation and contributes to the
           antiproliferative and pro-apoptotic effects of NO/cGMP.
           Jaw1 is a member of a class of proteins with
           COOH-terminal hydrophobic membrane anchors and is
           structurally similar to proteins involved in vesicle
           targeting and fusion. This suggests that the function
           and/or the structure of the ER in lymphocytes may be
           modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 28.1 bits (62), Expect = 3.2
 Identities = 11/53 (20%), Positives = 26/53 (49%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           +S Y     ++E++ ++     E E EE  ++  K    ++  E ++ + E+R
Sbjct: 406 ISTYSWADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQR 458


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly
          related to archaeal Holliday junction resolvase
          [Nucleotide transport and metabolism].
          Length = 175

 Score = 27.4 bits (61), Expect = 3.2
 Identities = 15/50 (30%), Positives = 23/50 (46%)

Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          + + + K  E EE     E E    + +  K++E K KE  +KK  E R 
Sbjct: 28 QGKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 28.2 bits (63), Expect = 3.2
 Identities = 16/44 (36%), Positives = 24/44 (54%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           E ED      +EEEE+EE+++EE + E   KE    +E E+   
Sbjct: 380 EIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDV 423


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 27.5 bits (61), Expect = 3.2
 Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 13/60 (21%)

Query: 11  KYKRRKKKKEEED----EEGGGGEEEEEEEEKKKEEKKK---------EKKKKEEEKKKE 57
           K   +  K E +           +   E  +K +EE  K         E+ +KE  KKKE
Sbjct: 133 KKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKKE 192


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
          inhibitor of apoptosis (IAP)-associated factor (VIAF)
          subfamily; VIAF is a Phd-like protein that functions in
          caspase activation during apoptosis. It was identified
          as an IAP binding protein through a screen of a human
          B-cell library using a prototype IAP. VIAF lacks a
          consensus IAP binding motif and while it does not
          function as an IAP antagonist, it still plays a
          regulatory role in the complete activation of caspases.
          VIAF itself is a substrate for IAP-mediated
          ubiquitination, suggesting that it may be a target of
          IAPs in the prevention of cell death. The similarity of
          VIAF to Phd points to a potential role distinct from
          apoptosis regulation. Phd functions as a cytosolic
          regulator of G protein by specifically binding to G
          protein betagamma (Gbg)-subunits. The C-terminal domain
          of Phd adopts a thioredoxin fold, but it does not
          contain a CXXC motif. Phd interacts with G protein beta
          mostly through the N-terminal helical domain.
          Length = 192

 Score = 27.6 bits (62), Expect = 3.3
 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 4/47 (8%)

Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK----KKEEEKKKEGE 59
          +   E+       EE +EEE+ +  E+ + K+    K   EK K GE
Sbjct: 37 ENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKALAEKSKFGE 83


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 28.1 bits (62), Expect = 3.3
 Identities = 19/46 (41%), Positives = 26/46 (56%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
           K  E+EE    EE EE EE KKE  +K   + +EEK K  E+ + +
Sbjct: 239 KVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFI 284


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
          of this family (Kri1p) is found to be required for 40S
          ribosome biogenesis in the nucleolus. This is the
          C-terminal domain of the family.
          Length = 93

 Score = 26.4 bits (59), Expect = 3.4
 Identities = 10/24 (41%), Positives = 18/24 (75%)

Query: 34 EEEEKKKEEKKKEKKKKEEEKKKE 57
          ++EE++K++KK  KK +  E KK+
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKK 91


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
          is specific to the N-terminal part of the prp1 splicing
          factor, which is involved in mRNA splicing (and
          possibly also poly(A)+ RNA nuclear export and cell
          cycle progression). This domain is specific to the N
          terminus of the RNA splicing factor encoded by prp1. It
          is involved in mRNA splicing and possibly also
          poly(A)and RNA nuclear export and cell cycle
          progression.
          Length = 131

 Score = 26.9 bits (60), Expect = 3.4
 Identities = 13/47 (27%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            K ++EDEE       E  +E+  E +KK +++KE+E+ ++  +  
Sbjct: 53 DGKYDDEDEEAD--RIYESIDERMDERRKKRREQKEKEEIEKYREEN 97


>gnl|CDD|216057 pfam00680, RdRP_1, RNA dependent RNA polymerase. 
          Length = 479

 Score = 28.0 bits (63), Expect = 3.4
 Identities = 15/62 (24%), Positives = 20/62 (32%), Gaps = 15/62 (24%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKK------------EKKKKEEEKKKEGEKRRT 63
           KK+    DE G G   +      ++ E K             E +  E   K E  K R 
Sbjct: 120 KKRDLFNDETGDGKLAKVLYSCLREYEGKYGPALVWITCLKDELRPLE---KVEAGKTRL 176

Query: 64  LT 65
            T
Sbjct: 177 FT 178


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 27.7 bits (62), Expect = 3.4
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           EE E +++EEK++ KK+ +E K++E E    + 
Sbjct: 179 EEAERRRREEKERRKKQDKERKQREKETAEKIA 211


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 28.0 bits (63), Expect = 3.5
 Identities = 16/55 (29%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK-EGEKRR 62
             + +++K   +  +E     EEE E+ EK+ +E ++E  + E E K+ E E  R
Sbjct: 71  PLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIER 125


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 27.9 bits (62), Expect = 3.5
 Identities = 10/32 (31%), Positives = 23/32 (71%)

Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          E+ E E+++++++ K K ++E + K+E  K+R
Sbjct: 10 EKLEREQRERKQRAKLKLERERKAKEEAAKQR 41


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap
          is involved in recombination. Rap (recombination adept
          with plasmid) increases lambda-by-plasmid recombination
          catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 27.3 bits (61), Expect = 3.6
 Identities = 13/41 (31%), Positives = 19/41 (46%), Gaps = 7/41 (17%)

Query: 38 KKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRP 78
            K EK +EKK+K E + +  E       ++  K  LK R 
Sbjct: 33 ALKREKAQEKKRKAEAQAERRE-------LKARKEKLKTRS 66


>gnl|CDD|216792 pfam01927, DUF82, Protein of unknown function DUF82.  This
           prokaryotic protein family has no known function. The
           protein contains four conserved cysteines that may be
           involved in metal binding or disulphide bridges.
          Length = 146

 Score = 27.2 bits (61), Expect = 3.7
 Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 11/61 (18%)

Query: 71  KPALKPRPTRCVEGEGAPTRTLSVQIPLHRVHEQGVPPVVYPPSIYNHQSTLYSGTVCLN 130
           K +LKP  +RC++  G          PL  V ++ V   V PP +Y      +    C  
Sbjct: 83  KLSLKPEFSRCLKCNG----------PLEPVSKEEVAGRV-PPYVYERYDEFWRCPKCGR 131

Query: 131 V 131
           V
Sbjct: 132 V 132


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 27.9 bits (62), Expect = 3.7
 Identities = 17/48 (35%), Positives = 26/48 (54%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           K+ KKK+ +E  +    EE E E+E K EE  ++    EEE+  + E 
Sbjct: 156 KKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSED 203


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 27.7 bits (62), Expect = 3.8
 Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 5   STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
            TT+   Y+  + +K    +  G    +E  E +  EE+++ KKK   EK +E EKR
Sbjct: 277 ETTENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKF-TEKIREKEKR 332


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 27.7 bits (62), Expect = 3.8
 Identities = 15/41 (36%), Positives = 26/41 (63%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
           +RR++      E     EEEEE  E+++EE+++E++  EEE
Sbjct: 293 RRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 27.7 bits (61), Expect = 3.9
 Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 3/42 (7%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             EE+EE    E EE+EE  +KE  K   +K EEE+++E + 
Sbjct: 139 VLEEEEEV---EMEEDEEYYEKEPGKVVDEKSEEEEEEELKT 177


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
          spores are protected by a protein shell consisting of
          over 50 different polypeptides, known as the coat. This
          family of proteins has an important morphogenetic role
          in coat assembly, it is involved in the assembly of at
          least 5 different coat proteins including CotB, CotG,
          CotS, CotSA and CotW. It is likely to act at a late
          stage of coat assembly.
          Length = 185

 Score = 27.1 bits (60), Expect = 3.9
 Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKK-KEEKKKEKKKKEEEKKKEGEKRR 62
           ++  +KEEE E      + +EEE+K   EE +KEK+  E E++   E++ 
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQE 83



 Score = 27.1 bits (60), Expect = 4.2
 Identities = 16/56 (28%), Positives = 34/56 (60%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            Q  +Y+  +K+KE  + E     E++E+EE  +EE+K+E+ +  ++++    KR+
Sbjct: 56  EQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRK 111


>gnl|CDD|216381 pfam01237, Oxysterol_BP, Oxysterol-binding protein. 
          Length = 335

 Score = 27.6 bits (62), Expect = 3.9
 Identities = 16/34 (47%), Positives = 26/34 (76%), Gaps = 1/34 (2%)

Query: 29  GEEEEEEEEKKK-EEKKKEKKKKEEEKKKEGEKR 61
           G+ +E EEEK + EEK++E++K+ EEK +E + R
Sbjct: 278 GDYDEAEEEKLRLEEKQRERRKEREEKGEEWKPR 311


>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 166

 Score = 27.3 bits (61), Expect = 4.0
 Identities = 6/25 (24%), Positives = 13/25 (52%)

Query: 26  GGGGEEEEEEEEKKKEEKKKEKKKK 50
                 E  E + K++EK +++ +K
Sbjct: 142 PSSQGAETNETKSKRQEKLEKRGEK 166



 Score = 26.9 bits (60), Expect = 4.5
 Identities = 7/29 (24%), Positives = 13/29 (44%)

Query: 26  GGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
                  +  E  + + K++EK +K  EK
Sbjct: 138 FASLPSSQGAETNETKSKRQEKLEKRGEK 166


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 26.9 bits (60), Expect = 4.0
 Identities = 8/28 (28%), Positives = 20/28 (71%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
             +EE++  ++E K++ ++ KE E++K+
Sbjct: 98  ARDEEKKLDEEEAKRQHEEAKEREREKK 125



 Score = 26.1 bits (58), Expect = 6.3
 Identities = 9/29 (31%), Positives = 20/29 (68%)

Query: 32  EEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
             +EE+K  EE+ K + ++ +E+++E +K
Sbjct: 98  ARDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|199812 cd10488, MH1_R-SMAD, N-terminal Mad Homology 1 (MH1) domain of
           receptor regulated SMADs.  The MH1 is a small
           DNA-binding domain present in SMAD (small mothers
           against decapentaplegic) family of proteins, which are
           signal transducers and transcriptional modulators that
           mediate multiple signaling pathways. It binds to the
           major groove in an unusual manner via a beta hairpin
           structure.  It negatively regulates the functions of the
           MH2 domain, the C-terminal domain of SMAD. This MH1
           domain is found in all receptor regulated SMADs
           (R-SMADs) including SMAD1, SMAD2, SMAD3, SMAD5 and
           SMAD9. SMAD1 plays an essential role in bone development
           and postnatal bone formation through activation by bone
           morphogenetic protein (BMP) type 1 receptor kinase.
           SMAD2 regulates multiple cellular processes, such as
           cell proliferation, apoptosis and differentiation, while
           SMAD3 modulates signals of activin and TGF-beta. SMAD4,
           a common mediator SMAD (co-SMAD) binds R-SMADs, forming
           an oligomeric complex that binds to DNA and serves as a
           transcription factor. SMAD5 is involved in bone
           morphogenetic proteins (BMP) signal modulation, possibly
           playing a role in the pathway involving inhibition of
           hematopoietic progenitor cells by TGF-beta. SMAD9 (also
           known as SMAD8) can mediate the differentiation of
           mesenchymal stem cells (MSCs) into tendon-like cells by
           inhibiting the osteogenic pathway.
          Length = 123

 Score = 26.8 bits (59), Expect = 4.1
 Identities = 23/82 (28%), Positives = 36/82 (43%), Gaps = 14/82 (17%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTRCVEGEGAPT 89
            E+  EEEK  E+  K   KK ++K +  E  + ++         +   TRCV      T
Sbjct: 12  GEQNGEEEKWAEKAVKSLVKKLKKKGQLEELEKAIS--------TQNVNTRCV------T 57

Query: 90  RTLSVQIPLHRVHEQGVPPVVY 111
              S+   L   H +G+P V+Y
Sbjct: 58  IPRSLDGRLQVSHRKGLPHVIY 79


>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
           (CLPTM1).  This family consists of several eukaryotic
           cleft lip and palate transmembrane protein 1 sequences.
           Cleft lip with or without cleft palate is a common birth
           defect that is genetically complex. The nonsyndromic
           forms have been studied genetically using linkage and
           candidate-gene association studies with only partial
           success in defining the loci responsible for orofacial
           clefting. CLPTM1 encodes a transmembrane protein and has
           strong homology to two Caenorhabditis elegans genes,
           suggesting that CLPTM1 may belong to a new gene family.
           This family also contains the human cisplatin resistance
           related protein CRR9p which is associated with
           CDDP-induced apoptosis.
          Length = 437

 Score = 27.6 bits (62), Expect = 4.2
 Identities = 12/44 (27%), Positives = 19/44 (43%), Gaps = 7/44 (15%)

Query: 4   SSTTQVSKYKRRKKKKEE---EDEEGGGGEEEEEEEEKKKEEKK 44
                ++ Y  +KK K++     +     E+EE EEEK     K
Sbjct: 139 HFVFPLTTYLPKKKVKKKNLLGGKS----EKEEPEEEKTPAPDK 178


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family
          consists of several ATP synthase E chain sequences
          which are components of the CF(0) subunit.
          Length = 83

 Score = 26.3 bits (58), Expect = 4.2
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKK 56
          ++EE+  E + +EK  EK K    KKK
Sbjct: 34 KKEEKIREYEAQEKLIEKAKAAYAKKK 60



 Score = 25.5 bits (56), Expect = 6.0
 Identities = 12/41 (29%), Positives = 20/41 (48%)

Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKE 51
             +R  KK+EE       +E+  E+ K    KKK+ K++ 
Sbjct: 26 AKHQRSLKKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEA 66



 Score = 25.1 bits (55), Expect = 10.0
 Identities = 9/35 (25%), Positives = 20/35 (57%)

Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
          K++++K  E + +    E+ +    KKK+ K++ K
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAK 67


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 26.9 bits (60), Expect = 4.2
 Identities = 12/48 (25%), Positives = 21/48 (43%)

Query: 12  YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
            K RK+  +  +EE    E+ E E+ +  E+ + E   +      E E
Sbjct: 69  LKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEE 116


>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
          Length = 251

 Score = 27.3 bits (61), Expect = 4.2
 Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 4/49 (8%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           V K K  K+ K +  E G    E+      KK  K   + +++E++  +
Sbjct: 206 VHKKKSSKEYKLQNGEFGKSAIEQFL----KKSPKGNIELQEQEQQAIQ 250


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 27.6 bits (62), Expect = 4.2
 Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 5/61 (8%)

Query: 9   VSKYKRRKKKKEE----EDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE-KKKEGEKRRT 63
           + + +RR  ++EE    + E     EE  E++EK+   K+K   +KEEE ++   E+R  
Sbjct: 78  LQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREE 137

Query: 64  L 64
           L
Sbjct: 138 L 138


>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK).  This
          family consists of several plant specific
          phytosulfokine precursor proteins. Phytosulfokines, are
          active as either a pentapeptide or a C-terminally
          truncated tetrapeptide. These compounds were first
          isolated because of their ability to stimulate cell
          division in somatic embryo cultures of Asparagus
          officinalis.
          Length = 74

 Score = 25.8 bits (57), Expect = 4.3
 Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 4/38 (10%)

Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGE----KRRTLT 65
           E  + +  E  K ++ +   E + EGE     RRTL 
Sbjct: 24 HESSKIEAAESSKVDEDEVVSECEGEGEEECLMRRTLA 61


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 27.6 bits (62), Expect = 4.4
 Identities = 8/33 (24%), Positives = 17/33 (51%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            E   +E    EK++++  KE+E  + G+   +
Sbjct: 243 AERLRQEAAAYEKQQKELAKEQEWIRRGKAAAS 275


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 27.0 bits (60), Expect = 4.4
 Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 4/60 (6%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKK----KEEKKKEKKKKEEEKKK 56
           L++  T ++ + K  +K+  E++      + E  E E+K    + +    K+KKE E K+
Sbjct: 101 LKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKE 160


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 27.5 bits (61), Expect = 4.4
 Identities = 15/49 (30%), Positives = 31/49 (63%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           + K K +  +++EE EE    E+EEEE+ +   +K++E+++  + K K+
Sbjct: 133 IQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQ 181



 Score = 27.5 bits (61), Expect = 4.7
 Identities = 12/53 (22%), Positives = 34/53 (64%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           ++  Y++  K   ++++E    E+EE EE  + E++++E+++   +K++E ++
Sbjct: 121 KIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQ 173


>gnl|CDD|216828 pfam01984, dsDNA_bind, Double-stranded DNA-binding domain.  This
          domain is believed to bind double-stranded DNA of 20
          bases length.
          Length = 105

 Score = 26.4 bits (59), Expect = 4.4
 Identities = 11/33 (33%), Positives = 25/33 (75%)

Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE 46
          RR +  E + ++GG  E++++EE++++EE +K+
Sbjct: 2  RRARLAELQQQQGGLEEQQQQEEQQQQEEAQKQ 34


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 27.4 bits (61), Expect = 4.6
 Identities = 15/48 (31%), Positives = 23/48 (47%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           R+KKK  E   E       ++ +  +K E+     KKEEE K+  E+ 
Sbjct: 851 RQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEE 898


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 27.1 bits (61), Expect = 4.6
 Identities = 11/34 (32%), Positives = 22/34 (64%)

Query: 19  KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
           ++    E    EEEE EEE+++EE+++ ++ + E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 26.3 bits (59), Expect = 8.8
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKE 51
           ++    E     EEE EEEE+++EE++ E+ + E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|193562 cd08014, M20_Acy1_like4, M20 Peptidase Aminoacylase 1 subfamily.
           Peptidase M20 family, Uncharacterized subfamily of
           uncharacterized bacterial proteins predicted as putative
           amidohydrolases. These are a class of zinc binding
           homodimeric enzymes involved in hydrolysis of
           N-acetylated proteins. N-terminal acetylation of
           proteins is a widespread and highly conserved process
           that is involved in protection and stability of
           proteins. Several types of aminoacylases can be
           distinguished on the basis of substrate specificity.
           Aminoacylase 1 (ACY1) breaks down cytosolic aliphatic
           N-acyl-alpha-amino acids (except L-aspartate),
           especially N-acetyl-methionine and acetyl-glutamate into
           L-amino acids and an acyl group. However, ACY1 can also
           catalyze the reverse reaction, the synthesis of
           acetylated amino acids. ACY1 may also play a role in
           xenobiotic bioactivation as well as the inter-organ
           processing of amino acid-conjugated xenobiotic
           derivatives (S-substituted-N-acetyl-L-cysteine).
          Length = 372

 Score = 27.2 bits (61), Expect = 4.6
 Identities = 10/39 (25%), Positives = 17/39 (43%), Gaps = 5/39 (12%)

Query: 72  PALKPRPTRCVEGEGAPTRTLSVQIPLHRVHEQGVPPVV 110
              +       EG  AP      ++ +   +++GVPPVV
Sbjct: 255 ETAEELVREIAEGVAAPYG---ARVEV--DYQRGVPPVV 288


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score = 27.4 bits (62), Expect = 4.7
 Identities = 13/32 (40%), Positives = 23/32 (71%)

Query: 29  GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           G+E+EEE+E+K+E +++ K   E  K+  G+K
Sbjct: 477 GKEDEEEKEEKEEAEEEFKPLLERLKEALGDK 508


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 26.4 bits (59), Expect = 4.9
 Identities = 8/25 (32%), Positives = 16/25 (64%)

Query: 36  EEKKKEEKKKEKKKKEEEKKKEGEK 60
           ++K + +++ + +KK EE K  GE 
Sbjct: 78  QKKLQAKQQAKAEKKAEENKAAGEA 102


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 27.1 bits (60), Expect = 4.9
 Identities = 15/56 (26%), Positives = 25/56 (44%)

Query: 5   STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           S     K K   +KK+   +     +E+EE  E   EE    ++ +E +KK + E 
Sbjct: 47  SKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYEL 102


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 25.7 bits (57), Expect = 5.0
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 38 KKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
          K+ E+ + ++ KK   KK   +K+R L F R
Sbjct: 7  KRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37


>gnl|CDD|217206 pfam02731, SKIP_SNW, SKIP/SNW domain.  This domain is found in
           chromatin proteins.
          Length = 158

 Score = 26.9 bits (60), Expect = 5.0
 Identities = 10/33 (30%), Positives = 22/33 (66%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           EE  +  E +++  +KEK++KEE+ ++  ++ R
Sbjct: 123 EEVRQRAELQRQLAEKEKQEKEEKLRELAQRAR 155


>gnl|CDD|150747 pfam10107, Endonuc_Holl, Endonuclease related to archaeal
          Holliday junction resolvase.  This domain is found in
          various predicted bacterial endonucleases which are
          distantly related to archaeal Holliday junction
          resolvases.
          Length = 153

 Score = 26.9 bits (60), Expect = 5.0
 Identities = 13/47 (27%), Positives = 26/47 (55%)

Query: 8  QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
          ++ K + + +K+  E  E     E E +  ++ E K +E +KKEE++
Sbjct: 12 KIIKLQGKVEKRARELFEQWRNAEREAQAMREAEAKFEEWQKKEEKE 58


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 27.3 bits (60), Expect = 5.0
 Identities = 4/42 (9%), Positives = 11/42 (26%)

Query: 18  KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           +     +          EE      +  ++ + E  +K    
Sbjct: 382 QAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPI 423


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 26.9 bits (59), Expect = 5.1
 Identities = 15/49 (30%), Positives = 32/49 (65%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           + K  K K+E+   +     +E + EEK+ +E+++E++ +EE K++EG+
Sbjct: 191 QVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQ 239



 Score = 26.5 bits (58), Expect = 7.5
 Identities = 12/45 (26%), Positives = 28/45 (62%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           + K  +  +E    + +E  +E K EEK+ +++++EEE ++E ++
Sbjct: 191 QVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQ 235


>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
           topoisomerase IV), A subunit [DNA replication,
           recombination, and repair].
          Length = 804

 Score = 27.2 bits (61), Expect = 5.2
 Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 9/50 (18%)

Query: 23  DEEGGGGEEEEEEEEKK-------KEEKKKEKKKKE--EEKKKEGEKRRT 63
           +EE    E +E E+E          EE+  +  KKE  E KKK G++RRT
Sbjct: 435 EEEKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERRT 484


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 27.5 bits (62), Expect = 5.2
 Identities = 16/62 (25%), Positives = 34/62 (54%), Gaps = 7/62 (11%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKK-------KEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
           K+ ++ +       +  E  E +K       K+EKKK+K+K+++E+ K G++ + L+  +
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQ 650

Query: 69  IG 70
            G
Sbjct: 651 KG 652



 Score = 27.5 bits (62), Expect = 5.2
 Identities = 15/42 (35%), Positives = 27/42 (64%), Gaps = 3/42 (7%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +++ E++ EE    E   +E EK KEE +++K+K +EE+ K 
Sbjct: 529 ERELEQKAEEA---EALLKEAEKLKEELEEKKEKLQEEEDKL 567



 Score = 27.1 bits (61), Expect = 6.1
 Identities = 19/64 (29%), Positives = 36/64 (56%), Gaps = 6/64 (9%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK--RRTLTFIRIGKPA 73
           K K+E E+++    E+ +EEE+K  EE +KE ++  +E KKE ++  +      + G  +
Sbjct: 548 KLKEELEEKK----EKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYAS 603

Query: 74  LKPR 77
           +K  
Sbjct: 604 VKAH 607


>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
          lipoprotein.  This model represents a family of six
          predicted lipoproteins from a region of about 20
          tandemly arranged genes in the Treponema denticola
          genome. Two other neighboring genes share the
          lipoprotein signal peptide region but do not show more
          extensive homology. The function of this locus is
          unknown.
          Length = 412

 Score = 27.2 bits (60), Expect = 5.3
 Identities = 10/50 (20%), Positives = 16/50 (32%)

Query: 9  VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
              K  +K+ E   +       +  E+ K+   K      K E  K  G
Sbjct: 36 EKFSKMEEKENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGESDKFVG 85


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 27.1 bits (60), Expect = 5.4
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKE 41
           EEE+ E G  +EEEE EE+ +E
Sbjct: 80  EEEEVEEGEEDEEEEGEEESEE 101


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 27.1 bits (61), Expect = 5.5
 Identities = 11/39 (28%), Positives = 21/39 (53%)

Query: 21  EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
           EE  E  G +EE  EE +++ ++  E +   +E+K   +
Sbjct: 389 EELLEIEGFDEETVEELRERAKEALETEALAQEEKLADD 427


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 26.9 bits (60), Expect = 5.5
 Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 4/52 (7%)

Query: 14  RRKKKKEEEDEEGGGGEEEEE----EEEKKKEEKKKEKKKKEEEKKKEGEKR 61
            R KKKE + +E     ++++    EEEKKK++KK  KKKK ++    G K 
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 27.2 bits (60), Expect = 5.5
 Identities = 20/57 (35%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK-KEEEKKKEGEKRRTLTFIR 68
           K+ +   +E + E    +  EEE ++K EE+K+E  K  EEE KKE E ++  T + 
Sbjct: 44  KKDRITIQEREREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLE 100


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 27.0 bits (59), Expect = 5.5
 Identities = 15/49 (30%), Positives = 28/49 (57%)

Query: 14  RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
           ++KK      E+     +E EE E++ E +++E+K  E E ++E E+ R
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERER 164



 Score = 26.2 bits (57), Expect = 9.1
 Identities = 15/61 (24%), Positives = 36/61 (59%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           L++     + K K    ++E+ ++     EE EE  E ++ E+K ++++ +EE+++E E+
Sbjct: 107 LRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREE 166

Query: 61  R 61
           +
Sbjct: 167 Q 167


>gnl|CDD|236601 PRK09637, PRK09637, RNA polymerase sigma factor SigZ;
          Provisional.
          Length = 181

 Score = 26.9 bits (60), Expect = 5.5
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKE 41
          RKK + EE  +    E+EE EE  KKE
Sbjct: 69 RKKNRSEELPDDLLFEDEEREENAKKE 95


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 26.7 bits (60), Expect = 5.8
 Identities = 7/28 (25%), Positives = 17/28 (60%)

Query: 34  EEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           +  E+++ E+++E  +K  E  +EG + 
Sbjct: 86  KGTEEERRERREENLEKALELLREGNRS 113


>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS).  This family
           consists of several mammalian selenoprotein S (SelS)
           sequences. SelS is a plasma membrane protein and is
           present in a variety of tissues and cell types. The
           function of this family is unknown.
          Length = 190

 Score = 26.8 bits (59), Expect = 5.8
 Identities = 8/25 (32%), Positives = 19/25 (76%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKE 57
           +E+ + K  + K+++K+ EEEK+++
Sbjct: 93  QEDLDAKAAKFKEKQKQLEEEKRRQ 117


>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
           hydroxylase/glutamate synthase subunit beta;
           Provisional.
          Length = 752

 Score = 27.4 bits (61), Expect = 5.8
 Identities = 12/58 (20%), Positives = 19/58 (32%), Gaps = 4/58 (6%)

Query: 13  KRRKKKKEEEDEE----GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
           KR    K  E EE             E  K  + K++   ++    + + E R    F
Sbjct: 255 KRMGAYKTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRF 312


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 26.5 bits (59), Expect = 5.8
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
           EE++EEEE K  E+  E  KK+E +KK+ EK 
Sbjct: 149 EEQKEEEEAKLAEEALEALKKKEAEKKKKEKE 180


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 27.1 bits (61), Expect = 5.9
 Identities = 11/32 (34%), Positives = 23/32 (71%)

Query: 29  GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
           GE++++++E ++  +K EK K  +E K++ EK
Sbjct: 243 GEDDDDKDEVEELREKIEKLKLPKEAKEKAEK 274


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 90 amino acids in
          length. There are two completely conserved L residues
          that may be functionally important.
          Length = 70

 Score = 25.4 bits (56), Expect = 5.9
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKK 56
          + EEE+ +++EE +K +++KE EK K
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNK 55



 Score = 25.4 bits (56), Expect = 6.0
 Identities = 10/25 (40%), Positives = 20/25 (80%)

Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEK 54
           EEE+ EE+++  K++E+K++E+ K
Sbjct: 31 AEEEKREEEEEARKREERKEREKNK 55


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 27.1 bits (61), Expect = 5.9
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKK 56
          EE+ E  E+   E  +E+++K+EEKK 
Sbjct: 64 EEQRERLEELAPELLEEEEEKKEEKKG 90



 Score = 27.1 bits (61), Expect = 6.7
 Identities = 14/28 (50%), Positives = 15/28 (53%)

Query: 29  GEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
             E  EEEE+KKEEKK        EK K
Sbjct: 74  APELLEEEEEKKEEKKGLPPLPNAEKGK 101


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 27.1 bits (60), Expect = 5.9
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
           +EE+ EE+K+E KK  K+KK+     + +K +T
Sbjct: 191 KEEKAEERKQESKKGAKRKKDASGDDKSKKAKT 223


>gnl|CDD|237214 PRK12813, flgD, flagellar basal body rod modification protein;
           Reviewed.
          Length = 223

 Score = 26.6 bits (59), Expect = 6.1
 Identities = 12/34 (35%), Positives = 18/34 (52%), Gaps = 3/34 (8%)

Query: 142 LSNIFETFLPQL---LTYPNPTDPLNGDAAAMYL 172
           L++ FETFL  L   L   +P +P++    A  L
Sbjct: 28  LTSDFETFLRMLTTQLQNQDPLNPMDSSDYAAQL 61


>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter.  [Transport and binding
           proteins, Other].
          Length = 953

 Score = 27.0 bits (60), Expect = 6.2
 Identities = 7/40 (17%), Positives = 18/40 (45%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRI 69
            + E++ E K  +    K  K  + + +   R++   +R+
Sbjct: 854 HKTEKDREAKDPKMTWTKPWKITDAELQSNVRKSYRQVRL 893


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 27.2 bits (60), Expect = 6.3
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 25  EGGGGEEEEEEEEKKKEEKKKEKKK 49
           E    E E E  +  K++KK +KKK
Sbjct: 234 EKPQAEAENEAGKSDKKDKKSKKKK 258



 Score = 26.8 bits (59), Expect = 7.4
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 21  EEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
           E  E     + E E E  K ++K K+ KKK
Sbjct: 228 EGGELIEKPQAEAENEAGKSDKKDKKSKKK 257


>gnl|CDD|163660 cd07417, MPP_PP5_C, PP5, C-terminal metallophosphatase domain.
           Serine/threonine protein phosphatase-5 (PP5) is a member
           of the PPP gene family of protein phosphatases that is
           highly conserved among eukaryotes and widely expressed
           in mammalian tissues. PP5 has a C-terminal phosphatase
           domain and an extended N-terminal TPR (tetratricopeptide
           repeat) domain containing three TPR motifs.  The PPP
           (phosphoprotein phosphatase) family, to which PP5
           belongs, is one of two known protein phosphatase
           families specific for serine and threonine.  The PPP
           family also includes: PP1, PP2A, PP2B (calcineurin),
           PP4, PP6, PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4
           hydrolase. The PPP catalytic domain is defined by three
           conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-).  The
           PPP enzyme family is ancient with members found in all
           eukaryotes, and in most bacterial and archeal genomes.
           Dephosphorylation of phosphoserines and
           phosphothreonines on target proteins plays a central
           role in the regulation of many cellular processes.  PPPs
           belong to the metallophosphatase (MPP) superfamily.
           MPPs are functionally diverse, but all share a conserved
           domain with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 316

 Score = 26.8 bits (60), Expect = 6.3
 Identities = 18/43 (41%), Positives = 21/43 (48%), Gaps = 11/43 (25%)

Query: 126 TVCLNVINQEWTPLYDLSNIFETFLPQLLTYPNPTDPL--NGD 166
           TVC +   Q     YDL NIFE     L   P+ T+P   NGD
Sbjct: 63  TVCGDTHGQ----FYDLLNIFE-----LNGLPSETNPYLFNGD 96


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
          secreted from the skin of amphibians, including the
          opiate-like dermorphins and deltorphins, and the
          antimicrobial dermoseptins and temporins. The alignment
          for this family includes the signal peptide.
          Length = 46

 Score = 24.6 bits (54), Expect = 6.3
 Identities = 12/29 (41%), Positives = 19/29 (65%), Gaps = 4/29 (13%)

Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
          EEE  E    +EEE E+E++ EE+ + K+
Sbjct: 22 EEEKRE----DEEENEDEEEGEEQSEVKR 46



 Score = 24.6 bits (54), Expect = 6.4
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 18 KKEEEDEEGGGGEEEEEEEEKKK 40
          +K E++EE    EE EE+ E K+
Sbjct: 24 EKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
           peptidase.  Corresponds to Merops family C37.
           Norwalk-like viruses (NLVs), including the Southampton
           virus, cause acute non-bacterial gastroenteritis in
           humans. The NLV genome encodes three open reading frames
           (ORFs). ORF1 encodes a polyprotein, which is processed
           by the viral protease into six proteins.
          Length = 535

 Score = 27.1 bits (60), Expect = 6.5
 Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 5/51 (9%)

Query: 20  EEEDEEGGGGEEEEEEE-----EKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
            E     G   EE++E      +    E KK K KK   KK     RR L+
Sbjct: 204 XEVTYYDGCESEEDDEGWYIEPQDATPEGKKGKNKKGRGKKHNAFSRRGLS 254


>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675).  The
          members of this family are sequences derived from
          hypothetical plant proteins of unknown function. One
          member of this family is annotated as a putative
          RNA-binding protein, but no evidence was found to
          support this.
          Length = 283

 Score = 26.8 bits (59), Expect = 6.8
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
          E EE+ ++ K+ +  ++ E K+K  EK   
Sbjct: 67 ETEEEWRKRKEMQSLRRLEAKRKRSEKEYN 96


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 26.8 bits (59), Expect = 6.9
 Identities = 9/38 (23%), Positives = 17/38 (44%), Gaps = 2/38 (5%)

Query: 22  EDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
             EE    +EE E+E +  E   +E  + E  + +  +
Sbjct: 343 VTEEST--DEESEDEVEIDESVIEEVAEMELLEVQVDD 378


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved
          in cell cycle arrest and pre-mRNA splicing. It has been
          shown to be a component of U4/U6 x U5 tri-snRNP complex
          in human, Schizosaccharomyces pombe and Saccharomyces
          cerevisiae. SART-1 is a known tumour antigen in a range
          of cancers recognised by T cells.
          Length = 603

 Score = 27.0 bits (60), Expect = 7.0
 Identities = 13/44 (29%), Positives = 22/44 (50%)

Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
          K+  +        + +EE E K+K E+ +EK  K  EK++   K
Sbjct: 31 KESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSK 74


>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
           Provisional.
          Length = 170

 Score = 26.6 bits (59), Expect = 7.0
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEE 53
           EE+E + K+ EEK++   K+ EE
Sbjct: 139 EEDEAKAKEVEEKREATWKRLEE 161


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 26.5 bits (59), Expect = 7.2
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
           KKKK +++EE    +++ E    K E +KK  K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 26.2 bits (58), Expect = 7.2
 Identities = 11/44 (25%), Positives = 20/44 (45%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
                E  EE      +E  +  ++EE +K +K+ ++ KK E  
Sbjct: 84  VNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPV 127



 Score = 25.8 bits (57), Expect = 9.6
 Identities = 12/50 (24%), Positives = 26/50 (52%)

Query: 8   QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           +V+     +  +EE+ E      +  ++EE +K +K+ +K KK E  + +
Sbjct: 81  RVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQGQ 130


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
          appears in eukaryotes as well as bacteria and tends to
          be found near the C-terminus of the metalloprotease
          M16C (pfam05193).
          Length = 248

 Score = 26.4 bits (59), Expect = 7.2
 Identities = 13/31 (41%), Positives = 20/31 (64%), Gaps = 3/31 (9%)

Query: 29 GEEEEEEEEKKKEEKKKEKKKK---EEEKKK 56
           +E  EE+  ++E +K EKKK    EE+K+K
Sbjct: 3  PDETLEEKLNEEEREKLEKKKSSLSEEDKEK 33


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 26.0 bits (57), Expect = 7.3
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKE 57
            E  E+ KKK++K K+ KK ++  KK+
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 26.5 bits (59), Expect = 7.4
 Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 30  EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
            E  + E  + E++   +K+KEEE+  E ++R  
Sbjct: 208 AERAKAEAAEAEQELLREKQKEEEQMMEAQERSY 241



 Score = 26.5 bits (59), Expect = 7.6
 Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK--EEEKKKEGEK 60
           K +  + E    E    E+E   E++K+EE+  E +++  +E  K+  EK
Sbjct: 202 KEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEK 251


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 26.7 bits (59), Expect = 7.4
 Identities = 10/34 (29%), Positives = 17/34 (50%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
             ED E G  E E  +E ++ E +  + + +E E
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGE 276


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 27.0 bits (60), Expect = 7.4
 Identities = 16/44 (36%), Positives = 25/44 (56%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           +  K++  E        +EE EE EK+ EE ++E ++ EEE KK
Sbjct: 392 QELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKK 435


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 26.7 bits (59), Expect = 7.4
 Identities = 10/51 (19%), Positives = 24/51 (47%)

Query: 15  RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
            +++    ++E     E+ ++    +  K  ++KKK  E+  E  ++R  T
Sbjct: 364 VEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLREQADEEYQQRHAT 414


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 26.7 bits (59), Expect = 7.6
 Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 3/59 (5%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
           T + + +   K+K     E    EEEEEEEE   EE++  + +   E+   G +   + 
Sbjct: 381 TTIDELREFLKEKGHPVVERWAEEEEEEEEE---EEEEAAEAEAPMEEPVPGFEVPEMP 436


>gnl|CDD|201249 pfam00472, RF-1, RF-1 domain.  This domain is found in peptide
           chain release factors such as RF-1 and RF-2, and a
           number of smaller proteins of unknown function. This
           domain contains the peptidyl-tRNA hydrolase activity.
           The domain contains a highly conserved motif GGQ, where
           the glutamine is thought to coordinate the water that
           mediates the hydrolysis.
          Length = 114

 Score = 25.6 bits (57), Expect = 7.8
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 36  EEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
           E +KK EK K  +  +  +    EK RT  F +
Sbjct: 75  ELQKKREKTKPTRASQVRRGDRKEKIRTYNFPQ 107


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 26.9 bits (60), Expect = 7.9
 Identities = 10/42 (23%), Positives = 21/42 (50%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
           K  +    K+   +     +E +++E+  K++KK  +K  EE
Sbjct: 883 KLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEE 924


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 26.9 bits (59), Expect = 7.9
 Identities = 13/43 (30%), Positives = 23/43 (53%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            E   E    +E E++EEKK EE++  ++K+  E+    +K  
Sbjct: 144 SESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFD 186


>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
          Length = 1121

 Score = 26.8 bits (60), Expect = 7.9
 Identities = 15/41 (36%), Positives = 24/41 (58%), Gaps = 4/41 (9%)

Query: 11  KYKRRKKKKEEED----EEGGGGEEEEEEEEKKKEEKKKEK 47
           K  +   K + +     EE   G++E+EEEE+KK+ K K+K
Sbjct: 255 KILKYVDKLKIDGWEWLEELISGKKEKEEEEEKKKIKPKDK 295


>gnl|CDD|149180 pfam07960, CBP4, CBP4.  The CBP4 in S. cerevisiae is essential for
           the expression and activity of ubiquinol-cytochrome c
           reductase. This family appears to be fungal specific.
          Length = 128

 Score = 25.8 bits (57), Expect = 7.9
 Identities = 12/41 (29%), Positives = 25/41 (60%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
           KK+   E ++    E ++EE E+ +EE ++ + + EE +K+
Sbjct: 88  KKRNVREFQKTKAEEAQKEELERIREELEEARAQSEEMRKE 128


>gnl|CDD|227199 COG4862, MecA, Negative regulator of genetic competence,
           sporulation and motility [Posttranslational
           modification, protein turnover, chaperones / Signal
           transduction mechanisms / Cell motility and secretion].
          Length = 224

 Score = 26.6 bits (59), Expect = 7.9
 Identities = 15/59 (25%), Positives = 31/59 (52%)

Query: 9   VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
           V+K K   +  ++ DE     +E+  EE  ++ E+K +  +KEE+  +  ++   L +I
Sbjct: 75  VTKSKIEGEGPDDLDEFLDMPDEDGAEEGLEQLEEKFDDVEKEEQGDEAEKEPDYLYYI 133


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 25.9 bits (57), Expect = 8.0
 Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 10/47 (21%)

Query: 13  KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK--KEEEKKKE 57
           K   KK+EE        +EEE  E  K+EE   E +   K++   K+
Sbjct: 92  KLTSKKEEE--------KEEEIPEPTKEEELLGEIRDLLKQQNSSKD 130


>gnl|CDD|181433 PRK08455, fliL, flagellar basal body-associated protein FliL;
          Reviewed.
          Length = 182

 Score = 26.2 bits (58), Expect = 8.0
 Identities = 10/36 (27%), Positives = 17/36 (47%)

Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          G  EEE+E  +K  +E +        ++ KEG   +
Sbjct: 43 GSKEEEKENAQKPTQEVQANPPANSAQEAKEGSNIQ 78


>gnl|CDD|129262 TIGR00158, L9, ribosomal protein L9.  Ribosomal protein L9
          appears to be universal in, but restricted to,
          eubacteria and chloroplast [Protein synthesis,
          Ribosomal proteins: synthesis and modification].
          Length = 148

 Score = 26.2 bits (58), Expect = 8.1
 Identities = 11/32 (34%), Positives = 15/32 (46%)

Query: 35 EEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
          E  +KK E+K    K    + KE  +  TLT 
Sbjct: 48 EARRKKLEEKLAANKAAAARLKEVLELGTLTI 79


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 26.5 bits (59), Expect = 8.3
 Identities = 23/56 (41%), Positives = 36/56 (64%), Gaps = 2/56 (3%)

Query: 11  KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE--KKKEKKKKEEEKKKEGEKRRTL 64
           K K++KK K+E  E     E+++ E +KK+EE  KK++  KK ++KKK  +K R L
Sbjct: 151 KEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 26.1 bits (58), Expect = 9.7
 Identities = 9/32 (28%), Positives = 21/32 (65%)

Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
          E   E+ ++K E++K +KK++ ++ K+ E  +
Sbjct: 5  EALLEQRRRKREQRKARKKQKRKEAKKKEDAQ 36


>gnl|CDD|236987 PRK11798, PRK11798, ClpXP protease specificity-enhancing factor;
           Provisional.
          Length = 138

 Score = 25.9 bits (58), Expect = 8.3
 Identities = 4/29 (13%), Positives = 11/29 (37%)

Query: 22  EDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
           E E     +E +++ +   E     +  +
Sbjct: 103 EPEAAYDEDEHDDDTDPDDEPPPPPRGGR 131


>gnl|CDD|132187 TIGR03143, AhpF_homolog, putative alkyl hydroperoxide reductase F
           subunit.  This family of thioredoxin reductase homologs
           is found adjacent to alkylhydroperoxide reductase C
           subunit predominantly in cases where there is only one C
           subunit in the genome and that genome is lacking the F
           subunit partner (also a thioredcxin reductase homolog)
           that is usually found (TIGR03140).
          Length = 555

 Score = 26.7 bits (59), Expect = 8.4
 Identities = 15/51 (29%), Positives = 20/51 (39%), Gaps = 6/51 (11%)

Query: 7   TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           T   +Y +  K      E+ G  EE EEEE K+  E    +       KK 
Sbjct: 302 TSAERYVKELK------EKLGIAEEYEEEEAKEASEASAAETTPAATTKKG 346


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and
          L12p. Ribosomal proteins P1 and P2 are the eukaryotic
          proteins that are functionally equivalent to bacterial
          L7/L12. L12p is the archaeal homolog. Unlike other
          ribosomal proteins, the archaeal L12p and eukaryotic P1
          and P2 do not share sequence similarity with their
          bacterial counterparts. They are part of the ribosomal
          stalk (called the L7/L12 stalk in bacteria), along with
          28S rRNA and the proteins L11 and P0 in eukaryotes (23S
          rRNA, L11, and L10e in archaea). In bacterial
          ribosomes, L7/L12 homodimers bind the extended
          C-terminal helix of L10 to anchor the L7/L12 molecules
          to the ribosome. Eukaryotic P1/P2 heterodimers and
          archaeal L12p homodimers are believed to bind the L10
          equivalent proteins, eukaryotic P0 and archaeal L10e,
          in a similar fashion. P1 and P2 (L12p, L7/L12) are the
          only proteins in the ribosome to occur as multimers,
          always appearing as sets of dimers. Recent data
          indicate that most archaeal species contain six copies
          of L12p (three homodimers), while eukaryotes have two
          copies each of P1 and P2 (two heterodimers). Bacteria
          may have four or six copies (two or three homodimers),
          depending on the species. As in bacteria, the stalk is
          crucial for binding of initiation, elongation, and
          release factors in eukaryotes and archaea.
          Length = 105

 Score = 25.7 bits (56), Expect = 8.6
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEE 53
                E  +K EE K+E++++E+E
Sbjct: 74 TAAATAEPAEKAEEAKEEEEEEEDE 98


>gnl|CDD|227513 COG5186, PAP1, Poly(A) polymerase [RNA processing and
           modification].
          Length = 552

 Score = 26.8 bits (59), Expect = 8.6
 Identities = 16/47 (34%), Positives = 19/47 (40%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
           EE  E+ G G   E   E    EK       EEE  ++G K    TF
Sbjct: 417 EESIEKIGSGVTLEVAYESTDHEKLANDTVPEEESMEDGMKVYCTTF 463


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 26.5 bits (59), Expect = 8.9
 Identities = 8/28 (28%), Positives = 16/28 (57%)

Query: 35  EEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
            EE K +E+   K + + E++++  K R
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPR 610


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 26.6 bits (59), Expect = 9.1
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKKEG 58
           EE+     +   KE+ KK E+K KE 
Sbjct: 525 EEQGTNFGKRNSKERYKKNEDKIKEF 550


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 26.6 bits (59), Expect = 9.2
 Identities = 14/53 (26%), Positives = 22/53 (41%)

Query: 1   LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
           L+    + V +      ++  E       +  E +EE KKE  KK KK  E +
Sbjct: 452 LEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVD 504



 Score = 26.6 bits (59), Expect = 9.4
 Identities = 9/39 (23%), Positives = 18/39 (46%)

Query: 16  KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
               EE  E+     E+  E +++ +++  +K KK  E 
Sbjct: 465 LSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503


>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
           and repair].
          Length = 1187

 Score = 26.6 bits (59), Expect = 9.4
 Identities = 12/61 (19%), Positives = 28/61 (45%)

Query: 20  EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPT 79
            EE  E      +   +   ++  ++ +++  E +++  +KRR L  + +     KPR +
Sbjct: 232 SEEVIESAYELIKLRRKLYGEKRAERVREELREVEREREKKRRKLGILVVSSATGKPRGS 291

Query: 80  R 80
           R
Sbjct: 292 R 292


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This
          is the highly conserved family of the major tail
          subunit protein.
          Length = 121

 Score = 25.8 bits (56), Expect = 9.4
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKK 56
          E +EE +K + E++K  K+ +EE+ K
Sbjct: 47 EMKEERDKMETEREKRDKESKEERDK 72


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 25.9 bits (57), Expect = 9.4
 Identities = 8/24 (33%), Positives = 12/24 (50%)

Query: 33  EEEEEKKKEEKKKEKKKKEEEKKK 56
              E+   E  +   KK+EE +KK
Sbjct: 117 SSNEQNSTEALQLRAKKREEHRKK 140


>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Missing In
           Metastasis.  The IMD domain, also called
           Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
           dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions. Members of
           this subfamily include missing in metastasis (MIM) or
           metastasis suppressor 1 (MTSS1), metastasis suppressor
           1-like (MTSSL) or ABBA (Actin-Bundling protein with
           BAIAP2 homology), and similar proteins. They contain an
           N-terminal IMD and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus. MIM was originally identified
           as a missing transcript from metastatic bladder and
           prostate cancer cells. It is a scaffold protein that
           functions in a signaling pathway between the PDGF
           receptor, Src kinases, and actin assembly. It may also
           function as a cofactor of the Sonic hedgehog (Shh)
           transcriptional pathway and may participate in tumor
           development and progression via this pathway. ABBA
           regulates actin and plasma membrane dynamics to promote
           the extension of radial glia, which is important in
           neuronal migration, axon guidance and neurogenesis. The
           IMD domain of MIM binds and bundles actin filaments,
           binds membranes, and interacts with the small GTPase
           Rac.
          Length = 231

 Score = 26.3 bits (58), Expect = 9.5
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 6/52 (11%)

Query: 31  EEEEEEEKK------KEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKP 76
           +E+ EE KK      K+  K+ KK ++E KKK  +  R     R GK  L+P
Sbjct: 103 QEKIEEWKKVANQLDKDHAKEYKKARQEIKKKSSDTIRLQKKARKGKGDLQP 154


>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
          co-repressor CIR.  This is a 45 residue conserved
          region at the N-terminal end of a family of proteins
          referred to as CIRs (CBF1-interacting co-repressors).
          CBF1 (centromere-binding factor 1) acts as a
          transcription factor that causes repression by binding
          specifically to GTGGGAA motifs in responsive promoters,
          and it requires CIR as a co-repressor. CIR binds to
          histone deacetylase and to SAP30 and serves as a linker
          between CBF1 and the histone deacetylase complex.
          Length = 37

 Score = 24.1 bits (53), Expect = 9.8
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 39 KKEEKKKEKKKKEEEKKKEGEK 60
          + E+K  E++KK EE +KE E+
Sbjct: 15 EAEQKALEEQKKIEELRKEIEE 36


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 26.0 bits (57), Expect = 9.8
 Identities = 9/31 (29%), Positives = 20/31 (64%)

Query: 36  EEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
           EE+K+++K   K++KE +KK+  ++     +
Sbjct: 94  EEEKEKKKNLNKEEKEAKKKERAKREYPFNY 124


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 26.6 bits (59), Expect = 9.8
 Identities = 10/52 (19%), Positives = 21/52 (40%)

Query: 5   STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
                   KRR+  K+   E       E++ + K  ++  ++ KK   + +K
Sbjct: 745 GDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQK 796


>gnl|CDD|178450 PLN02859, PLN02859, glutamine-tRNA ligase.
          Length = 788

 Score = 26.6 bits (59), Expect = 9.9
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 31  EEEEEEEKKKEEKKKEKKKKEEEKKKE 57
           E+   + +K  +KKKEK  K EEKK  
Sbjct: 181 EKTAADNEKPVKKKKEKPAKVEEKKVA 207


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.307    0.130    0.369 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,430,812
Number of extensions: 910517
Number of successful extensions: 13647
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8373
Number of HSP's successfully gapped: 2076
Length of query: 180
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 89
Effective length of database: 6,901,388
Effective search space: 614223532
Effective search space used: 614223532
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 56 (25.2 bits)