RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6121
         (126 letters)



>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain.  This family includes
          the Thrombospondin N-terminal-like domain, a Laminin G
          subfamily.
          Length = 124

 Score = 43.2 bits (102), Expect = 2e-06
 Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 2/53 (3%)

Query: 12 DGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRR 64
           G DG D+L + ++DG + +   L  G     +  +  + +D QWH +S+ R 
Sbjct: 14 GGEDGLDFLALELEDGRLVLRYDL--GSGGSVLLLSGKKLNDGQWHRVSVSRD 64


>gnl|CDD|214598 smart00282, LamG, Laminin G domain. 
          Length = 132

 Score = 42.7 bits (101), Expect = 5e-06
 Identities = 13/52 (25%), Positives = 25/52 (48%), Gaps = 2/52 (3%)

Query: 12 DGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHR 63
              G DYL + ++DG +   L+  +G     +  +    +D QWH +++ R
Sbjct: 19 GSKGGGDYLALELRDGRL--VLRYDLGSGPARLTSDPTPLNDGQWHRVAVER 68


>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are
          usually Ca++ mediated receptors that can have binding
          sites for steroids, beta1 integrins, heparin,
          sulfatides, fibulin-1, and alpha-dystroglycans.
          Proteins that contain LamG domains serve a variety of
          purposes including signal transduction via cell-surface
          steroid receptors, adhesion, migration and
          differentiation through mediation of cell adhesion
          molecules.
          Length = 151

 Score = 40.9 bits (96), Expect = 3e-05
 Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 12 DGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHR 63
             +G D+L + ++DG + +   L  G L   +       +D QWH +S+ R
Sbjct: 41 GSQNGGDFLALELEDGRLVLRYDLGSGSL--VLSSKT-PLNDGQWHSVSVER 89


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 39.1 bits (92), Expect = 2e-04
 Identities = 19/52 (36%), Positives = 34/52 (65%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
           K+ E++K+ + K      +K EEEE E++K+E++++ EEEE +EE+ E    
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472



 Score = 38.7 bits (91), Expect = 3e-04
 Identities = 17/48 (35%), Positives = 33/48 (68%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
             E+ +K+RE++ +E++ +    ++ EE++EE+KEK EEE+ +EE   
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 36.8 bits (86), Expect = 0.001
 Identities = 20/50 (40%), Positives = 38/50 (76%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           +++EKKK+     ++EE+E+ E+E+ EE+KEE++E+ EEE+ +EE ++  
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 36.8 bits (86), Expect = 0.001
 Identities = 21/54 (38%), Positives = 35/54 (64%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGGR 126
           K EE+KK ++ K    + ++ EEEE +E+KEE+KE+ EEE  +E+  E  +  +
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 36.4 bits (85), Expect = 0.002
 Identities = 17/52 (32%), Positives = 33/52 (63%)

Query: 69  TGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           TG  K  +K K+  +K E++ +E+ +E++ +    +KKE+ EEEE +++  E
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 34.1 bits (79), Expect = 0.013
 Identities = 19/46 (41%), Positives = 34/46 (73%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            KE++KK     K EEEE+E+ E++E E+++EE++ + E+EE +E+
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472



 Score = 33.7 bits (78), Expect = 0.014
 Identities = 17/41 (41%), Positives = 28/41 (68%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
                KKK  E++ E+E+ E+ +EEE EE +EEK+E+ E++
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 29.1 bits (66), Expect = 0.56
 Identities = 15/36 (41%), Positives = 31/36 (86%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
           +EEE++K ++++ +EEE+E+ EEE+ EE++++KK+ 
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 38.4 bits (89), Expect = 3e-04
 Identities = 17/38 (44%), Positives = 26/38 (68%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE 104
            G   G + E+++  E++ EEEE+E+ EEEE EE+ EE
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 37.7 bits (87), Expect = 7e-04
 Identities = 18/31 (58%), Positives = 25/31 (80%)

Query: 83  DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           D  EEEE+E+ EEEE EE++EE++E+ E EE
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 37.7 bits (87), Expect = 7e-04
 Identities = 17/31 (54%), Positives = 25/31 (80%)

Query: 87  EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           + E+E+ EEEE EE++EE++E+ EEEE  EE
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 37.3 bits (86), Expect = 0.001
 Identities = 23/55 (41%), Positives = 34/55 (61%), Gaps = 3/55 (5%)

Query: 65  IQEGTGCGKEEEKKKRRE---DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
             E  G  K++EK        D  + EE+E+ EEEE EE++EE++E+ EEEE +E
Sbjct: 838 NAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 36.9 bits (85), Expect = 0.001
 Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 15/83 (18%)

Query: 53  DNQWHHLSIHRRIQEGTGC-GKEEEKKKRREDKVEEEEDEK--------------LEEEE 97
             +  H        + T    +  E++   E++ E ++DEK               EEEE
Sbjct: 810 GEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEE 869

Query: 98  MEEQKEEKKEKVEEEEVKEERRE 120
            EE++EE++E+ EEEE +EE  E
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEE 892



 Score = 29.2 bits (65), Expect = 0.58
 Identities = 16/53 (30%), Positives = 25/53 (47%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
           G E+E +    ++ EE EDE   E E + + E + ++ E E   E   EG   
Sbjct: 725 GTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKED 777



 Score = 28.4 bits (63), Expect = 1.3
 Identities = 15/51 (29%), Positives = 19/51 (37%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           G   E +   E   EE E     E E  E+   + E+  E E K E    G
Sbjct: 633 GDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEG 683



 Score = 28.0 bits (62), Expect = 1.4
 Identities = 14/52 (26%), Positives = 24/52 (46%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           G E E++   E K E E + ++  E   EQ+ E + + +E + K E      
Sbjct: 664 GGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEV 715



 Score = 28.0 bits (62), Expect = 1.7
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           E  G  +  E+ +  ED+ E E + K E E   ++KE + E   E E KE+  EG
Sbjct: 727 EDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEG 781



 Score = 27.7 bits (61), Expect = 2.2
 Identities = 13/36 (36%), Positives = 25/36 (69%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
           +EEE+++  E++ EEEE+E+  EE +  +  E ++K
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQK 904



 Score = 27.7 bits (61), Expect = 2.2
 Identities = 17/56 (30%), Positives = 24/56 (42%), Gaps = 1/56 (1%)

Query: 68  GTGCGKEEEKKKRREDKV-EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           G   G+E E+    E +  EE   E  +E E E + E + E     E K E+   G
Sbjct: 642 GERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEG 697



 Score = 26.9 bits (59), Expect = 3.8
 Identities = 12/56 (21%), Positives = 22/56 (39%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
                G+  E+     ++  E E +   E E E   E K E+  E E++ +  +  
Sbjct: 652 PTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHK 707



 Score = 26.5 bits (58), Expect = 4.7
 Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 1/60 (1%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK-EKVEEEEVKEERREGGRGG 125
           E      E E +   E   +E E E  EE E  E + E + E   E E + +R+E    G
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 37.9 bits (88), Expect = 5e-04
 Identities = 15/52 (28%), Positives = 33/52 (63%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
           ++  + + E    EE + K +EE++++++ EKK+K + +E KE+R++  R  
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 34.8 bits (80), Expect = 0.007
 Identities = 13/67 (19%), Positives = 28/67 (41%)

Query: 52  DDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
           DDN    L++ +  +E          K++   K   ++ +   ++  E + E    +  E
Sbjct: 340 DDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAE 399

Query: 112 EEVKEER 118
            + KEE+
Sbjct: 400 AKAKEEK 406


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 37.2 bits (87), Expect = 0.001
 Identities = 17/45 (37%), Positives = 29/45 (64%)

Query: 76  EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            +  R+ DK  EEE+EK+ +   EE++EE +EK EE++ +E   +
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAK 299



 Score = 32.6 bits (75), Expect = 0.034
 Identities = 17/48 (35%), Positives = 32/48 (66%), Gaps = 6/48 (12%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV-KEERR 119
           ++EE ++++E+K +EE + KL +   EEQ+     K+EE+E  K+ R+
Sbjct: 280 RQEEAQEKKEEKKKEEREAKLAKLSPEEQR-----KLEEKERKKQARK 322



 Score = 32.2 bits (74), Expect = 0.039
 Identities = 16/47 (34%), Positives = 32/47 (68%), Gaps = 2/47 (4%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           +EEE+K  +    EEE  E+ +E++ E++KEE++ K+ +   +E+R+
Sbjct: 266 EEEEEKILKAA--EEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRK 310



 Score = 31.5 bits (72), Expect = 0.092
 Identities = 17/47 (36%), Positives = 29/47 (61%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EE+  K  E++ +EE  EK EE++ EE++ +  +   EE+ K E +E
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKE 315


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 35.4 bits (81), Expect = 0.004
 Identities = 18/48 (37%), Positives = 30/48 (62%), Gaps = 3/48 (6%)

Query: 79  KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGGR 126
           K+RE+ VE+ + E   E++  E++E +KEK +E E +E  RE  R  +
Sbjct: 579 KKREEAVEKAKREA--EQKAREEREREKEKEKERE-REREREAERAAK 623



 Score = 31.2 bits (70), Expect = 0.13
 Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 3/43 (6%)

Query: 78  KKRRE--DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           KKR E  +K + E ++K  EE   E KE++KE+  E E + ER
Sbjct: 579 KKREEAVEKAKREAEQKAREERERE-KEKEKEREREREREAER 620



 Score = 31.2 bits (70), Expect = 0.13
 Identities = 14/37 (37%), Positives = 24/37 (64%), Gaps = 2/37 (5%)

Query: 73  KEE--EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKE 107
           +EE  EK KR  ++   EE E+ +E+E E ++E ++E
Sbjct: 581 REEAVEKAKREAEQKAREEREREKEKEKERERERERE 617



 Score = 29.3 bits (65), Expect = 0.56
 Identities = 14/41 (34%), Positives = 23/41 (56%)

Query: 63  RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKE 103
           ++ +E     K E ++K RE++  E+E EK  E E E + E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 34.4 bits (79), Expect = 0.011
 Identities = 22/44 (50%), Positives = 34/44 (77%)

Query: 76  EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           EK+  +E   EEEE+EK E++E EE+  +K+E+V+EEE KEE++
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK 71



 Score = 33.6 bits (77), Expect = 0.015
 Identities = 18/47 (38%), Positives = 36/47 (76%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +EE+++ +E+K EEEE    +EEE++E++E++++K + ++VKE   E
Sbjct: 37  DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83



 Score = 33.2 bits (76), Expect = 0.025
 Identities = 20/47 (42%), Positives = 31/47 (65%), Gaps = 3/47 (6%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK---EKVEEEEVKEE 117
           EEE++K  + + EE+  +K EE + EE+KEEKK   +KV+E   + E
Sbjct: 39  EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85



 Score = 32.4 bits (74), Expect = 0.044
 Identities = 14/41 (34%), Positives = 28/41 (68%)

Query: 80  RREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +  +K   +E+E+ E+EE +E++E+  +K EE + +EE+ E
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 30.5 bits (69), Expect = 0.19
 Identities = 12/42 (28%), Positives = 29/42 (69%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
           +++E++++  DK EE ++E+ +EE+ ++ K+ K+   E E +
Sbjct: 46  EKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87



 Score = 26.7 bits (59), Expect = 3.7
 Identities = 16/43 (37%), Positives = 29/43 (67%), Gaps = 1/43 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEK-KEKVEEEEV 114
           KEE+K++  +   +EEE ++ EE+E +++K +K KE   E E+
Sbjct: 44  KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 33.3 bits (76), Expect = 0.023
 Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 71  CGKEEEKKKRREDKVEEEEDEKLEE-EEMEEQKEEKKEKVEEEEVKEERREG 121
           CG E    +R E  VEE E   ++E  ++EE+ E++ EK+ +EE    +R  
Sbjct: 356 CGPEPPLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVA 407


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 32.9 bits (76), Expect = 0.035
 Identities = 20/45 (44%), Positives = 28/45 (62%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           EE ++  E K EE E    E E+++E+ EEKKEK++EEE K    
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEE 570



 Score = 26.3 bits (59), Expect = 6.0
 Identities = 21/53 (39%), Positives = 37/53 (69%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
           K +E+ + +++K++EEED+ LEE E E Q+  K+ K E +E+ +E R+  +GG
Sbjct: 548 KLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGG 600


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 32.8 bits (74), Expect = 0.038
 Identities = 19/58 (32%), Positives = 33/58 (56%)

Query: 63   RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            ++ +E       EE KK  EDK + EE +K EE+E +  +  KKE  E ++ +E +++
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710



 Score = 32.4 bits (73), Expect = 0.052
 Identities = 15/48 (31%), Positives = 32/48 (66%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            K EE +K+  + +++E +E  + EE+++++ E+K+K EE +  EE  +
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729



 Score = 32.4 bits (73), Expect = 0.054
 Identities = 19/48 (39%), Positives = 33/48 (68%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            KE E+KK+ E+  + EE+ K++ EE +++ EE K+K EE +  EE ++
Sbjct: 1710 KEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757



 Score = 31.6 bits (71), Expect = 0.086
 Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 5/49 (10%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKE-----KVEEEEVKE 116
            K EE KK+ E+  + EEDEK   E ++++ EE K+     K E EE K+
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717



 Score = 30.9 bits (69), Expect = 0.17
 Identities = 19/45 (42%), Positives = 29/45 (64%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            K EE+KK+ E   ++E +EK + EE+++ +EE K K  EE  K E
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671



 Score = 30.9 bits (69), Expect = 0.18
 Identities = 18/45 (40%), Positives = 29/45 (64%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            KE E+KK+ E+  + EE+ K++  E  ++ EE K+K EE +  EE
Sbjct: 1641 KEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE 1685



 Score = 30.5 bits (68), Expect = 0.19
 Identities = 17/45 (37%), Positives = 30/45 (66%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            KE E+ K+ E+  ++E +EK + EE+++ +EE K K EE + + E
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740



 Score = 30.5 bits (68), Expect = 0.26
 Identities = 18/58 (31%), Positives = 34/58 (58%)

Query: 63   RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            ++  E     + EEKKK  E K  EEE++    EE ++ +E+KK+  E ++ +E+ ++
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689



 Score = 30.1 bits (67), Expect = 0.27
 Identities = 19/44 (43%), Positives = 30/44 (68%)

Query: 74   EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            EE+KKK  E K +EEE +K+   + EE+K+ ++ + E+E V EE
Sbjct: 1740 EEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEE 1783



 Score = 29.0 bits (64), Expect = 0.85
 Identities = 17/45 (37%), Positives = 22/45 (48%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            K EE KK  E    E E    E E  EE+ E  ++K EE + K +
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381



 Score = 29.0 bits (64), Expect = 0.87
 Identities = 19/48 (39%), Positives = 30/48 (62%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            K EE KK  ED+ +  E  K E EE ++ +E KK++ EE++  EE ++
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723



 Score = 28.6 bits (63), Expect = 0.89
 Identities = 19/57 (33%), Positives = 36/57 (63%)

Query: 63   RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
             RI+E     +EE+K K  E K  EE   K EE +  E++++K E+++++E +E+++
Sbjct: 1592 ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648



 Score = 28.6 bits (63), Expect = 1.0
 Identities = 17/45 (37%), Positives = 29/45 (64%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            K EE KK+  ++ ++ E+ K  EEE + + EE K++ EE++ K E
Sbjct: 1703 KAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE 1747



 Score = 27.0 bits (59), Expect = 3.0
 Identities = 13/48 (27%), Positives = 31/48 (64%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            K+ E+ K++ D++++    K + +E +++ EEKK+  E ++  EE ++
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445



 Score = 27.0 bits (59), Expect = 3.2
 Identities = 15/47 (31%), Positives = 29/47 (61%)

Query: 74   EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            EE KK   E+K++  E+ K  EE+ ++ +E KK + +E++  E  ++
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696



 Score = 26.6 bits (58), Expect = 4.3
 Identities = 14/45 (31%), Positives = 33/45 (73%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            K+ E+ K + +++++ E+EK + E++++++ E+K+K EE +  EE
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE 1657



 Score = 26.6 bits (58), Expect = 4.4
 Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEE-QKEEKKEKVEEEEVKEERR 119
            K EE KK  E K  EE+ +  E ++ EE +K E+K+K EE +  EE +
Sbjct: 1529 KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576



 Score = 26.3 bits (57), Expect = 5.4
 Identities = 10/48 (20%), Positives = 31/48 (64%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            K++ ++ ++ D+ +++ +E  + +E +++ EE K+K +E +   E ++
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK 1510



 Score = 26.3 bits (57), Expect = 5.5
 Identities = 19/57 (33%), Positives = 35/57 (61%)

Query: 64   RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            ++ E     K EE KK  E K++ EE +K EEE+ + ++ +KKE  E+++ +E ++ 
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655



 Score = 26.3 bits (57), Expect = 5.6
 Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 4/55 (7%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEM----EEQKEEKKEKVEEEEVKEERREGGR 123
              E  KK  E+  + EE +K E EE     E +K E++ K++ EE K+E  E  +
Sbjct: 1690 AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK 1744



 Score = 26.3 bits (57), Expect = 5.6
 Identities = 18/48 (37%), Positives = 31/48 (64%), Gaps = 1/48 (2%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            K EEKKK  E K + EED+K + +E+++    KK+  E ++  EE+++
Sbjct: 1386 KAEEKKKADEAKKKAEEDKK-KADELKKAAAAKKKADEAKKKAEEKKK 1432



 Score = 26.3 bits (57), Expect = 6.0
 Identities = 16/48 (33%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            KEE KKK    K + EE +K   +E +++ EE K+K +E +     ++
Sbjct: 1373 KEEAKKKADAAKKKAEEKKK--ADEAKKKAEEDKKKADELKKAAAAKK 1418



 Score = 26.3 bits (57), Expect = 6.8
 Identities = 17/50 (34%), Positives = 32/50 (64%)

Query: 74   EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
            EEE K + E+  +E E++K + EE ++ +EEKK+    ++ +E++ E  R
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774



 Score = 26.3 bits (57), Expect = 7.2
 Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEE-----EEMEEQKEEKKEKVEEEEVKEE 117
            K EEKKK  E K + EE +K +E     EE +++ +  K+K EE +   E
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346



 Score = 25.9 bits (56), Expect = 7.5
 Identities = 16/47 (34%), Positives = 32/47 (68%)

Query: 74   EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            EE KK   E+K++ EE +K  EE+ ++ +E KK++ E++++   ++E
Sbjct: 1719 EELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKE 1765



 Score = 25.9 bits (56), Expect = 8.6
 Identities = 15/47 (31%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 73   KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
            K EE KK  E K + EE +K + +  +++ EE K+  E  + + E  
Sbjct: 1310 KAEEAKKADEAKKKAEEAKK-KADAAKKKAEEAKKAAEAAKAEAEAA 1355


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 32.6 bits (74), Expect = 0.039
 Identities = 20/49 (40%), Positives = 33/49 (67%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
             +EEKKK++E   EE +D K +EE  E++  ++KEK +E++V+E R  
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDR 160



 Score = 29.1 bits (65), Expect = 0.58
 Identities = 15/48 (31%), Positives = 33/48 (68%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           KE + +  +E++ E+E+ ++ ++++ E+ KEE K++  +EE KE+R  
Sbjct: 96  KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143



 Score = 28.3 bits (63), Expect = 1.3
 Identities = 14/51 (27%), Positives = 34/51 (66%), Gaps = 1/51 (1%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           ++E+ K+ ++ K +E+  E+ ++ + +E+ +EK+   E+E+ KE++ E  R
Sbjct: 109 EKEQVKEEKK-KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158



 Score = 28.3 bits (63), Expect = 1.3
 Identities = 17/46 (36%), Positives = 29/46 (63%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           K +E+ K R+ K E +E    +E+E E++K+ ++ +  EEE K ER
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRER 168


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 32.6 bits (75), Expect = 0.041
 Identities = 12/45 (26%), Positives = 24/45 (53%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           K+  KK+R+ +K  E+E+ +    + + +   KK K  + E K+ 
Sbjct: 412 KKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 31.9 bits (73), Expect = 0.041
 Identities = 17/48 (35%), Positives = 30/48 (62%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           ++     ++E + EEEE+ ++EE + EEQ +E  EK   +  +E+RRE
Sbjct: 91  RKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRE 138



 Score = 29.9 bits (68), Expect = 0.19
 Identities = 12/48 (25%), Positives = 30/48 (62%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           KE+E+++  E +VEE ++E+  +E +E++  + K +   E  ++++  
Sbjct: 99  KEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146



 Score = 29.6 bits (67), Expect = 0.28
 Identities = 15/46 (32%), Positives = 30/46 (65%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
            ++K+K  E++ E E +E  EEE+++E  E++  K++ E+ +E  R
Sbjct: 96  LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENER 141



 Score = 27.3 bits (61), Expect = 1.8
 Identities = 14/46 (30%), Positives = 30/46 (65%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           +EEE++   E+  EEE+ ++L E+E+ + K EK+ + E ++ +  +
Sbjct: 103 EEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148



 Score = 26.9 bits (60), Expect = 2.1
 Identities = 12/49 (24%), Positives = 28/49 (57%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           G ++++K+  E++  E E+   EE+  E  ++E  +   E+  + ER++
Sbjct: 95  GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 32.5 bits (74), Expect = 0.043
 Identities = 14/60 (23%), Positives = 35/60 (58%)

Query: 64  RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           R Q+     K  E++++++++ + EE ++ +  E E  K+ +KE++  +E K++  E  +
Sbjct: 66  RQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125



 Score = 29.0 bits (65), Expect = 0.62
 Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 6/54 (11%)

Query: 73  KEEEKKKRREDKVEE------EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            EE++KK+ + + EE       E E+L++ E E    ++++K  EE  K+   +
Sbjct: 77  AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130



 Score = 29.0 bits (65), Expect = 0.66
 Identities = 9/43 (20%), Positives = 27/43 (62%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           ++K+   ++++++ E E+L  +E ++Q EE  ++   ++ + E
Sbjct: 93  QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAE 135



 Score = 27.5 bits (61), Expect = 2.3
 Identities = 12/44 (27%), Positives = 24/44 (54%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
            E+E+ K+ E +    +++K + EE  +Q   K+++ EE   K 
Sbjct: 98  AEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141



 Score = 26.3 bits (58), Expect = 4.9
 Identities = 11/44 (25%), Positives = 18/44 (40%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           +E+KK+  E   +    +K  EE   +     K K E E  +  
Sbjct: 114 QEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAA 157



 Score = 26.3 bits (58), Expect = 5.4
 Identities = 10/44 (22%), Positives = 20/44 (45%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           E+E+   +E K + EE  K    + ++ +E   +     + K E
Sbjct: 107 EKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAE 150



 Score = 25.5 bits (56), Expect = 9.7
 Identities = 16/51 (31%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 74  EEEKKKRRED----KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
             E KK+ E     K   E  +K E E   +   E K+K E E  K+   E
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAE 214


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 32.3 bits (74), Expect = 0.048
 Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 12/65 (18%)

Query: 64  RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEE------------EMEEQKEEKKEKVEE 111
            I   +G  +EE ++  +E +   EED+K +EE            + E+  +E  +K+  
Sbjct: 491 TITASSGLSEEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPA 550

Query: 112 EEVKE 116
           EE ++
Sbjct: 551 EEKEK 555


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 32.4 bits (74), Expect = 0.049
 Identities = 14/51 (27%), Positives = 25/51 (49%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
           + E+    ED+ EE+  ++ EEEE E+ +EE  +  E+   +     G   
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDS 416



 Score = 29.7 bits (67), Expect = 0.31
 Identities = 15/52 (28%), Positives = 31/52 (59%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
           K  +K++ R D ++ EE ++ E+EE E++ +E +E+  E+  +E  +    G
Sbjct: 353 KMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404



 Score = 28.2 bits (63), Expect = 1.2
 Identities = 10/48 (20%), Positives = 21/48 (43%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           EEE+++  E + EE ED + E  +  E    +       + + +  + 
Sbjct: 376 EEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423



 Score = 28.2 bits (63), Expect = 1.4
 Identities = 14/47 (29%), Positives = 26/47 (55%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           E+E++++R D+ EEEE E  EEE  + +++   E   +     E + 
Sbjct: 374 EDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420



 Score = 27.0 bits (60), Expect = 2.9
 Identities = 11/54 (20%), Positives = 19/54 (35%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGGR 126
            E+E ++  +   E EE+E  + EE   Q  E        +V  +         
Sbjct: 371 DEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKES 424


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 32.1 bits (74), Expect = 0.053
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 84  KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           +++ EE+    +E+ME    ++ E+ E++  K E
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542



 Score = 29.0 bits (66), Expect = 0.71
 Identities = 16/71 (22%), Positives = 30/71 (42%), Gaps = 2/71 (2%)

Query: 47  NRLRFDDNQ--WHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE 104
            +L  ++    W  L     +  GT   K E    R + + E    ++  E    ++ EE
Sbjct: 474 EQLGLEEELTSWESLLEWGGLPAGTKVAKGEPLFPRIDVEEEIAYIKEQMEGSAPKEPEE 533

Query: 105 KKEKVEEEEVK 115
           K++K E+ E+ 
Sbjct: 534 KEKKPEKPEIT 544


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 32.3 bits (73), Expect = 0.055
 Identities = 19/54 (35%), Positives = 35/54 (64%), Gaps = 6/54 (11%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK-----EERREGG 122
            EE    R++ V +++DE  +E++ +E+ +E++E+ EEEE+K     +E  EGG
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEE-EEEEIKGFDDEDEEDEGG 193



 Score = 30.4 bits (68), Expect = 0.22
 Identities = 13/39 (33%), Positives = 29/39 (74%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +D+ E+E+D+  E++E EE++EE+ +  ++E+ ++E  E
Sbjct: 156 DDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194



 Score = 26.2 bits (57), Expect = 6.6
 Identities = 21/87 (24%), Positives = 39/87 (44%), Gaps = 5/87 (5%)

Query: 34  KLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKL 93
           K  +G LDM I   R      +         + +      ++E +   +D  E++E+E+ 
Sbjct: 121 KAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDD-----DDEDEDEDDDDEEDDEEEEE 175

Query: 94  EEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EEEE++   +E +E    E+   E+ E
Sbjct: 176 EEEEIKGFDDEDEEDEGGEDFTYEKSE 202


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 32.0 bits (73), Expect = 0.056
 Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 3/60 (5%)

Query: 65  IQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
           I +     + E+    R+D+ EE+E+E+ EE+E E      KE  ++EE +E+  E    
Sbjct: 372 ILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGP---SKEHSDDEEFEEDDVESKYE 428


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 31.6 bits (72), Expect = 0.057
 Identities = 11/46 (23%), Positives = 32/46 (69%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EEK+K +  K + ++ +  ++++ +++K++K EK +E+E +++  +
Sbjct: 79  EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 28.9 bits (65), Expect = 0.60
 Identities = 10/50 (20%), Positives = 31/50 (62%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
             + +KKK ++ K ++++ +  ++++  E+K+EK+ + + E++ +   E 
Sbjct: 83  KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132



 Score = 26.6 bits (59), Expect = 3.0
 Identities = 11/38 (28%), Positives = 24/38 (63%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVE 110
           K+++KK  ++D   E++DEK  E+++E+  +   E + 
Sbjct: 97  KDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134



 Score = 25.4 bits (56), Expect = 8.2
 Identities = 11/50 (22%), Positives = 28/50 (56%)

Query: 63  RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           +   +     K+++K K ++D  ++++ EK +E+E E++ E+  +   E 
Sbjct: 83  KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132



 Score = 25.4 bits (56), Expect = 8.8
 Identities = 10/46 (21%), Positives = 32/46 (69%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           KE E+K++ + K ++ + +K ++++ ++ K++ K + ++E+  E++
Sbjct: 76  KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 31.8 bits (72), Expect = 0.067
 Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 3/71 (4%)

Query: 46  PNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRR---EDKVEEEEDEKLEEEEMEEQK 102
             R RF  + +   S+     E +  G   ++K+RR     +   +EDE   +E+  + +
Sbjct: 327 LLRERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYE 386

Query: 103 EEKKEKVEEEE 113
           +E+K + EE++
Sbjct: 387 KERKRRREEDK 397


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 31.8 bits (72), Expect = 0.070
 Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 3/51 (5%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE---KKEKVEEEEVKEERRE 120
              E      D+++E++DEK +++   E KE+   KKE+V  + V    +E
Sbjct: 5   NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKE 55



 Score = 26.8 bits (59), Expect = 3.4
 Identities = 14/44 (31%), Positives = 26/44 (59%), Gaps = 4/44 (9%)

Query: 78  KKRREDKVEEEEDEKLEEEEMEEQKE--EKKEKV--EEEEVKEE 117
           + RRE  +   +  + +++E ++QK   E KEKV  ++EEV  +
Sbjct: 4   RNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTD 47


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.8 bits (72), Expect = 0.074
 Identities = 18/39 (46%), Positives = 27/39 (69%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           ++  EEE  E+ EEEE EE++EE++ + EE E +EE  E
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEE 475



 Score = 31.0 bits (70), Expect = 0.12
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           EEE+    E++ EEEE+E+ +E E EE ++E++E+  E +   E    G
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEG 488



 Score = 31.0 bits (70), Expect = 0.13
 Identities = 18/54 (33%), Positives = 30/54 (55%)

Query: 71  CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
             +EE  ++  E++ EEEE+E+  EEE  E +EE++E   +   +EE      G
Sbjct: 439 SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492



 Score = 31.0 bits (70), Expect = 0.15
 Identities = 18/42 (42%), Positives = 29/42 (69%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
             +E ++    + EEEE+E+ EEEE E ++EE +++ EEEEV
Sbjct: 435 ASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476



 Score = 30.7 bits (69), Expect = 0.16
 Identities = 20/54 (37%), Positives = 33/54 (61%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
            +E E+++  E++ EEEE+E+ EE+E EE++ E +E+ EE E      E   G 
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGS 489



 Score = 29.9 bits (67), Expect = 0.31
 Identities = 16/47 (34%), Positives = 28/47 (59%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           E     +EEE+++  E++ E+E +E+  E+E EE++ E     EEE 
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEM 486



 Score = 29.1 bits (65), Expect = 0.63
 Identities = 18/58 (31%), Positives = 30/58 (51%)

Query: 68  GTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
            +    E      +E + EE  +E+ EEEE EE++E++ E+ E E+ +EE       G
Sbjct: 424 ASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481



 Score = 29.1 bits (65), Expect = 0.66
 Identities = 17/58 (29%), Positives = 32/58 (55%)

Query: 63  RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
              +  +  G+      +  ++ E  E+E+ EEEE EE+++E +E+  E+E +EE  E
Sbjct: 420 DPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477



 Score = 28.3 bits (63), Expect = 0.99
 Identities = 20/56 (35%), Positives = 31/56 (55%)

Query: 66  QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           QE       EE+++  E++ EEE++ + EE E EE++EE +     EE  E   EG
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEG 492



 Score = 27.2 bits (60), Expect = 2.9
 Identities = 18/52 (34%), Positives = 32/52 (61%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
           EEE+++  E++ EE+E E+ E E+ EE++E + +   EEE++      G G 
Sbjct: 446 EEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497



 Score = 26.0 bits (57), Expect = 8.0
 Identities = 20/56 (35%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 74  EEEKKKRREDKVEEEEDEKL----EEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
           EE++ +  E + EEEE+E       EEEME   E   +  E EE  E R     G 
Sbjct: 458 EEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGI 513


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 31.6 bits (73), Expect = 0.081
 Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 12/65 (18%)

Query: 64  RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEE------------EMEEQKEEKKEKVEE 111
            I   +G   EE ++  ++ +   EED+K +E             + E+  +E  +KV  
Sbjct: 493 TITASSGLSDEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPA 552

Query: 112 EEVKE 116
           +E ++
Sbjct: 553 DEKEK 557


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 31.6 bits (71), Expect = 0.083
 Identities = 19/51 (37%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 78  KKRREDKVEEEEDEKLEEEEMEEQKEEK--KEKVEEEEVKEERREGGRGGR 126
           +K+R +++E EE E+LE E ME  + E+  +E++E E ++ +R E  R  R
Sbjct: 456 EKKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDR 506



 Score = 30.9 bits (69), Expect = 0.17
 Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 67  EGTGCGKEEEKKKRREDKV-EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           E     KE   +K  E K+ E++  E+LE EE E  + E+ E++E E ++ ER E  R
Sbjct: 436 ERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERER 493



 Score = 25.9 bits (56), Expect = 7.3
 Identities = 18/75 (24%), Positives = 39/75 (52%), Gaps = 3/75 (4%)

Query: 48  RLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRR--EDKVEEEEDEKLEEEEMEEQKEEK 105
           R R +    H  ++  +I E     + E +++ R   +++E  E E+LE E +E ++ E 
Sbjct: 437 RARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERERLE- 495

Query: 106 KEKVEEEEVKEERRE 120
           ++++E + +    RE
Sbjct: 496 RDRLERDRLDRLERE 510


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 31.6 bits (72), Expect = 0.085
 Identities = 12/43 (27%), Positives = 26/43 (60%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           EEE K       ++++++K  E + + +K++KK+K ++E   E
Sbjct: 44  EEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSE 86



 Score = 30.5 bits (69), Expect = 0.21
 Identities = 10/37 (27%), Positives = 24/37 (64%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           KK ++EDK  E + +  ++++ +++K+E K + E + 
Sbjct: 55  KKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 31.2 bits (70), Expect = 0.090
 Identities = 17/45 (37%), Positives = 30/45 (66%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           +E  + +RRE+K++E ED++ +E E EEQ  E++    E E+ E+
Sbjct: 139 EERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQ 183



 Score = 28.1 bits (62), Expect = 1.2
 Identities = 15/46 (32%), Positives = 34/46 (73%), Gaps = 2/46 (4%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           ++E+ +K R+++  EE +E++E E  EE+ +E++++ E+E  +EE+
Sbjct: 124 RQEQIEKARQER--EELEERMEWERREEKIDEREDQEEQEREREEQ 167



 Score = 25.8 bits (56), Expect = 6.6
 Identities = 20/60 (33%), Positives = 38/60 (63%), Gaps = 4/60 (6%)

Query: 64  RIQEGTGCGKEEEKKKRREDKVEEEEDEKL--EEEEMEEQK--EEKKEKVEEEEVKEERR 119
           RI++     K+++ KK++   + +E+ EK   E EE+EE+   E ++EK++E E +EE+ 
Sbjct: 102 RIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQE 161



 Score = 25.4 bits (55), Expect = 9.6
 Identities = 16/48 (33%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           + EEK   RED+ EE+E E+ E+   E+  + + E +E++E + E  +
Sbjct: 146 RREEKIDEREDQ-EEQEREREEQTIEEQSDDSEHEIIEQDESETESDD 192


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 31.2 bits (71), Expect = 0.093
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query: 81  REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            E++ E EE E+  EE+  ++   K+   E +E +E RR 
Sbjct: 151 EEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRR 190


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 31.0 bits (71), Expect = 0.096
 Identities = 10/41 (24%), Positives = 25/41 (60%)

Query: 76  EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           +KKK+++ K  E  +EK ++  + E+++E ++ V+   +  
Sbjct: 84  KKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAA 124


>gnl|CDD|237046 PRK12297, obgE, GTPase CgtA; Reviewed.
          Length = 424

 Score = 31.2 bits (72), Expect = 0.11
 Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           +E      + +EE  +  LEEEE+EE+   K E+ EE++    R E G 
Sbjct: 316 DELLYAVAELLEETPEFPLEEEEVEEEVYYKFEE-EEKDFTITRDEDGV 363


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 31.2 bits (71), Expect = 0.11
 Identities = 16/38 (42%), Positives = 25/38 (65%)

Query: 83  DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           D  EE + EK E+E ++EQK  +K K +EE+ K+E  +
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 28.8 bits (65), Expect = 0.70
 Identities = 11/39 (28%), Positives = 27/39 (69%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
           KEE ++++ E +  +E+    + ++ EE+K+++ EK+E+
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 28.8 bits (65), Expect = 0.77
 Identities = 12/38 (31%), Positives = 25/38 (65%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           E + E+EE E L+E++   + ++++EK ++E  K E+ 
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589



 Score = 28.5 bits (64), Expect = 0.91
 Identities = 12/43 (27%), Positives = 24/43 (55%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
           +  +++KR     ++EE +K E E++E+ K    E  + +E K
Sbjct: 561 EALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDK 603



 Score = 28.5 bits (64), Expect = 1.1
 Identities = 10/37 (27%), Positives = 28/37 (75%)

Query: 84  KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           K++++E+ + E+EE E  KE+K+ +  +++ +++++E
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKE 582



 Score = 26.9 bits (60), Expect = 3.5
 Identities = 10/41 (24%), Positives = 26/41 (63%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           +E+E+K+  +++    + +K EE++ +E ++ +K K+   E
Sbjct: 555 REKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 31.1 bits (70), Expect = 0.11
 Identities = 17/52 (32%), Positives = 24/52 (46%)

Query: 70  GCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
               EEE++    D  EEEEDE++  +   E   E  E+ EEE    +  E 
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGED 224


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 31.0 bits (71), Expect = 0.12
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           ++ +  +      EEE EE++EE++E+  EEE
Sbjct: 289 KEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 29.4 bits (67), Expect = 0.43
 Identities = 15/44 (34%), Positives = 23/44 (52%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
           +D ++EE  E L  +      EE++E+ EEEE +E   E    G
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAAG 324



 Score = 27.5 bits (62), Expect = 1.9
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
            K   +++++E    + +    EE++EE++E+ EEE  +EE   G
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAAG 324


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 31.4 bits (71), Expect = 0.12
 Identities = 15/49 (30%), Positives = 27/49 (55%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           G   E +  ++   EEE + K ++EE  ++KE KK K  ++E K + + 
Sbjct: 1   GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQA 49



 Score = 28.3 bits (63), Expect = 1.4
 Identities = 14/54 (25%), Positives = 32/54 (59%), Gaps = 6/54 (11%)

Query: 73  KEEEKKKRREDKVEEEEDEKLE------EEEMEEQKEEKKEKVEEEEVKEERRE 120
           +E E+KK++E+K +E+E +KL+      + +++ Q+      V ++  K+ R+ 
Sbjct: 16  EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKR 69



 Score = 26.0 bits (57), Expect = 7.9
 Identities = 12/55 (21%), Positives = 31/55 (56%)

Query: 68  GTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           G+    E EKK   E+++E ++ ++ + +E E +K +  +K  + +++ ++   G
Sbjct: 1   GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDG 55


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 31.0 bits (70), Expect = 0.12
 Identities = 16/82 (19%), Positives = 30/82 (36%), Gaps = 2/82 (2%)

Query: 45  RPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQK-E 103
           R NRLR     W    +H    E     +  EK + +  K E  +  +  E E+  +  +
Sbjct: 309 RRNRLRMIITPWRAPELHAENAEIKK-TRTAEKNEAKARKKEIAQKRRAAEREINREARQ 367

Query: 104 EKKEKVEEEEVKEERREGGRGG 125
           E+   +     +    +  + G
Sbjct: 368 ERAAAMARARARRAAVKAKKKG 389


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 30.8 bits (70), Expect = 0.14
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 87  EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           E +D KL EE   E+ E      EEE  + E++E  +
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKK 194



 Score = 27.7 bits (62), Expect = 1.3
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 81  REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
            +D    EE    E E     KEE+  + E++E K
Sbjct: 159 WKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESK 193



 Score = 26.9 bits (60), Expect = 2.5
 Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 1/36 (2%)

Query: 74  EEEKKKRREDKVEEE-EDEKLEEEEMEEQKEEKKEK 108
           ++ K        E E      EEE  E +K+E K+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 30.8 bits (69), Expect = 0.15
 Identities = 13/53 (24%), Positives = 30/53 (56%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
           G  + K  + + +    + ++  +E   E+K+ ++E+ EEE  +E ++E G+G
Sbjct: 188 GSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQG 240



 Score = 30.0 bits (67), Expect = 0.24
 Identities = 12/49 (24%), Positives = 24/49 (48%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
                 + +  K ++E+     +E ++E K E+K+  EE+E +E   E 
Sbjct: 185 SLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEA 233



 Score = 27.7 bits (61), Expect = 1.7
 Identities = 16/60 (26%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 59  LSIHRRIQEGTGCGKEEEKKKRREDKVEE-EEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           LS+   +Q      K+E+   + ++ ++E + +EK  +EE EE++ E++ K EE +  ++
Sbjct: 184 LSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 30.6 bits (69), Expect = 0.16
 Identities = 33/102 (32%), Positives = 45/102 (44%), Gaps = 10/102 (9%)

Query: 20  LNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKK 79
           L+VA +DG  AI L       D Y  P+    DD          RI+       EEE+ +
Sbjct: 34  LDVAARDGDDAIELAEPETSDDPYGNPDPFGEDDEG--------RIEVRISEDGEEEEVE 85

Query: 80  RREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
             E+  +EEE+ + E EE E   +  +E  E  E  EE  EG
Sbjct: 86  EGEE--DEEEEGEEESEEFEPLGDTPEELTEASEQLEEHEEG 125


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 30.6 bits (69), Expect = 0.18
 Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 2/49 (4%)

Query: 74  EEEKKKRRED--KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EE K K+  +     E E EK  +EE ++Q EE+ +     E K++  E
Sbjct: 122 EEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170



 Score = 29.8 bits (67), Expect = 0.33
 Identities = 13/45 (28%), Positives = 21/45 (46%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           KE E++   E   ++ E    + EE ++Q EE K K   E   + 
Sbjct: 92  KELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136



 Score = 28.6 bits (64), Expect = 0.74
 Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 3/50 (6%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEE---EMEEQKEEKKEKVEEEEVKEERRE 120
           +E +KK  +   E E+    E+    E+E++   +K   + E+  ++  E
Sbjct: 67  QERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEE 116



 Score = 27.9 bits (62), Expect = 1.4
 Identities = 14/48 (29%), Positives = 20/48 (41%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            E +KK   E K + E   K + EE + + E  K K   E   +   E
Sbjct: 169 AEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAE 216



 Score = 27.9 bits (62), Expect = 1.7
 Identities = 11/48 (22%), Positives = 28/48 (58%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           KE+E++K+ E + EE E ++  E+  +++ E++    +  +  E+  +
Sbjct: 65  KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112



 Score = 26.7 bits (59), Expect = 3.6
 Identities = 11/58 (18%), Positives = 29/58 (50%)

Query: 63  RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            R +E       E+  K+ E   ++ E+++ + EE + ++  + +   E E +++ +E
Sbjct: 89  ARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKE 146



 Score = 26.7 bits (59), Expect = 3.8
 Identities = 14/45 (31%), Positives = 23/45 (51%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            E EKK + E K + EE+ K +     ++K  + +K  E E K +
Sbjct: 138 AEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAK 182



 Score = 26.0 bits (57), Expect = 7.3
 Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 3/46 (6%)

Query: 74  EEEKKKRR--EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           EEE K +   E K +  E +K  E E  + K E K K + EE K +
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAE-AKAKAEAKAKAKAEEAKAK 197



 Score = 25.6 bits (56), Expect = 7.9
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 4/48 (8%)

Query: 74  EEEKKKR----REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
             E K +     E K +EE  ++ EEE   +   E K+K  E + K E
Sbjct: 129 AAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE 176


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 30.5 bits (69), Expect = 0.18
 Identities = 17/52 (32%), Positives = 28/52 (53%)

Query: 66  QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           +E      E+++ K   +K +EE +E+ EEE+ E   E +KE  E+ E   E
Sbjct: 68  EEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVE 119



 Score = 28.9 bits (65), Expect = 0.59
 Identities = 18/47 (38%), Positives = 28/47 (59%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
            KEEEK+    +  E++ D + E+EE EE+ EE+ E+  +E  KE  
Sbjct: 65  VKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETE 111



 Score = 26.6 bits (59), Expect = 3.0
 Identities = 14/46 (30%), Positives = 26/46 (56%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
             ++E  +  E K EE+E    E++E +   E++ E+ EEE  +E+
Sbjct: 54  SDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99



 Score = 26.2 bits (58), Expect = 5.4
 Identities = 16/55 (29%), Positives = 30/55 (54%)

Query: 66  QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +E       E+K+ + + + E+EE E+  EEE EE  +E +++ EE+      +E
Sbjct: 67  EEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121



 Score = 25.9 bits (57), Expect = 7.2
 Identities = 13/48 (27%), Positives = 28/48 (58%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            ++++    E+  EEE++    E++ ++   EK+++  EEE +EE  E
Sbjct: 54  SDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 30.1 bits (68), Expect = 0.19
 Identities = 12/45 (26%), Positives = 24/45 (53%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           EEE ++   D+ E+E+  + + E  E++K E  +   E+E  +  
Sbjct: 98  EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 29.7 bits (67), Expect = 0.28
 Identities = 12/40 (30%), Positives = 25/40 (62%)

Query: 81  REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           RE    ++E+E+ EEE  +E ++E   + + E  ++++RE
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128



 Score = 28.2 bits (63), Expect = 0.86
 Identities = 10/36 (27%), Positives = 18/36 (50%)

Query: 83  DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
              EEEE E+   +E E++   + +   +E+ K E 
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKREV 129



 Score = 26.6 bits (59), Expect = 2.9
 Identities = 14/57 (24%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 64  RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +I        +EE++   E   E E+++    E   E KE+KK +V + + ++E+ +
Sbjct: 86  KIFRELSSSDDEEEETEEESTDETEQED--PPETKTESKEKKKREVPKPKTEKEKPK 140


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 30.4 bits (69), Expect = 0.20
 Identities = 11/46 (23%), Positives = 23/46 (50%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EE+++  + ++E   + K  ++E E   EE      EE+   ++ E
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTE 278



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 10/40 (25%), Positives = 21/40 (52%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           K+ ++    E++ EE + E     E +  K+E++   EE+
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEED 263



 Score = 28.0 bits (63), Expect = 1.6
 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 2/44 (4%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           KK +   + EE E+E   + E+E   E K  K E+E   EE   
Sbjct: 224 KKLKETSETEEREEET--DVEIETTSETKGTKQEQEGSTEEDPS 265



 Score = 27.7 bits (62), Expect = 2.0
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query: 90  DEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
            E  E EE EE+ + + E   E +  ++ +EG
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEG 258



 Score = 27.7 bits (62), Expect = 2.1
 Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 2/46 (4%)

Query: 74  EEEKKKRREDKVEEEEDEKL--EEEEMEEQKEEKKEKVEEEEVKEE 117
            E K  ++E +   EED  L  EE+E  ++ E+  +    +E K+E
Sbjct: 247 SETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292



 Score = 26.5 bits (59), Expect = 4.9
 Identities = 11/45 (24%), Positives = 22/45 (48%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            + E +   E K  ++E E   EE+     EEK++  + E++ + 
Sbjct: 239 TDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283



 Score = 26.1 bits (58), Expect = 5.6
 Identities = 9/32 (28%), Positives = 21/32 (65%)

Query: 88  EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           E+ ++ +  +  E+K +K+EK E+ + +E+ R
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKER 679


>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain. 
          Length = 131

 Score = 29.6 bits (67), Expect = 0.21
 Identities = 13/60 (21%), Positives = 27/60 (45%), Gaps = 3/60 (5%)

Query: 10 LGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRLRFDDNQWHHLSIHRRIQEGT 69
                  D+L + ++DG + ++  L  G     +R    + +D +WH + + R  + GT
Sbjct: 12 YNGTQTERDFLALELRDGRLEVSYDLGSGAA--VVRSGD-KLNDGKWHSVELERNGRSGT 68


>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor. 
          Length = 456

 Score = 30.4 bits (68), Expect = 0.21
 Identities = 10/38 (26%), Positives = 19/38 (50%)

Query: 81  REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           R  + +EE DEK ++++     E  + +VE    + E 
Sbjct: 51  RSCQGQEEPDEKTQDQQSLSDVERAEPEVEASRGRGEG 88


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 30.7 bits (69), Expect = 0.21
 Identities = 14/48 (29%), Positives = 25/48 (52%)

Query: 75   EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
            +E  ++ +     E+DEK+ E+  EE  +E +E  E+    +E  E G
Sbjct: 4044 DEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQG 4091



 Score = 29.2 bits (65), Expect = 0.56
 Identities = 14/53 (26%), Positives = 21/53 (39%), Gaps = 3/53 (5%)

Query: 65   IQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            ++ GTG G     +          ED+ LEE   EE    + +  E E  + E
Sbjct: 3812 LESGTGLGSGVGAEDITNTL---NEDDDLEELANEEDTANQSDLDESEARELE 3861



 Score = 25.7 bits (56), Expect = 9.5
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 75   EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
            E+ +K  ED  EE   E  E  E   + +E+ E+ E  E
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPE 4095


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 30.5 bits (69), Expect = 0.21
 Identities = 14/43 (32%), Positives = 23/43 (53%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           E E+ +R E+     E   L E+E E++ E ++ + EEEE   
Sbjct: 407 EAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449



 Score = 27.1 bits (60), Expect = 2.6
 Identities = 13/47 (27%), Positives = 21/47 (44%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            E   +R E++  + E E+  E E  E+ EE     E   + E+  E
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFE 432



 Score = 26.3 bits (58), Expect = 5.4
 Identities = 15/44 (34%), Positives = 22/44 (50%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           E E  K  E   + EE+E L+ E  E  + E+ E+ EE+    E
Sbjct: 379 EAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRE 422



 Score = 25.9 bits (57), Expect = 6.5
 Identities = 15/48 (31%), Positives = 21/48 (43%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +E E++   E     EED +L E     + E + E   EE   EE  E
Sbjct: 400 REAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEE 447



 Score = 25.5 bits (56), Expect = 8.7
 Identities = 13/47 (27%), Positives = 22/47 (46%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
            E  +  ED    E   + EEEE  +++ E++ + E+ E  EE    
Sbjct: 374 AEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARL 420



 Score = 25.5 bits (56), Expect = 8.7
 Identities = 12/44 (27%), Positives = 23/44 (52%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           EE  ++  E+++E E+ E+ EE+    +     E   E+E + E
Sbjct: 395 EEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELE 438


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 30.4 bits (68), Expect = 0.23
 Identities = 15/39 (38%), Positives = 23/39 (58%)

Query: 83  DKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           D V EEE+E   EE+ E  ++E  + V+E+  +EE  E 
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEEL 175



 Score = 30.0 bits (67), Expect = 0.29
 Identities = 17/34 (50%), Positives = 25/34 (73%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
           E++VE EEDE+  E+E  +  +EK E+ EEEE+K
Sbjct: 143 EEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELK 176


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 30.4 bits (69), Expect = 0.23
 Identities = 16/29 (55%), Positives = 23/29 (79%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
           EEEE+E+ E  E EE++EE++E+  EEEV
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEERTFEEEV 334



 Score = 29.2 bits (66), Expect = 0.53
 Identities = 16/43 (37%), Positives = 27/43 (62%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           +++RR      E +E+ EEEE   ++EE++E+ EE   +EE R
Sbjct: 293 RRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVR 335



 Score = 28.5 bits (64), Expect = 1.1
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
            E E+E+ EEEE+ E++EE++E+ E    +E R 
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVRA 336


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 30.2 bits (68), Expect = 0.24
 Identities = 14/45 (31%), Positives = 29/45 (64%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           E+K+++ E+K EE + E+++EE  EE+K    E  E+  ++  ++
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK 325


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 29.8 bits (67), Expect = 0.26
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query: 62  HRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
              IQ      K + KK R +D + EE     E  +  E+K +KK+  +++E K++++E 
Sbjct: 103 LMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEK 162

Query: 122 GR 123
            +
Sbjct: 163 KK 164


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 30.2 bits (69), Expect = 0.27
 Identities = 15/47 (31%), Positives = 22/47 (46%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EEE+ ++ E K E EE+ K E EE    + E++ K E          
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEA 85



 Score = 29.0 bits (66), Expect = 0.69
 Identities = 12/47 (25%), Positives = 18/47 (38%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EEE +K    +  EEE +   EE    + EE+ +         E   
Sbjct: 40  EEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAA 86


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 29.6 bits (67), Expect = 0.28
 Identities = 15/45 (33%), Positives = 30/45 (66%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
            EEK ++ E++VEE E E+  E+E+EE+  E++ + E  + ++ +
Sbjct: 1   MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAK 45


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.0 bits (68), Expect = 0.30
 Identities = 11/44 (25%), Positives = 29/44 (65%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           + +     +D  +E++DE  ++++++++ EEKKE  E E++ ++
Sbjct: 141 DLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDD 184



 Score = 30.0 bits (68), Expect = 0.34
 Identities = 11/45 (24%), Positives = 28/45 (62%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           E++ +   +D V++E++EK E +E+E+  ++     +E++ +  R
Sbjct: 154 EDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALR 198


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 29.8 bits (67), Expect = 0.31
 Identities = 11/36 (30%), Positives = 16/36 (44%)

Query: 71  CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK 106
           C       + R D   ++  E  +E EME  +E KK
Sbjct: 534 CWSLRNSTEERIDLDADDWTEDEDENEMETDEERKK 569



 Score = 28.3 bits (63), Expect = 1.1
 Identities = 8/28 (28%), Positives = 16/28 (57%)

Query: 88  EEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
           EE   L+ ++  E ++E + + +EE  K
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEERKK 569



 Score = 28.3 bits (63), Expect = 1.1
 Identities = 6/28 (21%), Positives = 13/28 (46%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           EE  D   ++   +E + E +   E ++
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEERKK 569


>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
          Length = 177

 Score = 29.4 bits (66), Expect = 0.32
 Identities = 16/44 (36%), Positives = 23/44 (52%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            ++ + ++VEEE +    EE +EE  EE  EK E E   E   E
Sbjct: 2   SEEIKNEEVEEEVEATETEETVEEVVEETPEKSELELANERADE 45


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 29.7 bits (67), Expect = 0.33
 Identities = 15/49 (30%), Positives = 28/49 (57%), Gaps = 4/49 (8%)

Query: 73  KEEEKKKR----REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           +E E+ +R    + +K EEE   K  +  + E +EE++   EE +++EE
Sbjct: 378 REAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEE 426


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 29.8 bits (67), Expect = 0.33
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 7/61 (11%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEK-------KEKVEEEEVKEERREGGRGG 125
           K +  KK  ED+ E+EE E + E  +EE++ E+       K++ E+EEV      G R G
Sbjct: 294 KRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRG 353

Query: 126 R 126
           R
Sbjct: 354 R 354



 Score = 28.3 bits (63), Expect = 1.1
 Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            +E E++  + D + E+E  +    + +E ++E K   E  + +EE  E
Sbjct: 241 KEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEE 289



 Score = 27.1 bits (60), Expect = 3.2
 Identities = 16/49 (32%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
             E+E K   E    EEE      EE E++K ++ +K+ E+E ++E  E
Sbjct: 269 EDEDEPKPSGERSDSEEE-----TEEKEKEKRKRLKKMMEDEDEDEEME 312



 Score = 26.3 bits (58), Expect = 5.4
 Identities = 13/49 (26%), Positives = 23/49 (46%), Gaps = 1/49 (2%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            K +EKK+++E      ++E  EEE  +     + E  E   + E+  E
Sbjct: 224 KKTKEKKEKKEASESTVKEES-EEESGKRDVILEDESAEPTGLDEDEDE 271


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.7 bits (68), Expect = 0.34
 Identities = 14/44 (31%), Positives = 34/44 (77%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           ++ E  ++RE+++E++E E  ++++  E+KEE+ E++ EE+++E
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 27.4 bits (62), Expect = 1.9
 Identities = 13/48 (27%), Positives = 33/48 (68%), Gaps = 1/48 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           K E++  ++E+ ++ +  E LE+ E E +K+EK+ + +++E++++  E
Sbjct: 86  KLEKRLLQKEENLDRK-LELLEKREEELEKKEKELEQKQQELEKKEEE 132



 Score = 26.7 bits (60), Expect = 3.9
 Identities = 18/51 (35%), Positives = 34/51 (66%), Gaps = 3/51 (5%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKE---EKKEKVEEEEVKEERRE 120
           ++EE   R+ + +E+ E+E  ++E+  EQK+   EKKE+  EE ++E+ +E
Sbjct: 93  QKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 26.3 bits (59), Expect = 5.0
 Identities = 14/52 (26%), Positives = 29/52 (55%), Gaps = 4/52 (7%)

Query: 73  KEEEKKKRREDKVEEEE---DEKLEEEEMEEQKEEKKEKVE-EEEVKEERRE 120
           K +E ++  +  +EE +   +   +E  +E ++E  K + E E+E++E R E
Sbjct: 32  KIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE 83



 Score = 25.5 bits (57), Expect = 8.8
 Identities = 15/45 (33%), Positives = 29/45 (64%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           E++ + R +++++ E   L++EE  ++K E  EK EEE  K+E+ 
Sbjct: 74  EKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKE 118


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 29.6 bits (67), Expect = 0.35
 Identities = 15/44 (34%), Positives = 29/44 (65%), Gaps = 2/44 (4%)

Query: 72  GKEEEKKKRREDKVE--EEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           G +EE  +   +K E  EEED+  E E+ +E+ E+++E+ +++E
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 29.3 bits (66), Expect = 0.36
 Identities = 14/50 (28%), Positives = 29/50 (58%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           K E  ++    + E +E++K +EE++E ++ EK+E +E+ E   E +   
Sbjct: 9   KHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58



 Score = 26.6 bits (59), Expect = 3.4
 Identities = 13/42 (30%), Positives = 28/42 (66%)

Query: 76  EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            +K+ ++ K E  E++  +E E +E+ + K+E +E EE+++E
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKE 42



 Score = 26.6 bits (59), Expect = 3.4
 Identities = 19/46 (41%), Positives = 26/46 (56%)

Query: 71  CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           C KE E K+  + K E+ E E++E+EE+ E  EE  E   EE   E
Sbjct: 17  CCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
           to archaeal Holliday junction resolvase [Nucleotide
           transport and metabolism].
          Length = 175

 Score = 29.3 bits (66), Expect = 0.36
 Identities = 16/45 (35%), Positives = 26/45 (57%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           E  +K  RE  V E +  K EE +++E  E+K E+  E+ V++ R
Sbjct: 40  ETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEAREDAVRKSR 84



 Score = 25.5 bits (56), Expect = 7.8
 Identities = 18/49 (36%), Positives = 27/49 (55%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           GK E K +  E+ +++ E E+L  E    ++EE K K   E+  EE RE
Sbjct: 29  GKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 29.3 bits (66), Expect = 0.40
 Identities = 19/48 (39%), Positives = 33/48 (68%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           + EE+KK  E +  E ++E+  EEE E++KEE++ K  EE+ ++E+ E
Sbjct: 27  EREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74



 Score = 27.7 bits (62), Expect = 1.5
 Identities = 21/47 (44%), Positives = 36/47 (76%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EEE+++ R+   E+ E E+ EEEE+EE++E+KKE+ E +E +E+ R+
Sbjct: 24  EEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARK 70



 Score = 27.0 bits (60), Expect = 2.8
 Identities = 13/46 (28%), Positives = 32/46 (69%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
            EE+++  R+++ E EE+ + ++EE E ++ E++ + E+EE ++ +
Sbjct: 34  LEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLK 79



 Score = 26.2 bits (58), Expect = 4.7
 Identities = 17/46 (36%), Positives = 31/46 (67%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           + ++++RE + EE E+ K  EE+ E +++E++E  EE E K+E  E
Sbjct: 15  QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60



 Score = 26.2 bits (58), Expect = 4.9
 Identities = 18/48 (37%), Positives = 34/48 (70%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           K EEK+ RR+ +  EEE+ +  ++  E+++ E+KE+ E EE +E+++E
Sbjct: 10  KLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKE 57


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 29.5 bits (67), Expect = 0.41
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query: 79  KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
             R    +EEE      +  EE  EE++E + +
Sbjct: 308 YLRRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 29.0 bits (65), Expect = 0.51
 Identities = 13/43 (30%), Positives = 26/43 (60%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           +EE ++  E + EEEE  +L  ++ EE+++  K K ++  + E
Sbjct: 144 QEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQALLDE 186



 Score = 26.3 bits (58), Expect = 4.4
 Identities = 17/44 (38%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 77  KKKRREDKVEEEEDEKLE-EEEMEEQKEEKKEKVEEEEVKEERR 119
           K K +  + +EE +E LE E+E EEQ+    +K EEE+   +R+
Sbjct: 135 KNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 29.0 bits (65), Expect = 0.53
 Identities = 10/51 (19%), Positives = 24/51 (47%), Gaps = 2/51 (3%)

Query: 73  KEEEKKKRREDKV--EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           +++  +K +        E  +KL  +E +E+K++ K+K    +V   +   
Sbjct: 39  QDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVPLQVNPAQLFV 89


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 28.9 bits (65), Expect = 0.54
 Identities = 16/46 (34%), Positives = 29/46 (63%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
            E  +K+      +E E E+ E++E +++KE KKEK E+++ KE+ 
Sbjct: 132 SETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177



 Score = 27.8 bits (62), Expect = 1.2
 Identities = 18/50 (36%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 72  GKEEEKKKRREDKVEEEED-EKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            KE   K  +E +VEEEE  EK +++E++++K+EKK+K E+    +  ++
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185



 Score = 26.2 bits (58), Expect = 4.7
 Identities = 17/49 (34%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
               +     +  E+E   K+E+E  E ++EEKKEK +++EVK+E++E 
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEA-EVEEEEKKEKKKKKEVKKEKKEK 170


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 28.4 bits (64), Expect = 0.58
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 88  EEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
                  EE+ EE++EE++++  EEE   
Sbjct: 71  AAAAAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 26.5 bits (59), Expect = 2.5
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 85  VEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
                    EE++ EE++EE+KE+ EEE
Sbjct: 69  AAAAAAAAAEEKKEEEEEEEEKEESEEE 96



 Score = 25.7 bits (57), Expect = 4.3
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 94  EEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
                 E+K+E++E+ EE+E  EE    G G 
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEEAAAGLGA 103



 Score = 24.9 bits (55), Expect = 9.9
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEE 104
                EE+ E+ EEEE +E+ EE
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEE 95


>gnl|CDD|218274 pfam04801, Sin_N, Sin-like protein conserved region.  Family of
           higher eukaryotic proteins. SIN was identified as a
           protein that interacts specifically with SXL (sex
           lethal) in a yeast two-hybrid assay. The interaction is
           mediated by one of the SXL RNA binding domains.
          Length = 422

 Score = 28.9 bits (65), Expect = 0.58
 Identities = 15/45 (33%), Positives = 26/45 (57%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           ++K+K+ ED  +E+ED   E EE  +Q   K  + E E+ ++ R 
Sbjct: 142 KDKRKKEEDTADEDEDPDEEAEEELKQVTVKFSRPETEKQRKRRE 186


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 28.2 bits (63), Expect = 0.61
 Identities = 14/27 (51%), Positives = 20/27 (74%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEE 112
               +EK EE+E E++KEE+KE+ EEE
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 25.5 bits (56), Expect = 5.0
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 94  EEEEMEEQKEEKKEKVEEEEVKEERREG 121
            EE+ EE++EEKK++ E+EE +EE   G
Sbjct: 74  AEEKAEEKEEEKKKEEEKEEEEEEALAG 101



 Score = 25.2 bits (55), Expect = 6.4
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 95  EEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
               EE+ EEK+E+ ++EE KEE  E    G
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEEALAG 101



 Score = 24.8 bits (54), Expect = 9.2
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 87  EEEDEKLEEEEMEEQKEEKKEKVEEEE 113
               E+  EE+ EE+K+E++++ EEEE
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEE 97


>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
           protein; Provisional.
          Length = 196

 Score = 28.8 bits (64), Expect = 0.62
 Identities = 12/50 (24%), Positives = 21/50 (42%), Gaps = 1/50 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           KEE K   +   + + ++  L+  E  E K  K + +E   + EE     
Sbjct: 146 KEEAKAAAKAAALAKAKEASLKSAE-AELKASKGKSIEVNVIGEEDEGEK 194


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 28.8 bits (64), Expect = 0.66
 Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 6/61 (9%)

Query: 66  QEGTGCGKEEEKKKRREDKVEE------EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           Q+ +    E+++KK+ E   EE       E E+L++ E E  K ++++K  EE  K+ + 
Sbjct: 70  QQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQL 129

Query: 120 E 120
           E
Sbjct: 130 E 130



 Score = 27.6 bits (61), Expect = 2.0
 Identities = 14/44 (31%), Positives = 28/44 (63%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           E+E+ K +E + + EE EK  + E ++Q+E+ ++   E++ K E
Sbjct: 107 EKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAE 150



 Score = 26.5 bits (58), Expect = 4.8
 Identities = 11/43 (25%), Positives = 28/43 (65%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           E+E+ K+ E +  + ++++ + EE E+Q + ++++ EE+  K 
Sbjct: 99  EQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKA 141



 Score = 26.1 bits (57), Expect = 5.6
 Identities = 14/48 (29%), Positives = 19/48 (39%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
             E KKK  E     EE +   E    ++K E + K   E+ K E   
Sbjct: 166 AAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEA 213


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 28.9 bits (65), Expect = 0.73
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query: 71  CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE 104
           C     ++K  E++ E+EE+E+ EEE  E + EE
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372



 Score = 28.5 bits (64), Expect = 0.85
 Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 2/40 (5%)

Query: 81  REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           R   V  E+ +  EEEE E+++EE++E+  EE   EE   
Sbjct: 338 RCTWVNPEQKD--EEEEQEDEEEEEEEEEPEEPEPEEGPP 375



 Score = 27.7 bits (62), Expect = 1.6
 Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 3/39 (7%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEV---KEERREG 121
           E EE+E+ EE    ++++E +E+ EEE     KEE   G
Sbjct: 169 EGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTG 207



 Score = 26.9 bits (60), Expect = 2.9
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
            E++DE+ E+E+ EE++EE++ +  E E     
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376



 Score = 26.6 bits (59), Expect = 4.4
 Identities = 15/33 (45%), Positives = 20/33 (60%)

Query: 85  VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           VE  E E+ EEEE   + +E+ E  EEEE + E
Sbjct: 165 VELREGEEEEEEEEVGEADEEDEGEEEEEEEPE 197



 Score = 25.8 bits (57), Expect = 7.8
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           EEEE+E+ EEE  E  + E+    E  EV+E
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEVRE 318


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 28.7 bits (65), Expect = 0.78
 Identities = 13/48 (27%), Positives = 26/48 (54%), Gaps = 4/48 (8%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEE----MEEQKEEKKEKVEEEEVKE 116
           K+ E+    + K +E  + K E EE    +E+  +E+ +K+ E + K+
Sbjct: 511 KDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKK 558



 Score = 26.8 bits (60), Expect = 3.5
 Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 7/51 (13%)

Query: 74  EEEKKKRREDKVEEEEDEKL---EEEEM----EEQKEEKKEKVEEEEVKEE 117
            ++K   +E K+       L   E E M    EE   E K++ E  E K E
Sbjct: 482 AKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNE 532


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 28.9 bits (65), Expect = 0.81
 Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 4/48 (8%)

Query: 76  EKKKR--REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE--VKEERR 119
              K+  R    ++ E +  +  E + +K++KKEK +EEE   +E+ R
Sbjct: 719 GLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 28.5 bits (64), Expect = 1.00
 Identities = 13/43 (30%), Positives = 25/43 (58%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
           GK+ E K     + + ++ +K E+++ EE K E+K ++E  E 
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEA 772


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 28.5 bits (64), Expect = 0.82
 Identities = 8/34 (23%), Positives = 17/34 (50%)

Query: 87  EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           E +D K +EE   E+++E      ++    ++ E
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVE 191



 Score = 27.7 bits (62), Expect = 1.5
 Identities = 6/35 (17%), Positives = 18/35 (51%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
            ++K++   ++ +E      ++    ++ E KK+K
Sbjct: 162 AKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 26.2 bits (58), Expect = 4.1
 Identities = 9/39 (23%), Positives = 18/39 (46%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           E +  K++E+   EE  E L     +    +K E  +++
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 28.5 bits (64), Expect = 0.83
 Identities = 17/60 (28%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 61  IHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +  R ++  G   + + KKR  ++   E +EK   EE+++Q+E    K  +EE + ER+ 
Sbjct: 34  MDAREKKIAGELADADTKKREAEQERREYEEKN--EELDQQREVLLTKA-KEEAQAERQR 90


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 28.1 bits (63), Expect = 0.84
 Identities = 14/49 (28%), Positives = 33/49 (67%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           G+++ K+K R DK++ +++E  +E+E EE+  +   + E+E  + E+++
Sbjct: 64  GRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKK 112


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 28.5 bits (63), Expect = 0.85
 Identities = 16/48 (33%), Positives = 28/48 (58%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +EEEK++ RE++ E EE E + + E +    + +E  +EE+  E   E
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEE 159



 Score = 26.2 bits (57), Expect = 6.6
 Identities = 19/49 (38%), Positives = 32/49 (65%), Gaps = 2/49 (4%)

Query: 74  EEEKKKRREDK--VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EE KKKR E +  +EEEE  + +EE   + +EE++++  +EE++  R E
Sbjct: 217 EELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAE 265


>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
           peptidase.  Corresponds to Merops family C37.
           Norwalk-like viruses (NLVs), including the Southampton
           virus, cause acute non-bacterial gastroenteritis in
           humans. The NLV genome encodes three open reading frames
           (ORFs). ORF1 encodes a polyprotein, which is processed
           by the viral protease into six proteins.
          Length = 535

 Score = 28.7 bits (64), Expect = 0.86
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 6/56 (10%)

Query: 75  EEKKKRREDK-----VEEE-EDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
           +E KK RE++     ++E  ED +  EEE+ E++  + +  EEEE K  +R  G  
Sbjct: 259 DEYKKIREERGGKYSIQEYLEDRERYEEELAERQATEADFCEEEEAKIRQRIFGLR 314


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 28.3 bits (63), Expect = 0.87
 Identities = 16/46 (34%), Positives = 29/46 (63%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           E+EK+    ++ +  E ++ EE   EE+KEE+ E V+++EV   +R
Sbjct: 65  EKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKR 110



 Score = 27.5 bits (61), Expect = 1.7
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
            KK  E + E+E  ++  + + EEQK E +E  +E+E  E  RE 
Sbjct: 33  IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPERED 77



 Score = 26.7 bits (59), Expect = 3.6
 Identities = 15/55 (27%), Positives = 31/55 (56%)

Query: 66  QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +E      +E  K + E++  E E+ + E+E  E ++E+  E+ E+EE+ +E  +
Sbjct: 39  KEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEK 93



 Score = 25.6 bits (56), Expect = 8.7
 Identities = 16/51 (31%), Positives = 32/51 (62%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           K +E++++ E +  E+E E  E E  +  ++++KE++ +EE KEE  E  +
Sbjct: 51  KSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVK 101


>gnl|CDD|148593 pfam07074, TRAP-gamma, Translocon-associated protein, gamma subunit
           (TRAP-gamma).  This family consists of several
           eukaryotic translocon-associated protein, gamma subunit
           (TRAP-gamma) sequences. The translocation site
           (translocon), at which nascent polypeptides pass through
           the endoplasmic reticulum membrane, contains a component
           previously called 'signal sequence receptor' that is now
           renamed as 'translocon-associated protein' (TRAP). The
           TRAP complex is comprised of four membrane proteins
           alpha, beta, gamma and delta which are present in a
           stoichiometric relation, and are genuine neighbors in
           intact microsomes. The gamma subunit is predicted to
           span the membrane four times.
          Length = 170

 Score = 28.2 bits (63), Expect = 0.92
 Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 6/40 (15%)

Query: 80  RREDKVEEE------EDEKLEEEEMEEQKEEKKEKVEEEE 113
           RRE+ V  E      +D+K+  +E +E+   KK +V + E
Sbjct: 80  RREEAVTREVNRKLADDKKMSRKEKDERILWKKNEVADYE 119


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 28.4 bits (64), Expect = 0.94
 Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 74  EEEKKKRREDKVEEEEDE-KLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           E E KK+  +K+ E+E   +  E+ + E+++E   K+EE + K E  E
Sbjct: 315 ERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLE 362



 Score = 27.7 bits (62), Expect = 1.6
 Identities = 16/48 (33%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEE-EEMEEQKEEKKEKVEEEEVKEER 118
            K  EK + +E ++EE E   +EE +E+  + EE ++K+E+ E + E+
Sbjct: 320 KKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEK 367


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 28.5 bits (63), Expect = 0.96
 Identities = 18/56 (32%), Positives = 30/56 (53%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           E  G     + ++ R  + EEEEDE +EE E  E+ E  +E+ +++E  E+  E  
Sbjct: 89  ELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKE 144



 Score = 27.3 bits (60), Expect = 2.1
 Identities = 21/64 (32%), Positives = 35/64 (54%)

Query: 59  LSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           L+I  + +E  G  KEEE+ +  E+  E EE E +EEE  +++  EK ++ E +   +E 
Sbjct: 93  LNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDEN 152

Query: 119 REGG 122
              G
Sbjct: 153 ELAG 156


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 28.5 bits (64), Expect = 1.0
 Identities = 7/43 (16%), Positives = 21/43 (48%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
            +E ++   +  +E      +E+  E ++E+ +    +EE+ +
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDK 43



 Score = 26.2 bits (58), Expect = 6.2
 Identities = 11/55 (20%), Positives = 23/55 (41%), Gaps = 1/55 (1%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGGR 126
            KE  K    E   E EE++  E + ++E+ ++   +++  E   +  E      
Sbjct: 12  AKELRKLTEDEKLAEAEEEKA-EYDALKEEIDKLDAEIDRLEELLDELEAKPAAS 65



 Score = 25.8 bits (57), Expect = 7.3
 Identities = 8/52 (15%), Positives = 20/52 (38%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
             E ++++ E    +EE +KL+ E    ++   + + +     E        
Sbjct: 24  LAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEE 75


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 28.0 bits (62), Expect = 1.1
 Identities = 15/40 (37%), Positives = 28/40 (70%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           EEE    + ++VEE+E+E  + E+  ++++E+ E+VEEE 
Sbjct: 20  EEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 28.1 bits (63), Expect = 1.2
 Identities = 18/43 (41%), Positives = 29/43 (67%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
            ++E+K    D  E EE EK EEEE EE++ E++ ++EEE+ +
Sbjct: 2   SDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQE 44


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 28.1 bits (62), Expect = 1.3
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 6/56 (10%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE------EVKEERREGG 122
            EEE+ +  E+  E EED+K   E++  + +E+K K +EE      +VK+    GG
Sbjct: 241 AEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRNIAGG 296



 Score = 27.7 bits (61), Expect = 1.6
 Identities = 17/33 (51%), Positives = 22/33 (66%), Gaps = 1/33 (3%)

Query: 79  KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
           K   DKV EEE E   +EE EE +E+KKE +E+
Sbjct: 234 KNDGDKVAEEE-EFELDEEHEEAEEDKKEALEK 265


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 28.0 bits (63), Expect = 1.3
 Identities = 18/52 (34%), Positives = 32/52 (61%)

Query: 66  QEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
             G G   EEE++   +D+ EE++D+ LEEEE +    +++E  E+E+  +E
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366



 Score = 26.1 bits (58), Expect = 6.2
 Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 2/39 (5%)

Query: 82  EDKVEEEED--EKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           +D  EEEED     EEE+ E++  + ++  EEEE ++E+
Sbjct: 340 DDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378



 Score = 26.1 bits (58), Expect = 6.8
 Identities = 14/47 (29%), Positives = 30/47 (63%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EEE     ED+ ++++D + EEE+++   EE+ E+ E+ + +++  E
Sbjct: 325 EEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 27.8 bits (62), Expect = 1.3
 Identities = 11/41 (26%), Positives = 27/41 (65%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
              EKK +  +  + +E+++ +EEE EE++EE ++  ++++
Sbjct: 154 SMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194



 Score = 27.4 bits (61), Expect = 1.7
 Identities = 16/42 (38%), Positives = 27/42 (64%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
             E+K K  E +  +EEDEK EEEE EE++E++    ++++ 
Sbjct: 155 MLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196



 Score = 26.3 bits (58), Expect = 4.3
 Identities = 16/58 (27%), Positives = 27/58 (46%)

Query: 64  RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
            I +  G      K KR+     EEE++  E+  M E+K ++ E  + +E  E+  E 
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEE 178



 Score = 25.9 bits (57), Expect = 6.8
 Identities = 15/45 (33%), Positives = 31/45 (68%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            EEE+    +  + E++ ++LE E+++E+ E+ +E+ EEEE ++E
Sbjct: 143 TEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDE 187


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 28.0 bits (62), Expect = 1.3
 Identities = 16/80 (20%), Positives = 30/80 (37%), Gaps = 3/80 (3%)

Query: 40  LDMYIRPNRLRFDD---NQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEE 96
           +D+    NR++      + +  +     + E        E+    E + E E DE + EE
Sbjct: 306 VDLERNGNRIKELADFGDYYEDIFEVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEE 365

Query: 97  EMEEQKEEKKEKVEEEEVKE 116
             E +  E +     E + E
Sbjct: 366 VAEMELLEVQVDDLAERLAE 385


>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
           domain 1.  The small mechanosensitive channel, MscS, is
           a part of the turgor-driven solute efflux system that
           protects bacteria from lysis in the event of osmotic
           shock. The MscS protein alone is sufficient to form a
           functional mechanosensitive channel gated directly by
           tension in the lipid bilayer. The MscS proteins are
           heptamers of three transmembrane subunits with seven
           converging M3 domains, and this domain is one of the
           inner membrane domains.
          Length = 339

 Score = 28.0 bits (63), Expect = 1.4
 Identities = 10/36 (27%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 78  KKRREDKVEE--EEDEKLEEEEMEEQKEEKKEKVEE 111
           K +R + + +  EE+E+  E   E   EE +  +E 
Sbjct: 264 KAKRAEILAQRAEEEEESSEGAAETI-EEPELDLET 298


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 27.9 bits (62), Expect = 1.5
 Identities = 10/49 (20%), Positives = 20/49 (40%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
            E+ +   + K +E++D+  EEE        +      EE++    E  
Sbjct: 221 SEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAA 269


>gnl|CDD|176483 cd08365, APC10-like1, APC10-like DOC1 domains of E3 ubiquitin
          ligases that mediate substrate ubiquitination.  This
          model represens the APC10-like DOC1 domain of
          multi-domain proteins present in E3 ubiquitin ligases.
          E3 ubiquitin ligases mediate substrate ubiquitination
          (or ubiquitylation), a component of the ubiquitin-26S
          proteasome pathway for selective proteolytic
          degradation. APC10/DOC1 domains such as those present
          in HECT (Homologous to the E6-AP Carboxyl Terminus) and
          Cullin-RING (Really Interesting New Gene) E3 ubiquitin
          ligase proteins, HECTD3, and CUL7, respectively, are
          also included here. CUL7 is a member of the Cullin-RING
          ligase family and functions as a molecular scaffold
          assembling a SCF-ROC1-like E3 ubiquitin ligase complex
          consisting of Skp1, CUL7, Fbx29 F-box protein, and ROC1
          (RING-box protein 1) and promotes ubiquitination. CUL7
          is a multi-domain protein with a C-terminal cullin
          domain that binds ROC1 and a centrally positioned
          APC10/DOC1 domain. HECTD3 contains a C-terminal HECT
          domain which contains the active site for ubiquitin
          transfer onto substrates, and an N-terminal APC10/DOC1
          domain which is responsible for substrate recognition
          and binding. An APC10/DOC1 domain homolog is also
          present in HERC2 (HECT domain and RLD2), a large
          multi-domain protein with three RCC1-like domains
          (RLDs), additional internal domains including zinc
          finger ZZ-type and Cyt-b5 (Cytochrome b5-like
          Heme/Steroid binding) domains, and a C-terminal HECT
          domain. Recent studies have shown that the protein
          complex HERC2-RNF8 coordinates ubiquitin-dependent
          assembly of DNA repair factors on damaged chromosomes.
          Also included in this hierarchy is an uncharacterized
          APC10/DOC1-like domain found in a multi-domain protein,
          which also contains CUB, zinc finger ZZ-type, and
          EF-hand domains. The APC10/DOC1 domain forms a
          beta-sandwich structure that is related in architecture
          to the galactose-binding domain-like fold; their
          sequences are quite dissimilar, however, and are not
          included here.
          Length = 131

 Score = 27.5 bits (61), Expect = 1.5
 Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 8  CWLGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRL 49
           W  DGS GS ++ + +K   +  +L LA+   D    P R+
Sbjct: 28 YWQSDGSQGSHWIRLKMKPDVLVRHLSLAVDATDSSYMPQRV 69


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 27.9 bits (62), Expect = 1.6
 Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 8/38 (21%)

Query: 84  KVEEEEDEKLEEEEMEEQK--------EEKKEKVEEEE 113
           K EE   ++LEE E E+          +E+KE++E EE
Sbjct: 77  KEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEE 114


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 26.8 bits (60), Expect = 1.6
 Identities = 16/41 (39%), Positives = 25/41 (60%), Gaps = 3/41 (7%)

Query: 82  EDKVEEEEDEKLE--EEEMEEQKEEKKEKVEEEEVKEERRE 120
           +D  +E   E  E   E+ E +K EKKEK  E++++EE +E
Sbjct: 48  KDYKDEPPPEPTETWLEKREREKREKKEK-LEKKLEEELKE 87


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 28.0 bits (63), Expect = 1.7
 Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 8/48 (16%)

Query: 76  EKKKRREDKVEEEEDEKLEE-----EEMEEQKEEKKEKVE--EEEVKE 116
           EK +R +  +EE E E+LEE     EE +EQ+EE + + E  EEEV E
Sbjct: 348 EKIERYQADLEELE-ERLEEQNEVVEEADEQQEENEARAEAAEEEVDE 394


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 27.7 bits (61), Expect = 1.7
 Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 1/45 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            +EE  K ++ K  + + E    E+  E KE + +K + E V E+
Sbjct: 293 NDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQK-KREPVAED 336



 Score = 26.5 bits (58), Expect = 4.7
 Identities = 15/45 (33%), Positives = 24/45 (53%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           K E +K + E K  +EE  K ++ +  + K+E K   +E E KE 
Sbjct: 280 KREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 27.4 bits (61), Expect = 1.8
 Identities = 9/30 (30%), Positives = 20/30 (66%)

Query: 87  EEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
            EE EK ++E  +E +E++  K ++E+ ++
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEK 121



 Score = 26.6 bits (59), Expect = 3.1
 Identities = 8/34 (23%), Positives = 22/34 (64%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
            EE +++ +E   E +++E  +  +E++ K+E++
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 25.5 bits (56), Expect = 7.9
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
            +K ++E  ++ EE+E  + K+EKKEK E++
Sbjct: 95  SEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 25.5 bits (56), Expect = 8.0
 Identities = 13/44 (29%), Positives = 30/44 (68%), Gaps = 1/44 (2%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
            EE +K +++  +E E+++  + + E+ KE+K++KV E+  K++
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEK-KEKKEKKVAEKLAKKK 134


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 27.7 bits (62), Expect = 1.9
 Identities = 15/48 (31%), Positives = 27/48 (56%), Gaps = 4/48 (8%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEK----KEKVEEEEVKEER 118
           +EK++    K+EEE   +LE +E   +K+ +    +EK E  +  EE+
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEK 242



 Score = 27.3 bits (61), Expect = 2.5
 Identities = 12/62 (19%), Positives = 28/62 (45%), Gaps = 3/62 (4%)

Query: 59  LSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
            S    + +     K+  + K  E++  E   ++  EE + + +EE   ++E +E   E+
Sbjct: 166 ASAKEELDQ---LSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEK 222

Query: 119 RE 120
           + 
Sbjct: 223 QL 224



 Score = 25.8 bits (57), Expect = 8.4
 Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 5/52 (9%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKE-----EKKEKVEEEEVKEERRE 120
           EEE+ +R   +  EE   KLEEE +   +      EK+ ++E E  KEE R+
Sbjct: 186 EEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRK 237


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 27.3 bits (60), Expect = 2.0
 Identities = 10/53 (18%), Positives = 25/53 (47%)

Query: 68  GTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           G   G    K+K +    E+E  E++ E+     ++ +++ +E +  +E   +
Sbjct: 87  GLAIGLPIVKRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQ 139


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 27.6 bits (61), Expect = 2.0
 Identities = 13/75 (17%), Positives = 32/75 (42%)

Query: 52  DDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
           D +       +      +G   EE + K   +   + E E+ E+ E  E+++ ++E    
Sbjct: 276 DGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLS 335

Query: 112 EEVKEERREGGRGGR 126
           ++ K+ ++  G+   
Sbjct: 336 KKGKKLKKLKGKKNG 350


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 27.4 bits (61), Expect = 2.0
 Identities = 10/36 (27%), Positives = 22/36 (61%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
           K   +K   +  +EE  +E +EE  ++ +K+++K+K
Sbjct: 400 KTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKK 435



 Score = 26.2 bits (58), Expect = 5.6
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKV 109
           K E++ K+ E+K+E++EDE  E E+  E+  +K E +
Sbjct: 290 KLEKELKKLENKLEKQEDELEELEKAAEELRQKGELL 326



 Score = 26.2 bits (58), Expect = 6.1
 Identities = 13/49 (26%), Positives = 22/49 (44%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            K+ +K K  +  ++ +  E  E     E  +   EK E ++  EE RE
Sbjct: 369 FKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIRE 417


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 27.0 bits (60), Expect = 2.1
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           K++++K   E+K  E +D+K EE E +    E+    +E 
Sbjct: 70  KKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQES 109



 Score = 25.5 bits (56), Expect = 7.5
 Identities = 11/45 (24%), Positives = 24/45 (53%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
             ++ KK+ ++K+ EEE +   +++  E+ E   +  EE    +E
Sbjct: 64  TTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQE 108


>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter.  [Transport and binding
           proteins, Other].
          Length = 953

 Score = 27.4 bits (61), Expect = 2.1
 Identities = 9/48 (18%), Positives = 19/48 (39%), Gaps = 3/48 (6%)

Query: 75  EEKKKRREDKVE---EEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
            E  +  E+ +      + EK  E +  +    K  K+ + E++   R
Sbjct: 838 TESMEAFEEMIRPFRLHKTEKDREAKDPKMTWTKPWKITDAELQSNVR 885


>gnl|CDD|234655 PRK00133, metG, methionyl-tRNA synthetase; Reviewed.
          Length = 673

 Score = 27.4 bits (62), Expect = 2.2
 Identities = 10/40 (25%), Positives = 14/40 (35%), Gaps = 1/40 (2%)

Query: 79  KRREDKVEEEEDEKLEEEEMEEQKEEK-KEKVEEEEVKEE 117
            R EDK  E   E  +E    +         + EE + E 
Sbjct: 528 TRIEDKQIEALIEASKEAAAAKAAAAAAAAPLAEEPIAET 567


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 27.1 bits (61), Expect = 2.2
 Identities = 14/44 (31%), Positives = 31/44 (70%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           +++E  +++E+ +EE+E E    ++  E+KEE+ E++ EE+ +E
Sbjct: 96  RKDESLEKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQE 139


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 27.2 bits (61), Expect = 2.4
 Identities = 12/58 (20%), Positives = 33/58 (56%), Gaps = 5/58 (8%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLE--EEEMEEQKEEKKEKVEEE--EVKEERRE 120
                 ++++   +  +++ ++ +E+LE  E+E++E  EE+  ++E E  E+++E   
Sbjct: 69  LNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKE-LEEEISELENEIKELEQEIER 125


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 27.4 bits (61), Expect = 2.4
 Identities = 12/53 (22%), Positives = 21/53 (39%), Gaps = 5/53 (9%)

Query: 76  EKKKRREDKVEEEEDEK-LEEEEMEEQKEEKKEKVE----EEEVKEERREGGR 123
             KK+      +E   K    E  +   +E++ K+E    EE  + E  +  R
Sbjct: 365 LAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQKR 417


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 26.9 bits (60), Expect = 2.4
 Identities = 15/47 (31%), Positives = 28/47 (59%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           K+ R  +++E EE+EK +EEE E      K + E+  +K++ +E  +
Sbjct: 114 KQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK 160



 Score = 26.5 bits (59), Expect = 3.4
 Identities = 12/42 (28%), Positives = 23/42 (54%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
           EEEK+   E +      +   E+   +QK ++ +K E+EE++
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMR 167


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 27.4 bits (60), Expect = 2.5
 Identities = 14/69 (20%), Positives = 30/69 (43%)

Query: 52  DDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
            +      +     +E +   KE+++++    +  +EE +    +E+ E+KEE  E  E 
Sbjct: 273 SEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEEN 332

Query: 112 EEVKEERRE 120
           E   E   +
Sbjct: 333 ERHTELLAD 341


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 27.3 bits (60), Expect = 2.5
 Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 2/43 (4%)

Query: 82  EDKVEEEEDEKLEEEEMEE--QKEEKKEKVEEEEVKEERREGG 122
           ED+  +E  E L E+E E     EE  E  +E E K       
Sbjct: 954 EDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDSRP 996


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 27.0 bits (60), Expect = 2.6
 Identities = 13/42 (30%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 72  GKEEEKKKRREDKVEE--EEDEKLEEEEMEEQKEEKKEKVEE 111
            K+ +++K+ +  +EE  +++ + E EE E ++   KEK EE
Sbjct: 95  AKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136



 Score = 25.8 bits (57), Expect = 7.2
 Identities = 10/37 (27%), Positives = 30/37 (81%), Gaps = 1/37 (2%)

Query: 84  KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +++E E +KL++++ + ++E +K++ +++E +EER++
Sbjct: 183 RLQEWELKKLKQQQQKREEERRKQRKKQQE-EEERKQ 218



 Score = 25.5 bits (56), Expect = 8.0
 Identities = 13/39 (33%), Positives = 27/39 (69%), Gaps = 1/39 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
           +E E KK ++ + + EE+ + + ++ +E +EE+K+K EE
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQE-EEERKQKAEE 222


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 26.5 bits (59), Expect = 2.6
 Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 1/49 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKL-EEEEMEEQKEEKKEKVEEEEVKEERRE 120
           K + +KK+ E K E E+  K+  EE+ E + E+ ++++EE E +  RRE
Sbjct: 21  KAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRE 69


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 27.1 bits (60), Expect = 2.6
 Identities = 23/95 (24%), Positives = 40/95 (42%), Gaps = 11/95 (11%)

Query: 19  YLNVAIKDGGVAINLKLAMGKLDMYIRPNRLR-FDDNQWH--HLSIHRRIQEGTGCGKEE 75
           Y  V + +  + ++   A G       P   R +D+N     H  +   I E     +EE
Sbjct: 379 YAEVVVPERALVLDWVFADGP------PGNARNYDNNGRQDFHAIVPNNIPEELYWVEEE 432

Query: 76  EK--KKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
            +  +K +E++   EE  + + E+    K E KEK
Sbjct: 433 HQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEK 467



 Score = 26.8 bits (59), Expect = 3.6
 Identities = 16/50 (32%), Positives = 31/50 (62%), Gaps = 5/50 (10%)

Query: 76  EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK-----EERRE 120
           E+K+R  +K+ +EE E+  + E + ++EE+K  +E +  +     E+RRE
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 27.1 bits (60), Expect = 2.7
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 91  EKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           E+  EEE EE++EE++E  E E   EE   G
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPG 429



 Score = 27.1 bits (60), Expect = 3.1
 Identities = 17/41 (41%), Positives = 21/41 (51%)

Query: 85  VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
            EEEE+E+ EEEE  E +   +E V   EV E       GG
Sbjct: 403 EEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPEMPMPAAGGG 443


>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
           protein [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 234

 Score = 27.0 bits (60), Expect = 2.8
 Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 1/49 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
                K   E  V EEE+ +   E  EE+ E   ++  +E ++E  REG
Sbjct: 24  ANAGPKVAPEKAVLEEEELEQALEAKEEELESAAQE-LQEGIEEGAREG 71


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
           MRVI1 proteins which are related to the
           lymphoid-restricted membrane protein (JAW1) and the IP3
           receptor associated cGMP kinase substrates A and B
           (IRAGA and IRAGB). The function of MRVI1 is unknown
           although mutations in the Mrvi1 gene induces myeloid
           leukaemia by altering the expression of a gene important
           for myeloid cell growth and/or differentiation so it has
           been speculated that Mrvi1 is a tumour suppressor gene.
           IRAG is very similar in sequence to MRVI1 and is an
           essential NO/cGKI-dependent regulator of IP3-induced
           calcium release. Activation of cGKI decreases
           IP3-stimulated elevations in intracellular calcium,
           induces smooth muscle relaxation and contributes to the
           antiproliferative and pro-apoptotic effects of NO/cGMP.
           Jaw1 is a member of a class of proteins with
           COOH-terminal hydrophobic membrane anchors and is
           structurally similar to proteins involved in vesicle
           targeting and fusion. This suggests that the function
           and/or the structure of the ER in lymphocytes may be
           modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 27.3 bits (60), Expect = 2.8
 Identities = 16/58 (27%), Positives = 26/58 (44%)

Query: 57  HHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEV 114
           H  S+ R I   +    EEEKK ++   + E E E+  E   +    E+  + EE + 
Sbjct: 398 HRPSLLRFISTYSWADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDK 455


>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
          Length = 1560

 Score = 27.1 bits (60), Expect = 2.9
 Identities = 11/45 (24%), Positives = 17/45 (37%), Gaps = 1/45 (2%)

Query: 81  REDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
           +E+K  +E D   E  E++          E    K+E    G  G
Sbjct: 882 KENKDSKEGDADAEVAEVDAA-PGGAVDHEPPVAKQEGNAEGGDG 925


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 27.2 bits (61), Expect = 2.9
 Identities = 12/44 (27%), Positives = 19/44 (43%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           K EE K R E +    E EK   E   ++  E +   +++ V  
Sbjct: 447 KAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAA 490


>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
          Length = 1119

 Score = 27.1 bits (60), Expect = 2.9
 Identities = 17/63 (26%), Positives = 30/63 (47%)

Query: 54  NQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           N  H +++H    E +   KE  K  + E K       K + +EME+  E+ K++ E ++
Sbjct: 562 NNNHRVTVHLEAVESSKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKKEREADD 621

Query: 114 VKE 116
             E
Sbjct: 622 DPE 624


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 26.8 bits (60), Expect = 3.0
 Identities = 10/49 (20%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLE-EEEMEEQKEEKKEKVEEEEVKEERRE 120
           K +++K+ R+   + +++ K   EE   E+ E  +E   + + + E++ 
Sbjct: 24  KRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKA 72


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 26.8 bits (60), Expect = 3.1
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%)

Query: 79  KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVE-EEEVKEERRE 120
           K    K EEEE  +L  +E  E   EKK   E +EE+ EE  +
Sbjct: 16  KPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDD 58


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 26.7 bits (59), Expect = 3.1
 Identities = 17/33 (51%), Positives = 21/33 (63%)

Query: 88  EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           EE    E+E  EE KEE KEK  E+++KE RR 
Sbjct: 93  EEALDEEKERREESKEEMKEKKFEKKLKELRRA 125


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 26.9 bits (59), Expect = 3.1
 Identities = 11/47 (23%), Positives = 19/47 (40%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           +  R E    + E    E E    + E +K + E E V++E     +
Sbjct: 84  RALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQ 130


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 26.5 bits (59), Expect = 3.2
 Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 1/34 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK 106
           KEE + ++R +K E  E  K EEE  E +++++K
Sbjct: 83  KEERELRKRAEK-EALEQAKKEEELREAKRQQRK 115


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
          represents the L12P protein of the large (50S) subunit
          of the archaeal ribosome.
          Length = 105

 Score = 26.2 bits (58), Expect = 3.3
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 72 GKEEEKKKRREDKVEEEEDEKLEEEEME 99
               +++  E++ EEEE+E+ EEE M 
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 25.8 bits (57), Expect = 4.2
 Identities = 12/20 (60%), Positives = 17/20 (85%)

Query: 94  EEEEMEEQKEEKKEKVEEEE 113
           EEEE EE++EE++E+  EEE
Sbjct: 76  EEEEEEEEEEEEEEEESEEE 95



 Score = 25.0 bits (55), Expect = 7.4
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query: 94  EEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
                EE++EE++E+ EEEE  EE    G G 
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEEAMAGLGA 102


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 26.5 bits (59), Expect = 3.3
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKK 106
           E ++ +  E K++EEE ++  EE  E ++E+KK
Sbjct: 94  EAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126



 Score = 24.9 bits (55), Expect = 9.3
 Identities = 16/38 (42%), Positives = 24/38 (63%), Gaps = 2/38 (5%)

Query: 79  KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           +R E +   +E++KL+EEE + Q EE KE   E E K+
Sbjct: 91  ERAEAQRARDEEKKLDEEEAKRQHEEAKE--REREKKK 126


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 26.4 bits (59), Expect = 3.4
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVE 110
           KEE +K+  E + + E  EK EEEE + +++   +++ 
Sbjct: 136 KEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173



 Score = 26.0 bits (58), Expect = 5.3
 Identities = 15/47 (31%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           + E+ K   E ++++ E+EK   EE+E++  E + K+E  E +EE  
Sbjct: 117 QAEQGKSELEQEIKKLEEEK---EELEKRVAELEAKLEAIEKREEEE 160


>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
          Length = 159

 Score = 26.6 bits (59), Expect = 3.4
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 3/50 (6%)

Query: 74  EEEKKKRREDKVEEEE---DEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +E++KK RE K++ E    + K E +++ E+ + K E V EE VKE   E
Sbjct: 49  DEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAENVYEEIVKEAHEE 98


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 26.8 bits (59), Expect = 3.6
 Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 2/43 (4%)

Query: 73  KEEEKKKRREDKV--EEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           K +E K R +     EEE+++ LEE + +E + E  E  E E 
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESES 675



 Score = 26.4 bits (58), Expect = 4.6
 Identities = 15/55 (27%), Positives = 29/55 (52%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           +    G+EE++    E   +E+E E +E+ E E + + +  + +E+E   E  EG
Sbjct: 642 QQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEG 696


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 26.7 bits (59), Expect = 3.6
 Identities = 10/46 (21%), Positives = 24/46 (52%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           E +++ R+   ++EE+ +  +E  EE+     + +++EEV      
Sbjct: 401 EAEEEARQRLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNLYDR 446


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 26.8 bits (59), Expect = 3.7
 Identities = 11/36 (30%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           K+K +E K   +++ K+E   +E + E    K  EE
Sbjct: 798 KQKPKE-KGPNDKEIKIESPSVETEGERCTIKQREE 832


>gnl|CDD|234660 PRK00139, murE,
          UDP-N-acetylmuramoylalanyl-D-glutamate--2,
          6-diaminopimelate ligase; Provisional.
          Length = 460

 Score = 26.6 bits (60), Expect = 3.7
 Identities = 11/51 (21%), Positives = 18/51 (35%), Gaps = 18/51 (35%)

Query: 8  CWLGDGSDGSDYLNVAIKDGGVAI------------------NLKLAMGKL 40
             G   DG D++  AI +G  A+                  +L+ A+  L
Sbjct: 33 ALPGHKVDGRDFIAQAIANGAAAVVAEADGEAGTGVPVIIVPDLRKALALL 83


>gnl|CDD|167284 PRK01833, tatA, twin arginine translocase protein A; Provisional.
          Length = 74

 Score = 25.6 bits (56), Expect = 3.8
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
           KK   +DK ++ E EK+E +E    +++ KEK
Sbjct: 40  KKAMADDKPKDAEFEKVEAKEAASTEQKAKEK 71


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 26.3 bits (58), Expect = 3.9
 Identities = 8/57 (14%), Positives = 25/57 (43%)

Query: 48  RLRFDDNQWHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEE 104
             + D  +     +  R+++     +  +K +    K+ ++ +E  E+ + E  K++
Sbjct: 135 GGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKK 191


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 26.4 bits (58), Expect = 3.9
 Identities = 20/45 (44%), Positives = 33/45 (73%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           K +EKK+R E+  +E E+ + E++ +E+  EEK++K E EE+KEE
Sbjct: 151 KRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEE 195


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 26.7 bits (60), Expect = 4.0
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
            EE+ E+LEE   E  +EE+++K E++ +  
Sbjct: 63  LEEQRERLEELAPELLEEEEEKKEEKKGLPP 93


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 26.5 bits (59), Expect = 4.0
 Identities = 11/49 (22%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            ++EE  K  +D    E+D+  ++++    +E   E+       EE+R 
Sbjct: 135 EEDEESSKSEDD----EDDDDDDDDDDIATRERSLERRRRRREWEEKRA 179



 Score = 26.1 bits (58), Expect = 6.9
 Identities = 10/51 (19%), Positives = 25/51 (49%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
                  R+ + ++++DE+ +EE+ E  K E  E  ++++  ++     R 
Sbjct: 114 PRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERS 164



 Score = 25.7 bits (57), Expect = 8.8
 Identities = 8/51 (15%), Positives = 24/51 (47%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
            E        D  E+++D++  +EE EE  + + ++ ++++  ++      
Sbjct: 112 DEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 26.7 bits (60), Expect = 4.0
 Identities = 12/46 (26%), Positives = 28/46 (60%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
            + +++  + +  +EEEDEKL +E +E  K+ + EK ++ +  + +
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPK 211


>gnl|CDD|241384 cd13230, PH1_SSRP1-like, Structure Specific Recognition protein 1
           (SSRP1) Pleckstrin homology (PH) domain, repeat 1.
           SSRP1 is a component of FACT (facilitator of chromatin
           transcription), an essential chromatin reorganizing
           factor. In yeast FACT (yFACT) is composed of three
           proteins: Spt16/Cdc68, Pob3, and Nhp6. In metazoans the
           Pob3 and Nhp6 orthologs are fused to form SSRP1/T160 in
           human and mouse, respectively. The middle domain of the
           Pob3 subunit (Pob3-M) has an unusual double pleckstrin
           homology (PH) architecture. yFACT interacts in a
           physiologically important way with the central
           single-strand DNA binding factor RPA to promote a step
           in DNA Replication. Coordinated function by yFACT and
           RPA is important during nucleosome deposition. These
           results support the model that the FACT family has an
           essential role in constructing nucleosomes during DNA
           replication, and suggest that RPA contributes to this
           process. Members of this cd are composed of the first
           PH-like repeat. PH domains have diverse functions, but
           in general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 137

 Score = 26.0 bits (58), Expect = 4.1
 Identities = 16/31 (51%), Positives = 19/31 (61%), Gaps = 1/31 (3%)

Query: 83  DKVEEEEDE-KLEEEEMEEQKEEKKEKVEEE 112
           DK EE E E  L EEE+EE+ + K EK  E 
Sbjct: 85  DKDEEIELELNLTEEELEEKYKGKLEKEYEG 115


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 26.6 bits (59), Expect = 4.2
 Identities = 7/32 (21%), Positives = 20/32 (62%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           E+E++      E E+ K++++ + ++++  EE
Sbjct: 75  EDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106



 Score = 26.6 bits (59), Expect = 4.2
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 85  VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            ++ EDEK       E ++ KK++  E + K+E  E
Sbjct: 71  DQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 25.7 bits (57), Expect = 4.2
 Identities = 14/57 (24%), Positives = 32/57 (56%)

Query: 56  WHHLSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           ++   I  ++        +EE ++ RE K + E+++    E + E++EE+K+++ EE
Sbjct: 52  YYKKKIGYKVPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 26.1 bits (58), Expect = 4.3
 Identities = 13/41 (31%), Positives = 30/41 (73%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           ++ E+K+R +++ E +E+++ E  +   QKEE+++++EE E
Sbjct: 16  EKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELE 56



 Score = 25.3 bits (56), Expect = 8.1
 Identities = 16/45 (35%), Positives = 28/45 (62%)

Query: 76  EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +K  R  D  E+ E+++ E+EE E ++E++KE  +    KEER +
Sbjct: 6   DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREK 50


>gnl|CDD|235364 PRK05201, hslU, ATP-dependent protease ATP-binding subunit HslU;
           Provisional.
          Length = 443

 Score = 26.6 bits (60), Expect = 4.3
 Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 7/42 (16%)

Query: 77  KKKRREDKVEEEEDEKLEEEE------MEEQKEEKKEKVEEE 112
           KKK+R+ KV+E   + L EEE      MEE K+E  E+VE+ 
Sbjct: 210 KKKKRKLKVKEAR-KILIEEEAAKLIDMEEIKQEAIERVEQN 250


>gnl|CDD|176486 cd08665, APC10-CUL7, APC10-like DOC1 domain of CUL7, subunit of
          the SCF-ROC1-like E3 ubiquitin ligase complex that
          mediates substrate ubiquitination.  This model
          represents the APC10/DOC1 domain present in CUL7, a
          subunit of the SCF-ROC1-like E3 Ubiquitin (Ub) ligase
          complex, which mediates substrate ubiquitination (or
          ubiquitylation), and is a component of the
          ubiquitin-26S proteasome pathway for selective
          proteolytic degradation.  CUL7 is a member of the
          Cullin-RING ligase family and functions as a molecular
          scaffold assembling the SCF-ROC1-like E3 Ub ligase
          complex consisting of the adapter protein Skp1, CUL7,
          the WD40 repeat-containing F-box Fbw8 (also known as
          Fbx29), and ROC1 (RING-box protein 1). CUL7 is a large
          protein with a C-terminal cullin domain that binds ROC1
          and additional domains, including an APC10/DOC1 domain.
          While the Fbw8 protein is responsible for substrate
          protein recognition, the ROC1 RING domain recruits an
          Ub-charged E2 Ub-conjugating enzyme for substrate
          ubiquitination. It remains to be determined how CUL7
          binds to the Skp1-Fbw8 heterodimer. The CUL7 E3 Ub
          ligase has been implicated in the proteasomal
          degradation of the cellular proteins, cyclin D1, an
          important regulator of the G1 to S-phase cell cycle
          progression, and insulin receptor substrate 1, a
          critical component of the signaling pathways downstream
          of the insulin and insulin-like growth factor 1
          receptor. CUL7 appears to be an important regulator of
          placental development. Germ line mutations of CUL7 are
          linked to 3-M syndrome and Yakuts short stature
          syndrome.
          Length = 131

 Score = 26.0 bits (57), Expect = 4.4
 Identities = 13/41 (31%), Positives = 20/41 (48%)

Query: 9  WLGDGSDGSDYLNVAIKDGGVAINLKLAMGKLDMYIRPNRL 49
          W  +GS GS Y+N+ +  G V   L + +   D    P R+
Sbjct: 28 WESNGSTGSHYINIHMHRGVVIRQLYMLVASEDSSYMPARV 68


>gnl|CDD|151491 pfam11044, TMEMspv1-c74-12, Plectrovirus spv1-c74 ORF 12
          transmembrane protein.  This is a family of proteins
          expressed by Plectroviruses. The plectroviruses are
          single-stranded DNA viruses belonging to the
          Inoviridae. Except that it is a putative transmembrane
          protein the function is not known.
          Length = 49

 Score = 24.8 bits (54), Expect = 4.5
 Identities = 10/29 (34%), Positives = 20/29 (68%)

Query: 56 WHHLSIHRRIQEGTGCGKEEEKKKRREDK 84
          W  LSI+++I++  G  K++++ KR+E  
Sbjct: 20 WIGLSIYQKIRQIRGKKKDKKEIKRKESN 48


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 26.6 bits (59), Expect = 4.5
 Identities = 11/48 (22%), Positives = 27/48 (56%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           E+ ++  +++ E+E   + + +E+E  +E+K EK+  E  + +    G
Sbjct: 193 EDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEG 240


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 26.4 bits (58), Expect = 4.6
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEE 104
           +  V E ++E++EEEE   Q EE
Sbjct: 397 KRVVREVDEEEIEEEEEAMQPEE 419



 Score = 26.0 bits (57), Expect = 6.2
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKE 103
           K+  RE   EE E+E+   +  E + E
Sbjct: 397 KRVVREVDEEEIEEEEEAMQPEEMEME 423


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 26.5 bits (59), Expect = 4.7
 Identities = 18/39 (46%), Positives = 23/39 (58%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
           K+E  KK +ED  EE  +E  EE   EE+ E +KEK  E
Sbjct: 138 KDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176


>gnl|CDD|235240 PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional.
          Length = 456

 Score = 26.2 bits (59), Expect = 4.9
 Identities = 12/32 (37%), Positives = 15/32 (46%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEK 108
           KK+ R D + EE     E    EE KE  +E 
Sbjct: 93  KKRYRADHLIEELGIDAEGLSNEELKELIREN 124


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 26.0 bits (57), Expect = 4.9
 Identities = 19/48 (39%), Positives = 30/48 (62%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           KE E  K  + + +EEE+E+ EEE  E +  E +E++ E+E +EE  E
Sbjct: 38  KENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDE 85



 Score = 25.6 bits (56), Expect = 7.7
 Identities = 14/47 (29%), Positives = 36/47 (76%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           +EE+++  E+++EE ED + EEE +E+++EE++++ +  ++K+  ++
Sbjct: 51  DEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97



 Score = 25.2 bits (55), Expect = 8.7
 Identities = 20/48 (41%), Positives = 33/48 (68%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           KE+  K+  + K E++ED++ EEEE EE+ EE ++  +EEE+ E+  E
Sbjct: 33  KEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEE 80


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 26.1 bits (58), Expect = 5.0
 Identities = 14/48 (29%), Positives = 26/48 (54%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           +EE  K+ ++     E  +++ E  +E+ EE+ E++ EE   E   EG
Sbjct: 80  KEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEG 127


>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
            This family consists of several Trypanosoma brucei
           procyclic acidic repetitive protein (PARP) like
           sequences. The procyclic acidic repetitive protein
           (parp) genes of Trypanosoma brucei encode a small family
           of abundant surface proteins whose expression is
           restricted to the procyclic form of the parasite. They
           are found at two unlinked loci, parpA and parpB;
           transcription of both loci is developmentally regulated.
          Length = 145

 Score = 26.1 bits (56), Expect = 5.0
 Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 64  RIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQ--KEEKKEKVEEEEVKEERRE 120
           + ++GT  G ++      +D+ EEEE+ + EEE  EE   +EE +E+ E EE  EE  E
Sbjct: 43  KGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPE 101


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 25.9 bits (57), Expect = 5.1
 Identities = 5/39 (12%), Positives = 12/39 (30%), Gaps = 3/39 (7%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
                      L     +   E  + + ++   +EE R+
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKK---REEHRK 139


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 26.6 bits (59), Expect = 5.1
 Identities = 13/48 (27%), Positives = 27/48 (56%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGG 122
           EE + + ++  EE  + + + EE++E+ E  + ++EE E + E  E  
Sbjct: 326 EELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373


>gnl|CDD|180707 PRK06811, PRK06811, RNA polymerase factor sigma-70; Validated.
          Length = 189

 Score = 26.1 bits (58), Expect = 5.2
 Identities = 9/35 (25%), Positives = 18/35 (51%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
           +K  + ++++  ++  L  EE  E +   KE  EE
Sbjct: 90  RKLTKNNEIDSIDEFILISEESIENEIILKENKEE 124


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 26.5 bits (58), Expect = 5.5
 Identities = 14/81 (17%), Positives = 36/81 (44%), Gaps = 3/81 (3%)

Query: 41  DMYIRPNRLRFDDNQWHH---LSIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEE 97
           D  +RP      +N+ ++   +     +      G+E   +   E  +E + D K E+++
Sbjct: 57  DYELRPRVSSSWNNESYNRLPIKTKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQK 116

Query: 98  MEEQKEEKKEKVEEEEVKEER 118
             E+++   E   ++++  E+
Sbjct: 117 SLEEQKIAPEIPVKQQIDSEK 137


>gnl|CDD|188965 cd09566, SAM_liprin-beta1,2_repeat2, SAM domain of liprin-beta1,2
          proteins repeat 2.  SAM (sterile alpha motif) domain
          repeat 2 of liprin-beta1,2 proteins is a
          protein-protein interaction domain. Liprin-beta
          proteins contain three copies (repeats) of SAM domain.
          They may form heterodimers with liprin-alpha proteins
          through their SAM domains. It was suggested based on
          bioinformatic approaches that the second SAM domain of
          liprin-beta potentially is able to form polymers.
          Liprins were originally identified as LAR (leukocyte
          common antigen-related) transmembrane protein-tyrosine
          phosphatase-interacting proteins. They participate in
          mammary gland development, in axon guidance, and in the
          maintenance of lymphatic vessel integrity.
          Length = 63

 Score = 25.0 bits (55), Expect = 5.5
 Identities = 10/27 (37%), Positives = 12/27 (44%), Gaps = 7/27 (25%)

Query: 47 NRLRFDD-------NQWHHLSIHRRIQ 66
          + L  DD       +  HH SI R IQ
Sbjct: 34 HYLTVDDLLHLKVTSALHHASIRRGIQ 60


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 26.0 bits (58), Expect = 5.7
 Identities = 14/39 (35%), Positives = 25/39 (64%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           E+K+ EEE EKLE+++    +E+K++ +E     EE + 
Sbjct: 8   EEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQS 46


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 25.9 bits (58), Expect = 5.8
 Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 76  EKKKRREDKVEEEEDEKLE--EEEMEEQKEEKKEKVEEEEVK 115
           EKK +  DK  +EE E LE  +E +EE K  +  ++ +EE+K
Sbjct: 144 EKKAKSGDKEAKEELELLEKIKEHLEEGKPARTLELTDEEIK 185


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 26.2 bits (58), Expect = 5.8
 Identities = 7/28 (25%), Positives = 15/28 (53%)

Query: 99  EEQKEEKKEKVEEEEVKEERREGGRGGR 126
            E+ + +++   + E K ER++  R  R
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPR 610



 Score = 26.2 bits (58), Expect = 6.8
 Identities = 12/47 (25%), Positives = 32/47 (68%), Gaps = 1/47 (2%)

Query: 75  EEKKKRREDKVEEEEDEK-LEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           E +++R ++K + +++ K L  EE   Q+ E++E+V++ + + ++R+
Sbjct: 678 ERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEERVQQVQPRRKQRQ 724


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 25.7 bits (57), Expect = 6.0
 Identities = 11/51 (21%), Positives = 26/51 (50%)

Query: 71  CGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
            G +  KKK+ + + E ++ +K  E  + + +  KK    ++ + E+ + G
Sbjct: 117 YGIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLEQFQSG 167


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 26.1 bits (58), Expect = 6.0
 Identities = 18/44 (40%), Positives = 30/44 (68%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
            K +EK  +  +KVEE+ +E+ EEE  E ++EE++E  EE++ K
Sbjct: 3   DKLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEK 46


>gnl|CDD|153125 cd02434, Nodulin-21_like_3, Nodulin-21 and CCC1-related protein
           family.  Nodulin-21_like_3: This is a family of proteins
           closely related to nodulin-21, a plant nodule-specific
           protein that may be involved in symbiotic nitrogen
           fixation. This family is also related to CCC1, a yeast
           vacuole transmembrane protein that functions as an iron
           and manganese transporter. .
          Length = 225

 Score = 25.8 bits (57), Expect = 6.0
 Identities = 15/43 (34%), Positives = 15/43 (34%), Gaps = 2/43 (4%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKE--EKKEKVEEE 112
            K E      E K EE E E   E E  E  E    K   EE 
Sbjct: 57  TKAERDFLHSEKKREEWEIENYPEGEKSEMVEIYSLKGLSEEV 99


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 26.2 bits (58), Expect = 6.0
 Identities = 13/44 (29%), Positives = 25/44 (56%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           E+E+K+  + KV  +   K   +   E+KE+ K + EE+ ++E 
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 25.9 bits (57), Expect = 6.0
 Identities = 12/51 (23%), Positives = 21/51 (41%)

Query: 69  TGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
           T   K+E+++       E  + E+ EE + +   E  K +    EV  E  
Sbjct: 67  TKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDV 117


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 26.2 bits (58), Expect = 6.4
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 61  IHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
           + R+++E     KEEE +++   K+ E+  E LEE     + E K E+ EE++++
Sbjct: 89  LQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQME 143


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 26.2 bits (57), Expect = 6.4
 Identities = 12/58 (20%), Positives = 26/58 (44%), Gaps = 6/58 (10%)

Query: 65  IQEGTGCGKEEEKKKRREDK------VEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           +        +E KK  R  +        + E+EK E++  + +K +K ++  E++  E
Sbjct: 99  VVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 25.4 bits (56), Expect = 6.5
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 95  EEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
             E   + +E +E+ +EEE +EE  +    G
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEESDDDMLFG 106


>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
           trafficking and secretion].
          Length = 340

 Score = 26.1 bits (58), Expect = 6.6
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
               ++  EE E  E ++EK E+ +EEE KE   E  + G
Sbjct: 1   MGFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQG 40


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 25.8 bits (56), Expect = 6.6
 Identities = 16/53 (30%), Positives = 26/53 (49%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRG 124
                KK  +ED+ +E+E+E+ EEEE E + EE ++            + G G
Sbjct: 122 AGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNG 174


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 25.8 bits (57), Expect = 6.7
 Identities = 9/30 (30%), Positives = 15/30 (50%)

Query: 84  KVEEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           K +EE++   E EE+E +   +     E E
Sbjct: 141 KTDEEKELLREVEELESRLATEPSPAPELE 170



 Score = 25.8 bits (57), Expect = 8.2
 Identities = 9/28 (32%), Positives = 11/28 (39%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEE 113
           EE+E  +  EE       E     E EE
Sbjct: 144 EEKELLREVEELESRLATEPSPAPELEE 171


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 25.7 bits (57), Expect = 6.9
 Identities = 15/33 (45%), Positives = 24/33 (72%)

Query: 84  KVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           +V+E+E  + E EE EEQKEE++ K+ EE ++ 
Sbjct: 134 EVDEDESSEEEIEEFEEQKEEEEAKLAEEALEA 166


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 25.9 bits (57), Expect = 6.9
 Identities = 13/43 (30%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
             +KK  E+  EEEEDE L +      + E+  +  ++E  E+
Sbjct: 178 RREKKEEEE--EEEEDEALIKSLSFGPETEEDRRRADDEDSED 218


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 26.2 bits (58), Expect = 6.9
 Identities = 7/47 (14%), Positives = 19/47 (40%), Gaps = 1/47 (2%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERR 119
              E ++   +  ++E +    E E   +K  ++     E ++ E+ 
Sbjct: 230 LISELEEVIAEI-QDELESLRSELEALAEKIAEELLAVREILEIEKA 275


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 25.9 bits (56), Expect = 7.0
 Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 75   EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
            EE  +  E+ VEE  +E  EE   EE  EE  E+ ++E  +E   E
Sbjct: 1089 EENVEEIEENVEENVEENAEEN-AEENAEENAEEYDDENPEEHNEE 1133



 Score = 25.9 bits (56), Expect = 8.9
 Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 60   SIHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLE---EEEMEEQKEEKKEKVEEEEVKE 116
            +I    +E      EE  ++  E+ VEE  +E +E   EE +EE  EE  E+  EE ++E
Sbjct: 946  NIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEE 1005

Query: 117  ERRE 120
               E
Sbjct: 1006 NVEE 1009



 Score = 25.5 bits (55), Expect = 9.8
 Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 75   EEKKKRREDKVEEEEDEKLEE--EEMEEQKEEKKEKVEEEEVKEERRE 120
            EE ++  E+ +EE  +E +EE  EE+EE  EE  E+  EE  +E   E
Sbjct: 1070 EEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEE 1117


>gnl|CDD|150747 pfam10107, Endonuc_Holl, Endonuclease related to archaeal Holliday
           junction resolvase.  This domain is found in various
           predicted bacterial endonucleases which are distantly
           related to archaeal Holliday junction resolvases.
          Length = 153

 Score = 25.4 bits (56), Expect = 7.2
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKV 109
           E+ +   RE +   E + K EE + +E+KE +K+ V
Sbjct: 29  EQWRNAEREAQAMREAEAKFEEWQKKEEKEIRKDAV 64


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 25.5 bits (57), Expect = 7.4
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query: 79  KRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           ++       EE+E+  EEE EE++EE+ E+ E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 25.6 bits (56), Expect = 7.4
 Identities = 14/48 (29%), Positives = 30/48 (62%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           K E+KKK  E K   ++ ++ + E+   +++++ EK+E  + K++ RE
Sbjct: 67  KFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERE 114


>gnl|CDD|133433 cd05297, GH4_alpha_glucosidase_galactosidase, Glycoside Hydrolases
           Family 4; Alpha-glucosidases and alpha-galactosidases.
           Glucosidases cleave glycosidic bonds to release glucose
           from oligosaccharides. Alpha-glucosidases and
           alpha-galactosidases release alpha-D-glucose and
           alpha-D-galactose, respectively, via the hydrolysis of
           alpha-glycopyranoside bonds. Some bacteria
           simultaneously translocate and phosphorylate
           disaccharides via the phosphoenolpyruvate-dependent
           phosphotransferase system (PEP-PTS). After
           translocation, these phospho-disaccharides may be
           hydrolyzed by the GH4 glycoside hydrolases such as the
           alpha-glucosidases. Other organsisms (such as archaea
           and Thermotoga maritima) lack the PEP-PTS system, but
           have several enzymes normally associated with the
           PEP-PTS operon. Alpha-glucosidases and
           alpha-galactosidases are part of the NAD(P)-binding
           Rossmann fold superfamily, which includes a wide variety
           of protein families including the NAD(P)-binding domains
           of alcohol dehydrogenases, tyrosine-dependent
           oxidoreductases, glyceraldehyde-3-phosphate
           dehydrogenases, formate/glycerate dehydrogenases,
           siroheme synthases, 6-phosphogluconate dehydrogenases,
           aminoacid dehydrogenases, repressor rex, and NAD-binding
           potassium channel domains, among others.
          Length = 423

 Score = 26.0 bits (58), Expect = 7.4
 Identities = 13/54 (24%), Positives = 19/54 (35%), Gaps = 5/54 (9%)

Query: 73  KEEEKKKRREDKVEE-----EEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           KE +K    E   +E     EE      EE  +    + +K E + VK      
Sbjct: 260 KETKKIWYGEFNEDEYGGRDEEQGWEWYEERLKLILAEIDKEELDPVKRSGEYA 313


>gnl|CDD|204467 pfam10376, Mei5, Double-strand recombination repair protein.  Mei5
           is one of a pair of meiosis-specific proteins which
           facilitate the loading of Dmc1 on to Rad51 on DNA at
           double-strand breaks during recombination. Recombination
           is carried out by a large protein complex based around
           the two RecA homologues, Rad51 and Dmc1. This complex
           may play both a catalytic and a structural role in the
           interaction between homologous chromosomes during
           meiosis. Mei5 is seen to contain a coiled-coli region.
          Length = 212

 Score = 25.5 bits (56), Expect = 7.7
 Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 4/51 (7%)

Query: 75  EEKKKRREDKVEEEEDE--KLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           E +K++ E +V+EE D+  +L+  E  E K +  E   +E +K+  +   R
Sbjct: 128 EAEKRKLEYQVDEESDDLRRLKLVEKYEIKNDLSEL--QELIKKWEKAKQR 176


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 25.9 bits (57), Expect = 7.8
 Identities = 10/38 (26%), Positives = 23/38 (60%)

Query: 74  EEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEE 111
           +  +K + +D ++  E E+  +    E++EEK++ +EE
Sbjct: 785 QAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEE 822


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 25.8 bits (56), Expect = 7.9
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           KE+EK+ ++  K E E   +L+     + K E K K E E
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIE 275


>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
          Length = 697

 Score = 25.9 bits (58), Expect = 8.0
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 79  KRREDKVEEEEDEKLEEEEMEEQKEE 104
           ++RE K  E   E+ EEEE  E +EE
Sbjct: 327 RKREKKAAELAAEEAEEEEAAEPEEE 352


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 26.0 bits (57), Expect = 8.1
 Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 6/63 (9%)

Query: 61  IHRRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQK-----EEKKEKVEEEEVK 115
           I +++        +  KKK R D +  +  E LE  EM+        E  K+K+EE    
Sbjct: 327 IIKKLSVLQKDYNDYIKKKSRYDDLNNQILE-LEGYEMDYNSYLKSIESLKKKIEEYSKN 385

Query: 116 EER 118
            ER
Sbjct: 386 IER 388


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 25.6 bits (56), Expect = 8.3
 Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 67  EGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
              G   + +K    ED   +++ + +EE+E++E +EE++ + EEEE  E+
Sbjct: 236 GPKGSDVKNKKSDDEEDD--DQDGDYVEEKELKEDEEEEETEEEEEEEDED 284


>gnl|CDD|240521 cd13394, Syo1_like, Fungal symportin 1 (syo1) and similar proteins.
            This family of eukaryotic proteins includes
           Saccharomyces cerevisiae Ydl063c and Chaetomium
           thermophilum Syo1, which mediate the co-import of two
           ribosomal proteins, Rpl5 and Rpl11 (which both interact
           with 5S rRNA) into the nucleus. Import precedes their
           association with rRNA and subsequent ribosome assembly
           in the nucleolus. The primary structure of syo1 is a
           mixture of Armadillo- (ARM, N-terminal part of syo1) and
           HEAT-repeats (C-terminal part of syo1).
          Length = 597

 Score = 25.7 bits (57), Expect = 8.3
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 88  EEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           E D+K +E   +EQ +E +E+ E++E  E 
Sbjct: 304 ELDKKDDEAMEDEQSDEDEEEDEDDEASET 333


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 25.2 bits (55), Expect = 8.3
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 92  KLEEEEMEEQKEEKKEKVEEEEVKEERRE 120
           KL  ++ EE++EE  E  +EEE+  E R+
Sbjct: 92  KLTSKKEEEKEEEIPEPTKEEELLGEIRD 120


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 25.5 bits (56), Expect = 8.5
 Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 1/34 (2%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
           +D V+E+E+++   EE E +  E++E  EE EVK
Sbjct: 42  DDVVDEDEEDEAVVEEDENELTEEEED-EEGEVK 74


>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
           This group consists of Eukaryotic TruB proteins similar
           to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
           makes psi55 in the T loop of both cytoplasmic and
           mitochondrial tRNAs. Psi55 is almost universally
           conserved.  Pseudouridine synthases catalyze the
           isomerization of specific uridines in an RNA molecule to
           pseudouridines (5-ribosyluracil, psi).
          Length = 312

 Score = 25.5 bits (56), Expect = 8.5
 Identities = 15/43 (34%), Positives = 17/43 (39%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           +E K   E        E    EE+E   EE K K E EE   E
Sbjct: 211 KELKTFAEVTELTATAEGDPVEEVEATHEESKRKSEVEEEANE 253


>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3).  This family
           consists of several optic atrophy 3 (OPA3) proteins.
           OPA3 deficiency causes type III 3-methylglutaconic
           aciduria (MGA) in humans. This disease manifests with
           early bilateral optic atrophy, spasticity,
           extrapyramidal dysfunction, ataxia, and cognitive
           deficits, but normal longevity.
          Length = 134

 Score = 25.3 bits (56), Expect = 8.6
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 91  EKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           E  +EEE++++ EE + +V E E++ ER
Sbjct: 103 EAKKEEELQQELEELEARVGELELEIER 130


>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
           Provisional.
          Length = 693

 Score = 25.8 bits (58), Expect = 8.6
 Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 11/60 (18%)

Query: 76  EKKKRRE--DKVEEEEDEKLEEEEMEEQKEEKK----EKVEEEEV-----KEERREGGRG 124
           +K++R    D ++EE  E L  EE EE+ E++     +K+E++ V     +E  R  GR 
Sbjct: 261 DKQEREAALDAIKEEVLEALAAEEEEEEDEKEIKEAFKKLEKKIVRRRILEEGIRIDGRK 320


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 25.7 bits (56), Expect = 8.6
 Identities = 10/56 (17%), Positives = 25/56 (44%)

Query: 63  RRIQEGTGCGKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEER 118
           R+     G   EE +     ++   E+ ++  +E+  EQ+ E  ++  E+  + + 
Sbjct: 146 RQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKA 201


>gnl|CDD|224849 COG1938, COG1938, Archaeal enzymes of ATP-grasp superfamily
           [General function prediction only].
          Length = 244

 Score = 25.4 bits (56), Expect = 8.7
 Identities = 16/32 (50%), Positives = 24/32 (75%), Gaps = 2/32 (6%)

Query: 91  EKLEE--EEMEEQKEEKKEKVEEEEVKEERRE 120
           +KLE+  EE+EEQ E+  E++E+EE + ER E
Sbjct: 207 DKLEKEAEEIEEQLEKLAEQLEKEEERVEREE 238


>gnl|CDD|219320 pfam07172, GRP, Glycine rich protein family.  This family of
           proteins includes several glycine rich proteins as well
           as two nodulins 16 and 24. The family also contains
           proteins that are induced in response to various
           stresses.
          Length = 91

 Score = 24.7 bits (54), Expect = 8.8
 Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 97  EMEEQKEEKKEKVE-EEEVKEERREGGRGG 125
           E+        EK E E EV++++  GG GG
Sbjct: 22  EVAAADLSNTEKSESENEVQDDKYGGGGGG 51


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 25.2 bits (55), Expect = 8.9
 Identities = 12/38 (31%), Positives = 22/38 (57%), Gaps = 2/38 (5%)

Query: 86  EEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
            +EE+E+ EE E ++Q +E   +V++ +  E   E G 
Sbjct: 95  RQEEEEEEEENEKQQQSDE--AQVQQHQQHEHPAENGE 130


>gnl|CDD|223327 COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 843

 Score = 25.7 bits (57), Expect = 9.1
 Identities = 9/43 (20%), Positives = 19/43 (44%), Gaps = 1/43 (2%)

Query: 71  CGKEEEKKKRREDKVEE-EEDEKLEEEEMEEQKEEKKEKVEEE 112
            G  EE  +R  + + E E++ +    E+ ++      KV + 
Sbjct: 779 AGLPEEVIERAREILAELEKESRSSNLELNQKDLSLFPKVLKA 821


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 25.3 bits (56), Expect = 9.1
 Identities = 12/46 (26%), Positives = 26/46 (56%)

Query: 76  EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREG 121
           +KK+    + E   D+K      EE+K++KK+  +++++K+    G
Sbjct: 157 KKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVG 202


>gnl|CDD|237205 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
          Length = 694

 Score = 25.5 bits (56), Expect = 9.3
 Identities = 8/39 (20%), Positives = 19/39 (48%)

Query: 77  KKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVK 115
           +++      E  + ++ EE    E K+  KE++   E++
Sbjct: 325 RRRAARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIE 363


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 25.8 bits (57), Expect = 9.4
 Identities = 15/53 (28%), Positives = 28/53 (52%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGRGG 125
           + +E +++R D  E+ +  + E E +  +KEE +E++EE E      E   G 
Sbjct: 834 EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886


>gnl|CDD|221117 pfam11460, DUF3007, Protein of unknown function (DUF3007).  This is
           a family of uncharacterized proteins found in bacteria
           and eukaryotes.
          Length = 104

 Score = 25.0 bits (55), Expect = 9.4
 Identities = 5/37 (13%), Positives = 19/37 (51%), Gaps = 1/37 (2%)

Query: 76  EKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEE 112
           ++++   +  +   +E+L ++  +    E+ E ++ E
Sbjct: 66  QQRRDYREAYDALTNEEL-QKRFDSLSPEELEALQAE 101


>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS).  This family
           consists of several mammalian selenoprotein S (SelS)
           sequences. SelS is a plasma membrane protein and is
           present in a variety of tissues and cell types. The
           function of this family is unknown.
          Length = 190

 Score = 25.2 bits (55), Expect = 9.5
 Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 15/86 (17%)

Query: 56  WHHLSIHRRIQEGTGCGKEE-----EKKKRREDKVE-------EEED---EKLEEEEMEE 100
              L    R  +     KE      +K  RR++ +E       E+ D    K +E++ + 
Sbjct: 51  IQKLRPRWRSLQKRSSEKEITVFDPDKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQL 110

Query: 101 QKEEKKEKVEEEEVKEERREGGRGGR 126
           ++E++++K+E  +  +E +      +
Sbjct: 111 EEEKRRQKIEMWDRMQEGKSYKVSAK 136


>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 215

 Score = 25.3 bits (55), Expect = 9.5
 Identities = 9/52 (17%), Positives = 23/52 (44%)

Query: 72  GKEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEERREGGR 123
           G    KK+ +  ++ ++      E      +E K E++E ++   E+++   
Sbjct: 53  GLVTSKKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAA 104


>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
           modification, protein turnover, chaperones].
          Length = 623

 Score = 25.7 bits (57), Expect = 9.6
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 75  EEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           E K  +   K + + +   EE+E + ++E+K+ K   E VKE
Sbjct: 472 EGKPFKSITKGDLDLELLEEEDEADSEEEKKEFKPLLERVKE 513


>gnl|CDD|227415 COG5083, SMP2, Uncharacterized protein involved in plasmid
           maintenance [General function prediction only].
          Length = 580

 Score = 25.6 bits (56), Expect = 9.7
 Identities = 10/32 (31%), Positives = 14/32 (43%), Gaps = 2/32 (6%)

Query: 59  LSIHRRIQEGTGCGKEEEKKKRREDKVEEEED 90
           L ++ R QE  G G +  K     D V +E  
Sbjct: 114 LILNEREQESLGSGSDLSKLN--FDIVLKEPK 143


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 25.4 bits (56), Expect = 9.8
 Identities = 14/45 (31%), Positives = 25/45 (55%)

Query: 73  KEEEKKKRREDKVEEEEDEKLEEEEMEEQKEEKKEKVEEEEVKEE 117
           KE+++    E+  +EEE +  +      ++ EK +K EEEE  +E
Sbjct: 448 KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492


>gnl|CDD|234425 TIGR03974, rSAM_six_Cys, SCIFF radical SAM maturase.  Members of
           this protein family are predicted radical SAM enzymes
           universally associated with Six Cysteines in Forty-Five
           protein, or SCIFF (family TIGR03973), a predicted
           ribosomal natural product precursor that is nearly
           universal in the class Clostridia. Similarity of this
           family to radical SAM maturases (PqqE and subtilosin A
           maturase) found in the vicinity of other peptide
           precursors suggests this protein is the SCIFF radical
           SAM maturase [Cellular processes, Biosynthesis of
           natural products].
          Length = 451

 Score = 25.3 bits (56), Expect = 9.9
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 90  DEKLEEEEMEEQKEEKKEKVEEEEVKE 116
           +E  ++E  EE KE+ K K  EE+++E
Sbjct: 35  EEGKDKETPEEIKEKLKGKYSEEDIEE 61


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 25.0 bits (55), Expect = 9.9
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 82  EDKVEEEEDEKLEEEEMEEQKEEKKEKV 109
           +D  E+EE+E  EE++ E++ EE++  V
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPV 142


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.133    0.375 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,766,780
Number of extensions: 661256
Number of successful extensions: 11136
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7337
Number of HSP's successfully gapped: 2320
Length of query: 126
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 41
Effective length of database: 7,167,512
Effective search space: 293867992
Effective search space used: 293867992
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 53 (24.0 bits)