RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10650
         (446 letters)



>gnl|CDD|217215 pfam02752, Arrestin_C, Arrestin (or S-antigen), C-terminal domain. 
           Ig-like beta-sandwich fold. Scop reports duplication
           with N-terminal domain.
          Length = 136

 Score =  108 bits (271), Expect = 3e-28
 Identities = 47/136 (34%), Positives = 71/136 (52%), Gaps = 4/136 (2%)

Query: 279 SSGPVLCRVSLDRGGYVPGETIIINATVYNRSKITVKSTKAALTETIQYLARNKVLES-- 336
            SGPV   VSL + GYVPGETI I   + N+S   +K  K +L + + Y A+  +  +  
Sbjct: 1   WSGPVSLSVSLPKKGYVPGETIPITVEIDNQSSKKIKKIKISLVQQVTYKAKTPLRRTKN 60

Query: 337 ETRELASLTRGKIKPGERDDWT-NQQLYVPP-LPPTNLRGCHLIKIQYDVFFIVDPKSIE 394
           E R +A    G ++PG  D +     L +P  LPP++   C +IK+ Y +   VD     
Sbjct: 61  EERVVAKEESGGVEPGSSDKFEKELSLQIPTSLPPSSTTKCKIIKVSYKLKVTVDLSGSH 120

Query: 395 KPVKLQLPIMLATYPF 410
             ++L+LPI + T P 
Sbjct: 121 SELRLELPITIGTSPL 136


>gnl|CDD|214976 smart01017, Arrestin_C, Arrestin (or S-antigen), C-terminal domain.
            Ig-like beta-sandwich fold. Scop reports duplication
           with N-terminal domain. Arrestins comprise a family of
           closely-related proteins that includes beta-arrestin-1
           and -2, which regulate the function of beta-adrenergic
           receptors by binding to their phosphorylated forms,
           impairing their capacity to activate G(S) proteins; Cone
           photoreceptors C-arrestin (arrestin-X). which could bind
           to phosphorylated red/green opsins; and Drosophila
           phosrestins I and II, which undergo light-induced
           phosphorylation, and probably play a role in
           photoreceptor transduction.
          Length = 142

 Score =  100 bits (252), Expect = 2e-25
 Identities = 52/144 (36%), Positives = 74/144 (51%), Gaps = 14/144 (9%)

Query: 279 SSGPVLCRVSLDRGGYVPGETIIINATVYNRSKITVKSTKAALTETIQY-LARNKVLESE 337
            SGP+   VSL + GYVPGETI +   + N SK TVK  K +L +T+ Y  +   V  S 
Sbjct: 1   WSGPLSLEVSLPKKGYVPGETIPVTIKITNLSKKTVKKIKVSLVQTVTYVSSDGPVKRSL 60

Query: 338 TRE-----------LASLTRGKIKPGERDDWTNQQLYVPPLPPTNLRGCHLIKIQYDVFF 386
             +           +  L  G + PG +D +   QL VPPLPPT+ R C LIK++Y +  
Sbjct: 61  AEKSKEKKADRKTLVKELDGGPVLPGNKDKFE-GQLKVPPLPPTS-RTCRLIKVEYKLKV 118

Query: 387 IVDPKSIEKPVKLQLPIMLATYPF 410
            +        ++L+LPI + T P 
Sbjct: 119 KLRLSGKHSELRLELPITIGTVPL 142


>gnl|CDD|215866 pfam00339, Arrestin_N, Arrestin (or S-antigen), N-terminal domain. 
           Ig-like beta-sandwich fold. Scop reports duplication
           with C-terminal domain.
          Length = 148

 Score = 63.1 bits (154), Expect = 5e-12
 Identities = 23/60 (38%), Positives = 33/60 (55%)

Query: 117 LSPGIHSFPFKLGLPLGLPSTFLGKHGWVQYFCKAALREPTGYTHKNQQVFIIMSPIDLN 176
           L  G H+FPF   LP   P +F G+ G ++Y  K  L  P    HK ++VF ++  +DLN
Sbjct: 89  LPAGTHAFPFSFELPPNCPPSFEGQPGGIRYEVKVELDRPWKKDHKRKKVFTVIRKLDLN 148


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 46.6 bits (111), Expect = 4e-06
 Identities = 20/54 (37%), Positives = 31/54 (57%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           EE E  +KE     + K ++ + +KKK K K KK  KK    E++ E+E ++KL
Sbjct: 69  EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122



 Score = 35.4 bits (82), Expect = 0.024
 Identities = 19/47 (40%), Positives = 30/47 (63%), Gaps = 10/47 (21%)

Query: 23  KEEETE----------EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           K++E            EEK+K K KKKK KKKK  ++++K+++KD+K
Sbjct: 63  KKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109



 Score = 31.6 bits (72), Expect = 0.47
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E TE +KKKK+  ++ +K KK YEE++K + K
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWK 87



 Score = 31.6 bits (72), Expect = 0.55
 Identities = 11/64 (17%), Positives = 33/64 (51%), Gaps = 6/64 (9%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
            +K  ++K+D++K+       K+++ +++ +KK +K+ + K +   +   +     ++L 
Sbjct: 87  KKKKSKKKKDKDKD------KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140

Query: 62  HNCY 65
              Y
Sbjct: 141 PRKY 144


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 45.9 bits (109), Expect = 3e-05
 Identities = 22/54 (40%), Positives = 34/54 (62%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          E+  EEKE++++E    +  +EE  EEE+K++KKKK KK K+   E E   + K
Sbjct: 38 EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91



 Score = 44.8 bits (106), Expect = 7e-05
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
            E+  E++E +E+E     + +E + EEEK++KKKK KK K+     E 
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86



 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 20/53 (37%), Positives = 33/53 (62%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          +E+E++EKE       K  + EEE  ++++K++KKKK K+ +E   E E  NK
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89



 Score = 34.7 bits (80), Expect = 0.091
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          KE   EEE+++K++KK++++K    EEE  EEE+ 
Sbjct: 33 KEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK 67



 Score = 34.4 bits (79), Expect = 0.13
 Identities = 15/40 (37%), Positives = 30/40 (75%), Gaps = 3/40 (7%)

Query: 23 KEEETE---EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          KE E E   EE++++K++KK++++K   +EEE +EE++ +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 45.1 bits (107), Expect = 5e-05
 Identities = 19/59 (32%), Positives = 28/59 (47%), Gaps = 1/59 (1%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE-EEKDNKLLHN 63
           GEE+ D E    G     E   +  + K+K K ++ K+K+R E E +  EEK  K    
Sbjct: 249 GEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLA 307



 Score = 28.5 bits (64), Expect = 6.8
 Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 6/53 (11%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEE------TEEEKKKKKKKKKKKKKKKRYEE 49
          E+  EE+ D+    G ++   +E+      T +E   KKK++KK KK  + +E
Sbjct: 18 EQFLEEQRDERITGGVLAEKPDEDLFVVDTTGDEATIKKKRRKKLKKPLKSDE 70


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 44.1 bits (104), Expect = 1e-04
 Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           + G  K    K        KE + E  K+++K+K++ K++KK+ +E+ KEE KD K
Sbjct: 81  EKGGSKGPAAKT----KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 44.1 bits (104), Expect = 1e-04
 Identities = 16/56 (28%), Positives = 36/56 (64%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           + K  E++++Q KE     + K +E  +++K K++ K+K+  K++ +E+EK+ E+ 
Sbjct: 102 SGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157



 Score = 40.3 bits (94), Expect = 0.002
 Identities = 19/55 (34%), Positives = 36/55 (65%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           NE G EE++++E+      + KE+  EE K +K K++ K+K+  + +E+EKE++ 
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154



 Score = 39.1 bits (91), Expect = 0.004
 Identities = 17/55 (30%), Positives = 38/55 (69%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
             + G+E+E ++++V    + K+E+ +EE K +K K++ K+K+   E+E+++E+K
Sbjct: 99  KNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 38.7 bits (90), Expect = 0.005
 Identities = 14/55 (25%), Positives = 31/55 (56%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           K  +EK  +E +        +E+   ++K+K+K+KK ++ + R EE+++E  +  
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAK 172



 Score = 38.3 bits (89), Expect = 0.006
 Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
                + K  +E +      GKEEE E+E+ K++KKKKK+K K+  ++ + +EE   K
Sbjct: 86  KGPAAKTKPAKEPKNES---GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140



 Score = 37.6 bits (87), Expect = 0.014
 Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 3/55 (5%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            EK  EE  D+E+E     R +       KK  KKK   KKK+   EE++++  +
Sbjct: 150 KEKKVEEPRDREEEK---KRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAR 201



 Score = 37.2 bits (86), Expect = 0.016
 Identities = 16/64 (25%), Positives = 36/64 (56%), Gaps = 7/64 (10%)

Query: 3   EKGGEEKEDQEKEVGGIS-------RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           E+  +E+  +EK+            R  +EE +E++  K+K+K+K+KK +   + E+E++
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165

Query: 56  KDNK 59
           ++  
Sbjct: 166 RERV 169



 Score = 36.8 bits (85), Expect = 0.023
 Identities = 18/58 (31%), Positives = 30/58 (51%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
             K    KE + +      + KE+  EE+KKKK+K K++ K +K  EE +++     K
Sbjct: 89  AAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEK 146



 Score = 36.4 bits (84), Expect = 0.025
 Identities = 16/53 (30%), Positives = 34/53 (64%), Gaps = 1/53 (1%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E+++++EK+V    R +EEE + E+ + K + KK  KKK   ++++  E++ +
Sbjct: 145 EKEKEKEKKVEE-PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196



 Score = 36.0 bits (83), Expect = 0.038
 Identities = 11/50 (22%), Positives = 27/50 (54%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
            +  +  +    ++ KE   EE++++  ++  K K ++    EE+E+E+D
Sbjct: 173 SRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEED 222



 Score = 35.6 bits (82), Expect = 0.044
 Identities = 18/58 (31%), Positives = 36/58 (62%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E   E  +++EKE   +   K+++ E+ K++ K +K K++ K++   +EKE+EK+ K
Sbjct: 96  KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 35.2 bits (81), Expect = 0.066
 Identities = 19/56 (33%), Positives = 36/56 (64%), Gaps = 1/56 (1%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK-RYEEEEKEEEK 56
            ++  ++K+++ KE     + KEE  E+   K+K+K+K+KK ++ R  EEEK+ E+
Sbjct: 113 VKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRER 168



 Score = 34.5 bits (79), Expect = 0.13
 Identities = 21/56 (37%), Positives = 33/56 (58%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
             KG   K    KE    S  +EE+ +E+ K++KKKKK+K K++  + + KEE K+
Sbjct: 84  GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139



 Score = 34.1 bits (78), Expect = 0.15
 Identities = 16/57 (28%), Positives = 28/57 (49%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           ++  +EKE + +E       K+ E    K + KK  KKK   K+ E  E+E+++   
Sbjct: 144 KEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200



 Score = 33.3 bits (76), Expect = 0.27
 Identities = 13/67 (19%), Positives = 30/67 (44%), Gaps = 10/67 (14%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKK----------KKKRYEEEEK 52
           +K  E+ +++ K+       KE+   +EK+K+K+KK ++           + +     +K
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKK 177

Query: 53  EEEKDNK 59
             +K   
Sbjct: 178 PPKKKPP 184



 Score = 32.2 bits (73), Expect = 0.56
 Identities = 14/53 (26%), Positives = 27/53 (50%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           K  E K   ++ V  + +G  +    + K  K+ K +  K++  E+E+ +EEK
Sbjct: 65  KCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEK 117



 Score = 28.7 bits (64), Expect = 8.1
 Identities = 14/55 (25%), Positives = 32/55 (58%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           E+  E++  +EKE     + +E    EE+KK+++ + K + KK  +++   ++K+
Sbjct: 135 EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE 189


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 43.8 bits (104), Expect = 1e-04
 Identities = 22/53 (41%), Positives = 40/53 (75%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E+E +EK+    +  K+EE EEE+K+KK+++K++++++  EE+E+EEEK  K
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 43.0 bits (102), Expect = 2e-04
 Identities = 18/53 (33%), Positives = 41/53 (77%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           + +E+++++      GK++E EEE++K+KK+++K+++++  EEE++EEE+  K
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474



 Score = 42.2 bits (100), Expect = 4e-04
 Identities = 19/37 (51%), Positives = 26/37 (70%)

Query: 21  RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
             K E+  EE+KK+KKKK    KKK  EEEE++E+K+
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452



 Score = 41.8 bits (99), Expect = 6e-04
 Identities = 22/47 (46%), Positives = 32/47 (68%)

Query: 13  EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            K++  I    E++ EEEKK+KKKK    KKK+  EEEEKE++++ K
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 39.9 bits (94), Expect = 0.002
 Identities = 20/58 (34%), Positives = 37/58 (63%), Gaps = 4/58 (6%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
             E+  +EK+ +          KEEE EEEK+KK+++K++++++   E+EE+EE+K  
Sbjct: 422 KREEEKKEKKKKAFAGKK----KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 39.5 bits (93), Expect = 0.003
 Identities = 16/37 (43%), Positives = 27/37 (72%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           K+ E E+++KKKK    KKK+++  EE+EK+EE+  +
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 39.1 bits (92), Expect = 0.004
 Identities = 15/57 (26%), Positives = 40/57 (70%), Gaps = 3/57 (5%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           EK  +++E+++KE     +      ++E++++++K+KK+++K+  EEE +EE+++ +
Sbjct: 417 EKAEKKREEEKKE---KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470



 Score = 39.1 bits (92), Expect = 0.004
 Identities = 20/55 (36%), Positives = 36/55 (65%)

Query: 5   GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           G ++   + K++   +  K EE ++EKKKK    KKK++++  E+E+KEEEK+ +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458



 Score = 38.0 bits (89), Expect = 0.010
 Identities = 17/57 (29%), Positives = 37/57 (64%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           +K  +  E  EK+     + K+++    KKK+++++++K+KK+  +EEE+EE ++ K
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466



 Score = 36.8 bits (86), Expect = 0.023
 Identities = 14/53 (26%), Positives = 37/53 (69%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            EK ++++E     + K+    ++K+++++++K+KK++++ EEEE+ EE+  +
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 36.4 bits (85), Expect = 0.030
 Identities = 19/59 (32%), Positives = 43/59 (72%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           +K  EEK++++K+     + +EEE EE++KK+++K++++++ +  +EEE+E++K    L
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQATL 479



 Score = 34.5 bits (80), Expect = 0.10
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 27  TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           T  +K  KK KK  +K +K+ EEE+KE++K
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKK 432



 Score = 33.7 bits (78), Expect = 0.19
 Identities = 11/50 (22%), Positives = 32/50 (64%)

Query: 10  EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E++ + + G  +  ++  +  +K +KK++++KK+KK+     K++E++ +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445



 Score = 29.1 bits (66), Expect = 5.5
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query: 9   KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +   +    ++   E   EE +     KK  KK KK  E+ EK+ E+
Sbjct: 378 FKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREE 425


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 40.4 bits (95), Expect = 3e-04
 Identities = 23/61 (37%), Positives = 41/61 (67%), Gaps = 5/61 (8%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEK--KKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           M+EK  +E ED+E +     R +++  +EEK  KK+ K++KKK+KKK+ ++ +K  +K+ 
Sbjct: 55  MDEKWKKETEDEEFQ---QKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEE 111

Query: 59  K 59
           K
Sbjct: 112 K 112



 Score = 30.4 bits (69), Expect = 0.84
 Identities = 14/42 (33%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 20 SRGKEEE--TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           R +E E     ++K KK+ + ++ ++KR E++ K+EEK  K
Sbjct: 44 LRRREYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAK 85


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 21/57 (36%), Positives = 31/57 (54%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E    EKE   K        +EE+ E++KKK+ KK+KK+KK K+ +  E +  K  K
Sbjct: 131 ESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187



 Score = 40.9 bits (96), Expect = 4e-04
 Identities = 18/49 (36%), Positives = 33/49 (67%)

Query: 5   GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
           G E +  +++    + +  E E EE+K+KKKKK+ KK+KK++ +++EK 
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177



 Score = 36.2 bits (84), Expect = 0.017
 Identities = 14/57 (24%), Positives = 32/57 (56%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
             E   + +     +  K E+  E ++++KK+KKKKK+ K+ ++E+K++++      
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPK 181



 Score = 35.8 bits (83), Expect = 0.022
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
             G   +   E E        + E E E ++++KK+KKKKK+ + E++EK+++K+
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175



 Score = 35.8 bits (83), Expect = 0.024
 Identities = 14/38 (36%), Positives = 23/38 (60%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
           EK+ +++        K+++ +  + K  KKKKKKKKKK
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 32.8 bits (75), Expect = 0.19
 Identities = 16/55 (29%), Positives = 34/55 (61%)

Query: 5   GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           G    +    E+G  S   E+ET  + +K+ + ++++KK+K+ ++E K+E+K+ K
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKK 171



 Score = 32.4 bits (74), Expect = 0.33
 Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 3/62 (4%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK---KKRYEEEEKEEEKDNKLLH 62
           G   E   +          +  +E + ++++KK+KKKK   KK  +E++ ++EK  +   
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182

Query: 63  NC 64
           + 
Sbjct: 183 SK 184



 Score = 31.2 bits (71), Expect = 0.70
 Identities = 18/49 (36%), Positives = 32/49 (65%), Gaps = 8/49 (16%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKK---KKKKKKKRYEEEEK 52
           EE+E +EK+     + KE + E+++KK KK+K    K  KKK+ ++++K
Sbjct: 150 EEEEKKEKK-----KKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
          subunit.  This is a family of proteins which are
          subunits of the eukaryotic translation initiation
          factor 3 (eIF3). In yeast it is called Hcr1. The
          Saccharomyces cerevisiae protein eIF3j (HCR1) has been
          shown to be required for processing of 20S pre-rRNA and
          binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 40.8 bits (96), Expect = 6e-04
 Identities = 14/54 (25%), Positives = 29/54 (53%), Gaps = 6/54 (11%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          +E+ED+EKE       ++ +   + K KK  K K ++K++ + E++E+      
Sbjct: 38 DEEEDEEKE------EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85



 Score = 40.0 bits (94), Expect = 0.001
 Identities = 19/56 (33%), Positives = 30/56 (53%)

Query: 6  GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           E+++D  K+       +E+E E+ K   K K KK  K K  E+E+ + EK+ K L
Sbjct: 26 DEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81



 Score = 35.8 bits (83), Expect = 0.031
 Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 7/52 (13%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          +E++++EK        K     + KK  K K ++K+K KR +EE+   E + 
Sbjct: 42 DEEKEEEKA-------KVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86



 Score = 34.2 bits (79), Expect = 0.080
 Identities = 16/58 (27%), Positives = 34/58 (58%), Gaps = 3/58 (5%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEE---KEEEKDNKLL 61
          E++E +E++    ++ K ++  + K ++K+K K++K++K   E E    E+E   KL 
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLR 98



 Score = 34.2 bits (79), Expect = 0.10
 Identities = 14/54 (25%), Positives = 27/54 (50%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          E+  EEK     +       K +  E+EK K++K++K  ++ +    E++  EK
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96



 Score = 31.6 bits (72), Expect = 0.62
 Identities = 10/23 (43%), Positives = 12/23 (52%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKK 45
            E+   E+  K  KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 31.2 bits (71), Expect = 0.98
 Identities = 9/18 (50%), Positives = 10/18 (55%)

Query: 29  EEKKKKKKKKKKKKKKKR 46
            EK  K  KKKK K K +
Sbjct: 197 AEKAAKGGKKKKGKAKAK 214


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 41.1 bits (96), Expect = 9e-04
 Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 21/73 (28%)

Query: 6   GEEKEDQEKEVGGISRGK---------------------EEETEEEKKKKKKKKKKKKKK 44
           G+E E+ E E GG  RGK                      E+ +EEK KKKKKK  K KK
Sbjct: 194 GDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKK 253

Query: 45  KRYEEEEKEEEKD 57
           K  ++++ +   D
Sbjct: 254 KLDDDKKGKRGGD 266



 Score = 38.4 bits (89), Expect = 0.006
 Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 2/56 (3%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
            K   E+++  +E        EEE    KK KK KK K KK    +++    +  +
Sbjct: 310 AKPEIEQDEDSEESE--EEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSD 363



 Score = 38.4 bits (89), Expect = 0.007
 Identities = 19/55 (34%), Positives = 25/55 (45%), Gaps = 4/55 (7%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEK----KKKKKKKKKKKKKKRYEEEEKEEEKD 57
           E     E E    S   EEE  EE+    KK KK KK K KK   ++++ +   D
Sbjct: 307 EIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDD 361



 Score = 35.7 bits (82), Expect = 0.055
 Identities = 13/52 (25%), Positives = 23/52 (44%), Gaps = 6/52 (11%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           EE E+++ E  G        +++ KK KK K KK    K   +   + +  +
Sbjct: 321 EESEEEKNEEEG------GLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSD 366



 Score = 31.1 bits (70), Expect = 1.2
 Identities = 15/62 (24%), Positives = 29/62 (46%), Gaps = 3/62 (4%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGK---EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
            +E   ++ +D+ +E   IS       +  E E K   +   K + ++  + EE EEEK+
Sbjct: 269 ADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKN 328

Query: 58  NK 59
            +
Sbjct: 329 EE 330



 Score = 29.9 bits (67), Expect = 2.8
 Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 7/62 (11%)

Query: 2   NEKGGEEK----EDQEKEVGGISRGKEEETEEEKKKKKKK--KKKKKKKKRYEEEEKEEE 55
           N+    E     E   K          EE+EEEK +++    KK KK KK   ++   ++
Sbjct: 295 NDPEEREDKLSPEIPAKPEIEQD-EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDK 353

Query: 56  KD 57
            D
Sbjct: 354 DD 355


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 41.1 bits (97), Expect = 0.001
 Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 7/50 (14%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           E E++E E G +S         E+KK +KK++K +KK   EE EK   K 
Sbjct: 395 EGEEEEGENGNLS-------PAERKKLRKKQRKAEKKAEKEEAEKAAAKK 437



 Score = 33.8 bits (78), Expect = 0.19
 Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 5/45 (11%)

Query: 23  KEEETEEEKKKKKKKKKK--KKKKKRYEEEEKEEEKD---NKLLH 62
           K+ E EE +K   KKK +   KK K  + E K+ + D    KL  
Sbjct: 423 KKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEKLAR 467



 Score = 31.1 bits (71), Expect = 1.4
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 21  RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           + ++EE E+   KKK +   KK K    E +K + 
Sbjct: 424 KAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458



 Score = 29.9 bits (68), Expect = 2.7
 Identities = 14/40 (35%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 22  GKEEETEEEKKKKKKKKK-KKKKKKRYEEEEKEEEKDNKL 60
           G+EEE E       ++KK +KK++K  ++ EKEE +    
Sbjct: 396 GEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAA 435


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 37.8 bits (88), Expect = 0.001
 Identities = 17/29 (58%), Positives = 21/29 (72%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          KKKKKKKKKK K K+    E++EEEK + 
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEEEEKSSA 48



 Score = 35.1 bits (81), Expect = 0.009
 Identities = 17/27 (62%), Positives = 19/27 (70%)

Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          + KKKKKKKKKK K K     EKEEE+
Sbjct: 18 DVKKKKKKKKKKNKSKEEVVTEKEEEE 44



 Score = 33.2 bits (76), Expect = 0.041
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           KK   KKKKKKKKK+ + +E+   +  +
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKEE 42



 Score = 32.8 bits (75), Expect = 0.057
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          K KK   KKKKKKKK+  + ++E   + +
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKE 41



 Score = 31.6 bits (72), Expect = 0.15
 Identities = 14/29 (48%), Positives = 20/29 (68%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          K   KKKKKKKKKK + +EE   E+++ +
Sbjct: 16 KIDVKKKKKKKKKKNKSKEEVVTEKEEEE 44



 Score = 29.7 bits (67), Expect = 0.84
 Identities = 15/39 (38%), Positives = 27/39 (69%)

Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          GK+ + +++KKKKKKK K K++    +EEE++   ++ L
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDL 52



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          K K KK   KKKKKK+ ++ + +EE   + 
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEK 40


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
          family of proteins is found in bacteria and viruses.
          Proteins in this family are typically between 180 and
          214 amino acids in length.
          Length = 125

 Score = 38.0 bits (89), Expect = 0.002
 Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 3/55 (5%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          E   E+     KEV       +E+ + EKK+++KK + +K  K   EE+ E E +
Sbjct: 1  EPEEEKTFTD-KEVDKAI--AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELE 52



 Score = 33.0 bits (76), Expect = 0.079
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          K+EE + E +K  K   ++K +   E+ EKE E+
Sbjct: 27 KQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60



 Score = 33.0 bits (76), Expect = 0.097
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          E++ EE+K + +K  K   ++K   E EK E++  +L
Sbjct: 25 EKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEEL 61



 Score = 33.0 bits (76), Expect = 0.097
 Identities = 14/58 (24%), Positives = 27/58 (46%), Gaps = 2/58 (3%)

Query: 2  NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           EK   EK+ +EK+         + + EEK + + +K +K+ ++   E  + E K   
Sbjct: 19 KEKAKWEKKQEEKKSEAEKL--AKMSAEEKAEYELEKLEKELEELEAELARRELKAEA 74



 Score = 31.5 bits (72), Expect = 0.36
 Identities = 14/39 (35%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 23 KEEETE-EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          +EE+T  +++  K   K+K K +K+ EE++ E EK  K+
Sbjct: 3  EEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKM 41



 Score = 28.4 bits (64), Expect = 3.6
 Identities = 10/35 (28%), Positives = 20/35 (57%), Gaps = 2/35 (5%)

Query: 27 TEEEKKK--KKKKKKKKKKKKRYEEEEKEEEKDNK 59
          T++E  K   K+K K +KK++  + E ++  K + 
Sbjct: 9  TDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSA 43



 Score = 28.0 bits (63), Expect = 4.4
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          E  EE+    K+  K   K+K   E+++EE+K   
Sbjct: 1  EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEA 35


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 37.3 bits (87), Expect = 0.004
 Identities = 12/32 (37%), Positives = 22/32 (68%)

Query: 21  RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
           R +  +   EK+K+ +K ++KK K+R +E+EK
Sbjct: 99  RRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 31.9 bits (73), Expect = 0.24
 Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 5/44 (11%)

Query: 23  KEEETEEEKKKKKKKKKKKK---KKKRYEE--EEKEEEKDNKLL 61
           KE   E+EKKKK+K+  KK+    + R E   E  E+EK  +  
Sbjct: 73  KELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKN 116



 Score = 31.1 bits (71), Expect = 0.47
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 18  GISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            + +    E E++KK+K+  KK++   +   E  KE  +
Sbjct: 70  KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAE 108


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 36.5 bits (85), Expect = 0.004
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
          +E  EEEK +++++ K+ K  KR E EEK
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREEIEEK 29



 Score = 28.0 bits (63), Expect = 3.0
 Identities = 7/23 (30%), Positives = 17/23 (73%), Gaps = 1/23 (4%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKK 45
          KEEE + +++++ K+ K  K+++
Sbjct: 4  KEEE-KAQREEELKRLKNLKREE 25



 Score = 26.8 bits (60), Expect = 8.7
 Identities = 10/38 (26%), Positives = 22/38 (57%), Gaps = 7/38 (18%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
          E KE+++ +       +EEE +  K  K+++ ++K +K
Sbjct: 2  ERKEEEKAQ-------REEELKRLKNLKREEIEEKLEK 32


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
          approximately 300 residues, found in plants and
          vertebrates. They contain a highly conserved DDRGK
          motif.
          Length = 189

 Score = 38.1 bits (89), Expect = 0.004
 Identities = 18/58 (31%), Positives = 36/58 (62%), Gaps = 2/58 (3%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          +  K  +  + +EK+     R  EEE  EE+KK ++K++ ++K++   EEE+E++K+ 
Sbjct: 3  IGAK--KRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEE 58



 Score = 36.6 bits (85), Expect = 0.010
 Identities = 16/56 (28%), Positives = 39/56 (69%)

Query: 2  NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           ++  EE+E +E++     R  E + EEE +++++KKK+++++K  EE+ ++E+++
Sbjct: 19 QQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74



 Score = 36.2 bits (84), Expect = 0.017
 Identities = 19/50 (38%), Positives = 34/50 (68%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          E K+ +EK  G     +E E E EKKK+++++K+++++ R E+EE E+ K
Sbjct: 30 ERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLK 79



 Score = 32.7 bits (75), Expect = 0.24
 Identities = 12/50 (24%), Positives = 31/50 (62%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          ++ E++ +         EEE E++K+++++K+++++ +K  EE EK +  
Sbjct: 32 KKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSS 81



 Score = 27.7 bits (62), Expect = 8.5
 Identities = 16/61 (26%), Positives = 34/61 (55%), Gaps = 4/61 (6%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            E   +E+E+ E+E       K+EE E ++++++ +K++++ +K       EEE  +KL
Sbjct: 37 KREGERKEEEELEEE----REKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKL 92

Query: 61 L 61
           
Sbjct: 93 S 93


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 39.4 bits (91), Expect = 0.005
 Identities = 19/60 (31%), Positives = 35/60 (58%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
            +K  E K+ + +E       K+ E E + K ++ KK+ ++ KK+ EE +K+EE+  K+ H
Sbjct: 1702 KKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAH 1761



 Score = 39.0 bits (90), Expect = 0.006
 Identities = 18/57 (31%), Positives = 31/57 (54%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +   E   D+ +     +   E++ EE KKK    KKK ++KK+ +E +K+ E+D K
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405



 Score = 37.4 bits (86), Expect = 0.016
 Identities = 17/57 (29%), Positives = 31/57 (54%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K  + K+ + +E       K+ E E + K  ++ KK ++ KK+ EE +K EE + K
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689



 Score = 36.3 bits (83), Expect = 0.039
 Identities = 11/50 (22%), Positives = 24/50 (48%)

Query: 7    EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            ++K D  K+    ++   E  + E +    + +  ++K    E++KEE K
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377



 Score = 36.3 bits (83), Expect = 0.041
 Identities = 17/56 (30%), Positives = 30/56 (53%)

Query: 4    KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            K  EE +  E++       K+ E +E+K  +  KK+ ++ KK  E ++KE E+  K
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717



 Score = 35.9 bits (82), Expect = 0.047
 Identities = 19/53 (35%), Positives = 34/53 (64%)

Query: 7    EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            EE +  E+E   + + K++E EE+KK ++ KK +++ K +  EE K+ E+D K
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675



 Score = 35.9 bits (82), Expect = 0.051
 Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 2/59 (3%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK--KKKKKKKKKRYEEEEKEEEKDNK 59
            EK   E+  + +E   I   +E +  EE KKK +  KK ++ +KK  E  +KE E+  K
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703



 Score = 35.9 bits (82), Expect = 0.056
 Identities = 10/52 (19%), Positives = 26/52 (50%)

Query: 8    EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K+ +E +    +   E E   ++ +  ++K +  +KK+ E ++K +    K
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386



 Score = 35.5 bits (81), Expect = 0.060
 Identities = 19/57 (33%), Positives = 34/57 (59%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K  E  + + +E       K++E EE+KK ++ KK +++ K + EE +KE E+D K
Sbjct: 1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK 1744



 Score = 35.5 bits (81), Expect = 0.064
 Identities = 19/56 (33%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 3    EKGGEE--KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +K  EE  K  +  +    +   E E  EEK +  +KKK++ KKK    ++K EEK
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390



 Score = 35.1 bits (80), Expect = 0.10
 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 4/57 (7%)

Query: 7    EEKEDQEKEVGGISRGKEEET----EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            ++KE +EK+     +  EEE      EE KK ++ KKK ++ K+ EE+EK+  +  K
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695



 Score = 34.7 bits (79), Expect = 0.10
 Identities = 18/63 (28%), Positives = 36/63 (57%), Gaps = 10/63 (15%)

Query: 8    EKEDQEKEVGGISRGKEEET----------EEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
            +K +++K+    ++  EE+            EE KK ++ KKK+ ++K+  EE K+ E++
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727

Query: 58   NKL 60
            NK+
Sbjct: 1728 NKI 1730



 Score = 34.7 bits (79), Expect = 0.11
 Identities = 19/59 (32%), Positives = 35/59 (59%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
            +K  E K+ +E +    ++  EE+ + ++ KK ++ KK ++KK+ EE +K EE  N  L
Sbjct: 1522 KKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580



 Score = 34.3 bits (78), Expect = 0.16
 Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K  EE + +  E+   +  K++  +E KKK ++KKK  + KK+ EE +K +E   K
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKK-ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKK 1452



 Score = 34.3 bits (78), Expect = 0.17
 Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 5/63 (7%)

Query: 2    NEKGGEE--KEDQEKEVGGISRGKEEE---TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            ++K  +E  K    K+    ++ K EE    +E KKK ++ KK  + KK+ EE +K EE 
Sbjct: 1403 DKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462

Query: 57   DNK 59
              K
Sbjct: 1463 KKK 1465



 Score = 34.0 bits (77), Expect = 0.19
 Identities = 21/58 (36%), Positives = 38/58 (65%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            EK  + +E ++ E   I   + ++ EEEKKK ++ KKK+ ++K+  EE K+ E++NK+
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661



 Score = 34.0 bits (77), Expect = 0.20
 Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 5/59 (8%)

Query: 3    EKGGEE--KEDQEKEVGGISRGKEE---ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +K  EE  K D+ K+    ++  +E   + EE KKK  + KK  + KK+ +E +K EE 
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521



 Score = 33.6 bits (76), Expect = 0.30
 Identities = 17/60 (28%), Positives = 33/60 (55%), Gaps = 4/60 (6%)

Query: 4    KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK----KKRYEEEEKEEEKDNK 59
            +  E+K+++ K+    ++ K EE ++  + KKK ++ KKK    KK    ++K +E   K
Sbjct: 1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426



 Score = 33.2 bits (75), Expect = 0.31
 Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 1/58 (1%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK-KKRYEEEEKEEEKDNK 59
            +K  E  + + +     +   EE+ E  +KKK++ KKK    KK+ EE++K +E   K
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399



 Score = 33.2 bits (75), Expect = 0.31
 Identities = 17/51 (33%), Positives = 29/51 (56%)

Query: 9    KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            K D+ K+     +  E +  EEKKK  + KKK ++ K+ +E +K+ E+  K
Sbjct: 1279 KADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1329



 Score = 33.2 bits (75), Expect = 0.36
 Identities = 19/55 (34%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 8    EKEDQEKEVGGISRGKEEE---TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K+ +EK+    ++ K EE    +E KKK ++ KK ++ KK+ EE +K +E   K
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478



 Score = 33.2 bits (75), Expect = 0.38
 Identities = 11/57 (19%), Positives = 24/57 (42%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K    K+  E+        K E      + +  ++K +  +K+ EE +K+ +   K
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385



 Score = 33.2 bits (75), Expect = 0.38
 Identities = 15/59 (25%), Positives = 28/59 (47%)

Query: 1    MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
              +K  E K+  E          +E    E+K +  +KKK++ KK+ +  +K+ E+  K
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 33.2 bits (75), Expect = 0.38
 Identities = 12/57 (21%), Positives = 23/57 (40%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K  E K+  E+        K++  E +K  +  K + +      E  E++ E   K
Sbjct: 1315 KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371



 Score = 33.2 bits (75), Expect = 0.39
 Identities = 16/54 (29%), Positives = 28/54 (51%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +K  +  + + +E       K++  E++KK  + KK    KKK  E ++K EEK
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430



 Score = 33.2 bits (75), Expect = 0.39
 Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K  EEK+  + E    +   +++ +E KK    KKK  + KK+ EE++K +E   K
Sbjct: 1384 KKKAEEKKKAD-EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439



 Score = 32.8 bits (74), Expect = 0.43
 Identities = 8/54 (14%), Positives = 26/54 (48%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            E+  ++ +  +K+     +  E    E +    + +  ++K +  E++++E +K
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378



 Score = 32.8 bits (74), Expect = 0.47
 Identities = 20/54 (37%), Positives = 33/54 (61%), Gaps = 1/54 (1%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +K  E K+ +E+        K+E  EE+KKK ++ KK +++KK+    +KEEEK
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEA-EEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768



 Score = 32.8 bits (74), Expect = 0.52
 Identities = 10/57 (17%), Positives = 25/57 (43%)

Query: 4    KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            K  +E + + +E    +   +++ EE KK  +  K + +      E  +E+ +  + 
Sbjct: 1315 KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371



 Score = 32.4 bits (73), Expect = 0.57
 Identities = 18/54 (33%), Positives = 32/54 (59%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +K  E K+ +E +    ++  EE  + ++ KK ++KKK  + K+ EE +K EEK
Sbjct: 1510 KKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEK 1563



 Score = 32.4 bits (73), Expect = 0.58
 Identities = 15/51 (29%), Positives = 31/51 (60%)

Query: 9    KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            K ++ ++   + + +E++  +E KK ++KKK  + KK+ EE +K +E   K
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323



 Score = 32.0 bits (72), Expect = 0.74
 Identities = 20/57 (35%), Positives = 32/57 (56%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K  E K+ +E E       K+E  E +K ++ KKK+ ++KKK  E ++ EEE   K
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK 1731



 Score = 32.0 bits (72), Expect = 0.81
 Identities = 15/56 (26%), Positives = 25/56 (44%)

Query: 4    KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            K  +E + + +E       K++  E +KK    KKK ++ KK  E  + E E    
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357



 Score = 32.0 bits (72), Expect = 0.84
 Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEE---ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E   E+ E  EK+     +  +    + EE+KK  + KKK ++ KK+ +E +K      K
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419



 Score = 32.0 bits (72), Expect = 0.85
 Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 4/58 (6%)

Query: 3    EKGGEE--KEDQEKEVGGISRGKEEETE--EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +K  EE  K D+ K+    ++ K +E +   E KKK  + KK ++ K+ +E +K EE 
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEA 1533



 Score = 32.0 bits (72), Expect = 0.91
 Identities = 10/53 (18%), Positives = 25/53 (47%)

Query: 7    EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            ++K ++ K+    ++ K EE ++  +  K + +    +    EE+ E  +  K
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 31.6 bits (71), Expect = 0.92
 Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 4/58 (6%)

Query: 3    EKGGEE--KEDQEKEVGGISRGKEEETEE--EKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +K  EE  K D+ K+    ++ K +  ++  E+ KK  +  K + +   +E E  EEK
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365



 Score = 31.6 bits (71), Expect = 1.1
 Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 4/63 (6%)

Query: 1    MNEKGGEEKEDQEKEVGGISRGKEEE----TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
              +K  E K+ +E +       K +E     EE KK  + KKK ++ KK+ +E +K  E 
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEA 1508

Query: 57   DNK 59
              K
Sbjct: 1509 KKK 1511



 Score = 31.3 bits (70), Expect = 1.3
 Identities = 16/55 (29%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 8    EKEDQEKEVGGISRGKEEE---TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K+ +E +    ++ K EE    +E KKK ++ KKK  + K+  E +K+ ++  K
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517



 Score = 31.3 bits (70), Expect = 1.5
 Identities = 15/57 (26%), Positives = 27/57 (47%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            EK   ++  + +E       K++  E +K  + KKK ++ KKK    ++K EE    
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 30.9 bits (69), Expect = 1.7
 Identities = 15/50 (30%), Positives = 29/50 (58%)

Query: 7    EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            ++K D+ K+     +  +E  + E+ KK  + KK ++ K+ +E +K EEK
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK 1545



 Score = 30.5 bits (68), Expect = 2.1
 Identities = 13/56 (23%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 8    EKEDQEKEVGGISRGKEEET----EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K+ +E +    ++ K EE     +  KKK ++ KK  +  K   E   +E +  +
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363



 Score = 30.5 bits (68), Expect = 2.4
 Identities = 14/57 (24%), Positives = 30/57 (52%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E+  ++ E+ +K      + +E +  +E KKK ++ KKK  + +   E K++  + K
Sbjct: 1460 EEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAK 1516



 Score = 30.5 bits (68), Expect = 2.4
 Identities = 15/55 (27%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 3    EKGGEE--KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
            +K  +E  K  + K+    ++  EE  + ++ KK ++ KK  + K+ EE++K +E
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE 1550



 Score = 30.5 bits (68), Expect = 2.5
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 4    KGGEEKEDQEKEVGGISRGKEEETE--EEKKKKKKKKKKKKKKKRYEEEEKEEE 55
            +     E  E+  G     K+ ET   EE +K ++ KKK +  ++ EE  K E+
Sbjct: 1085 EDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAED 1138



 Score = 30.1 bits (67), Expect = 2.8
 Identities = 12/53 (22%), Positives = 30/53 (56%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
            ++  ++ E+ +K+     +  E + + ++ KK ++ KK  + K+ EE +K +E
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538



 Score = 29.7 bits (66), Expect = 4.1
 Identities = 17/54 (31%), Positives = 31/54 (57%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +K  E K+  E +    ++  +E  + E+KKK  + KK ++ K+ EE++K EE 
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEA 1569



 Score = 29.7 bits (66), Expect = 4.2
 Identities = 14/55 (25%), Positives = 29/55 (52%)

Query: 1    MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
              +    +K+  E +    ++  +E  + E+ KK  + KK ++KK+ +E +K EE
Sbjct: 1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556



 Score = 29.7 bits (66), Expect = 4.6
 Identities = 13/47 (27%), Positives = 26/47 (55%)

Query: 9    KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
            K ++E++     + ++ +  E  KK ++ KK  ++ K+ EEE   EE
Sbjct: 1207 KAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEE 1253



 Score = 29.3 bits (65), Expect = 4.9
 Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 3/55 (5%)

Query: 8    EKEDQEKEVGGISRGKEEETE--EEKKKKKKKKKKKKK-KKRYEEEEKEEEKDNK 59
            +K+ +E +    ++ K EE +  EE KKK ++ KK  + KK+ EE +K +E   K
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491



 Score = 29.3 bits (65), Expect = 5.9
 Identities = 16/53 (30%), Positives = 27/53 (50%)

Query: 4    KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            K  +E + + +E       K++  E +KK  + KK  + KKK  E ++ EE K
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522



 Score = 29.3 bits (65), Expect = 6.0
 Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 1/57 (1%)

Query: 4    KGGEEKEDQEKEVGGISRGKEEETE-EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            K  E K+ +EK+    ++  EE+ + +E KKK ++ KK  + K+  EE K++    K
Sbjct: 1279 KADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAK 1335



 Score = 29.0 bits (64), Expect = 6.8
 Identities = 17/57 (29%), Positives = 32/57 (56%), Gaps = 4/57 (7%)

Query: 7    EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK----KRYEEEEKEEEKDNK 59
            ++K +Q K+     + K EE ++ +++ K K  ++ KK    K+  EE K+ E+D K
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688



 Score = 29.0 bits (64), Expect = 7.3
 Identities = 12/56 (21%), Positives = 24/56 (42%)

Query: 1    MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            + +  G +   ++ +       + +E  EE   K ++ KK +  K  E  + EE K
Sbjct: 1066 VGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAK 1121



 Score = 29.0 bits (64), Expect = 7.7
 Identities = 14/60 (23%), Positives = 33/60 (55%), Gaps = 3/60 (5%)

Query: 3    EKGGEEK---EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +K  EE    ++ +K+     + +E + + E+ KK  + KKK ++ +  +E K++ ++ K
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496



 Score = 28.6 bits (63), Expect = 9.0
 Identities = 18/52 (34%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 4    KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
            K  E K++ E++     +  EE  ++E++KKK    KK+++K+ EE  KE+E
Sbjct: 1731 KAEEAKKEAEED----KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778



 Score = 28.6 bits (63), Expect = 9.3
 Identities = 15/54 (27%), Positives = 28/54 (51%)

Query: 3    EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            E+  EE    E+        ++   + E+ +K  + KK ++KK+ +E +K EEK
Sbjct: 1248 ERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEK 1301


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 36.5 bits (85), Expect = 0.006
 Identities = 20/63 (31%), Positives = 33/63 (52%), Gaps = 9/63 (14%)

Query: 5   GGEEKEDQEKEVGGISRGK--------EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           GG    DQ  EV  I+R +        +E  E ++ + ++KK  +++ KR  EE KE E+
Sbjct: 64  GGGLFVDQA-EVDAIARSRVQPVLDEIDERAEAQRARDEEKKLDEEEAKRQHEEAKERER 122

Query: 57  DNK 59
           + K
Sbjct: 123 EKK 125



 Score = 28.4 bits (64), Expect = 3.1
 Identities = 9/38 (23%), Positives = 23/38 (60%)

Query: 19  ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           I    E +   +++KK  +++ K++ +  +E E+E++K
Sbjct: 89  IDERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 35.4 bits (82), Expect = 0.006
 Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 2/48 (4%)

Query: 6  GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
          GEE E+ ++++   +      ++  ++KKKKKKK KKK K +   +KE
Sbjct: 17 GEEGEEDDEQID--ASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKE 62



 Score = 33.8 bits (78), Expect = 0.021
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query: 8  EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
            E+ E++   I   K     +  ++KKKKKKK KKK +
Sbjct: 16 LGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 31.5 bits (72), Expect = 0.15
 Identities = 15/43 (34%), Positives = 22/43 (51%)

Query: 2  NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
             G  E+ +++ E    S+ +       +KKKKKKK KKK K
Sbjct: 12 LPNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 26.9 bits (60), Expect = 5.8
 Identities = 20/61 (32%), Positives = 29/61 (47%), Gaps = 5/61 (8%)

Query: 4  KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK-----KKKKKKRYEEEEKEEEKDN 58
           G E +ED E+      R   +    +KKKKKK KK       +KK++     KE + D+
Sbjct: 16 LGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKEVKPDS 75

Query: 59 K 59
          K
Sbjct: 76 K 76


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
          protein SURF-6 is shown to be a component of the
          nucleolar matrix and has a strong binding capacity for
          nucleic acids.
          Length = 206

 Score = 36.9 bits (86), Expect = 0.009
 Identities = 11/36 (30%), Positives = 27/36 (75%)

Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          E   E++++K++++K +KK+KR E ++KE+ + ++ 
Sbjct: 5  EALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEA 40



 Score = 36.1 bits (84), Expect = 0.016
 Identities = 14/38 (36%), Positives = 25/38 (65%)

Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          R K E+ +  KK+K+K+ KKK+  ++ E EE + E++ 
Sbjct: 12 RRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENK 49



 Score = 35.4 bits (82), Expect = 0.034
 Identities = 13/32 (40%), Positives = 23/32 (71%)

Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          E+ K +KK+K+K+ KKK+  ++ E EE K+ +
Sbjct: 16 EQRKARKKQKRKEAKKKEDAQKSEAEEVKNEE 47



 Score = 32.3 bits (74), Expect = 0.34
 Identities = 8/37 (21%), Positives = 22/37 (59%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
                  +++++K++++K +KK+  +E K++E   K
Sbjct: 1  PSSREALLEQRRRKREQRKARKKQKRKEAKKKEDAQK 37



 Score = 31.5 bits (72), Expect = 0.53
 Identities = 12/32 (37%), Positives = 22/32 (68%)

Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E++K +KK+K+K+ KKK   ++ + EE  N+
Sbjct: 15 REQRKARKKQKRKEAKKKEDAQKSEAEEVKNE 46



 Score = 30.7 bits (70), Expect = 1.1
 Identities = 8/37 (21%), Positives = 26/37 (70%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          +E   E+ ++K++++K +KK+K++  +++++ +K   
Sbjct: 4  REALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEA 40



 Score = 30.0 bits (68), Expect = 2.2
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 31  KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           KK  K+K+K+KKK K+  +E KE+ +  K
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKK 173



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query: 29  EEKKKKKKKKKKKKKK 44
           ++KK KKKKK KKK +
Sbjct: 189 DDKKNKKKKKAKKKGR 204



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 11/17 (64%), Positives = 11/17 (64%)

Query: 31  KKKKKKKKKKKKKKKRY 47
             KK KKKKK KKK R 
Sbjct: 189 DDKKNKKKKKAKKKGRI 205



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKR 46
           EE  ++ K  KK KKKKK KKK 
Sbjct: 181 EENLKKRKDDKKNKKKKKAKKKG 203



 Score = 29.6 bits (67), Expect = 2.7
 Identities = 23/61 (37%), Positives = 31/61 (50%), Gaps = 3/61 (4%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEE--ETEEEKKKKKKKKKKKKKKKRYEEE-EKEEEKDNK 59
            K  ++    E   G I   K E  + E+ KK  K KKKKKKKK  Y++  +K E +  K
Sbjct: 48  NKSKKKAAPIENAEGNIVFSKVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKK 107

Query: 60  L 60
           L
Sbjct: 108 L 108



 Score = 28.8 bits (65), Expect = 4.1
 Identities = 10/16 (62%), Positives = 12/16 (75%)

Query: 30  EKKKKKKKKKKKKKKK 45
           + KK KKKKK KKK +
Sbjct: 189 DDKKNKKKKKAKKKGR 204



 Score = 28.8 bits (65), Expect = 4.7
 Identities = 12/26 (46%), Positives = 21/26 (80%)

Query: 31  KKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +K+K+KKK KK+ K+R E+ EK++ +
Sbjct: 150 RKEKQKKKSKKEWKERKEKVEKKKAE 175


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 36.3 bits (84), Expect = 0.010
 Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 3/58 (5%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
                +E+E+ E+E    +   E+E   E K + K+KKK++  K   E+EK + +  K
Sbjct: 91  LSSSDDEEEETEEES---TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 37.6 bits (88), Expect = 0.010
 Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 3/54 (5%)

Query: 9   KEDQEKEVGGISR---GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           K   EK+ G  S+    K  E +E++K+K K KK+ +  K   +  K      +
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 36.5 bits (85), Expect = 0.027
 Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           +     K+V      K+E+ +E+ K KK+ +  K   KR +     EE
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 36.1 bits (84), Expect = 0.034
 Identities = 14/33 (42%), Positives = 19/33 (57%)

Query: 27  TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           T+   +KK  K  KK   KR E++EKE+EK   
Sbjct: 380 TKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412



 Score = 35.3 bits (82), Expect = 0.063
 Identities = 13/51 (25%), Positives = 23/51 (45%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
              +  +  +K +   +  KE+E E+ K KK+ +  K   K+R      EE
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 33.0 bits (76), Expect = 0.33
 Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 6/53 (11%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           +E   + K       GK  +    K+ +KK+K+K+K K       K+  +D K
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV------KKRHRDTK 420



 Score = 29.5 bits (67), Expect = 3.9
 Identities = 10/35 (28%), Positives = 18/35 (51%)

Query: 28  EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
              +KK  K  KK   K+  ++E+++E+   K  H
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 37.0 bits (86), Expect = 0.011
 Identities = 13/68 (19%), Positives = 31/68 (45%), Gaps = 11/68 (16%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEE-----EKKKKKKKKKKKKKKKRYEEEEKEEE 55
            +       +D + +        E+ET       EK KK++ ++K+++++    EE++  
Sbjct: 124 SSSDSDSSDDDSDDD------DSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAR 177

Query: 56  KDNKLLHN 63
           ++  L  N
Sbjct: 178 EEEILTGN 185


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 37.4 bits (87), Expect = 0.013
 Identities = 17/34 (50%), Positives = 22/34 (64%)

Query: 22  GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
               E  + KK KKKKKKKKK++K  +E+E E E
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPE 300



 Score = 37.4 bits (87), Expect = 0.014
 Identities = 17/33 (51%), Positives = 23/33 (69%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           + +E  K KK KKKKKKKKK+R + +E E E +
Sbjct: 268 DVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPE 300



 Score = 34.0 bits (78), Expect = 0.18
 Identities = 15/62 (24%), Positives = 35/62 (56%), Gaps = 6/62 (9%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEE------KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
               EK ++  ++GGI    E++ +++      KK KK++KKK+ ++K+    +++E++ 
Sbjct: 63  AKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLLDEKEKER 122

Query: 58  NK 59
             
Sbjct: 123 AA 124



 Score = 32.8 bits (75), Expect = 0.42
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 31  KKKKKKKKKKKKKKKRYEEEEKEEE 55
           K KKKKKKKKK++K   E+E + E 
Sbjct: 277 KPKKKKKKKKKRRKDLDEDELEPEA 301



 Score = 32.8 bits (75), Expect = 0.43
 Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 12/71 (16%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEK------------KKKKKKKKKKKKKKRYEE 49
           +E   + +E ++K         ++ TE               KK KKKKKKKKK+++  +
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLD 293

Query: 50  EEKEEEKDNKL 60
           E++ E +   L
Sbjct: 294 EDELEPEAEGL 304



 Score = 31.7 bits (72), Expect = 0.86
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 31  KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            KKKKKKKKK++K    +E E E E    
Sbjct: 278 PKKKKKKKKKRRKDLDEDELEPEAEGLGS 306



 Score = 30.1 bits (68), Expect = 2.4
 Identities = 13/51 (25%), Positives = 25/51 (49%), Gaps = 1/51 (1%)

Query: 7   EEKEDQEKEVGGISRGKEEE-TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EE E ++K+   +        T++E +KK+++ KKK K      ++   E 
Sbjct: 211 EEIEGKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTET 261



 Score = 29.0 bits (65), Expect = 5.6
 Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 16/69 (23%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKK----------------KKKKKKKKKRYEEE 50
           E K  +E+    I++ +E+     K    K                KK KK++KK+  E 
Sbjct: 50  EAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEAER 109

Query: 51  EKEEEKDNK 59
           +K    D K
Sbjct: 110 KKALLLDEK 118



 Score = 29.0 bits (65), Expect = 6.1
 Identities = 14/46 (30%), Positives = 24/46 (52%), Gaps = 7/46 (15%)

Query: 24  EEETEE-------EKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
           E+E +E       EK+K KK  + KKKK  Y+ ++ ++     +L 
Sbjct: 162 EDEGDELENVELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILS 207


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 37.6 bits (87), Expect = 0.014
 Identities = 14/55 (25%), Positives = 26/55 (47%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           E +E  E + G ++  KEE  ++  K  K K + KK +   +E +  +  +   L
Sbjct: 435 ELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLEL 489



 Score = 36.1 bits (83), Expect = 0.048
 Identities = 18/60 (30%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKK-KKKKKKKKRYEEEEKEEEKDNKL 60
            +   +++E++E++     + KE   EEEK +K K  ++K++K K  EEE +  E++ K 
Sbjct: 754 EKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKE 813



 Score = 34.6 bits (79), Expect = 0.14
 Identities = 12/54 (22%), Positives = 24/54 (44%), Gaps = 2/54 (3%)

Query: 7    EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK--KKKKKKRYEEEEKEEEKDN 58
             ++E     +  I+  +E+E    K + KK++   +KK+  R   EE  +    
Sbjct: 969  AKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKE 1022



 Score = 34.2 bits (78), Expect = 0.17
 Identities = 18/49 (36%), Positives = 27/49 (55%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           E K +  K        +E+  E EK+ KK +K+ KK+K+  EE EKE +
Sbjct: 298 ELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELK 346



 Score = 33.4 bits (76), Expect = 0.30
 Identities = 13/43 (30%), Positives = 25/43 (58%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLY 67
           +  E + ++ K K++ KK  + Y+ +EK E ++  LL+  YL 
Sbjct: 192 DLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLK 234



 Score = 32.2 bits (73), Expect = 0.59
 Identities = 18/54 (33%), Positives = 34/54 (62%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           +EK+ QE+E+  +++ +EE   E  K +++K   ++K K  E+E K+ EK+ K 
Sbjct: 280 KEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKK 333



 Score = 32.2 bits (73), Expect = 0.65
 Identities = 11/57 (19%), Positives = 24/57 (42%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
           EE   Q  +       +++  EEE K   K++++ K +    E  K ++++      
Sbjct: 265 EEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESE 321



 Score = 31.9 bits (72), Expect = 0.80
 Identities = 17/63 (26%), Positives = 34/63 (53%), Gaps = 2/63 (3%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
                 E+ ++ EKE+  +   KE + E+E+ ++ +K+ K+ + KR  EEE+EE+ +   
Sbjct: 309 RKVDDEEKLKESEKELKKLE--KELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQ 366

Query: 61  LHN 63
              
Sbjct: 367 EKL 369



 Score = 31.5 bits (71), Expect = 1.1
 Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKK------KKKKKKKKKKKRYEEEEKEEE 55
            EK   E+ ++E +   I R  EEE EE+ +K        +++   KKK   E      +
Sbjct: 333 KEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAK 392

Query: 56  KDNKLL 61
              + L
Sbjct: 393 LKEEEL 398



 Score = 31.1 bits (70), Expect = 1.7
 Identities = 12/47 (25%), Positives = 24/47 (51%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
           EE E+  + +  +   K +E + +++ KK  +  + K+K   EEE  
Sbjct: 181 EETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENL 227



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 13/58 (22%), Positives = 31/58 (53%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
             E+  +E+ +  +E   +   +E+  EEE ++   + K+++K ++  EEE E  ++ 
Sbjct: 806 ALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEE 863



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 13/56 (23%), Positives = 29/56 (51%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
           + +  + +        K +E+E+E KK +K+ KK+K++    E+E +E +  +   
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAE 355



 Score = 30.7 bits (69), Expect = 2.2
 Identities = 13/61 (21%), Positives = 27/61 (44%), Gaps = 3/61 (4%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
             +     KE +  E        E+   EE+K++K K ++++ +   EE ++E E   + 
Sbjct: 765 EEKSELSLKEKELAEEEEK---TEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEE 821

Query: 61  L 61
            
Sbjct: 822 Q 822



 Score = 29.9 bits (67), Expect = 3.2
 Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            K++ EKE   +++  +E  EEEK+KK ++++ K   K  EE + E  K
Sbjct: 257 SKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLK 305



 Score = 29.9 bits (67), Expect = 3.7
 Identities = 11/56 (19%), Positives = 23/56 (41%), Gaps = 6/56 (10%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
           E K  + K        ++ +   E  + K+K + +++   Y +  K  E+   LL 
Sbjct: 195 ELKLQELKLK------EQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQ 244



 Score = 29.2 bits (65), Expect = 5.1
 Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 1/56 (1%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK-KRYEEEEKEEEKDNKLL 61
           E   ++  E       KEEE E  K+    K++          E E++EE+ NK  
Sbjct: 940 ELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDE 995



 Score = 29.2 bits (65), Expect = 5.2
 Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 6/59 (10%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            E   +E+E+ E         +E E EEE   +  K+ K+++K++  +EE+ +    + 
Sbjct: 244 QELLRDEQEEIESSK------QELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEE 296



 Score = 28.8 bits (64), Expect = 7.6
 Identities = 13/54 (24%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 7    EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            E +E +E+        K+E  EEEKK+  ++  ++  ++  E  E     +  L
Sbjct: 983  EFEEKEERYNK--DELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRGL 1034


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 36.3 bits (84), Expect = 0.017
 Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 7/63 (11%)

Query: 18  GIS-RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVTHIC 76
           GIS R K   T  EKKKK K KK K K K+ E+++K  E       N + Y +  +THI 
Sbjct: 77  GISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKARE-----GWNNFSY-FPEITHIV 130

Query: 77  FSD 79
             +
Sbjct: 131 IKE 133


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 36.8 bits (86), Expect = 0.018
 Identities = 20/52 (38%), Positives = 34/52 (65%), Gaps = 5/52 (9%)

Query: 13  EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRY-----EEEEKEEEKDNK 59
           E+E   I +  EEE +EE ++KK++KKK++++ +      EE+ K EEK+ K
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERK 317



 Score = 32.2 bits (74), Expect = 0.51
 Identities = 12/39 (30%), Positives = 25/39 (64%), Gaps = 5/39 (12%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
           EEK+ +E+E         + + EE++K ++K++KK+ +K
Sbjct: 289 EEKKKEERE-----AKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 31.8 bits (73), Expect = 0.72
 Identities = 12/45 (26%), Positives = 28/45 (62%), Gaps = 3/45 (6%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
            E+  E+KE+++KE       K  +   E+++K ++K++KK+ ++
Sbjct: 281 QEEAQEKKEEKKKEE---REAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 28.4 bits (64), Expect = 7.8
 Identities = 11/38 (28%), Positives = 26/38 (68%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           +EEE +  K  +++++++ ++KK  +++E+ E K  KL
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKL 303



 Score = 28.4 bits (64), Expect = 8.0
 Identities = 14/38 (36%), Positives = 27/38 (71%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           K  E EEEK  K  +++++++ +  +EE+K+EE++ KL
Sbjct: 263 KTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKL 300


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 35.5 bits (82), Expect = 0.020
 Identities = 11/33 (33%), Positives = 25/33 (75%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            EE+E+E+K+  K+ ++K+  K  +E+++++EK
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124



 Score = 33.6 bits (77), Expect = 0.099
 Identities = 12/39 (30%), Positives = 26/39 (66%)

Query: 22  GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            +E E E+++  K+ ++K+  K K+ ++E+KE++   KL
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKL 130



 Score = 31.3 bits (71), Expect = 0.53
 Identities = 8/31 (25%), Positives = 20/31 (64%)

Query: 27  TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
             EE +K++K+  K+ ++K   + +KE+++ 
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKKEK 121



 Score = 31.3 bits (71), Expect = 0.57
 Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 6/54 (11%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            EE E ++KE        +E  E+E  K KK+KK+KK+KK  E+  K++     
Sbjct: 92  PEESEKEQKE------VSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139



 Score = 30.9 bits (70), Expect = 0.63
 Identities = 11/36 (30%), Positives = 24/36 (66%)

Query: 26  ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
             E EK++K+  K+ ++K+    ++EK+E+K+ K+ 
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVA 127



 Score = 30.1 bits (68), Expect = 1.2
 Identities = 17/54 (31%), Positives = 27/54 (50%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EK  +E   + +E   I   KE++ ++EKK  +K  KKK  K      +K  +K
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149



 Score = 29.3 bits (66), Expect = 2.0
 Identities = 13/53 (24%), Positives = 29/53 (54%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E+ ++E++       ++E  + +K+KK+KK+KK  +K   ++  K  +   K
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 37.1 bits (86), Expect = 0.022
 Identities = 14/57 (24%), Positives = 28/57 (49%)

Query: 1    MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
            ++++  + +E +EK     +RG+    ++  ++  KK   KK  K+  E E  EE  
Sbjct: 1168 LDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETY 1224



 Score = 31.0 bits (70), Expect = 1.8
 Identities = 18/58 (31%), Positives = 24/58 (41%)

Query: 2    NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
              K   E E  E+  G  +   E   E  K K +   KKK      E+EE++E  D K
Sbjct: 1210 TTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLK 1267


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 35.6 bits (82), Expect = 0.023
 Identities = 19/42 (45%), Positives = 29/42 (69%)

Query: 11  DQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
           D E   G   + K+++ E++K++KKKKK+KKKKKKR+  E  
Sbjct: 133 DSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174



 Score = 33.3 bits (76), Expect = 0.14
 Identities = 13/54 (24%), Positives = 29/54 (53%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            K  + +          S  +  +  E+K KKKK +  K++KK+ +E++K++++
Sbjct: 115 HKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKR 168


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 35.3 bits (82), Expect = 0.024
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
               +EKE++E+E   +    EEE  +E  +K+  K K++K++  E ++KE  K+
Sbjct: 95  GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKE 149



 Score = 30.7 bits (70), Expect = 0.89
 Identities = 13/50 (26%), Positives = 29/50 (58%), Gaps = 7/50 (14%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E +E  E+E       + +E  E++  K K++K+++ +++ +E  KE+ K
Sbjct: 110 EVEELDEEE-------QIDELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 29.2 bits (66), Expect = 2.7
 Identities = 10/36 (27%), Positives = 22/36 (61%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNC 64
           +     KK+K+++++++   EE  EEE+ ++LL   
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKE 127


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 36.5 bits (85), Expect = 0.025
 Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 11/63 (17%)

Query: 7   EEKEDQEKEVGGISRGKEEE-----------TEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           EE   QEKE  G   G                 E+KK++KK+++K+ K+ R ++++ E+E
Sbjct: 149 EEPVQQEKEAAGSGGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDE 208

Query: 56  KDN 58
            D+
Sbjct: 209 DDD 211


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 36.5 bits (85), Expect = 0.026
 Identities = 14/35 (40%), Positives = 23/35 (65%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
           EEK+  K++K+ +K KK+ E+++KE EK  K    
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIP 592



 Score = 34.6 bits (80), Expect = 0.12
 Identities = 11/34 (32%), Positives = 26/34 (76%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +E+E +E  K++K+ +K KK++++ ++E ++ EK
Sbjct: 555 REKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 34.2 bits (79), Expect = 0.13
 Identities = 13/33 (39%), Positives = 24/33 (72%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           EE +++K++K+  K++KR  + +K+EEK  K L
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKEL 583



 Score = 33.1 bits (76), Expect = 0.37
 Identities = 15/48 (31%), Positives = 32/48 (66%), Gaps = 7/48 (14%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
           ++KE+ ++E       KEE+   +++K+ +K KK+++KK+ E E+ E+
Sbjct: 548 DDKEELQRE-------KEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 33.1 bits (76), Expect = 0.37
 Identities = 15/51 (29%), Positives = 31/51 (60%)

Query: 10  EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           ED+           +EE + EK++K+  K++K+ +K  ++EEK++++  KL
Sbjct: 536 EDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 33.4 bits (77), Expect = 0.027
 Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 9/46 (19%)

Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKR---------YEEEEKEEEKD 57
          R + E+   ++ KK+  KK  +KKKR         Y +E +  E+D
Sbjct: 6  RKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERD 51


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 34.8 bits (81), Expect = 0.030
 Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 6  GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           EE +D ++ +      K+E    EK +KK +K K K +K   ++ K+EE   +L
Sbjct: 48 NEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEK---KKAKKEEPKPRL 99



 Score = 29.0 bits (66), Expect = 3.0
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRY 47
           K+ E  + K +KKK KK++ K + +
Sbjct: 76  KKAEKAKAKAEKKKAKKEEPKPRLF 100


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
          Members of this family are associated with type VII
          secretion of WXG100 family targets in the Firmicutes,
          but not in the Actinobacteria. This highly divergent
          protein family consists largely of a central region of
          highly polar low-complexity sequence containing
          occasional LF motifs in weak repeats about 17 residues
          in length, flanked by hydrophobic N- and C-terminal
          regions [Protein fate, Protein and peptide secretion
          and trafficking].
          Length = 150

 Score = 34.7 bits (80), Expect = 0.031
 Identities = 16/55 (29%), Positives = 32/55 (58%), Gaps = 4/55 (7%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           EK+D E      +   +EETE +K+    +++KK   ++++E+ ++EE  N+L 
Sbjct: 31 YEKKDIEIN----TDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLF 81


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 35.1 bits (81), Expect = 0.042
 Identities = 11/50 (22%), Positives = 31/50 (62%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EE +++EKE       +++   E++ ++ +++ +++ ++  +E EKE E+
Sbjct: 63  EEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEE 112



 Score = 35.1 bits (81), Expect = 0.047
 Identities = 18/53 (33%), Positives = 26/53 (49%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E K+  ++E   I   KEEE E    + K+ K   +K+    EEE EEE +  
Sbjct: 50  EAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102



 Score = 33.6 bits (77), Expect = 0.15
 Identities = 17/58 (29%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E+  E    ++KE  G +  ++EE+EEE +++ ++   + +K   E EEK E    K
Sbjct: 66  KEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK---ETEEKTESNVEK 120



 Score = 32.4 bits (74), Expect = 0.35
 Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 2/61 (3%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK--KKKKKKKKKRYEEEEKEEEKDN 58
                ++    E+E       +  E EE K+++K+    + K+ K   E+E++E E++N
Sbjct: 36 FPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEEN 95

Query: 59 K 59
          +
Sbjct: 96 E 96



 Score = 28.5 bits (64), Expect = 6.3
 Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 4/57 (7%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           N +  E+K D EKE        EEE EEE ++   + +K+ ++K     EKE    +
Sbjct: 74  NSEDKEDKGDAEKE----DEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 36.0 bits (83), Expect = 0.045
 Identities = 13/43 (30%), Positives = 19/43 (44%)

Query: 15  EVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
               + R  + +TE+ KKK    K   KKKK+    + E  K 
Sbjct: 110 LAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKY 152


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 35.3 bits (82), Expect = 0.053
 Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 7/49 (14%)

Query: 5   GGEEKEDQEKEVGGISR-------GKEEETEEEKKKKKKKKKKKKKKKR 46
           G E KE+  K +  I          K EE + +K+KKKKK+KKK KK++
Sbjct: 361 GDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 32.6 bits (75), Expect = 0.38
 Identities = 11/24 (45%), Positives = 20/24 (83%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKR 46
           K ++ +++KK+KKK KK+KKK ++
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 29.2 bits (66), Expect = 4.5
 Identities = 13/41 (31%), Positives = 26/41 (63%), Gaps = 5/41 (12%)

Query: 24  EEETEEEKKK-----KKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +  EE K+K     KKK+++KK +K++ +++ K++ K  K
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 35.5 bits (82), Expect = 0.054
 Identities = 16/55 (29%), Positives = 34/55 (61%), Gaps = 3/55 (5%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           +K GE K++ +      ++ KEE+ ++E+ +KK+K++  + K   E+ +K+E K 
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK---EKRQKDERKK 434



 Score = 35.2 bits (81), Expect = 0.073
 Identities = 13/52 (25%), Positives = 33/52 (63%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           +K  + K+    S   E + +EEK K+++ +KK+K++   ++E++++++  K
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 34.0 bits (78), Expect = 0.16
 Identities = 15/56 (26%), Positives = 32/56 (57%), Gaps = 3/56 (5%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK---KKKKKKRYEEEEKEEEKDNK 59
           ++ +D  K++G +    +   E E K K++K K    +KK+K   +E+KE+ + ++
Sbjct: 376 KKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431



 Score = 30.1 bits (68), Expect = 2.4
 Identities = 10/52 (19%), Positives = 24/52 (46%), Gaps = 7/52 (13%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           G+++++  KE         ++ ++  KK  + K +    +  E + KEE+  
Sbjct: 364 GDKRQELLKEY-------NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLK 408


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
          several eukaryotic synaphin 1 and 2 proteins.
          Synaphin/complexin is a cytosolic protein that
          preferentially binds to syntaxin within the SNARE
          complex. Synaphin promotes SNAREs to form precomplexes
          that oligomerise into higher order structures. A
          peptide from the central, syntaxin binding domain of
          synaphin competitively inhibits these two proteins from
          interacting and prevents SNARE complexes from
          oligomerising. It is thought that oligomerisation of
          SNARE complexes into a higher order structure creates a
          SNARE scaffold for efficient, regulated fusion of
          synaptic vesicles. Synaphin promotes neuronal
          exocytosis by promoting interaction between the
          complementary syntaxin and synaptobrevin transmembrane
          regions that reside in opposing membranes prior to
          fusion.
          Length = 139

 Score = 34.1 bits (78), Expect = 0.055
 Identities = 11/47 (23%), Positives = 22/47 (46%)

Query: 6  GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
                +E E       +E+E  +E  ++ ++++K K +K  EE E 
Sbjct: 17 KGMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREV 63



 Score = 32.9 bits (75), Expect = 0.12
 Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 7/52 (13%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
            GG+E E  E          EEE EE ++  ++ ++++K K R  EEE+E 
Sbjct: 19 MLGGKEDEGDES-------DAEEEDEEIQEALREAEEERKAKHRKMEEEREV 63


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 33.8 bits (78), Expect = 0.057
 Identities = 19/54 (35%), Positives = 35/54 (64%), Gaps = 8/54 (14%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKK--KKKKKKKKRYEEEEKEEEKDN 58
          E++E++E+E       +E+E +EEK+K+  K   +K++++KR EE EK + K  
Sbjct: 16 EKREEKEREK------EEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL 63



 Score = 33.8 bits (78), Expect = 0.058
 Identities = 15/47 (31%), Positives = 29/47 (61%), Gaps = 7/47 (14%)

Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRY-------EEEEKEEEKDNKL 60
             EE+ EE++++K++K++K++K+K +       EE EK  E+  K 
Sbjct: 12 IDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKA 58



 Score = 29.2 bits (66), Expect = 2.1
 Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 3/59 (5%)

Query: 3  EKGGEEKEDQEKEVGG---ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          ++  E KE++EKE G        +E+  EE +K K K   +    + Y+EE KE+E+ +
Sbjct: 25 KEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKEQERWD 83


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 35.5 bits (82), Expect = 0.057
 Identities = 18/65 (27%), Positives = 27/65 (41%), Gaps = 7/65 (10%)

Query: 19  ISRGKEEETEEEKKKKKKKKKKKKKKKRYE-------EEEKEEEKDNKLLHNCYLYSYYR 71
           I   +EE+     K+  K++ KK + K  E       E  KEE  +N   +   L  YY+
Sbjct: 520 IEMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNILKLRLYYQ 579

Query: 72  VTHIC 76
               C
Sbjct: 580 QNGKC 584


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 32.5 bits (74), Expect = 0.058
 Identities = 6/34 (17%), Positives = 15/34 (44%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          E E  E+    +       K  + ++++ ++E D
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79



 Score = 28.6 bits (64), Expect = 1.7
 Identities = 5/32 (15%), Positives = 15/32 (46%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
          + E+ ++++       K  K     +++E +E
Sbjct: 49 EPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80



 Score = 27.4 bits (61), Expect = 3.7
 Identities = 10/46 (21%), Positives = 16/46 (34%), Gaps = 12/46 (26%)

Query: 6  GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEE 51
          GEE ED + +            E++     K  K        E++E
Sbjct: 47 GEEPEDDDDD------------EDDDDDDDKDDKDDDDDDDDEDDE 80


>gnl|CDD|191136 pfam04949, Transcrip_act, Transcriptional activator.  This family
          of proteins may act as a transcriptional activator. It
          plays a role in stress response in plants.
          Length = 160

 Score = 34.1 bits (78), Expect = 0.061
 Identities = 13/52 (25%), Positives = 27/52 (51%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
                 +++D+E     +S  K +E E E++K + +++ + +  R EEE K
Sbjct: 15 FGGSPMMDEKDEELSRSALSLFKAKEEEIERRKMEVRERVQAQLGRVEEETK 66


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 35.1 bits (81), Expect = 0.063
 Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 2/49 (4%)

Query: 13  EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
            KEV  +   KE+ TE   + K+ KK+  +K+   E +E +   D KL 
Sbjct: 60  NKEVAKLKIAKEDATELIAETKELKKEITEKEA--EVQEAKAALDAKLK 106


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 33.3 bits (76), Expect = 0.064
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEEE 50
            E TE+ KKKKKK KK KK KK  +++
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 32.2 bits (73), Expect = 0.16
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEE 51
            E  E+ KKKKKK KK KK K+  +++
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 30.6 bits (69), Expect = 0.54
 Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 3/34 (8%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           ++   E  +K KKKKKK KK K+    +K  +KD
Sbjct: 90  DDHPPEPTEKPKKKKKKSKKTKK---PKKSSKKD 120


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 35.0 bits (80), Expect = 0.073
 Identities = 16/34 (47%), Positives = 26/34 (76%)

Query: 28  EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           E++ KK KKK+KK+K+K+R ++++KE E    LL
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLL 229



 Score = 33.1 bits (75), Expect = 0.34
 Identities = 14/35 (40%), Positives = 25/35 (71%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           K +    EKK KK KKK+KK+K++  +++K++E +
Sbjct: 189 KGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVE 223



 Score = 32.7 bits (74), Expect = 0.45
 Identities = 11/37 (29%), Positives = 25/37 (67%)

Query: 21  RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           R   +  E++K++KKK++K+K+ ++R+     E ++D
Sbjct: 88  RRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDED 124



 Score = 32.3 bits (73), Expect = 0.61
 Identities = 10/33 (30%), Positives = 24/33 (72%)

Query: 28  EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           EEE++ +++ +K K++KK+ E+E++   + + L
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSL 117



 Score = 32.0 bits (72), Expect = 0.71
 Identities = 12/32 (37%), Positives = 23/32 (71%)

Query: 26  ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           E + +K KKK+KK+K+K++ + +++E E  K 
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227



 Score = 30.8 bits (69), Expect = 1.4
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           K  E  +    +KK KK KKK+K+ +E+E++++K
Sbjct: 185 KSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDK 218



 Score = 30.8 bits (69), Expect = 1.6
 Identities = 12/37 (32%), Positives = 25/37 (67%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           E+++++ KKK+KK+K+K++ K + +E E  +     L
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLAL 232



 Score = 30.4 bits (68), Expect = 2.1
 Identities = 14/38 (36%), Positives = 26/38 (68%)

Query: 20  SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           S+  E+      +KK KK KKK+KK++ +E +K+++K+
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE 221



 Score = 29.3 bits (65), Expect = 5.5
 Identities = 18/47 (38%), Positives = 24/47 (51%)

Query: 11  DQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           D EK      R  E     EK      +KK KK K+ E++EKE+E+D
Sbjct: 169 DSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERD 215



 Score = 28.5 bits (63), Expect = 7.7
 Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 3/35 (8%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E+ +    +KK KK KKK+KK   E+EKE +KD K
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKK---EKEKERDKDKK 219


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 35.1 bits (80), Expect = 0.076
 Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 14/68 (20%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKK--------------KKKKKKKKKKKKKRYEEEEKE 53
           E    E   G      +EE + +K+              +  K   K K KKR  +EE+E
Sbjct: 481 EDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEE 540

Query: 54  EEKDNKLL 61
           E+K   ++
Sbjct: 541 EKKLKMIM 548



 Score = 33.1 bits (75), Expect = 0.34
 Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           + +++ E E  GI   K  ET E  K   K K KK+K    EEEEK+ + 
Sbjct: 501 QAQKELELEAQGI---KYSETSEADKDVNKSKNKKRKVDE-EEEEKKLKM 546



 Score = 29.7 bits (66), Expect = 3.4
 Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 8/66 (12%)

Query: 3   EKGG---EEKEDQEKEVGGISRGKEEETEEEKKKKKKK-----KKKKKKKKRYEEEEKEE 54
           E  G    E  + +K+V      K +  EEE++KK K      K+KK  KK      K+E
Sbjct: 509 EAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKE 568

Query: 55  EKDNKL 60
           E+   L
Sbjct: 569 EQAENL 574



 Score = 28.5 bits (63), Expect = 9.9
 Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 8/59 (13%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKK--KKKRYEEEEKEEEKDN 58
            +K   + ++++++V       EEE E++ K      K+KK  KK +Y   +KEE+ +N
Sbjct: 521 ADKDVNKSKNKKRKV------DEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAEN 573


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein
          7 (Rrp7p), ribosomal RNA-processing protein 7 homolog A
          (Rrp7A), and similar proteins.  This CD corresponds to
          the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
          YCL031C gene from Saccharomyces cerevisiae. It is an
          essential yeast protein involved in pre-rRNA processing
          and ribosome assembly, and is speculated to be required
          for correct assembly of rpS27 into the pre-ribosomal
          particle. Rrp7A, also termed gastric cancer antigen
          Zg14, is the Rrp7p homolog mainly found in Metazoans.
          The cellular function of Rrp7A remains unclear
          currently. Both Rrp7p and Rrp7A harbor an N-terminal
          RNA recognition motif (RRM), also termed RBD (RNA
          binding domain) or RNP (ribonucleoprotein domain), and
          a C-terminal RRP7 domain.
          Length = 118

 Score = 33.0 bits (76), Expect = 0.079
 Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 17 GGISRGKEEETEEEKKKKKKKKKKKKKKK 45
          G   +   EE  E K K+K+KKKKKKK+ 
Sbjct: 60 GRKGKTAREEAVEAKAKEKEKKKKKKKEL 88



 Score = 32.2 bits (74), Expect = 0.13
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 16 VGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
            G  +GK    E  + K K+K+KKKKKKK
Sbjct: 57 TRGGRKGKTAREEAVEAKAKEKEKKKKKKK 86



 Score = 31.8 bits (73), Expect = 0.20
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 18 GISRGKEEETEEEKKKKKKKKKKKKKK 44
          G +  +E    + K+K+KKKKKKK+ +
Sbjct: 63 GKTAREEAVEAKAKEKEKKKKKKKELE 89



 Score = 30.7 bits (70), Expect = 0.56
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 17 GGISRGKEEETEEEKKKKKKKKKKKKK 43
          G  +R +  E + ++K+KKKKKKK+ +
Sbjct: 63 GKTAREEAVEAKAKEKEKKKKKKKELE 89



 Score = 29.9 bits (68), Expect = 1.0
 Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 15/52 (28%)

Query: 16  VGGISRGKEEETEEEKKKKKKKKKK---------KKKK------KRYEEEEK 52
                  + +  E+EKKKKKKK+ +         KKK+      K++EE++K
Sbjct: 65  TAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEELAELRKKFEEDKK 116



 Score = 28.4 bits (64), Expect = 3.3
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 17 GGISRGKEEETEEEKKKKKKKKKKKKKKK 45
          GG       E   E K K+K+KKKKKKK+
Sbjct: 59 GGRKGKTAREEAVEAKAKEKEKKKKKKKE 87



 Score = 27.2 bits (61), Expect = 8.2
 Identities = 17/44 (38%), Positives = 22/44 (50%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
          E   E  ED    V    R  +   EE  + K K+K+KKKKKK+
Sbjct: 43 EARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKK 86


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 33.2 bits (76), Expect = 0.087
 Identities = 15/40 (37%), Positives = 24/40 (60%), Gaps = 6/40 (15%)

Query: 23  KEEETEEEKKKKKKK------KKKKKKKKRYEEEEKEEEK 56
           KE E E ++   K+K      KK+ ++ ++ EEEEKE E+
Sbjct: 75  KEREKEGDEFADKEKFVTSAYKKQLEENRKLEEEEKEREE 114



 Score = 29.7 bits (67), Expect = 1.4
 Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 4/48 (8%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEET----EEEKKKKKKKKKKKKKKK 45
            EK G+E  D+EK V    + + EE     EEEK++++ +++    K 
Sbjct: 77  REKEGDEFADKEKFVTSAYKKQLEENRKLEEEEKEREELEEENDVTKG 124


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 34.7 bits (80), Expect = 0.095
 Identities = 14/42 (33%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYE 48
            EK + +K +  I   +E   E   K KKKK+KKK+  +++ 
Sbjct: 403 LEKAEGKKAIEEIR--EELIEEGLLKSKKKKRKKKEWFEKFR 442


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 34.0 bits (78), Expect = 0.096
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          +K    KK KKK K++  EEEK+ E D+
Sbjct: 60 RKPATTKKSKKKDKEKLTEEEKKPESDD 87



 Score = 31.7 bits (72), Expect = 0.47
 Identities = 16/33 (48%), Positives = 19/33 (57%)

Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          T   K    KK KKK K+K  EEE+K E  D+K
Sbjct: 57 TTPRKPATTKKSKKKDKEKLTEEEKKPESDDDK 89



 Score = 29.7 bits (67), Expect = 2.2
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
            KK KKK K+K  +  ++ E +++K 
Sbjct: 64 TTKKSKKKDKEKLTEEEKKPESDDDKT 90



 Score = 29.0 bits (65), Expect = 4.3
 Identities = 12/36 (33%), Positives = 24/36 (66%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          ++  T ++ KKK K+K  +++KK   +++K EE +N
Sbjct: 60 RKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENEN 95


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 34.4 bits (79), Expect = 0.10
 Identities = 15/38 (39%), Positives = 23/38 (60%)

Query: 20  SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
             G E + ++ K KKK KK KKK+ K   +++ EEE +
Sbjct: 139 ILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESE 176



 Score = 34.4 bits (79), Expect = 0.11
 Identities = 15/65 (23%), Positives = 24/65 (36%), Gaps = 9/65 (13%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK---------RYEEEEK 52
           NE   E + D+       SR   +  E     KK K+KK                 E+E 
Sbjct: 229 NEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDED 288

Query: 53  EEEKD 57
           ++++D
Sbjct: 289 DDDED 293



 Score = 33.7 bits (77), Expect = 0.22
 Identities = 14/44 (31%), Positives = 27/44 (61%)

Query: 14  KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           K +G  ++ K+ + +++ KK KKK+ K+   K  EEE + E++ 
Sbjct: 138 KILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDES 181



 Score = 31.7 bits (72), Expect = 0.81
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 20  SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
              K +  ++ KK KKK+ K+   K   EE E E+E  +
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E K KK K KKK KK K+ E +E  ++ D +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEE 173



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 28  EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E + KK K KKK KK KK+  +E  +++ + +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEE 174



 Score = 30.2 bits (68), Expect = 2.3
 Identities = 16/53 (30%), Positives = 29/53 (54%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E   +   ++ GI    ++   ++K KK KKK+ K+   + +EEE E E ++K
Sbjct: 130 ESIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESK 182


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 34.5 bits (79), Expect = 0.10
 Identities = 15/53 (28%), Positives = 19/53 (35%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           K   EK   E E    +  K E   EEK   +KKK   K K        +  +
Sbjct: 201 KAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAE 253



 Score = 29.5 bits (66), Expect = 3.7
 Identities = 11/54 (20%), Positives = 25/54 (46%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E+  +  E+ + +    +  K+ E E +   +K K + + K K  ++ E   E+
Sbjct: 174 EEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEE 227



 Score = 28.4 bits (63), Expect = 8.6
 Identities = 15/58 (25%), Positives = 38/58 (65%), Gaps = 3/58 (5%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKK--KKKKRYEEEEKEEEKDNK 59
           KG ++++ +E++V    +  ++  E+E+ K+ +K++ K  +++K+ EE EK+ + + K
Sbjct: 76  KGEQQRKKKEEQVAEELK-PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQK 132


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 34.8 bits (81), Expect = 0.11
 Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 9   KEDQEKEVGGISRGKEEETEEEKKK--KKKKKKKKKKKKRYEEEEKEEEKD 57
           KE ++ + GG +  K  E  E +K+  K  +KK+KKKKK+ E++E+ +  D
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGD 641



 Score = 29.8 bits (68), Expect = 3.1
 Identities = 15/46 (32%), Positives = 28/46 (60%), Gaps = 5/46 (10%)

Query: 10  EDQEKEVGGISRGKEEETEEEKKKKKKKKKK-KKKKKRYEEEEKEE 54
           E+ E+E+      K EE E   K+ +K K++ ++KK++ +EEE + 
Sbjct: 526 EELEREL----EQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKL 567



 Score = 29.4 bits (67), Expect = 4.7
 Identities = 22/63 (34%), Positives = 28/63 (44%), Gaps = 15/63 (23%)

Query: 25  EETEEEKKKKKKKKKKKKKK------------KRYEEEEKEEEKDNKLLHNCYLYSYYRV 72
           E+ ++ KKKKKKK K  K K             RYEE     E+ +K L +  L  Y  V
Sbjct: 680 EKIQKPKKKKKKKPKTVKPKPRTVSLELDLRGMRYEE---ALERLDKYLDDALLAGYGEV 736

Query: 73  THI 75
             I
Sbjct: 737 LII 739


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
          is an essential protein in yeast that is involved in
          pre-rRNA processing and ribosome assembly. It is
          speculated to be required for correct assembly of rpS27
          into the pre-ribosomal particle.
          Length = 131

 Score = 33.0 bits (76), Expect = 0.11
 Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 8/49 (16%)

Query: 3  EKGGEEKED--------QEKEVGGISRGKEEETEEEKKKKKKKKKKKKK 43
          +   E  ED          K   G SR K  E   + K+KKKKKKK+ +
Sbjct: 42 KARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELE 90


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 34.6 bits (79), Expect = 0.11
 Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 8   EKEDQEKEVGGISRGK----EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
             + + K+    +R K      + EEEK +KK +K++K KK   + E++  E + 
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEV 159



 Score = 28.9 bits (64), Expect = 7.2
 Identities = 11/53 (20%), Positives = 25/53 (47%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           K   E  +  +E   +   ++   ++E KK  ++ ++K      + EE++ EK
Sbjct: 83  KTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEK 135


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 33.9 bits (78), Expect = 0.11
 Identities = 16/29 (55%), Positives = 17/29 (58%)

Query: 28  EEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EEE+ KKKKKKKKKK KK        E  
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEAT 205



 Score = 33.1 bits (76), Expect = 0.25
 Identities = 14/32 (43%), Positives = 15/32 (46%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EE   +KKKKKKKKK KK        E     
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSS 208



 Score = 31.9 bits (73), Expect = 0.53
 Identities = 16/37 (43%), Positives = 17/37 (45%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           K    EE  KKKKKKKKKK KK        E    + 
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSA 209


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 34.2 bits (79), Expect = 0.12
 Identities = 15/52 (28%), Positives = 27/52 (51%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           GE  + +  +    S  ++EE ++E  KK K+   ++  +  EEE  EEE+ 
Sbjct: 116 GEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEA 167



 Score = 32.3 bits (74), Expect = 0.47
 Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 2/60 (3%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEET--EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
             +  E+ +D+ + +   S  + E +  E+E++K +  KK K+       EE EEE   +
Sbjct: 105 EVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEE 164



 Score = 29.6 bits (67), Expect = 3.4
 Identities = 15/57 (26%), Positives = 29/57 (50%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           ++G     + +KE+       EEE +E  KK K+   ++  ++  EE  +EEE + +
Sbjct: 114 DEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170



 Score = 28.9 bits (65), Expect = 6.1
 Identities = 10/38 (26%), Positives = 23/38 (60%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
           EEK++  K+    S  +  E +EE+  ++++ + +K+K
Sbjct: 136 EEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173



 Score = 28.5 bits (64), Expect = 6.8
 Identities = 9/42 (21%), Positives = 23/42 (54%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE 49
           E+E  E          EE +EE++++  ++++ + +K++  E
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 31.0 bits (71), Expect = 0.12
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEE 51
          E   EK+K+KK   +K+++K+   EE
Sbjct: 39 EKPSEKRKRKKAAARKRRRKKLAREE 64


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 34.0 bits (78), Expect = 0.13
 Identities = 14/54 (25%), Positives = 29/54 (53%), Gaps = 9/54 (16%)

Query: 7   EEKEDQEKEVGGISRGKEE----ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           ++ E+ EK+     R  E+    E E+    +K  K+ ++  K+ EE++K+ E+
Sbjct: 75  QQAEEAEKQ-----RAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEE 123



 Score = 33.3 bits (76), Expect = 0.21
 Identities = 14/57 (24%), Positives = 31/57 (54%), Gaps = 7/57 (12%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E+  ++ E+++K+        EE   ++  + K K + + +KK  EE +K+ E++ K
Sbjct: 108 EQAAKQAEEKQKQA-------EEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAK 157



 Score = 32.9 bits (75), Expect = 0.29
 Identities = 18/63 (28%), Positives = 24/63 (38%), Gaps = 15/63 (23%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKK----------KKKKKKKKKKKRYEEEEKEEEK 56
           E K  Q  E        + E E EKK K           K K   + KK+  E +K+ E 
Sbjct: 123 EAKAKQAAE-----AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEA 177

Query: 57  DNK 59
           + K
Sbjct: 178 EAK 180



 Score = 31.7 bits (72), Expect = 0.66
 Identities = 9/55 (16%), Positives = 25/55 (45%), Gaps = 2/55 (3%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE--EKDNK 59
           E ++ +  E       ++    E+  K+ ++  K+ ++K+ + EE +     + K
Sbjct: 79  EAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133



 Score = 31.7 bits (72), Expect = 0.74
 Identities = 8/37 (21%), Positives = 23/37 (62%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           + EE E+++  ++ ++K+ +++   E+  K+ E+  K
Sbjct: 76  QAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112



 Score = 31.3 bits (71), Expect = 0.88
 Identities = 15/53 (28%), Positives = 24/53 (45%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E E ++K      +  EEE + +   + KKK  + KKK   E + + E   K
Sbjct: 136 AEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 31.0 bits (70), Expect = 1.4
 Identities = 14/53 (26%), Positives = 26/53 (49%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           E   + + + EK+    ++ + EE  + K   + KKK  + KK+ E E K + 
Sbjct: 131 EAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183



 Score = 30.6 bits (69), Expect = 1.8
 Identities = 14/57 (24%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           ++   EK  ++ E    ++  EE+ ++ ++ K K+  + K K   E E+K +E+  K
Sbjct: 96  QRAAAEKAAKQAE--QAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150



 Score = 29.8 bits (67), Expect = 2.7
 Identities = 11/62 (17%), Positives = 29/62 (46%), Gaps = 3/62 (4%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKK---KKKKKRYEEEEKEEEKD 57
             ++   E+  Q++     +  K  +  E+  K+ ++K+K   + K K+  E + + E +
Sbjct: 80  AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAE 139

Query: 58  NK 59
            +
Sbjct: 140 AE 141



 Score = 29.4 bits (66), Expect = 4.3
 Identities = 10/52 (19%), Positives = 21/52 (40%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
           E   +   + +K+     +  E E + + + K K K ++ K K    + K  
Sbjct: 155 EAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAA 206



 Score = 28.6 bits (64), Expect = 7.7
 Identities = 13/52 (25%), Positives = 22/52 (42%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E+E + K      +   E  ++ + + K K + K K K  E + K E    K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
          glycoprotein C-terminal domain.  The trypanosome
          parasite expresses these proteins to evade the immune
          response.
          Length = 98

 Score = 32.0 bits (73), Expect = 0.14
 Identities = 17/56 (30%), Positives = 22/56 (39%), Gaps = 4/56 (7%)

Query: 8  EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
          E + ++           E T E+ K KK KK  KK  K     E    KD+  L N
Sbjct: 31 EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKW----EGNTCKDSSFLVN 82



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          K ++ E++ K K K+ K KK      +    E    K
Sbjct: 17 KWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEK 53


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 34.3 bits (79), Expect = 0.14
 Identities = 18/35 (51%), Positives = 24/35 (68%)

Query: 22  GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           GK+ E +     ++K KKKKKK+K+ EEE K EEK
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764



 Score = 33.9 bits (78), Expect = 0.21
 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 4/51 (7%)

Query: 4    KGGEEKEDQEKEVGGIS----RGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
              GE + D+EK+    S    + K  E +  +   KK+K KK+ K   E E
Sbjct: 1532 GQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAE 1582



 Score = 32.0 bits (73), Expect = 0.85
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
           +  EE+ KKKKKK+KKK+++ + EE+ + E
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 31.6 bits (72), Expect = 1.1
 Identities = 16/35 (45%), Positives = 26/35 (74%), Gaps = 2/35 (5%)

Query: 19  ISRGKEEETEEEKKKKKKKKK--KKKKKKRYEEEE 51
           IS   EE+T+++KKK+KKK++  K+++K R E  E
Sbjct: 737 ISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771



 Score = 30.8 bits (70), Expect = 2.0
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
             E    +  ++K KKKKKK+KK+ EE ++EE+ 
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 32.1 bits (73), Expect = 0.14
 Identities = 17/63 (26%), Positives = 27/63 (42%), Gaps = 5/63 (7%)

Query: 2  NEKGGEEKEDQEK-----EVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E+   EK D+             + K++ T +  + K  KK +KK KK+ E  E  EE 
Sbjct: 1  KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60

Query: 57 DNK 59
            +
Sbjct: 61 KRR 63


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 34.2 bits (78), Expect = 0.14
 Identities = 21/59 (35%), Positives = 41/59 (69%), Gaps = 5/59 (8%)

Query: 3   EKGGEEKEDQEK-EVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           E  GE K+D++  + GG S G + E EEE+++++++++++++    EEEE+EEE +  L
Sbjct: 840 ENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEE----EEEEEEEENEEPL 894


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 33.6 bits (77), Expect = 0.15
 Identities = 16/63 (25%), Positives = 35/63 (55%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          M E   +E E++E++ G     + EE E+E+  +  ++  + K +  ++E  + +++NK 
Sbjct: 13 MEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKK 72

Query: 61 LHN 63
          L N
Sbjct: 73 LEN 75



 Score = 32.0 bits (73), Expect = 0.42
 Identities = 13/59 (22%), Positives = 28/59 (47%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           K    +ED  KE       K +E + E ++ +K++  +  ++  E + +E + +N  L
Sbjct: 8  AKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKL 66



 Score = 30.1 bits (68), Expect = 1.9
 Identities = 22/71 (30%), Positives = 32/71 (45%), Gaps = 3/71 (4%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            ++   E E+ EKE       +  E + E+ K +  K K++ KK   E E E  KD +L
Sbjct: 28 KGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKL--ENELEALKD-RL 84

Query: 61 LHNCYLYSYYR 71
          L     Y  YR
Sbjct: 85 LRTVAEYDNYR 95


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 33.7 bits (78), Expect = 0.16
 Identities = 18/68 (26%), Positives = 36/68 (52%), Gaps = 13/68 (19%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRY-------------EEEEKE 53
           EE E++ +E   +    E   EE++ + ++K++K+KK +               EE+E+E
Sbjct: 89  EEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKERE 148

Query: 54  EEKDNKLL 61
            E++ K+L
Sbjct: 149 REEELKIL 156



 Score = 31.8 bits (73), Expect = 0.63
 Identities = 12/35 (34%), Positives = 28/35 (80%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           +EEE E E++++K++K+++  + R ++EE E+E++
Sbjct: 165 REEEREAERRERKEEKEREVARLRAQQEEAEDERE 199



 Score = 31.0 bits (71), Expect = 1.1
 Identities = 14/37 (37%), Positives = 25/37 (67%), Gaps = 2/37 (5%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEE--EKEEEKD 57
           KE E EEE K  + +++K ++++  E E  E++EEK+
Sbjct: 145 KEREREEELKILEYQREKAEREEEREAERRERKEEKE 181



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 10/48 (20%), Positives = 28/48 (58%), Gaps = 13/48 (27%)

Query: 23 KEEETEEEKKKKKKKKKKKK-------------KKKRYEEEEKEEEKD 57
           +   EEE++++K+K+++++             K+++ E EE+ +E++
Sbjct: 52 LKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQERE 99



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 13/52 (25%), Positives = 28/52 (53%), Gaps = 6/52 (11%)

Query: 12 QEKEVGGISRGKEEETEEEKKKKKK-KKKKKKKKKRYEEEEKEEEKDNKLLH 62
          +EK+     R K EE EEE++  +  ++++ K     EE E++ +++ +   
Sbjct: 27 EEKK-----RIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGR 73



 Score = 29.9 bits (68), Expect = 2.6
 Identities = 11/41 (26%), Positives = 26/41 (63%), Gaps = 7/41 (17%)

Query: 25  EETEEEKKKKKKKKKKKKKK-------KRYEEEEKEEEKDN 58
            E EEE++ +++++K++K++       ++ E E++ EE D 
Sbjct: 163 AEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDE 203



 Score = 29.9 bits (68), Expect = 3.0
 Identities = 7/32 (21%), Positives = 21/32 (65%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
           +  E EEE ++ ++ ++++ +++   EEE++ 
Sbjct: 304 RAAEREEELEEGERLREEEAERQARIEEERQR 335



 Score = 28.3 bits (64), Expect = 7.5
 Identities = 8/38 (21%), Positives = 22/38 (57%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           KEE    E++++ ++ ++ ++++   +   EEE+   L
Sbjct: 300 KEERRAAEREEELEEGERLREEEAERQARIEEERQRLL 337



 Score = 28.3 bits (64), Expect = 9.5
 Identities = 12/60 (20%), Positives = 32/60 (53%), Gaps = 11/60 (18%)

Query: 7   EEKEDQEKEVGGISRGKE------EETEEEKKKKKKKKKKKKKK-----KRYEEEEKEEE 55
           EE+E + K       G+       EE E+ ++++ +++ +++++     +R +EE++ E 
Sbjct: 57  EEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEA 116


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 32.7 bits (75), Expect = 0.16
 Identities = 7/36 (19%), Positives = 9/36 (25%)

Query: 22  GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           G   E    KKKK   +    +      E       
Sbjct: 116 GPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAA 151



 Score = 32.0 bits (73), Expect = 0.29
 Identities = 8/51 (15%), Positives = 12/51 (23%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           E K  + +         +     E    KKKK   +           E   
Sbjct: 97  EPKPSKLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAA 147


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 32.4 bits (74), Expect = 0.18
 Identities = 17/32 (53%), Positives = 22/32 (68%), Gaps = 3/32 (9%)

Query: 28  EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E+ K+KK+KKKKKKKKKK   +  K+  K  K
Sbjct: 107 EKAKQKKQKKKKKKKKKK---KTSKKAAKKKK 135



 Score = 32.0 bits (73), Expect = 0.25
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKK 45
           K+++ +++KKKKK  KK  KKKK
Sbjct: 113 KQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 32.0 bits (73), Expect = 0.26
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 14  KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
           K+V      ++++ +++KKKKKKK  KK  KK+
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 31.6 bits (72), Expect = 0.35
 Identities = 15/28 (53%), Positives = 22/28 (78%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +EK K+KK+KKKKKKKK+ +  +K  +K
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 29.7 bits (67), Expect = 1.3
 Identities = 13/36 (36%), Positives = 23/36 (63%), Gaps = 5/36 (13%)

Query: 29  EEKK-----KKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           EE+K      K+K K+KK+KKK+ ++++K+  K   
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAA 131



 Score = 28.9 bits (65), Expect = 2.6
 Identities = 15/38 (39%), Positives = 25/38 (65%), Gaps = 5/38 (13%)

Query: 27  TEEEK-----KKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            EE+K      K+K K+KK+KKKK+ ++++K  +K  K
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132



 Score = 27.7 bits (62), Expect = 7.2
 Identities = 11/32 (34%), Positives = 23/32 (71%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EE +  K+  K+K K+KK+KK+ ++++K++  
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTS 127


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 33.7 bits (77), Expect = 0.18
 Identities = 16/52 (30%), Positives = 30/52 (57%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
               ++ E E      G+  ++EEE ++K+K+K+K+ KK   +E+E EE + 
Sbjct: 262 PTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEI 313



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 11/66 (16%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKK-----------KKKKKKKKKKKKRYEEEE 51
           E+ G+     E E    +   E+E E+E K            ++K+K+K+K+ K+  E+E
Sbjct: 246 EESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDE 305

Query: 52  KEEEKD 57
            E+E+ 
Sbjct: 306 DEDEEM 311



 Score = 29.0 bits (65), Expect = 5.1
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 30  EKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            KKK K+KK+KK+  +   +EE EEE   +
Sbjct: 222 FKKKTKEKKEKKEASESTVKEESEEESGKR 251


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 33.8 bits (78), Expect = 0.19
 Identities = 15/37 (40%), Positives = 26/37 (70%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           EE  +EE ++ KK + ++ ++ R EEE+ EEE+D+K 
Sbjct: 261 EELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKE 297



 Score = 30.0 bits (68), Expect = 3.5
 Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 8/65 (12%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK--------KKKKKKKKKRYEEEEKE 53
           +       +      GG    ++EE + E+KK KK        K K  K +++  +EE E
Sbjct: 142 DLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVIAKSKFYKAERQKAKEEDE 201

Query: 54  EEKDN 58
           + ++ 
Sbjct: 202 DLREE 206



 Score = 29.6 bits (67), Expect = 4.1
 Identities = 13/41 (31%), Positives = 25/41 (60%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRY 47
           +  +++E E    S  +++E EEE++K+KKKKK  +  +  
Sbjct: 350 DLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 28.8 bits (65), Expect = 7.0
 Identities = 13/48 (27%), Positives = 28/48 (58%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
           E+  E+ +  ++E        ++E +EE+++++K+KKKKK  +    E
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).
          This family consists of several Plasmodium falciparum
          SPAM (secreted polymorphic antigen associated with
          merozoites) proteins. Variation among SPAM alleles is
          the result of deletions and amino acid substitutions in
          non-repetitive sequences within and flanking the
          alanine heptad-repeat domain. Heptad repeats in which
          the a and d position contain hydrophobic residues
          generate amphipathic alpha-helices which give rise to
          helical bundles or coiled-coil structures in proteins.
          SPAM is an example of a P. falciparum antigen in which
          a repetitive sequence has features characteristic of a
          well-defined structural element.
          Length = 164

 Score = 32.5 bits (74), Expect = 0.20
 Identities = 15/57 (26%), Positives = 29/57 (50%), Gaps = 11/57 (19%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          EK  +++E++E++   I   ++ E EEE  + ++           EEEE EE+  + 
Sbjct: 46 EKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEE-----------EEEEDEEDNVDL 91



 Score = 32.1 bits (73), Expect = 0.26
 Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 6/61 (9%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE--EEKEEEKDNKLLHNC 64
           EE ED E E        E+E EEE+ ++     K  +KK   +     +++    L+   
Sbjct: 63  EEPEDIEDE----EEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLISKN 118

Query: 65  Y 65
           Y
Sbjct: 119 Y 119


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 33.9 bits (77), Expect = 0.20
 Identities = 25/62 (40%), Positives = 32/62 (51%), Gaps = 6/62 (9%)

Query: 1   MNEKGGEEK---EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           M EK  E++   ED+E        G+E + EE K+  + KKKK KK K   E  KE E  
Sbjct: 206 MVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVK---EVTKEYEVQ 262

Query: 58  NK 59
           NK
Sbjct: 263 NK 264


>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956).  This
          family of proteins with unknown function appears to be
          restricted to Gammaproteobacteria.
          Length = 103

 Score = 31.5 bits (72), Expect = 0.21
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 6/60 (10%)

Query: 4  KGGEEKEDQEKEVG-GISRGKEEETEEEKKKK----KKKKKKKKKKKRYEEEEKEEEKDN 58
          K G+ KE Q K +  GI +G  E  +++K K     K +K++ K K+R    + EE+   
Sbjct: 17 KPGQTKE-QTKLIAQGIEKGIAEYKKQQKAKAREADKARKQQLKAKQRQAANDDEEDTIE 75


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 33.9 bits (78), Expect = 0.21
 Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           +E +++ KE       +E E EEE K +K   K  K+ ++ ++EE+EEE
Sbjct: 441 KENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEE 489



 Score = 33.5 bits (77), Expect = 0.24
 Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 1/62 (1%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE-EKDNKLL 61
            +  EE + + +E+     G+EE  EEE ++  KK   ++K      E++ E +K  K  
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKEN 443

Query: 62  HN 63
            N
Sbjct: 444 KN 445



 Score = 33.1 bits (76), Expect = 0.34
 Identities = 14/30 (46%), Positives = 22/30 (73%)

Query: 31  KKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           KK+KKK+K+K++   + E  +KE+ KD KL
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKL 656



 Score = 32.7 bits (75), Expect = 0.38
 Identities = 17/57 (29%), Positives = 30/57 (52%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +   +K    ++ G  +  KE E+++ KK+ K + K+KK+    EE E EEE   +
Sbjct: 412 NEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVE 468



 Score = 32.7 bits (75), Expect = 0.41
 Identities = 17/55 (30%), Positives = 30/55 (54%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
            + E +EK+        E+E E + +K   K  K+ +K + EEEE+E +++N  L
Sbjct: 443 NKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWL 497



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 4/54 (7%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
             +  E+KE  E+E        EEE + EK   K  K+ +K +K  EEEE +EE
Sbjct: 444 KNEFKEKKESDEEE----ELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEE 493



 Score = 28.9 bits (65), Expect = 6.9
 Identities = 17/38 (44%), Positives = 21/38 (55%), Gaps = 7/38 (18%)

Query: 28  EEEKKKKKKKKKKKK----KKKRYEEEEKEEEKDNKLL 61
           EE K K+ KK K KK     KK   E+ KEE K+ + L
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKK---EKLKEELKEFEEL 250


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 33.5 bits (76), Expect = 0.21
 Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 4/59 (6%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           M     EE+E  E E    S   E+E  + K +K+K    KK +K +   E E+E++N+
Sbjct: 241 METSESEEEESSESE----SDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENE 295


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 31.8 bits (72), Expect = 0.24
 Identities = 11/39 (28%), Positives = 23/39 (58%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
             E Q++E       + E+  E  ++++++K+KKKKK +
Sbjct: 78  FMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116



 Score = 31.0 bits (70), Expect = 0.43
 Identities = 12/42 (28%), Positives = 25/42 (59%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
           E    E + +E+    ++R ++E     +++++K+KKKKK K
Sbjct: 75  EATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116


>gnl|CDD|176682 cd07261, Glo_EDI_BRP_like_11, This conserved domain belongs to a
           superfamily including the bleomycin resistance protein,
           glyoxalase I, and type I ring-cleaving dioxygenases.
           This protein family belongs to a conserved domain
           superfamily that is found in a variety of structurally
           related metalloproteins, including the bleomycin
           resistance protein, glyoxalase I, and type I
           ring-cleaving dioxygenases. A bound metal ion is
           required for protein activities for the members of this
           superfamily. A variety of metal ions have been found in
           the catalytic centers of these proteins including
           Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein
           superfamily contains members with or without domain
           swapping.
          Length = 114

 Score = 31.4 bits (72), Expect = 0.24
 Identities = 13/28 (46%), Positives = 15/28 (53%)

Query: 107 STFLGQAPVLLSPGIHSFPFKLGLPLGL 134
           S  LG+ PV LSP    F    G+ LGL
Sbjct: 17  SELLGREPVELSPTFALFVLGSGVKLGL 44



 Score = 29.1 bits (66), Expect = 1.8
 Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 82  GQAPVLLSPGIHSFPFKLGLPLGL 105
           G+ PV LSP    F    G+ LGL
Sbjct: 21  GREPVELSPTFALFVLGSGVKLGL 44


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 33.2 bits (76), Expect = 0.28
 Identities = 13/53 (24%), Positives = 29/53 (54%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           + +E Q+K+     R K+ E E    +++KK+ ++  K+   ++++ EE   K
Sbjct: 88  QAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAK 140



 Score = 30.9 bits (70), Expect = 1.5
 Identities = 8/54 (14%), Positives = 27/54 (50%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           +++  +++ +  + + +    E++K+ ++  K+   K+K+ EE   +     K 
Sbjct: 94  QKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKA 147



 Score = 29.0 bits (65), Expect = 5.8
 Identities = 5/57 (8%), Positives = 32/57 (56%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           +K  +++ ++ ++     + + ++ E+E+   +++KK+ ++  +    ++++ ++  
Sbjct: 82  KKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138



 Score = 29.0 bits (65), Expect = 5.9
 Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 6/55 (10%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKK--KKKKKKKKKRYEEEEKEEEKDNK 59
           ++ E+  K+    +  K+++ EE   K     K K + + KR     K+   + K
Sbjct: 118 KQAEEAAKQ----AALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK 168


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This
          is a family of uncharacterized proteins.
          Length = 449

 Score = 33.0 bits (75), Expect = 0.29
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          KK+   +K  K+ ++   EEE ++E+D
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEED 59



 Score = 31.1 bits (70), Expect = 1.3
 Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 1/56 (1%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          E    E ED +  V G    KE+ T  +  KK+   +K  K+   E  E+E++ + 
Sbjct: 4  EFASSELEDDD-WVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEE 58



 Score = 31.1 bits (70), Expect = 1.3
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          K+   +K  K+ +++  EEE+ +EE D+
Sbjct: 34 KENAIRKLGKEAEEEAMEEEDDDEEDDD 61



 Score = 28.8 bits (64), Expect = 5.8
 Identities = 6/27 (22%), Positives = 17/27 (62%)

Query: 32 KKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          KK+   +K  K+ +    EE+++++++
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEED 59


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 31.5 bits (72), Expect = 0.31
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 28  EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E ++ K++KKK+ K+KK  E+E K + K  K
Sbjct: 93  LEHERNKQEKKKRSKEKKEEEKERKRQLKQQK 124



 Score = 30.7 bits (70), Expect = 0.54
 Identities = 14/40 (35%), Positives = 27/40 (67%)

Query: 13  EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
           E +  GIS   ++  + E ++ K++KKK+ K+K+ EE+E+
Sbjct: 77  EVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKER 116



 Score = 29.9 bits (68), Expect = 1.1
 Identities = 16/39 (41%), Positives = 27/39 (69%), Gaps = 4/39 (10%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
           E+  QEK+     R KE++ EE+++K++ K++KKK K R
Sbjct: 96  ERNKQEKK----KRSKEKKEEEKERKRQLKQQKKKAKHR 130


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.1 bits (76), Expect = 0.31
 Identities = 17/50 (34%), Positives = 29/50 (58%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +     E+E+ GI    +E  E+E++ ++ KKK K+ +KR EE E+  E 
Sbjct: 314 KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHEL 363



 Score = 33.1 bits (76), Expect = 0.39
 Identities = 20/58 (34%), Positives = 28/58 (48%), Gaps = 4/58 (6%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK----KKKKKKKKKRYEEEEKEEEK 56
              G   E  EKE+  + + KEE  EE  K   +    KK+ K+ KK  EE +K + K
Sbjct: 380 RLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGK 437



 Score = 32.3 bits (74), Expect = 0.59
 Identities = 16/54 (29%), Positives = 31/54 (57%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EK  +E ++ E+    I   ++E    E  K+K ++K ++ ++R EE +KE E+
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE 277



 Score = 32.0 bits (73), Expect = 0.76
 Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 3   EKGGEEKEDQEKEVGGIS---RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EK  EE E  ++++  +    +  ++E E+ ++ KKK  + +KK    EEE  E  K
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLK 577



 Score = 31.6 bits (72), Expect = 0.94
 Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 10/67 (14%)

Query: 7   EEKEDQEKEVGGIS----------RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +E E  E     +              ++E EE ++K K+ K+ K+K + Y +  +  E+
Sbjct: 245 KELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEE 304

Query: 57  DNKLLHN 63
               L  
Sbjct: 305 YLDELRE 311



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 16/54 (29%), Positives = 33/54 (61%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            E+++ +K    + +  EE  E EK+ ++ +K+ ++ +K+Y EEE EE ++  L
Sbjct: 616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYL 669



 Score = 30.8 bits (70), Expect = 1.9
 Identities = 16/58 (27%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 7   EEKEDQEKEVGGIS---RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           +E  + EK +  +     G EE  +E ++K+++ ++ KKK K  E+  +E E+ ++L 
Sbjct: 307 DELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELY 364



 Score = 30.4 bits (69), Expect = 2.2
 Identities = 18/58 (31%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 7   EEKEDQEKEVGGISRGKEE---ETEEEKKKKKKKKKK-KKKKKRYEEEEKEEEKDNKL 60
           EE E + + + G  R  EE   E EE  ++ KK+ ++ ++K K  +E +++ E+  KL
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKL 298



 Score = 30.0 bits (68), Expect = 3.4
 Identities = 12/50 (24%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 7   EEKEDQEKEVGGISRGKE--EETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
            E  +  +E+  + +  +  EE +EE ++ +K+ +  +  KR  EE+  E
Sbjct: 214 SELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRE 263



 Score = 29.6 bits (67), Expect = 4.6
 Identities = 17/57 (29%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 5   GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           G E  E+  KE+  +     E    EK+ K+ ++K++K +K   E EK  +K+++L+
Sbjct: 442 GRELTEEHRKEL--LEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELI 496



 Score = 28.9 bits (65), Expect = 6.4
 Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 4/54 (7%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E+  EE + + +E+      K +E +E K+K ++  K  +  + Y +E +E EK
Sbjct: 265 EERIEELKKEIEEL----EEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEK 314



 Score = 28.9 bits (65), Expect = 7.2
 Identities = 18/54 (33%), Positives = 29/54 (53%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            +E E+  +E+  IS    E  EE +K +K+ K+ ++ K+  EE EKE E    
Sbjct: 199 EKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEG 252



 Score = 28.5 bits (64), Expect = 8.6
 Identities = 13/52 (25%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           ++ ++ EK +  +   + E  EE K KK++ ++ KK+      E+ E+E + 
Sbjct: 345 KKLKELEKRLEEL-EERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEE 395


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation
          of 60S ribosomal subunits.
          Length = 132

 Score = 31.6 bits (72), Expect = 0.32
 Identities = 12/36 (33%), Positives = 22/36 (61%)

Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
          +SR K+    ++K K +K +KK K++ R E+ +  E
Sbjct: 24 LSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALE 59



 Score = 30.4 bits (69), Expect = 0.82
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          + KKK K +K +KK KR    EK +  +
Sbjct: 32 KAKKKLKSEKLEKKAKRQLRAEKRQALE 59



 Score = 27.7 bits (62), Expect = 6.8
 Identities = 6/33 (18%), Positives = 19/33 (57%)

Query: 14 KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
          K++    +  + E  E+K K++ + +K++  ++
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 33.1 bits (75), Expect = 0.33
 Identities = 14/54 (25%), Positives = 29/54 (53%)

Query: 5   GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           G  EK+ + +   G++  KEE T E+K + ++  K+K+ K       K  ++++
Sbjct: 476 GKAEKDGEIRLKNGVAMSKEELTREDKNRLRRALKRKRSKANLPNVNKRSKRND 529



 Score = 29.7 bits (66), Expect = 4.1
 Identities = 10/32 (31%), Positives = 14/32 (43%)

Query: 30  EKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           E  KK    K  K+ +  +EEE +   D   L
Sbjct: 220 EPVKKHSDVKDPKEDEELDEEEHDSAMDKVKL 251


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 31.6 bits (72), Expect = 0.34
 Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 8/57 (14%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKK-KKKRYEEEEKEEEKD 57
            +K   +KE+ EKE       KE+E    K   + +K++ + +KK+ E +  +EEK 
Sbjct: 74  RDKLKAKKEEAEKE-------KEKEERFMKALAEAEKERAELEKKKAEAKLMKEEKK 123



 Score = 30.8 bits (70), Expect = 0.61
 Identities = 14/56 (25%), Positives = 30/56 (53%)

Query: 2  NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          N   G    ++ ++    S  + +  E  KK K+K ++ K K K+ E E+++E+++
Sbjct: 37 NSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEE 92



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 5/41 (12%)

Query: 25 EETEEEK-----KKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          E T E K     KK K+K ++ K K + EE EKE+EK+ + 
Sbjct: 54 ESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERF 94


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 31.9 bits (73), Expect = 0.34
 Identities = 15/53 (28%), Positives = 30/53 (56%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           + +D+E+E+    + +E+E  +E KK    KK + KK   +++  E +K  +L
Sbjct: 99  KNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQL 151


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 31.1 bits (71), Expect = 0.35
 Identities = 14/57 (24%), Positives = 31/57 (54%), Gaps = 5/57 (8%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK-----KKKKKKRYEEEEKEEEKDN 58
           E    Q++E+    +  E+  E+ ++  +++KK     +++KKK  +EE + E+K  
Sbjct: 59  EAIAQQQQELEQAEKQVEQAREQWQEANQERKKLEKLLEREKKKEQKEENRREQKLL 115


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 32.1 bits (73), Expect = 0.36
 Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 1/53 (1%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
            +  G   +    E   I   K     +E KKKK   K KK K+ Y+    E 
Sbjct: 160 PKVEGPSPDKALVEETKI-ISKAWSELDESKKKKYIDKYKKLKEEYDSFYPEA 211



 Score = 27.9 bits (62), Expect = 9.1
 Identities = 15/55 (27%), Positives = 21/55 (38%), Gaps = 5/55 (9%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
             +G    +   +E   IS+   E  E      KKKK   K KK  EE +    +
Sbjct: 161 KVEGPSPDKALVEETKIISKAWSELDE-----SKKKKYIDKYKKLKEEYDSFYPE 210


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 90 amino acids in
          length. There are two completely conserved L residues
          that may be functionally important.
          Length = 70

 Score = 30.0 bits (68), Expect = 0.37
 Identities = 13/36 (36%), Positives = 22/36 (61%), Gaps = 7/36 (19%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          KE + EEEK++++++ +K        EE KE EK+ 
Sbjct: 27 KELKAEEEKREEEEEARK-------REERKEREKNK 55



 Score = 29.2 bits (66), Expect = 0.63
 Identities = 14/36 (38%), Positives = 27/36 (75%), Gaps = 1/36 (2%)

Query: 23 KEEETEEEKKKKKKKKKKKKKK-KRYEEEEKEEEKD 57
          +EE+ EEE++ +K++++K+++K K +EE   E E D
Sbjct: 32 EEEKREEEEEARKREERKEREKNKSFEELLNESELD 67



 Score = 29.2 bits (66), Expect = 0.72
 Identities = 9/31 (29%), Positives = 23/31 (74%)

Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           +E K +++K++++++ +KR E +E+E+ K 
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREKNKS 56



 Score = 29.2 bits (66), Expect = 0.75
 Identities = 8/30 (26%), Positives = 23/30 (76%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
          K+E   EE+K++++++ +K+++++  E+ K
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREKNK 55



 Score = 27.3 bits (61), Expect = 2.9
 Identities = 7/31 (22%), Positives = 23/31 (74%)

Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
          +  EE+++++++ +K++++K+R + +  EE 
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNKSFEEL 60


>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain.  This putative domain is found
          to be the most conserved region in mediator of
          replication checkpoint protein 1.
          Length = 145

 Score = 31.6 bits (72), Expect = 0.39
 Identities = 19/69 (27%), Positives = 36/69 (52%), Gaps = 15/69 (21%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEE-----------KKKKKKKKKKKKKKKRYEEEEKEEE 55
          EE ED+   +GG     +E+++++           +  K+K +  +   KR EEE +++E
Sbjct: 15 EESEDEWAGLGGAD---DEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAEEEREDDE 71

Query: 56 KD-NKLLHN 63
          K+  KLL +
Sbjct: 72 KEVEKLLDD 80



 Score = 28.9 bits (65), Expect = 2.8
 Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 19/68 (27%)

Query: 7   EEKEDQEKEV---------GGISRGK----------EEETEEEKKKKKKKKKKKKKKKRY 47
           EE+ED EKEV         GG+ R +          + E E  ++++ K+++    ++R 
Sbjct: 65  EEREDDEKEVEKLLDDITTGGLRRRRGKNGLDLELSDSEDELLQRRRLKRRELALMRQRL 124

Query: 48  EEEEKEEE 55
            E+   ++
Sbjct: 125 LEDGDLDK 132


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 31.3 bits (71), Expect = 0.40
 Identities = 17/77 (22%), Positives = 39/77 (50%), Gaps = 9/77 (11%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKK-------KKKKKKKKKRYEEEEKEEEK 56
           +       +EK    I +  EE+ ++E+++ +K       ++++K+ + R  E++ E+EK
Sbjct: 25  QEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDEK 84

Query: 57  DNKL--LHNCYLYSYYR 71
             +    HN  L ++ R
Sbjct: 85  LQETWHEHNLALANFIR 101



 Score = 28.6 bits (64), Expect = 3.0
 Identities = 15/54 (27%), Positives = 33/54 (61%), Gaps = 1/54 (1%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE-EKDNK 59
          EEK+   +  G +    ++ ++EE +   K+K++ + +++ EE+EK+E E+  K
Sbjct: 4  EEKKRNRRMFGLLLGTLQKFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRK 57


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 32.8 bits (75), Expect = 0.40
 Identities = 17/37 (45%), Positives = 24/37 (64%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          K+E+   E KKK +KKKKKKK+K+  + E E +   K
Sbjct: 58 KKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFK 94



 Score = 32.0 bits (73), Expect = 0.69
 Identities = 17/53 (32%), Positives = 26/53 (49%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          EE +          +  +    ++K +KKKKKKK+KK+ + E E K   K  K
Sbjct: 45 EENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPK 97



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 9/69 (13%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK---------RYEEEEK 52
             K   EK+ ++K+     + + E     K  KK KK KKK  K          + +  +
Sbjct: 65  ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAE 124

Query: 53  EEEKDNKLL 61
             EK N  +
Sbjct: 125 LAEKSNVYI 133



 Score = 29.7 bits (67), Expect = 3.4
 Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 7/44 (15%)

Query: 23 KEEETEEEKKKKKKKKKKK-------KKKKRYEEEEKEEEKDNK 59
             E E +      KK KK       KKK   ++++K+E+K+ K
Sbjct: 41 TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84



 Score = 29.3 bits (66), Expect = 4.8
 Identities = 16/61 (26%), Positives = 30/61 (49%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYL 66
           ++K+ ++++    S G+ +   +  KK KK KKK  K K  E+ +    K  +L     +
Sbjct: 72  KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNV 131

Query: 67  Y 67
           Y
Sbjct: 132 Y 132



 Score = 28.5 bits (64), Expect = 7.4
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          + +E+K  + KKK +KKKKK+ E++E + E + KL
Sbjct: 57 DKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91


>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
          co-repressor CIR.  This is a 45 residue conserved
          region at the N-terminal end of a family of proteins
          referred to as CIRs (CBF1-interacting co-repressors).
          CBF1 (centromere-binding factor 1) acts as a
          transcription factor that causes repression by binding
          specifically to GTGGGAA motifs in responsive promoters,
          and it requires CIR as a co-repressor. CIR binds to
          histone deacetylase and to SAP30 and serves as a linker
          between CBF1 and the histone deacetylase complex.
          Length = 37

 Score = 29.1 bits (66), Expect = 0.40
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 28 EEEKKKKKKKKKKKKKKKRYEEE 50
          E E+K  +++KK ++ +K  EEE
Sbjct: 15 EAEQKALEEQKKIEELRKEIEEE 37


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 33.1 bits (75), Expect = 0.41
 Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 7/60 (11%)

Query: 9   KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE-------EKEEEKDNKLL 61
           KE Q +    I +  E E +  K + K+ +    K  + + E       +K+ EKDN  L
Sbjct: 229 KEAQLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSEL 288


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 31.1 bits (71), Expect = 0.42
 Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 7/47 (14%)

Query: 16  VGGISRGKEEETEEEKKKKKKKKKKKKK-----KKRYEEEEKEEEKD 57
             G   GK  E E+++K+KKKKKKK+ +     + R  E++KEE+ D
Sbjct: 65  AAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLR--EKKKEEQAD 109


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 32.8 bits (75), Expect = 0.44
 Identities = 10/48 (20%), Positives = 23/48 (47%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
           E+ E  ++E+       EE  +   + ++ K++ ++K     EE +E 
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEEL 377



 Score = 32.8 bits (75), Expect = 0.47
 Identities = 13/53 (24%), Positives = 24/53 (45%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           +EK +  KE         EE E+   + ++ K++ ++K     EE EE  +  
Sbjct: 329 KEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEAL 381



 Score = 29.3 bits (66), Expect = 5.4
 Identities = 16/55 (29%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           EE E+ E E   IS  +E   E E + ++ +++ ++ K++ E  ++E E+   LL
Sbjct: 295 EEIEELEGE---ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL 346



 Score = 28.9 bits (65), Expect = 6.2
 Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 5/61 (8%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE-----EKEEEKDNKLL 61
           EE E+ EKE+  +    EE  EE ++ +++  + K++ +  E E     E+ EE +N+L 
Sbjct: 260 EELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELE 319

Query: 62  H 62
            
Sbjct: 320 E 320


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 32.0 bits (73), Expect = 0.46
 Identities = 18/54 (33%), Positives = 31/54 (57%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            E  D+ +    IS+ ++    +EK+KKK  KKK+K  K  +E ++ E+K  K+
Sbjct: 147 PELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKV 200


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 30.3 bits (69), Expect = 0.49
 Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 7/41 (17%)

Query: 9  KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE 49
          +E +EKE        +++ +E     KKKKK+K KKKR E 
Sbjct: 30 EEQEEKE-------LKQKADEGNNSGKKKKKRKAKKKRDEA 63



 Score = 28.8 bits (65), Expect = 1.6
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
          KE + + ++     KKKKK+K K+  +E     
Sbjct: 35 KELKQKADEGNNSGKKKKKRKAKKKRDEAGPAS 67



 Score = 27.2 bits (61), Expect = 5.0
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          ++EE E ++K  +     KKKKKR  +++++E   
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGP 65


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
          coiled-coil proteins that are involved in pre-rRNA
          processing.
          Length = 105

 Score = 30.4 bits (69), Expect = 0.49
 Identities = 13/47 (27%), Positives = 32/47 (68%), Gaps = 1/47 (2%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
          E++ ++  E   I + +E+E ++EK+ +++++ +  K++R  +EEKE
Sbjct: 31 EKRMEKRLEQQAI-KAREKELKDEKEAERQRRIQAIKERRAAKEEKE 76



 Score = 27.8 bits (62), Expect = 5.1
 Identities = 12/40 (30%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 21 RGKEEETEEEKKKKKKKKKKKKKKKRYE-EEEKEEEKDNK 59
          R K + T  EK+ +K+ +++  K +  E ++EKE E+  +
Sbjct: 22 RPKSKLTSWEKRMEKRLEQQAIKAREKELKDEKEAERQRR 61


>gnl|CDD|219528 pfam07703, A2M_N_2, Alpha-2-macroglobulin family N-terminal region.
            This family includes a region of the
           alpha-2-macroglobulin family.
          Length = 136

 Score = 31.0 bits (71), Expect = 0.50
 Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 10/57 (17%)

Query: 287 VSLDRGGYVPGETIIINATVYNRSKITVKSTKAALTETIQYLARNKVLESETRELAS 343
           +SLD+  Y  GET           K+TV S     T      +R K+L S   ++  
Sbjct: 3   LSLDKKSYKVGETA----------KVTVTSPFPIGTALYTVESRGKILSSGVVDVKG 49


>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 32.5 bits (74), Expect = 0.51
 Identities = 17/62 (27%), Positives = 31/62 (50%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYL 66
           E+K  QEK     +  K E T E+ K+  + K + ++++R E+ +K +    +      L
Sbjct: 369 EQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKIDKAQAALKQAQSAFDL 428

Query: 67  YS 68
           YS
Sbjct: 429 YS 430


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
          7 homolog A (Rrp7A) and similar proteins.  The family
          corresponds to the RRP7 domain of Rrp7A, also termed
          gastric cancer antigen Zg14, and similar proteins which
          are yeast ribosomal RNA-processing protein 7 (Rrp7p)
          homologs mainly found in Metazoans. The cellular
          function of Rrp7A remains unclear currently. Rrp7A
          harbors an N-terminal RNA recognition motif (RRM), also
          termed RBD (RNA binding domain) or RNP
          (ribonucleoprotein domain), and a C-terminal RRP7
          domain.
          Length = 129

 Score = 30.7 bits (70), Expect = 0.52
 Identities = 19/49 (38%), Positives = 24/49 (48%), Gaps = 6/49 (12%)

Query: 3  EKGGEEKEDQEKEVGGI------SRGKEEETEEEKKKKKKKKKKKKKKK 45
          EK  +EKE +  E G +       R K    E    K  +K+KKKKKKK
Sbjct: 37 EKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKK 85



 Score = 29.6 bits (67), Expect = 1.5
 Identities = 16/66 (24%), Positives = 33/66 (50%), Gaps = 12/66 (18%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEE-------KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          +++E++EKE       ++E   +E       KK ++ K  +K+       E+++++K  K
Sbjct: 31 DKEEEEEKE-----EKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKK 85

Query: 60 LLHNCY 65
           L N Y
Sbjct: 86 ELKNFY 91



 Score = 27.2 bits (61), Expect = 9.9
 Identities = 13/36 (36%), Positives = 22/36 (61%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           KE    +  +K+KKKKKKK+ K  Y  + +E +++ 
Sbjct: 67  KESVAAKAAEKEKKKKKKKELKNFYRFQIRESKRNE 102


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 31.9 bits (73), Expect = 0.53
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 19  ISRGKEEETEEEKKKKKKKKKKKKKKK 45
            SR +++   +E+ KKKK K+ K  K 
Sbjct: 245 ESRAEKKRKSKEEIKKKKPKESKGVKA 271



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 5/39 (12%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
           E+ E  E         K+ +++EE KKKK K+ K  K  
Sbjct: 239 EDVEAAESRAE-----KKRKSKEEIKKKKPKESKGVKAL 272


>gnl|CDD|238686 cd01390, HMGB-UBF_HMG-box, HMGB-UBF_HMG-box, class II and III
          members of the HMG-box superfamily of DNA-binding
          proteins. These proteins bind the minor groove of DNA
          in a non-sequence specific fashion and contain two or
          more tandem HMG boxes. Class II members include
          non-histone chromosomal proteins, HMG1 and HMG2, which
          bind to bent or distorted DNA such as four-way DNA
          junctions, synthetic DNA cruciforms, kinked
          cisplatin-modified DNA, DNA bulges, cross-overs in
          supercoiled DNA, and can cause looping of linear DNA.
          Class III members include nucleolar and mitochondrial
          transcription factors, UBF and mtTF1, which bind
          four-way DNA junctions.
          Length = 66

 Score = 29.5 bits (67), Expect = 0.54
 Identities = 11/21 (52%), Positives = 18/21 (85%)

Query: 30 EKKKKKKKKKKKKKKKRYEEE 50
          E++KKK ++K +K K+RYE+E
Sbjct: 44 EEEKKKYEEKAEKDKERYEKE 64


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 32.0 bits (73), Expect = 0.56
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E E E +++++ KKKK+ K K   E  +K+++KD  
Sbjct: 73  EGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPT 108



 Score = 31.6 bits (72), Expect = 0.57
 Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 14/58 (24%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKK-------KKKKKRYEEEEKEEEKD 57
           E  +++E E       KE + EE  KKKK+ K K       KKKKK     +  +   
Sbjct: 67  ESDDEEEGE-------KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAA 117



 Score = 29.3 bits (66), Expect = 4.0
 Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 3/36 (8%)

Query: 23  KEEETEEEKKK---KKKKKKKKKKKKRYEEEEKEEE 55
           + +E E  +KK   K +K+K+KKK+K   +EE+  E
Sbjct: 156 RLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAE 191


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.5 bits (72), Expect = 0.57
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EE+K+KKK   K++KK   EE+EK EE 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 30.7 bits (70), Expect = 1.3
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEK 52
           EE +E+KK   K++KK  K+++ + EE 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 31.3 bits (71), Expect = 0.57
 Identities = 10/62 (16%), Positives = 25/62 (40%), Gaps = 3/62 (4%)

Query: 1   MNEKGGEEKEDQEKEVGGISR---GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
             E   E + D   + G ++       +E E++   +K+K  + K++K  ++     +  
Sbjct: 74  EAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDPRGGTQDV 133

Query: 58  NK 59
             
Sbjct: 134 VD 135


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 30.3 bits (69), Expect = 0.62
 Identities = 14/49 (28%), Positives = 28/49 (57%), Gaps = 4/49 (8%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
            + E++E+EV      +E+E +E + K    K +KK++K  E + + +E
Sbjct: 70  AKVEEKEQEV----AEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 32.0 bits (73), Expect = 0.64
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E +E + K+  KKKKKKKKK+ ++   E      
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79



 Score = 31.6 bits (72), Expect = 0.81
 Identities = 21/65 (32%), Positives = 30/65 (46%), Gaps = 3/65 (4%)

Query: 2  NEKGGEEKEDQEKEVGGISRGKEEETEEEKK---KKKKKKKKKKKKKRYEEEEKEEEKDN 58
           +K  + K+D + +   ++    E  E E K   KKKKKKKKKKKKK   E         
Sbjct: 23 KKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLP 82

Query: 59 KLLHN 63
           +  +
Sbjct: 83 VVWSS 87


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 32.4 bits (73), Expect = 0.64
 Identities = 11/54 (20%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 5    GGEEKEDQE--KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
               E  ++   K+     R K++E +E  +  ++ K+++  KK  E E++  + 
Sbjct: 1007 VPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKT 1060



 Score = 30.1 bits (67), Expect = 2.8
 Identities = 12/62 (19%), Positives = 28/62 (45%), Gaps = 5/62 (8%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEE-----EKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            E      +++ +E  G S   E E        E ++++  +KK++ +  +E EE+ + +
Sbjct: 607 LEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPE 666

Query: 57  DN 58
             
Sbjct: 667 KK 668


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 31.7 bits (73), Expect = 0.66
 Identities = 9/33 (27%), Positives = 18/33 (54%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          +    ++K+ K   K +KKK+K+  +  K + K
Sbjct: 57 KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 30.6 bits (70), Expect = 1.5
 Identities = 11/40 (27%), Positives = 22/40 (55%)

Query: 6  GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
           E+ ++ ++E+      K+E     K +KKK+K++ K  K
Sbjct: 46 NEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAK 85



 Score = 29.0 bits (66), Expect = 5.2
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKK 45
          K     ++KK+K++ K  K K K
Sbjct: 67 KAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 28.3 bits (64), Expect = 9.0
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 33 KKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          KK+ K   K +KK+ ++E K  +  +K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 28.3 bits (64), Expect = 9.9
 Identities = 8/29 (27%), Positives = 16/29 (55%), Gaps = 3/29 (10%)

Query: 27 TEEEKK---KKKKKKKKKKKKKRYEEEEK 52
           ++E K   K +KKK+K++ K    + + 
Sbjct: 62 DKKELKAWHKAQKKKEKQEAKAAKAKSKP 90


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 31.6 bits (72), Expect = 0.67
 Identities = 12/35 (34%), Positives = 22/35 (62%)

Query: 27  TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
            E+EKK++ K +KK+   ++Y  EE  ++   +LL
Sbjct: 164 KEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELL 198



 Score = 28.9 bits (65), Expect = 4.3
 Identities = 13/37 (35%), Positives = 20/37 (54%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          + E+ EE +KK K KK+       YE+E++E   D  
Sbjct: 25 EYEKLEELRKKLKAKKEDDSDDTDYEDEDEELYLDES 61



 Score = 28.6 bits (64), Expect = 6.0
 Identities = 8/32 (25%), Positives = 20/32 (62%)

Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          E  E+ ++ +KK K KK+    + + ++E+++
Sbjct: 24 EEYEKLEELRKKLKAKKEDDSDDTDYEDEDEE 55



 Score = 28.2 bits (63), Expect = 9.2
 Identities = 12/45 (26%), Positives = 22/45 (48%)

Query: 26  ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYY 70
           + +E+K++ K +KK+   +K   EE  ++     LL     Y  Y
Sbjct: 164 KEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEY 208


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.2 bits (73), Expect = 0.67
 Identities = 15/38 (39%), Positives = 26/38 (68%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          K+  TEEE ++KKKK++K K+K+  + +  ++E   KL
Sbjct: 10 KKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKL 47



 Score = 31.4 bits (71), Expect = 1.1
 Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 23/78 (29%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKK---------------------KK 39
            E   E+K   E+E+    + K+EE  +EK+ KK K                      K
Sbjct: 3  RTESEAEKKILTEEELE--RKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60

Query: 40 KKKKKKRYEEEEKEEEKD 57
          K +KK R  + E E  +D
Sbjct: 61 KSEKKSRKRDVEDENPED 78


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 31.6 bits (72), Expect = 0.69
 Identities = 8/34 (23%), Positives = 20/34 (58%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           ++E+       + ++ KKK++ +  +++E EE K
Sbjct: 75  EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELK 108



 Score = 31.6 bits (72), Expect = 0.75
 Identities = 11/39 (28%), Positives = 23/39 (58%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
             + E ++ KKK++ + KKKK  EE +  ++K ++ +  
Sbjct: 82  ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKE 120



 Score = 28.2 bits (63), Expect = 9.3
 Identities = 9/33 (27%), Positives = 19/33 (57%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           K E+ +E      + ++ KKK++   +++KE E
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEME 105


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 30.3 bits (69), Expect = 0.73
 Identities = 11/73 (15%), Positives = 38/73 (52%), Gaps = 11/73 (15%)

Query: 1   MNEKGGEEKED---QEKEVGGISRGK--EEETEEEKK------KKKKKKKKKKKKKRYEE 49
           ++    E+++    Q+ +  G+      ++E EE  +      ++  +++KK+++++ +E
Sbjct: 29  VDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKE 88

Query: 50  EEKEEEKDNKLLH 62
           E ++  ++N  + 
Sbjct: 89  EIEKYREENPKIQ 101


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 31.3 bits (71), Expect = 0.76
 Identities = 10/26 (38%), Positives = 20/26 (76%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEE 49
           +++ +EEKKK K+ K+++K++KR   
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEKRMAA 210



 Score = 28.2 bits (63), Expect = 6.6
 Identities = 13/53 (24%), Positives = 24/53 (45%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           EE  +Q+KE     + +E E  +   K  +     + +KR  E++    K+ K
Sbjct: 106 EENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERK 158


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 31.3 bits (71), Expect = 0.77
 Identities = 11/54 (20%), Positives = 25/54 (46%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           EK    K+ G      +  ++++  +K K   K +  +R ++   +E+K+ K 
Sbjct: 18 SEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKK 71



 Score = 30.1 bits (68), Expect = 1.9
 Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 5/40 (12%)

Query: 2  NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKK 41
           EK     + Q  E     R K+  ++E+K++KKK KKKK
Sbjct: 43 IEKFKLLIKAQMAE-----RVKKLHSQEKKEEKKKPKKKK 77


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 31.9 bits (73), Expect = 0.81
 Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 3/69 (4%)

Query: 8   EKEDQEKEVGGISRGKEE---ETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNC 64
           E E Q +E+    +  EE   + E E +KK+++ +K K K   +  +K  +K +  L   
Sbjct: 92  EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEA 151

Query: 65  YLYSYYRVT 73
                Y+  
Sbjct: 152 LKGLNYKKN 160



 Score = 30.0 bits (68), Expect = 2.7
 Identities = 14/42 (33%), Positives = 24/42 (57%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNC 64
            E E + E+ KK+ KK ++K ++   E EK+EE+  K  +  
Sbjct: 91  IEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKF 132


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 29.3 bits (66), Expect = 0.81
 Identities = 12/37 (32%), Positives = 18/37 (48%)

Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
          TE +  KKKKKKKKKK +      + ++     +   
Sbjct: 18 TEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEG 54



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          K E    +KKKKKKKKK +      + ++   E  ++
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53



 Score = 26.6 bits (59), Expect = 8.3
 Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          +KKKKKKK +      + ++   E+  +  
Sbjct: 26 KKKKKKKKAEDTAATAKAKKATAEDVSEGA 55


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 30.4 bits (69), Expect = 0.85
 Identities = 12/46 (26%), Positives = 22/46 (47%), Gaps = 7/46 (15%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEE 51
            EEK +  KE         +  ++E+ +K +K+ KK KK    + +
Sbjct: 92  EEEKAEARKE-------ALKAYQQEELRKIQKRSKKSKKAEPVQGQ 130


>gnl|CDD|197700 smart00398, HMG, high mobility group. 
          Length = 70

 Score = 28.8 bits (65), Expect = 0.85
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
             E++K   ++K KK K+RYEEE  E +
Sbjct: 42 LLSEEEKAPYEEKAKKDKERYEEEMPEYK 70


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
          This domain is the C-terminal end of ribosomal L18/L5
          proteins.
          Length = 93

 Score = 29.4 bits (67), Expect = 0.88
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          KKK+KK+ K + K+   ++   E+ K 
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRKA 88



 Score = 28.7 bits (65), Expect = 1.6
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 30 EKKKKKKKKKKKKKKKRYEEEEKE 53
          EKK+ K + K+   KK   E+ K 
Sbjct: 65 EKKEVKAESKRYNAKKLTLEQRKA 88


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 31.5 bits (71), Expect = 0.94
 Identities = 22/62 (35%), Positives = 32/62 (51%), Gaps = 6/62 (9%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKK------KKKKKKRYEEEEKEE 54
           M E  G E E  E+E    SR + EE EE +   K ++K      ++ +K+  E E +EE
Sbjct: 100 MQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEE 159

Query: 55  EK 56
           EK
Sbjct: 160 EK 161



 Score = 31.2 bits (70), Expect = 1.1
 Identities = 18/54 (33%), Positives = 30/54 (55%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           E  E++EKE     R + EETE   K ++K   +  ++ + EE+E E E++ K 
Sbjct: 109 ETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKP 162


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 31.6 bits (71), Expect = 0.95
 Identities = 16/64 (25%), Positives = 33/64 (51%), Gaps = 6/64 (9%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
           E+G +++E+++++V  I   +E E  EE+    +  +K       +E+E + E D   L 
Sbjct: 102 ERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKD------DEKESDAEGDENELA 155

Query: 63  NCYL 66
             Y+
Sbjct: 156 GEYI 159


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 31.0 bits (71), Expect = 0.95
 Identities = 13/28 (46%), Positives = 19/28 (67%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E +K+KKK   K++KK   EE++K EE 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
          includes archaeal L12p, the protein that is
          functionally equivalent to L7/L12 in bacteria and the
          P1 and P2 proteins in eukaryotes. L12p is homologous to
          P1 and P2 but is not homologous to bacterial L7/L12. It
          is located in the L12 stalk, with proteins L10, L11,
          and 23S rRNA. L12p is the only protein in the ribosome
          to occur as multimers, always appearing as sets of
          dimers. Recent data indicate that most archaeal species
          contain six copies of L12p (three homodimers), while
          eukaryotes have four copies (two heterodimers), and
          bacteria may have four or six copies (two or three
          homodimers), depending on the species. The organization
          of proteins within the stalk has been characterized
          primarily in bacteria, where L7/L12 forms either two or
          three homodimers and each homodimer binds to the
          extended C-terminal helix of L10. L7/L12 is attached to
          the ribosome through L10 and is the only ribosomal
          protein that does not directly interact with rRNA.
          Archaeal L12p is believed to function in a similar
          fashion. However, hybrid ribosomes containing the large
          subunit from E. coli with an archaeal stalk are able to
          bind archaeal and eukaryotic elongation factors but not
          bacterial elongation factors. In several mesophilic and
          thermophilic archaeal species, the binding of 23S rRNA
          to protein L11 and to the L10/L12p pentameric complex
          was found to be temperature-dependent and cooperative.
          Length = 106

 Score = 29.8 bits (67), Expect = 0.96
 Identities = 10/26 (38%), Positives = 21/26 (80%)

Query: 32 KKKKKKKKKKKKKKRYEEEEKEEEKD 57
            ++K ++K+++KK+ EE+E+EEE+ 
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEA 98



 Score = 29.4 bits (66), Expect = 1.2
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEE 55
              ++K ++K+++KKK  E+EE+EEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 29.4 bits (66), Expect = 1.2
 Identities = 12/24 (50%), Positives = 20/24 (83%)

Query: 28 EEEKKKKKKKKKKKKKKKRYEEEE 51
           EEK ++K+++KKK+++K  EEEE
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 29.0 bits (65), Expect = 1.6
 Identities = 10/27 (37%), Positives = 20/27 (74%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEE 50
              EE+ ++K+++KKK+++K+  EEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 31.3 bits (71), Expect = 0.96
 Identities = 11/40 (27%), Positives = 17/40 (42%)

Query: 21  RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
                  +E+  K K   KK  KK R  + + +E+ D  L
Sbjct: 188 SSTASSGDEKSPKSKAAPKKAGKKMRKWDLDGDEDDDAVL 227


>gnl|CDD|193475 pfam13002, LDB19, Arrestin_N terminal like.  This is a family of
           proteins related to the Arrestin_N terminal family.
          Length = 183

 Score = 30.6 bits (69), Expect = 1.0
 Identities = 18/65 (27%), Positives = 27/65 (41%), Gaps = 3/65 (4%)

Query: 114 PVLLSPGIHSFPFKLGLPLGLPSTFLGKHGWVQYFCKAALREPT---GYTHKNQQVFIIM 170
              LS G HS+PF    P  LP++       V+Y   A +   T   G +    Q+  + 
Sbjct: 38  TTDLSVGSHSYPFSYLFPGSLPASTSNSETQVKYELIATVTYKTPRRGISPSKPQLLKLN 97

Query: 171 SPIDL 175
            PI +
Sbjct: 98  MPIAV 102


>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
           modification, protein turnover, chaperones].
          Length = 623

 Score = 31.5 bits (72), Expect = 1.0
 Identities = 11/45 (24%), Positives = 22/45 (48%)

Query: 9   KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
            E + K    I++G  +    E++ +   +++KK+ K   E  KE
Sbjct: 469 PEFEGKPFKSITKGDLDLELLEEEDEADSEEEKKEFKPLLERVKE 513


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 31.2 bits (71), Expect = 1.0
 Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 3/64 (4%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKK---KKKKKKKRYEEEEKEEEKD 57
           + EK    K   ++    I   +     + K K K ++   K   + K  ++E+K+EEK 
Sbjct: 282 LQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKK 341

Query: 58  NKLL 61
            K +
Sbjct: 342 KKQI 345



 Score = 30.4 bits (69), Expect = 2.0
 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 7   EEKEDQEKE-VGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
           E   D +++      R  E+   E K+KKK+KKK++KKKK+ E  E+  EK      
Sbjct: 304 EMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 31.6 bits (72), Expect = 1.1
 Identities = 12/40 (30%), Positives = 21/40 (52%)

Query: 20   SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
                EE+   ++++ K K K K  K R  + +K+E+K  K
Sbjct: 1144 QEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKK 1183


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 31.7 bits (72), Expect = 1.1
 Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 7/62 (11%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKK-------KKKKKKKKKRYEEEEKEEEKDNK 59
           EEKE  E+    +   +  E E  K ++++        ++ ++K +R EE E+E E+  +
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305

Query: 60  LL 61
            L
Sbjct: 306 EL 307



 Score = 29.3 bits (66), Expect = 5.5
 Identities = 13/58 (22%), Positives = 27/58 (46%), Gaps = 4/58 (6%)

Query: 10  EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK----KKRYEEEEKEEEKDNKLLHN 63
            + E       + +EEE  E ++  ++ ++K ++    ++  EE E+E E    LL  
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEE 316



 Score = 29.0 bits (65), Expect = 6.5
 Identities = 14/51 (27%), Positives = 26/51 (50%), Gaps = 4/51 (7%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           E  ED E  +       EEE +E KK ++ ++++++++   E E  EE   
Sbjct: 196 ELLEDIEDLL----EALEEELKELKKLEEIQEEQEEEELEQEIEALEERLA 242



 Score = 28.6 bits (64), Expect = 8.3
 Identities = 10/53 (18%), Positives = 24/53 (45%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            E E+ E+E+         + E E+  +   ++ ++K +  E E + E +  +
Sbjct: 626 NELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIE 678


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 31.0 bits (70), Expect = 1.1
 Identities = 17/40 (42%), Positives = 21/40 (52%), Gaps = 6/40 (15%)

Query: 12  QEKEVGGISRGKEEETE------EEKKKKKKKKKKKKKKK 45
           Q +   G   GK+E  +      E K+KKKKK KKKK  K
Sbjct: 65  QAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
          protein sorting and transport from the endosome to the
          vacuole/lysosome in eukaryotic cells.
          Vacuoles/lysosomes play an important role in the
          degradation of both lipids and cellular proteins. In
          order to perform this degradative function,
          vacuoles/lysosomes contain numerous hydrolases which
          have been transported in the form of inactive
          precursors via the biosynthetic pathway and are
          proteolytically activated upon delivery to the
          vacuole/lysosome. The delivery of transmembrane
          proteins, such as activated cell surface receptors to
          the lumen of the vacuole/lysosome, either for
          degradation/downregulation, or in the case of
          hydrolases, for proper localisation, requires the
          formation of multivesicular bodies (MVBs). These late
          endosomal structures are formed by invaginating and
          budding of the limiting membrane into the lumen of the
          compartment. During this process, a subset of the
          endosomal membrane proteins is sorted into the forming
          vesicles. Mature MVBs fuse with the vacuole/lysosome,
          thereby releasing cargo containing vesicles into its
          hydrolytic lumen for degradation. Endosomal proteins
          that are not sorted into the intralumenal MVB vesicles
          are either recycled back to the plasma membrane or
          Golgi complex, or remain in the limiting membrane of
          the MVB and are thereby transported to the limiting
          membrane of the vacuole/lysosome as a consequence of
          fusion. Therefore, the MVB sorting pathway plays a
          critical role in the decision between recycling and
          degradation of membrane proteins. A few archaeal
          sequences are also present within this family.
          Length = 169

 Score = 30.3 bits (69), Expect = 1.1
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 7/35 (20%)

Query: 23 KEEETEEEKKKKKKKKKKK-------KKKKRYEEE 50
          K+ +  E + KK  KK  K       K+KKRYE++
Sbjct: 22 KKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQ 56


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           + +K++KK   K++KK    E+EK EE+
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEE 122


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 30.2 bits (68), Expect = 1.2
 Identities = 19/53 (35%), Positives = 32/53 (60%), Gaps = 3/53 (5%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           EKE  EK+V      KE+   E+KKK  +KK+  K++KR + E++  ++  +L
Sbjct: 51  EKESAEKQVKSS---KEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKEL 100



 Score = 27.5 bits (61), Expect = 9.7
 Identities = 15/50 (30%), Positives = 24/50 (48%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          E     EKE         +E  + +KKKK  +KK+  K+R  E+ ++E  
Sbjct: 45 EGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELA 94


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of
          70kDa MW N terminal.  This domain is found in
          eukaryotes. This domain is about 90 amino acids in
          length. This domain is found associated with pfam00076.
          This domain is part of U1 snRNP, which is the pre-mRNA
          binding protein of the penta-snRNP spliceosome complex.
          It extends over a distance of 180 A from its RNA
          binding domain, wraps around the core domain of U1
          snRNP consisting of the seven Sm proteins and finally
          contacts U1-C, which is crucial for 5'-splice-site
          recognition.
          Length = 94

 Score = 29.2 bits (66), Expect = 1.4
 Identities = 14/28 (50%), Positives = 23/28 (82%)

Query: 26 ETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
          ET  EK++++K++KK+K +K+ EEE KE
Sbjct: 60 ETWLEKREREKREKKEKLEKKLEEELKE 87


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 11/42 (26%), Positives = 29/42 (69%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
           E GG + +++E+    ++R ++ E E  +K+++++K+ +K+K
Sbjct: 65  EAGGADIQEEERRSARLARLEDREEERLEKEEEREKRARKRK 106


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast)
          and GNL3L (human) are putative GTPases which are
          required for growth and play a role in processing of
          nucleolar pre-rRNA. This family contains a potential
          nuclear localisation signal.
          Length = 80

 Score = 28.8 bits (65), Expect = 1.4
 Identities = 10/31 (32%), Positives = 23/31 (74%)

Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
          EE EE+K+K++++K+++K+ ++ E  E  + 
Sbjct: 49 EEIEEKKRKQEEEKERRKEARKAERAEARKR 79



 Score = 27.6 bits (62), Expect = 3.2
 Identities = 9/29 (31%), Positives = 21/29 (72%)

Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          EEKK+K++++K+++K+ R  E  +  ++ 
Sbjct: 52 EEKKRKQEEEKERRKEARKAERAEARKRG 80


>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; Very large protein
           containing McrA/HNH-nuclease related domain and a
           RuvC-like nuclease domain; signature gene for type II.
          Length = 799

 Score = 31.2 bits (71), Expect = 1.4
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 5/63 (7%)

Query: 19  ISRGKEEETEEEKKKKKKKKKK-----KKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVT 73
           I   +E  T +  K +KK++KK     K+     E++ KE   D K  +   L  YY+  
Sbjct: 518 IEMARENGTNKGTKNRKKRQKKNEDNIKEAASALEQKLKELPLDIKSKNILKLRLYYQQN 577

Query: 74  HIC 76
             C
Sbjct: 578 GKC 580


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 30.1 bits (67), Expect = 1.5
 Identities = 17/54 (31%), Positives = 36/54 (66%), Gaps = 4/54 (7%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            +  EE+  +E+E    +R +EEE   EK++K K+K ++++K+  EE+E+ +++
Sbjct: 62  RRAAEERLRREEE----ARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQ 111



 Score = 28.9 bits (64), Expect = 3.3
 Identities = 17/60 (28%), Positives = 37/60 (61%), Gaps = 3/60 (5%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEET---EEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           E+   E+E+ ++        +EEE    EEE+ ++K++K K+K ++  ++E++E+E+  K
Sbjct: 51  EQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQK 110



 Score = 27.8 bits (61), Expect = 9.1
 Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E+   EKE++ K        K+E+ E+E+ +K+K++ + + ++  E    E EK
Sbjct: 80  EERAREKEEKAKRKAEE-EEKQEQEEQERIQKQKEEAEARAREEAERMRLEREK 132


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 29.8 bits (67), Expect = 1.6
 Identities = 10/40 (25%), Positives = 23/40 (57%)

Query: 20  SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           SR +      ++K+++ K  ++ KK+  ++  KEE+ + K
Sbjct: 61  SRSRSPSRRRDRKRERDKDAREPKKRERQKLIKEEDLEGK 100


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 30.9 bits (69), Expect = 1.6
 Identities = 12/55 (21%), Positives = 26/55 (47%)

Query: 5   GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
              E E  +K    I +   +  E ++ +K+++ KK K     + ++K+E  + K
Sbjct: 420 ASNESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKK 474


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 30.5 bits (69), Expect = 1.6
 Identities = 10/53 (18%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 9   KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE--EEKEEEKDNK 59
           K+ Q ++       ++++ E EK++++ + +++  K++YEE   +K ++   +
Sbjct: 96  KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQ 148



 Score = 30.1 bits (68), Expect = 1.9
 Identities = 8/29 (27%), Positives = 26/29 (89%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
           ++ +++++K++++++K++KK++ EEE K+
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEERKQ 218



 Score = 29.7 bits (67), Expect = 2.7
 Identities = 9/32 (28%), Positives = 28/32 (87%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +E E +K K++++K++++++K+ +++++EEE+
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEER 216



 Score = 29.3 bits (66), Expect = 3.2
 Identities = 11/37 (29%), Positives = 28/37 (75%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           K+   E E KK K++++K+++++R + ++++EE++ K
Sbjct: 181 KKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217


>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator
          of basal transcription 1 (ABT1) and similar proteins.
          This subfamily corresponds to the RRM of novel nuclear
          proteins termed ABT1 and its homologous counterpart,
          pre-rRNA-processing protein ESF2 (eighteen S factor 2),
          from yeast. ABT1 associates with the TATA-binding
          protein (TBP) and enhances basal transcription activity
          of class II promoters. Meanwhile, ABT1 could be a
          transcription cofactor that can bind to DNA in a
          sequence-independent manner. The yeast ABT1 homolog,
          ESF2, is a component of 90S preribosomes and 5'
          ETS-based RNPs. It is previously identified as a
          putative partner of the TATA-element binding protein.
          However, it is primarily localized to the nucleolus and
          physically associates with pre-rRNA processing factors.
          ESF2 may play a role in ribosome biogenesis. It is
          required for normal pre-rRNA processing, as well as for
          SSU processome assembly and function. Both ABT1 and
          ESF2 contain an RNA recognition motif (RRM), also
          termed RBD (RNA binding domain) or RNP
          (ribonucleoprotein domain). .
          Length = 98

 Score = 28.7 bits (65), Expect = 1.6
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 28 EEEKKKKKKKKKKKKKKKRYEE 49
          E+  K+K++KKK   KKK++ E
Sbjct: 35 EDPAKRKRRKKKGGNKKKKFTE 56


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 30.7 bits (70), Expect = 1.7
 Identities = 12/43 (27%), Positives = 19/43 (44%), Gaps = 8/43 (18%)

Query: 31  KKKKKKKKKKKKKKKRYE--------EEEKEEEKDNKLLHNCY 65
           ++KKK KK  +K+ + Y         E +   EK   +LH   
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHVLHYHN 126


>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
          Length = 150

 Score = 29.4 bits (67), Expect = 1.9
 Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 3/44 (6%)

Query: 12 QEKEVGGISRGKEEETEEEKKKKKKK---KKKKKKKKRYEEEEK 52
          + K   GISRG+  E  E++KK +++    +K KK  R  ++E+
Sbjct: 51 KAKPKKGISRGRARERHEKRKKGRRRGPGSRKGKKGARTPKKER 94


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 30.0 bits (68), Expect = 1.9
 Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 9/62 (14%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKK--KKKKKKKKKKRYE----EEEKEEEKDNKL 60
            E ED E +V   S+ + E  E  ++ K+  +K    K ++       E +  +E+ N L
Sbjct: 52  IELEDLENQV---SQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSL 108

Query: 61  LH 62
             
Sbjct: 109 ED 110


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 29.7 bits (67), Expect = 1.9
 Identities = 10/56 (17%), Positives = 20/56 (35%), Gaps = 8/56 (14%)

Query: 9   KEDQEKEVGGISRGKEEETEEEKKKK--------KKKKKKKKKKKRYEEEEKEEEK 56
           KE++E +V        +  +  KK          + + +   +KK  E      E+
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEE 162



 Score = 28.6 bits (64), Expect = 4.5
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 3/47 (6%)

Query: 17  GGISRGKEEETEEEKK---KKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           GG+++G   E + E K    K++K+ K   KK    + K+  K   L
Sbjct: 87  GGVAKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAAL 133


>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897).  This
          is a bacterial family of uncharacterized proteins.
          Length = 55

 Score = 27.6 bits (62), Expect = 1.9
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 32 KKKKKKKKKKKKKKRYEEEEKEEEKD 57
          K  +  K+KKKK KR  E++ +   D
Sbjct: 30 KFPQFDKQKKKKAKRNAEQDAKNSAD 55


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 30.4 bits (68), Expect = 1.9
 Identities = 13/43 (30%), Positives = 19/43 (44%)

Query: 10  EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
           ++QEKE+  I + + E     K     K K + K K   E E 
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEA 278


>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020).  This
          family of fungal proteins is conserved towards the
          C-terminus of HMG domains. The function is not known.
          Length = 49

 Score = 27.6 bits (61), Expect = 2.0
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           E   E  K    + + KKK K+   +E  +EKD 
Sbjct: 9  REANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDA 43



 Score = 27.2 bits (60), Expect = 2.2
 Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 6/45 (13%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEE 51
          E   ++ K+    SR K+      K KKK  K+  K+K  Y ++E
Sbjct: 10 EANSERNKDNDLRSRVKK------KAKKKFGKEDSKEKDAYIDDE 48



 Score = 25.6 bits (56), Expect = 8.8
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
            E  ++   + + KKK KKK  + + +EK+
Sbjct: 12 NSERNKDNDLRSRVKKKAKKKFGKEDSKEKD 42


>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
           (CLPTM1).  This family consists of several eukaryotic
           cleft lip and palate transmembrane protein 1 sequences.
           Cleft lip with or without cleft palate is a common birth
           defect that is genetically complex. The nonsyndromic
           forms have been studied genetically using linkage and
           candidate-gene association studies with only partial
           success in defining the loci responsible for orofacial
           clefting. CLPTM1 encodes a transmembrane protein and has
           strong homology to two Caenorhabditis elegans genes,
           suggesting that CLPTM1 may belong to a new gene family.
           This family also contains the human cisplatin resistance
           related protein CRR9p which is associated with
           CDDP-induced apoptosis.
          Length = 437

 Score = 30.3 bits (69), Expect = 2.0
 Identities = 12/37 (32%), Positives = 18/37 (48%)

Query: 35  KKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYR 71
             KKK KKK     + EKEE ++ K      + S++ 
Sbjct: 148 LPKKKVKKKNLLGGKSEKEEPEEEKTPAPDKIVSHWH 184


>gnl|CDD|238037 cd00084, HMG-box, High Mobility Group (HMG)-box is found in a
          variety of eukaryotic chromosomal proteins and
          transcription factors. HMGs bind to the minor groove of
          DNA and have been classified by DNA binding
          preferences. Two phylogenically distinct groups of
          Class I proteins bind DNA in a sequence specific
          fashion and contain a single HMG box. One group
          (SOX-TCF) includes transcription factors, TCF-1, -3,
          -4; and also SRY and LEF-1, which bind four-way DNA
          junctions and duplex DNA targets. The second group
          (MATA) includes fungal mating type gene products MC,
          MATA1 and Ste11. Class II and III proteins (HMGB-UBF)
          bind DNA in a non-sequence specific fashion and contain
          two or more tandem HMG boxes. Class II members include
          non-histone chromosomal proteins, HMG1 and HMG2, which
          bind to bent or distorted DNA such as four-way DNA
          junctions, synthetic DNA cruciforms, kinked
          cisplatin-modified DNA, DNA bulges, cross-overs in
          supercoiled DNA, and can cause looping of linear DNA.
          Class III members include nucleolar and mitochondrial
          transcription factors, UBF and mtTF1, which bind
          four-way DNA junctions.
          Length = 66

 Score = 28.0 bits (63), Expect = 2.0
 Identities = 17/46 (36%), Positives = 29/46 (63%), Gaps = 2/46 (4%)

Query: 9  KEDQEKEVGGISR--GKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
           E+    VG IS+  G+  ++  E++KKK ++K +K K+RYE+E  
Sbjct: 21 AENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEMP 66


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 29.5 bits (67), Expect = 2.2
 Identities = 9/27 (33%), Positives = 19/27 (70%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKD 57
          KK KK+K+K++K+ ++  ++  +E K 
Sbjct: 18 KKAKKEKRKQRKQARKGADDGDDELKQ 44



 Score = 27.6 bits (62), Expect = 9.3
 Identities = 8/30 (26%), Positives = 20/30 (66%)

Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          +K+K+K++K+ +K     ++E K+  ++ K
Sbjct: 21 KKEKRKQRKQARKGADDGDDELKQAAEEAK 50


>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper. 
          Length = 54

 Score = 27.6 bits (62), Expect = 2.2
 Identities = 13/46 (28%), Positives = 27/46 (58%), Gaps = 8/46 (17%)

Query: 26 ETEEEKKKKKK-----KKKKKKKKKRYEEEE---KEEEKDNKLLHN 63
          + +E + ++++     ++ ++KKK+R EE E   KE E++N  L  
Sbjct: 1  KDDEYRDRRRRNNEAARRSREKKKQREEELEERVKELEEENAQLRQ 46


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.
          This family represents the C-terminus (approximately
          300 residues) of proteins that are involved as binding
          partners for Prp19 as part of the nuclear pore complex.
          The family in Drosophila is necessary for pre-mRNA
          splicing, and the human protein has been found in
          purifications of the spliceosome. In the past this
          family was thought, erroneously, to be associated with
          microfibrillin.
          Length = 277

 Score = 29.9 bits (67), Expect = 2.2
 Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 4/48 (8%)

Query: 12 QEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          QE+E        +E+  EE+ K+K +++K++  K  EEE K+E +  K
Sbjct: 51 QERER----EAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKK 94


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 30.5 bits (69), Expect = 2.2
 Identities = 11/56 (19%), Positives = 24/56 (42%), Gaps = 3/56 (5%)

Query: 5   GGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
              E+  QE + G     ++++   + K KK K  +         ++K  + ++KL
Sbjct: 193 EKAEERKQESKKG---AKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESKL 245


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 30.2 bits (68), Expect = 2.3
 Identities = 13/54 (24%), Positives = 27/54 (50%)

Query: 27  TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVTHICFSDP 80
           T+E K ++  +  ++  KKR E  +   ++D  + + C +   +  TH+C   P
Sbjct: 153 TDEAKVEEFLETAREIYKKRDERTKALSDEDVDVFYGCTMCQSFAPTHVCVVTP 206


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 29.5 bits (67), Expect = 2.6
 Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 3/53 (5%)

Query: 4   KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +  + K + E+E   I + +EE+ E EK+  + + K +  +KR EEE + EEK
Sbjct: 117 QAEQGKSELEQE---IKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEK 166


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 29.5 bits (67), Expect = 2.6
 Identities = 7/34 (20%), Positives = 19/34 (55%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +E   E+EK  ++++K K K  ++  +   ++ +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155



 Score = 29.5 bits (67), Expect = 3.1
 Identities = 8/34 (23%), Positives = 19/34 (55%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           K E   E++K  ++++K K K  R + + + ++ 
Sbjct: 121 KREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 28.7 bits (65), Expect = 4.5
 Identities = 9/37 (24%), Positives = 20/37 (54%)

Query: 21  RGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           + K+ E   EK+K  ++++K K K   ++ +   +K 
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 28.0 bits (63), Expect = 7.6
 Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 4/57 (7%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            K   ++ +Q+ +     R    E E+  ++++K K K  +KKR    +K E +   
Sbjct: 108 AKVQAQRAEQQAK----KREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160


>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
          Length = 233

 Score = 29.7 bits (67), Expect = 2.7
 Identities = 10/37 (27%), Positives = 15/37 (40%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           + EE   +  K    KKK +     E E++ E K   
Sbjct: 196 EAEEELPKSDKNPNSKKKSRVHVEIEYEDEIEYKSLM 232



 Score = 28.6 bits (64), Expect = 6.2
 Identities = 13/43 (30%), Positives = 18/43 (41%)

Query: 20  SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
               EE  EE  K  K    KKK +   E E ++E +   L+ 
Sbjct: 191 MEEVEEAEEELPKSDKNPNSKKKSRVHVEIEYEDEIEYKSLMS 233



 Score = 28.6 bits (64), Expect = 6.5
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 11  DQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
           ++  E+  +   +EE  + +K    KKK +   +  YE+E
Sbjct: 186 NKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVEIEYEDE 225


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 29.6 bits (67), Expect = 2.7
 Identities = 12/56 (21%), Positives = 28/56 (50%), Gaps = 8/56 (14%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           + +    E+E++EK        +E E      K + ++ + K+K +  ++E++EE 
Sbjct: 119 LEQL---EREEEEK-----RDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEM 166


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 30.1 bits (67), Expect = 2.7
 Identities = 19/101 (18%), Positives = 34/101 (33%), Gaps = 13/101 (12%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL----- 60
           GEE+E +++    +   + +E+E E    +K  +++K      EEE   E++ K      
Sbjct: 380 GEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLASPEEEALGEEEQKAPPKKK 439

Query: 61  --------LHNCYLYSYYRVTHICFSDPPGQAPVLLSPGIH 93
                          S          +    A     P IH
Sbjct: 440 QLNQKNKQQTGSGTNSDEERDDTSLDEDRDLADDGGLPRIH 480


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is
          a small protein module with a triple-stranded
          beta-sheet fold. This is a family of WW domain binding
          proteins.
          Length = 78

 Score = 27.6 bits (62), Expect = 2.7
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 27 TEEEKKKKKKKKKKKKKKKR 46
          T+  +K++KKK+ KK K +R
Sbjct: 7  TDAYRKEQKKKELKKNKAER 26



 Score = 26.9 bits (60), Expect = 5.7
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 29 EEKKKKKKKKKKKKKKKRYE 48
          E+KKK+ KK K +++ +R  
Sbjct: 13 EQKKKELKKNKAERQARREA 32


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
          represents the L12P protein of the large (50S) subunit
          of the archaeal ribosome.
          Length = 105

 Score = 28.5 bits (64), Expect = 2.7
 Identities = 4/27 (14%), Positives = 19/27 (70%), Gaps = 2/27 (7%)

Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEE 51
               E+++++++++++++++   EEE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEE--SEEE 95


>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein.  The PcfJ-like protein family
          includes the E. faecalis PcfJ protein, which is
          functionally uncharacterized. It is found in bacteria
          and viruses, and is typically between 159 and 170 amino
          acids in length. There is a conserved HCV sequence
          motif.
          Length = 169

 Score = 29.2 bits (66), Expect = 2.8
 Identities = 11/35 (31%), Positives = 19/35 (54%)

Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          +    + + KK+K+K +K +KRY E EK   +   
Sbjct: 32 DRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKS 66


>gnl|CDD|181091 PRK07720, fliJ, flagellar biosynthesis chaperone; Validated.
          Length = 146

 Score = 28.9 bits (65), Expect = 2.8
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 34  KKKKKKKKKKKKRYEEEEKEEE 55
           KK +K K+KK++ +  EEK  E
Sbjct: 109 KKYEKMKEKKQEMFALEEKAAE 130


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.1 bits (69), Expect = 2.8
 Identities = 7/48 (14%), Positives = 17/48 (35%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEE 49
                  +ED       +    ++E EEE++ +        + +  E+
Sbjct: 172 GFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 29.7 bits (67), Expect = 2.8
 Identities = 17/84 (20%), Positives = 30/84 (35%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYL 66
           +  E  E  V   +   E E     +   +  ++  +    E E+  EE++NK +     
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVP 255

Query: 67  YSYYRVTHICFSDPPGQAPVLLSP 90
             Y        +  P + P LL P
Sbjct: 256 KPYISPDEYPSAPLPPELPQLLQP 279


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 29.5 bits (66), Expect = 2.9
 Identities = 13/49 (26%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 13  EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK---EEEKDN 58
           EKE   +SR K ++   ++  K+K+K++ +  ++++E+ +   EE++ N
Sbjct: 144 EKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRN 192



 Score = 29.1 bits (65), Expect = 4.0
 Identities = 9/42 (21%), Positives = 25/42 (59%)

Query: 14  KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           K+     R +  + +E+++ +  +K K+ ++K+ EE+ + +E
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDE 194


>gnl|CDD|115706 pfam07070, Spo0M, SpoOM protein.  This family consists of several
           bacterial SpoOM proteins which are thought to control
           sporulation in Bacillus subtilis.Spo0M exerts certain
           negative effects on sporulation and its gene expression
           is controlled by sigmaH.
          Length = 218

 Score = 29.6 bits (67), Expect = 2.9
 Identities = 12/30 (40%), Positives = 13/30 (43%), Gaps = 1/30 (3%)

Query: 117 LSPG-IHSFPFKLGLPLGLPSTFLGKHGWV 145
           + PG    FPF L LP   P T      WV
Sbjct: 82  IQPGEERVFPFTLTLPWETPITIGDAKVWV 111


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 30.0 bits (68), Expect = 2.9
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 29  EEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EEK+K+KK KK+ K+++R   + K  E+
Sbjct: 394 EEKEKEKKAKKEGKEERRIHFQNKSIEE 421



 Score = 28.8 bits (65), Expect = 5.8
 Identities = 11/30 (36%), Positives = 21/30 (70%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
           E  E+EK+KK KK+ K++++  ++ +  EE
Sbjct: 392 EREEKEKEKKAKKEGKEERRIHFQNKSIEE 421



 Score = 28.4 bits (64), Expect = 9.5
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
           + EE E+EKK KK+ K++++   + +  E+ E
Sbjct: 392 EREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 30.0 bits (67), Expect = 3.0
 Identities = 15/56 (26%), Positives = 29/56 (51%), Gaps = 3/56 (5%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKK---KKKKKKKKKRYEEEEKEEEKDNK 59
           ++ ED + +    S   EE+ EE   + +     K+K KKK   ++ E E+++ +K
Sbjct: 227 DDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDK 282



 Score = 29.2 bits (65), Expect = 5.1
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
           E   E+  D E E     + +EEE  EEK+  ++  ++KK  K
Sbjct: 145 ESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDK 187


>gnl|CDD|235602 PRK05776, PRK05776, DNA topoisomerase I; Provisional.
          Length = 670

 Score = 30.0 bits (68), Expect = 3.0
 Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 325 IQYLARNKVLESETRELASLTRGKIKP--GERDDWTNQQLYVPPLPPTNL 372
           ++ L+R+         L   T+G +KP  G +DD  +  +Y    PP NL
Sbjct: 331 LKGLSRSPQYRPLVNLLLKETKGVLKPVQGPKDDPAHPAIYPTGEPPKNL 380


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 29.8 bits (67), Expect = 3.0
 Identities = 10/42 (23%), Positives = 20/42 (47%)

Query: 20  SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
            + KE E  E++  +  K+ +K+ +   E  EK+ E +    
Sbjct: 111 EKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEE 152


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 29.4 bits (66), Expect = 3.1
 Identities = 8/38 (21%), Positives = 23/38 (60%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          +E  +E++  +++   +  KK+ +   +E+++EK N +
Sbjct: 4  EEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPV 41



 Score = 27.9 bits (62), Expect = 8.4
 Identities = 9/34 (26%), Positives = 17/34 (50%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          K+   EE   +  KK+ K+  K+  +E++   E 
Sbjct: 10 KDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVED 43


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 28.9 bits (65), Expect = 3.2
 Identities = 18/55 (32%), Positives = 32/55 (58%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           NE   E ++  +KE+   +   E   EE KK + K KK++K+ ++  +E ++EEK
Sbjct: 85  NEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEK 139


>gnl|CDD|226776 COG4326, Spo0M, Sporulation control protein [General function
           prediction only].
          Length = 270

 Score = 29.4 bits (66), Expect = 3.3
 Identities = 20/73 (27%), Positives = 31/73 (42%), Gaps = 16/73 (21%)

Query: 49  EEEKEEEKDNKLLHNCYLYSYYRVTHICFSDPPGQAPVLLSPGIHSFPFKLGLPLGLPST 108
           E + E  +    L N      +R+ +  F+  PG+          +FPF+L LP   P T
Sbjct: 79  EVDDERGQQQGTLAN------WRLPY-AFTIQPGEE--------RNFPFELSLPWNTPVT 123

Query: 109 FLGQAPVLLSPGI 121
             G A V +  G+
Sbjct: 124 I-GDAKVWVETGL 135


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 29.6 bits (67), Expect = 3.3
 Identities = 18/52 (34%), Positives = 31/52 (59%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
          EK  E+ E++E+E    +  +EEE +EE+KK+K     K K    +E++ E+
Sbjct: 14 EKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 30.1 bits (68), Expect = 3.3
 Identities = 14/43 (32%), Positives = 19/43 (44%), Gaps = 7/43 (16%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
           N K  EEK  QE       R    ++ EE  K+ K+ + K K 
Sbjct: 907 NNKYNEEKSAQE-------RMPGADSPEELMKRAKEYQDKHKG 942


>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361). 
          This is a region of 120 amino acids that is conserved
          in a family of proteins found from plants to fungi. The
          function is not known.
          Length = 114

 Score = 28.3 bits (64), Expect = 3.3
 Identities = 10/35 (28%), Positives = 18/35 (51%)

Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNC 64
          E+KK  ++ K+ +KK     +EKE +   + L   
Sbjct: 60 ERKKATRRLKQLEKKLEETSDEKEIKDLKEQLSKA 94


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
          family includes the vacuolar ATP synthase E subunit, as
          well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 15/47 (31%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 9  KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
          +++ E++   I    EEE E EK +  ++ +KK ++  YE++EK+ E
Sbjct: 3  RQEAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEE-IYEKKEKQAE 48


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 29.7 bits (67), Expect = 3.4
 Identities = 10/53 (18%), Positives = 26/53 (49%), Gaps = 4/53 (7%)

Query: 9   KEDQEKEVGGISRGKEEET----EEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           +E+ EK +  +     +       E  KK++  + ++KKK++  + E+   ++
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 29.8 bits (67), Expect = 3.4
 Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 4/45 (8%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKR 46
           +E  G      E +        +EE++EE +KK+K KKK +K+K 
Sbjct: 329 SEFLGYMGGIDEDD----EDEDDEESKEEVEKKQKVKKKPRKRKV 369



 Score = 28.6 bits (64), Expect = 6.6
 Identities = 15/44 (34%), Positives = 27/44 (61%), Gaps = 7/44 (15%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
           +   GG +++D++++        EE  EE +KK+K KKK +K+K
Sbjct: 332 LGYMGGIDEDDEDED-------DEESKEEVEKKQKVKKKPRKRK 368



 Score = 28.6 bits (64), Expect = 6.7
 Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 8/39 (20%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
           E+ ED++ E        E + E EKK+K KKK +K+K  
Sbjct: 340 EDDEDEDDE--------ESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 27.9 bits (63), Expect = 3.4
 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 2/48 (4%)

Query: 14  KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           KE+G   + K+E+ E    + K+ K + K  +  E  E E E D  LL
Sbjct: 57  KEIGKAKK-KKEDAEALIAEVKELKDELKALEA-ELRELEAELDKLLL 102


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 29.5 bits (66), Expect = 3.5
 Identities = 11/58 (18%), Positives = 24/58 (41%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            +   E+E Q        + +EE  E+ +   +K    K+++    +E + +E    L
Sbjct: 82  AEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLL 139


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
          production and conversion].
          Length = 108

 Score = 28.2 bits (63), Expect = 3.5
 Identities = 14/50 (28%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          E KE+ E+ +   +R +  E  EE +++ +K  ++  ++  EE E+E E+
Sbjct: 25 EAKEEAEQIIKE-AREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEE 73



 Score = 27.8 bits (62), Expect = 4.3
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
          +EE EE K++ ++  K+ +++ R   EE EEE
Sbjct: 20 DEEIEEAKEEAEQIIKEAREEAREIIEEAEEE 51


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 29.5 bits (67), Expect = 3.5
 Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 4/51 (7%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           EE E++ KE       ++E  E EKK  +KK KK  KK + EE   EE  +
Sbjct: 72  EEYEEELKE----YEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLE 118


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 29.6 bits (67), Expect = 3.5
 Identities = 15/57 (26%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 1   MNEKGGEEKEDQEKEV-GGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +NE+  E K+   +++     R +E E    +++K+   K ++ +K+ E+ EK  EK
Sbjct: 311 LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEK 367


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 29.8 bits (67), Expect = 3.5
 Identities = 13/54 (24%), Positives = 29/54 (53%)

Query: 27  TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVTHICFSDP 80
           T+ +K K++ +K ++  +KR E  +   ++D  + + C +   +  TH+C   P
Sbjct: 151 TDPDKVKEELEKAREIYEKRDERTKALSDEDVDVFYGCVMCQSFAPTHVCIVTP 204


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 29.7 bits (67), Expect = 3.6
 Identities = 6/50 (12%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            E+ +    +   S  + ++ +  +  +++KK   +       ++K+ E+
Sbjct: 182 AEQAELTTLL---SEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEE 228


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
          appears in eukaryotes as well as bacteria and tends to
          be found near the C-terminus of the metalloprotease
          M16C (pfam05193).
          Length = 248

 Score = 29.1 bits (66), Expect = 3.6
 Identities = 14/38 (36%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 25 EETEEEKKKKKKKKKKKKKKKRYEEEEKEE-EKDNKLL 61
          +ET EEK  +++++K +KKK    EE+KE+  +    L
Sbjct: 4  DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLEL 41


>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
          structure and biogenesis].
          Length = 67

 Score = 27.3 bits (61), Expect = 3.6
 Identities = 9/25 (36%), Positives = 19/25 (76%)

Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKE 53
          E++K+KK   +K+K K+  +E+++E
Sbjct: 43 EKRKRKKAAARKRKFKRLRKEQQRE 67



 Score = 26.5 bits (59), Expect = 6.9
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 26 ETEEEKKKKKKKKKKKKKKKR 46
          E   EK+K+KK   +K+K KR
Sbjct: 39 EKPSEKRKRKKAAARKRKFKR 59


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 28.0 bits (61), Expect = 3.7
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          IS   E +T  E  KK+ + +KK +K   E+E+K E+ +N+
Sbjct: 46 ISNEDEMKTTHEHIKKENEDEKKPEKPENEDEKKPEKPENE 86


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
          represents a conserved region approximately 60 residues
          long within the eukaryotic targeting protein for Xklp2
          (TPX2). Xklp2 is a kinesin-like protein localised on
          centrosomes throughout the cell cycle and on spindle
          pole microtubules during metaphase. In Xenopus, it has
          been shown that Xklp2 protein is required for
          centrosome separation and maintenance of spindle
          bi-polarity. TPX2 is a microtubule-associated protein
          that mediates the binding of the C-terminal domain of
          Xklp2 to microtubules. It is phosphorylated during
          mitosis in a microtubule-dependent way.
          Length = 57

 Score = 27.0 bits (60), Expect = 3.7
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          E+  E +KK ++K+K  + +K+  E  +KEEE+
Sbjct: 8  EKRAEFDKKLEEKEKALEAEKEEAEARQKEEEE 40


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 28.4 bits (64), Expect = 3.8
 Identities = 13/45 (28%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 21  RGKEEETEEEKKKKKKKKKKKKKKKRYEE-EEKEEEKDNKLLHNC 64
             KE + E EK  + ++++KK+ K   EE E+K ++++ + L   
Sbjct: 75  EKKELKQEVEKLPRFQEQEKKRMKAEKEEQEQKHQKQEREFLAKQ 119


>gnl|CDD|205311 pfam13130, DUF3952, Domain of unknown function (DUF3952).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria, and is approximately
           110 amino acids in length. There is a conserved VMSAS
           sequence motif.
          Length = 107

 Score = 28.1 bits (63), Expect = 3.9
 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 10/56 (17%)

Query: 7   EEKEDQEKEV------GGISRGKEE----ETEEEKKKKKKKKKKKKKKKRYEEEEK 52
           EEKED E          GI   K++    ET +E       +K+K K   Y+EE+ 
Sbjct: 52  EEKEDGEHSKTIYQTTEGIYNTKDKSLYGETTQEITTDIDNEKEKDKNNNYKEEKV 107


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.4 bits (67), Expect = 4.0
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          ++E E  KK+   + K++  K R E E++  E+ N+L
Sbjct: 48 KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84



 Score = 29.4 bits (67), Expect = 4.1
 Identities = 14/56 (25%), Positives = 37/56 (66%), Gaps = 1/56 (1%)

Query: 2   NEKGGEEKEDQ-EKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
            EK   +KE+  ++++  + + +EE  ++EK+ ++K+++ +KK++  EE  +E+ +
Sbjct: 87  LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142



 Score = 29.0 bits (66), Expect = 5.8
 Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
           EE  D++ E       K EE  E+K+K+ ++K+++ +KK  E EE  EE+  +L  
Sbjct: 95  EENLDRKLE----LLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
          family.  This model family represents the major capsid
          protein component of the heads (capsids) of
          bacteriophage HK97, phi-105, P27, and related phage.
          This model represents one of several analogous families
          lacking detectable sequence similarity. The gene
          encoding this component is typically located in an
          operon encoding the small and large terminase subunits,
          the portal protein and the prohead or maturation
          protease [Mobile and extrachromosomal element
          functions, Prophage functions].
          Length = 384

 Score = 29.2 bits (66), Expect = 4.0
 Identities = 9/55 (16%), Positives = 23/55 (41%), Gaps = 2/55 (3%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK--KRYEEEEKEEEKDNK 59
          E   +  KE+  ++  ++    EE+K +    K++  K     +  E+  ++   
Sbjct: 6  EALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 29.5 bits (67), Expect = 4.0
 Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 14  KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           KE+G   +  E++ EE   + K+ K+K K+ +    +E E E D  LL
Sbjct: 57  KEIGRALKRGEDDAEELIAEVKELKEKLKELEA-ALDELEAELDTLLL 103


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 29.5 bits (66), Expect = 4.1
 Identities = 14/31 (45%), Positives = 16/31 (51%)

Query: 15  EVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
           E G +    + E E E  K  KK KK KKKK
Sbjct: 228 EGGELIEKPQAEAENEAGKSDKKDKKSKKKK 258


>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451).  This
          presumed domain is functionally uncharacterized. This
          domain is found in eukaryotes. This domain is typically
          between 199 to 238 amino acids in length. This domain
          is found associated with pfam06512, pfam00520. This
          domain has a conserved ADD sequence motif.
          Length = 222

 Score = 29.0 bits (65), Expect = 4.1
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 28 EEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          E  K+++ +  K KK++     EE + EK
Sbjct: 9  ESAKERRNRNDKNKKEEHSIGSEEGDSEK 37


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 29.4 bits (66), Expect = 4.3
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEE 55
          ++ K +K K+ KK+KKR E++E ++E
Sbjct: 26 DEAKPRKIKRVKKRKKREEKDELDDE 51



 Score = 28.3 bits (63), Expect = 8.5
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEE 54
          +E K +K K+ KK+KK+  ++E  +E
Sbjct: 26 DEAKPRKIKRVKKRKKREEKDELDDE 51


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 29.2 bits (65), Expect = 4.3
 Identities = 16/63 (25%), Positives = 24/63 (38%), Gaps = 12/63 (19%)

Query: 9   KEDQEKEVGGISRGKEEETEEEKKKKKKKK------------KKKKKKKRYEEEEKEEEK 56
               +++  G     EEE E +K+K K K             K+K  +     EE  EE+
Sbjct: 140 AALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEE 199

Query: 57  DNK 59
             K
Sbjct: 200 KAK 202


>gnl|CDD|220624 pfam10187, Nefa_Nip30_N, N-terminal domain of NEFA-interacting
          nuclear protein NIP30.  This is a the N-terminal 100
          amino acids of a family of proteins conserved from
          plants to humans. The full-length protein has
          putatively been called NEFA-interacting nuclear protein
          NIP30, however no reference could be found to confirm
          this.
          Length = 99

 Score = 27.7 bits (62), Expect = 4.4
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 24 EEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
          E+E E   + ++ ++  +KK+KR E EE EE
Sbjct: 67 EDEVEFLDEVEESRRAAEKKRKREEAEELEE 97


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 29.0 bits (65), Expect = 4.4
 Identities = 16/56 (28%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 6   GEEKEDQEKEVGGISRGK---EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           G  K+  +K      + K     E EE+  +K    +KK K+   E+ ++E+EKD 
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDE 176



 Score = 28.2 bits (63), Expect = 6.7
 Identities = 16/63 (25%), Positives = 34/63 (53%), Gaps = 6/63 (9%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKK------KKKKKRYEEEEKEEE 55
            + G +    + K   G+   +EE+ +E+    +KK K+       ++ ++ EEEE+EEE
Sbjct: 124 KKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEE 183

Query: 56  KDN 58
           +++
Sbjct: 184 EED 186



 Score = 28.2 bits (63), Expect = 8.1
 Identities = 15/56 (26%), Positives = 29/56 (51%), Gaps = 4/56 (7%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
             K G   E++E     +S  +++  E E +   ++ +K ++    EEEE+EEE +
Sbjct: 136 KRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE----EEEEEEEEDE 187


>gnl|CDD|223232 COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit
           and related amidases [Translation, ribosomal structure
           and biogenesis].
          Length = 475

 Score = 29.2 bits (66), Expect = 4.4
 Identities = 18/76 (23%), Positives = 25/76 (32%), Gaps = 27/76 (35%)

Query: 86  VLLSPGIHSFPFKLG---------------LPLGLPSTFLGQAPVLLSPGIHSFPF---K 127
           VLL+P   +   K+G                   +P+   G       P I S P     
Sbjct: 385 VLLTPTTPTPAPKIGESESDGDDPLEMYLLDVFTVPANLAGL------PAI-SVPAGFTA 437

Query: 128 LGLPLGLPSTFLGKHG 143
            GLP+GL    +G   
Sbjct: 438 DGLPVGL--QLIGPAF 451


>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
           (FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
           module that binds and bends membranes.  F-BAR domains
           are dimerization modules that bind and bend membranes
           and are found in proteins involved in membrane dynamics
           and actin reorganization. F-BAR domain containing
           proteins, also known as Pombe Cdc15 homology (PCH)
           family proteins, include Fes and Fer tyrosine kinases,
           PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
           srGAPs. Many members also contain an SH3 domain and play
           roles in endocytosis. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           These tubules have diameters larger than those observed
           with N-BARs. The F-BAR domains of some members such as
           NOSTRIN and Rgd1 are important for the subcellular
           localization of the protein.
          Length = 191

 Score = 28.8 bits (65), Expect = 4.4
 Identities = 18/60 (30%), Positives = 26/60 (43%), Gaps = 11/60 (18%)

Query: 9   KEDQEKEVGGISRGKEEETEEEKKKKK-----KKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
           KED+E+        K+E  E EK KKK      K  KK  ++  E+ EK     ++    
Sbjct: 92  KEDKEQA------RKKELAEGEKLKKKLQELWAKLAKKADEEYREQVEKLNPAQSEYEEE 145


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 29.3 bits (66), Expect = 4.5
 Identities = 13/39 (33%), Positives = 23/39 (58%), Gaps = 6/39 (15%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK 44
           G++ ED ++E      G + E +  KKK KKK+++K + 
Sbjct: 164 GDDGEDDDEE------GSDGEEDGAKKKAKKKRQRKPES 196



 Score = 28.2 bits (63), Expect = 9.8
 Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 8   EKEDQEKEVGGISRGKEEE----TEEEKKKKKKKKKKKKK 43
            KE  E    G     ++E     EE+  KKK KKK+++K
Sbjct: 154 NKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRK 193


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 28.3 bits (64), Expect = 4.5
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 32  KKKKKKKKKKKKKKRYEEEEKEEEKD 57
           K+KKKKKKKKK+ + +EE++K+    
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVS 107



 Score = 27.5 bits (62), Expect = 9.6
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 31  KKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           K+KKKKKKKKK+ +   E+++K    + +
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEER 110


>gnl|CDD|221544 pfam12353, eIF3g, Eukaryotic translation initiation factor 3
           subunit G.  This domain family is found in eukaryotes,
           and is approximately 130 amino acids in length. The
           family is found in association with pfam00076. This
           family is subunit G of the eukaryotic translation
           initiation factor 3. Subunit G is required for eIF3
           integrity.
          Length = 126

 Score = 28.0 bits (63), Expect = 4.5
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 19  ISRGKEEETEEEKKKKKKKKKKKKKKKRY 47
           + R  +EE EEE+++K KK  KK  K R 
Sbjct: 81  LIRNWKEEEEEEEEEKAKKSGKKLVKCRI 109


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 29.4 bits (65), Expect = 4.6
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 19  ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           I++  E +   +     KKKKKKKKK  +  + K+E K
Sbjct: 91  INKSGEFDNITDDDTPSKKKKKKKKKGWFWAKSKQESK 128


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents
          the seryl-tRNA synthetase found in most organisms. This
          protein is a class II tRNA synthetase, and is
          recognized by the pfam model tRNA-synt_2b. The
          seryl-tRNA synthetases of two archaeal species,
          Methanococcus jannaschii and Methanobacterium
          thermoautotrophicum, differ considerably and are
          included in a different model [Protein synthesis, tRNA
          aminoacylation].
          Length = 418

 Score = 29.3 bits (66), Expect = 4.6
 Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 6/64 (9%)

Query: 2  NEKGGEEKEDQEKEVGGISRGKEEETE-EEKKKK-----KKKKKKKKKKKRYEEEEKEEE 55
            +G     D EK +      K+  +E EE + K     K+  K K +KK   EE K+E 
Sbjct: 19 KARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKEL 78

Query: 56 KDNK 59
          K+ K
Sbjct: 79 KELK 82


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 27.6 bits (61), Expect = 4.7
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKK 38
             +G EE+ ++E+E  G +  +  E EE+  + KK+K
Sbjct: 65  EGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQK 101



 Score = 27.2 bits (60), Expect = 7.8
 Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 7/53 (13%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          EE+E  E +      G+EEE EEE++ +    K+         E++E++ + K
Sbjct: 53 EEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAA-------EDEEDDAETK 98


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 28.6 bits (64), Expect = 4.8
 Identities = 10/29 (34%), Positives = 24/29 (82%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
           EE  +E+K+++++ K++ K+K++E++ KE
Sbjct: 93  EEALDEEKERREESKEEMKEKKFEKKLKE 121



 Score = 27.4 bits (61), Expect = 9.6
 Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 11/64 (17%)

Query: 5   GGEEKEDQEKEVGGISRGKEEETEEEKKKKK-----------KKKKKKKKKKRYEEEEKE 53
           G EE  D+EKE    S+ + +E + EKK K+            KK+ +  +  +  E   
Sbjct: 91  GSEEALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHEFGPETNG 150

Query: 54  EEKD 57
            E+D
Sbjct: 151 VEED 154


>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
          Length = 476

 Score = 29.2 bits (66), Expect = 4.8
 Identities = 14/37 (37%), Positives = 20/37 (54%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           KE+E EE  KK K  K+K K +       K+EE++  
Sbjct: 202 KEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNKP 238


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 29.2 bits (65), Expect = 4.8
 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 9/65 (13%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKK---------KKKKKKKKRYEEEEKE 53
             G EE+ED+E        G  E  EEE+++ +           + ++  KK+ EEEE E
Sbjct: 185 SDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEME 244

Query: 54  EEKDN 58
           EE  N
Sbjct: 245 EEVIN 249


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
          family of proteins has no known function. However, one
          of the members is annotated as an EF-hand family
          protein.
          Length = 349

 Score = 29.0 bits (65), Expect = 4.8
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 30 EKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           K++KKK++KK  ++ + EEE  + EK  + 
Sbjct: 12 TKREKKKREKKSIRQSKLEEELNKLEKKEEE 42



 Score = 28.6 bits (64), Expect = 7.8
 Identities = 12/49 (24%), Positives = 23/49 (46%), Gaps = 3/49 (6%)

Query: 10  EDQEKEVGGISRGKEEETEEEKKKKKKKKK---KKKKKKRYEEEEKEEE 55
            ++E++   I   K   +  E++ KK K K   K       +++ +EEE
Sbjct: 95  YEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKSIYIPLLKDKMQEEE 143



 Score = 28.2 bits (63), Expect = 8.6
 Identities = 12/59 (20%), Positives = 29/59 (49%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           ++++     E+  K          ++ E   K  + ++K+K+ K++ E++E E+ K  K
Sbjct: 245 LDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKK 303


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 29.6 bits (66), Expect = 5.0
 Identities = 16/53 (30%), Positives = 30/53 (56%)

Query: 7    EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            + K+D E  +  I    ++   E +K ++K K+ KK+K R E++  + +K NK
Sbjct: 1541 KTKKDSEIIIKEIKDAHKKFILEAEKSEQKIKEIKKEKFRIEDDAAKNDKSNK 1593


>gnl|CDD|240597 cd12933, eIF3G, eIF3G domain found in eukaryotic translation
          initiation factor 3 subunit G (eIF-3G) and similar
          proteins.  eIF-3G, also termed eIF-3 subunit 4, or
          eIF-3-delta, or eIF3-p42, or eIF3-p44, is the
          RNA-binding subunit of eIF3. eIF3 is a large
          multi-subunit complex that plays a central role in the
          initiation of translation by binding to the 40 S
          ribosomal subunit and promoting the binding of
          methionyl-tRNAi and mRNA. eIF-3G binds 18 S rRNA and
          beta-globin mRNA, and therefore appears to be a
          nonspecific RNA-binding protein. Besides, eIF-3G is one
          of the cytosolic targets; it interacts with mature
          apoptosis-inducing factor (AIF). This family also
          includes yeast eIF3-p33, a homolog of vertebrate
          eIF-3G; it plays an important role in the initiation
          phase of protein synthesis in yeast. It binds both mRNA
          and rRNA fragments due to an RNA recognition motif near
          its C-terminus.
          Length = 114

 Score = 27.6 bits (62), Expect = 5.0
 Identities = 15/24 (62%), Positives = 18/24 (75%)

Query: 19 ISRGKEEETEEEKKKKKKKKKKKK 42
          + R KEE  EEE++ KKKKKK KK
Sbjct: 71 LIRNKEELEEEEEEAKKKKKKGKK 94


>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014).  This
          family consists of several hypothetical eukaryotic
          proteins of unknown function.
          Length = 122

 Score = 27.8 bits (62), Expect = 5.1
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query: 20 SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
          +R + EE  E++++K +  ++K KK    E   EE 
Sbjct: 1  TRAQIEERLEKEQRKLEAPEEKSKKNAAREVPLEEN 36



 Score = 27.8 bits (62), Expect = 5.2
 Identities = 8/35 (22%), Positives = 19/35 (54%)

Query: 29 EEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
          EE+ +K+++K +  ++K  +   +E   +  L  N
Sbjct: 6  EERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVN 40


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
          unknown].
          Length = 192

 Score = 28.6 bits (64), Expect = 5.1
 Identities = 10/56 (17%), Positives = 17/56 (30%)

Query: 4  KGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            G E ED   E+       + E E+           +    + E  EK ++    
Sbjct: 34 PVGAEAEDAAAELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEV 89


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 28.9 bits (65), Expect = 5.2
 Identities = 13/61 (21%), Positives = 29/61 (47%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLH 62
            +  ++ ++  +E   +     E  ++ K  + K+K  K+++KR E +E+ EE   K   
Sbjct: 217 VELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKR 276

Query: 63  N 63
            
Sbjct: 277 G 277


>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit.  This
          family represents the eukaryotic vacuolar (H+)-ATPase
          (V-ATPase) G subunit. V-ATPases generate an acidic
          environment in several intracellular compartments.
          Correspondingly, they are found as membrane-attached
          proteins in several organelles. They are also found in
          the plasma membranes of some specialised cells.
          V-ATPases consist of peripheral (V1) and membrane
          integral (V0) heteromultimeric complexes. The G subunit
          is part of the V1 subunit, but is also thought to be
          strongly attached to the V0 complex. It may be involved
          in the coupling of ATP degradation to H+ translocation.
          Length = 105

 Score = 27.5 bits (62), Expect = 5.3
 Identities = 10/36 (27%), Positives = 21/36 (58%), Gaps = 2/36 (5%)

Query: 24 EEETEEEKKKKKKKKKKKKKKK--RYEEEEKEEEKD 57
          E+E EE + +++ + K+ + +      E EK+ EK+
Sbjct: 39 EKEIEEYRAQREAEFKEFEAEHSGSRGELEKKIEKE 74


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 29.1 bits (66), Expect = 5.4
 Identities = 13/51 (25%), Positives = 29/51 (56%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
           E+E ++ +   +    E++ ++++KKK KKKK K  + +  +  KE  + +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQD 432


>gnl|CDD|206402 pfam14234, DUF4336, Domain of unknown function (DUF4336). 
          Length = 314

 Score = 28.7 bits (65), Expect = 5.5
 Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 83  QAPVLLSPGIHSFPFKLGLP-LGLPST 108
           +A V ++PG  SFP  L L  LG+P T
Sbjct: 84  KAQVWVAPGQWSFPLNLPLSWLGIPRT 110



 Score = 28.7 bits (65), Expect = 5.5
 Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 112 QAPVLLSPGIHSFPFKLGLP-LGLPST 137
           +A V ++PG  SFP  L L  LG+P T
Sbjct: 84  KAQVWVAPGQWSFPLNLPLSWLGIPRT 110


>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
           (Spore_III_AF).  This family represents the stage III
           sporulation protein AF (Spore_III_AF) of the bacterial
           endospore formation program, which exists in some but
           not all members of the Firmicutes (formerly called
           low-GC Gram-positives). The C-terminal region of these
           proteins is poorly conserved.
          Length = 185

 Score = 28.4 bits (64), Expect = 5.5
 Identities = 9/55 (16%), Positives = 23/55 (41%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE 55
           ++E       D ++    +    +E+ + + +      +  K K+  EE E+ E+
Sbjct: 111 IDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESEEAEK 165


>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family.  The TOR signalling pathway
           activates a cell-growth program in response to
           nutrients. TIP41 (pfam04176) interacts with TAP42 and
           negatively regulates the TOR signaling pathway.
          Length = 335

 Score = 28.8 bits (65), Expect = 5.5
 Identities = 8/51 (15%), Positives = 23/51 (45%), Gaps = 7/51 (13%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKD 57
           +E  +QE + GG+        +   +     ++++  +   E++++E  K 
Sbjct: 281 DEYLEQEMKRGGV-------PQGGGEAAASAEEEEDDEDDEEDDDEETLKA 324


>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 166

 Score = 28.5 bits (64), Expect = 5.6
 Identities = 8/21 (38%), Positives = 14/21 (66%)

Query: 24  EEETEEEKKKKKKKKKKKKKK 44
             ET E K K+++K +K+ +K
Sbjct: 146 GAETNETKSKRQEKLEKRGEK 166



 Score = 28.1 bits (63), Expect = 6.4
 Identities = 6/26 (23%), Positives = 13/26 (50%)

Query: 20  SRGKEEETEEEKKKKKKKKKKKKKKK 45
               +     E K K+++K +K+ +K
Sbjct: 141 LPSSQGAETNETKSKRQEKLEKRGEK 166


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 29.1 bits (65), Expect = 5.6
 Identities = 14/23 (60%), Positives = 18/23 (78%)

Query: 31  KKKKKKKKKKKKKKKRYEEEEKE 53
           +KKKKKKK K+KKKK+  E  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 29.2 bits (66), Expect = 5.6
 Identities = 16/63 (25%), Positives = 37/63 (58%), Gaps = 3/63 (4%)

Query: 1   MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK-KKRYEEEEKEEEKDNK 59
           + EK  E     E+E+  ++R + +E   EK+ + + +++K++ +K+YEE+ ++E +   
Sbjct: 194 LKEKREELLSKLEEEL--LARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQA 251

Query: 60  LLH 62
             H
Sbjct: 252 EAH 254


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This
          family includes outer membrane proteins such as OmpH
          among others. Skp (OmpH) has been characterized as a
          molecular chaperone that interacts with unfolded
          proteins as they emerge in the periplasm from the Sec
          translocation machinery.
          Length = 140

 Score = 27.9 bits (63), Expect = 5.6
 Identities = 11/40 (27%), Positives = 25/40 (62%)

Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
          GK  + + EK+ KK++ + +K +K  ++ +++ +KD   L
Sbjct: 16 GKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATL 55


>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
          Length = 131

 Score = 28.0 bits (62), Expect = 5.7
 Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEEE 50
           EE+  E++++K++K ++K +KK  EEE
Sbjct: 104 EEDDMEQQEEKEEKGREKGRKKNVEEE 130



 Score = 27.2 bits (60), Expect = 9.2
 Identities = 11/29 (37%), Positives = 23/29 (79%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEE 51
           +EEE + E++++K++K ++K +K+  EEE
Sbjct: 102 QEEEDDMEQQEEKEEKGREKGRKKNVEEE 130


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
          family is found from fungi to humans, but its exact
          function is not known.
          Length = 88

 Score = 27.2 bits (61), Expect = 5.7
 Identities = 8/37 (21%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 21 RGKE-EETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
          R ++ EE E+ +++ ++++K+++++K   +   EE +
Sbjct: 45 RKEQMEELEKAREETERERKEREERKEKRKRAIEERR 81


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 29.0 bits (65), Expect = 5.7
 Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E     EKE   +   +++    E +K  ++KKK +++   +EE ++   
Sbjct: 366 ERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLREQA--DEEYQQRHA 413


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 28.0 bits (63), Expect = 5.8
 Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 7/61 (11%)

Query: 14  KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEE----KDNKLLHNCYLYSY 69
           +E+    +  E+E  E +K+ +K+  ++ KK   EEE +E +    K N L+    LYS+
Sbjct: 72  REMLLFWKKNEKEERELRKRAEKEALEQAKK---EEELREAKRQQRKLNFLITQTELYSH 128

Query: 70  Y 70
           +
Sbjct: 129 F 129


>gnl|CDD|238706 cd01418, Ribosomal_L19e_A, Ribosomal protein L19e, archaeal.
          L19e is found in the large ribosomal subunit of
          eukaryotes and archaea. L19e is distinct from the
          ribosomal subunit L19, which is found in prokaryotes.
          It consists of two small globular domains connected by
          an extended segment. It is located toward the surface
          of the large subunit, with one exposed end involved in
          forming the intersubunit bridge with the small subunit.
           The other exposed end is involved in forming the
          translocon binding site, along with L22, L23, L24, L29,
          and L31e subunits.
          Length = 145

 Score = 28.1 bits (63), Expect = 5.8
 Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 3/46 (6%)

Query: 18 GISRGKEEETEEEKKKKKKKK---KKKKKKKRYEEEEKEEEKDNKL 60
          GISRG+ +E  E++KK +++    +K KK  R  ++E+  +    L
Sbjct: 54 GISRGRLKERHEKRKKGRRRGPGSRKGKKGARTPKKERWIKTIRAL 99


>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
          Length = 111

 Score = 27.6 bits (61), Expect = 5.8
 Identities = 13/51 (25%), Positives = 23/51 (45%)

Query: 9  KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
          K +QE E+       EE+ ++ +++          + R E E+K  E  NK
Sbjct: 30 KNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKKAVEIINK 80


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
          CCCH-type Zn-finger protein [General function
          prediction only].
          Length = 299

 Score = 28.9 bits (64), Expect = 5.8
 Identities = 11/30 (36%), Positives = 21/30 (70%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
          K+ ET   K++ +KK+K + ++KR E E++
Sbjct: 46 KQIETLNLKEQLEKKEKMRMEEKRREPEKQ 75


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
          Provisional.
          Length = 191

 Score = 28.2 bits (63), Expect = 6.0
 Identities = 16/49 (32%), Positives = 24/49 (48%)

Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDN 58
          +D   E   I   K  + ++   +KKK  KK KKKK+ +EE  E   + 
Sbjct: 36 DDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEE 84



 Score = 28.2 bits (63), Expect = 6.9
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 31 KKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
          KKK   +KKK  KK + +++EKEE  +      
Sbjct: 53 KKKTTPRKKKTTKKTKKKKKEKEEVPELAAEEL 85



 Score = 28.2 bits (63), Expect = 7.0
 Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 4/40 (10%)

Query: 24 EEETEEEKKKKKKKKKKKKK----KKRYEEEEKEEEKDNK 59
          E  +++  KKKK   +KKK     KK+ +E+E+  E   +
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83


>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752).  This
           domain family is found in eukaryotes, and is typically
           between 140 and 163 amino acids in length.
          Length = 148

 Score = 28.1 bits (63), Expect = 6.0
 Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 7/59 (11%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKK-------KRYEEEEKEEEKDN 58
              E         S    +  +E  +K K+  +KK+ K       K+ ++++KEEE D+
Sbjct: 51  LGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKEEENDD 109


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 29.0 bits (66), Expect = 6.0
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 32  KKKKKKKKKKKKKKRYEEEEKEE 54
           +K++ KKKKK+KK +  E EK E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 28.9 bits (65), Expect = 6.1
 Identities = 12/53 (22%), Positives = 29/53 (54%), Gaps = 8/53 (15%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            EK+++E+E        EEE E   K      + ++ ++R ++E+ E+++++ 
Sbjct: 179 REKKEEEEE--------EEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDN 223


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
          structure and biogenesis].
          Length = 107

 Score = 27.6 bits (61), Expect = 6.1
 Identities = 16/39 (41%), Positives = 22/39 (56%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
          KE++ E+ K    K KKK  KKK+ EE  +    D +LL
Sbjct: 9  KEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELL 47


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 28.3 bits (64), Expect = 6.3
 Identities = 16/65 (24%), Positives = 32/65 (49%), Gaps = 4/65 (6%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYE--EEEKEE--EKD 57
            E+  ++ +D EK+   +   K    E EK+ K  K + +  ++R+E  E E++E  +K 
Sbjct: 71  VEELRKKLKDYEKDKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKF 130

Query: 58  NKLLH 62
              + 
Sbjct: 131 EAAIQ 135


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family
          consists of several ATP synthase E chain sequences
          which are components of the CF(0) subunit.
          Length = 83

 Score = 27.1 bits (60), Expect = 6.4
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 23 KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
          K  E E ++K  +K K    KKK+ ++E K   + +  
Sbjct: 38 KIREYEAQEKLIEKAKAAYAKKKQAKQEAKSLAEGSSA 75


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
          includes outer membrane proteins such as OmpH among
          others. Skp (OmpH) has been characterized as a
          molecular chaperone that interacts with unfolded
          proteins as they emerge in the periplasm from the Sec
          translocation machinery.
          Length = 157

 Score = 28.0 bits (63), Expect = 6.5
 Identities = 13/40 (32%), Positives = 25/40 (62%)

Query: 22 GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
          GK  + + EK+ KK + + +KK+K  ++EE++ +K    L
Sbjct: 33 GKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATL 72


>gnl|CDD|232861 TIGR00181, pepF, oligoendopeptidase F.  This family represents
          the oligoendopeptidase F clade of the family of larger
          M3 or thimet (for thiol-dependent metallopeptidase)
          oligopeptidase family. Lactococcus lactis PepF
          hydrolyzed peptides of 7 and 17 amino acids with fairly
          broad specificity. The homolog of lactococcal PepF in
          group B Streptococcus was named PepB (PMID:8757883),
          with the name difference reflecting a difference in
          species of origin rather activity; substrate profiles
          were quite similar. Differences in substrate
          specificity should be expected in other species. The
          gene is duplicated in Lactococcus lactis on the plasmid
          that bears it. A shortened second copy is found in
          Bacillus subtilis [Protein fate, Degradation of
          proteins, peptides, and glycopeptides].
          Length = 591

 Score = 28.8 bits (65), Expect = 6.6
 Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 7/63 (11%)

Query: 7  EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYL 66
          + KE+ E  +       E   E+ K+ K  KK     K+ + E    EEK   LL+  Y 
Sbjct: 17 KNKEEWELFL-------EALEEDIKEIKAFKKGLLHSKETFLEALALEEKILILLNRLYN 69

Query: 67 YSY 69
          Y+ 
Sbjct: 70 YAS 72


>gnl|CDD|221339 pfam11957, efThoc1, THO complex subunit 1 transcription elongation
           factor.  The THO complex plays a role in coupling
           transcription elongation to mRNA export. It is composed
           of subunits THP2, HPR1, THO2 and MFT1. The THO complex
           is a nuclear complex that is required for transcription
           elongation through genes containing tandemly repeated
           DNA sequences. The THO complex is also part of the TREX
           (TRanscription EXport) complex that is involved in
           coupling transcription to export of mRNAs to the
           cytoplasm.
          Length = 494

 Score = 28.9 bits (65), Expect = 6.6
 Identities = 12/50 (24%), Positives = 17/50 (34%), Gaps = 2/50 (4%)

Query: 23  KEEETEEEKKKKKKKKKK--KKKKKRYEEEEKEEEKDNKLLHNCYLYSYY 70
            E E   + K K  + K   +   KR       EE  +   +  YL S  
Sbjct: 185 IETELYNKIKSKTAEIKVDSRALNKRKRNPLDGEELASSFFNPKYLTSRD 234


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 28.6 bits (64), Expect = 6.6
 Identities = 11/34 (32%), Positives = 17/34 (50%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           +     E  + K KK+K  KK+K+  +EE    K
Sbjct: 342 RMALLAEAARVKAKKEKLAKKRKQVSKEEAAAIK 375


>gnl|CDD|235245 PRK04183, PRK04183, glutamyl-tRNA(Gln) amidotransferase subunit
          D; Validated.
          Length = 419

 Score = 28.7 bits (65), Expect = 6.6
 Identities = 9/36 (25%), Positives = 15/36 (41%)

Query: 18 GISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
          GI   K  E E  +K +K K++   K+   +     
Sbjct: 42 GIDIDKIAEIELLEKGEKPKQEPPPKEIEKDPGLPN 77


>gnl|CDD|220839 pfam10661, EssA, WXG100 protein secretion system (Wss), protein
          EssA.  The WXG100 protein secretion system (Wss) is
          responsible for the secretion of WXG100 proteins
          (pfam06013) such as ESAT-6 and CFP-10 in Mycobacterium
          tuberculosis or EsxA and EsxB in Staphylococcus aureus.
          In S. aureus, the Wss seems to be encoded by a locus of
          eight CDS, called ess (eSAT-6 secretion system). This
          locus encodes, amongst several other proteins, EssA, a
          protein predicted to possess one transmembrane domain.
          Due to its predicted membrane location and its absolute
          requirement for WXG100 protein secretion, it has been
          speculated that EssA could form a secretion apparatus
          in conjunction with the polytopic membrane protein
          EsaA, YukC (pfam10140) and YukAB, which is a
          membrane-bound ATPase containing Ftsk/SpoIIIE domains
          (pfam01580) called EssC in S. aureus and Snm1/Snm2 in
          Mycobacterium tuberculosis. Proteins homologous to
          EssA, YukC, EsaA and YukD seem absent from
          mycobacteria.
          Length = 145

 Score = 27.9 bits (62), Expect = 6.7
 Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 1/72 (1%)

Query: 3  EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKK-KKRYEEEEKEEEKDNKLL 61
          EK  ++  + E +  GI     E  EE  KKK  ++K+ +  +     E+KE     K  
Sbjct: 28 EKNEKKLRETELDKLGIELFTTETEEEINKKKNAEQKEMEDIENSLFSEDKEGNVAVKET 87

Query: 62 HNCYLYSYYRVT 73
           +    S Y VT
Sbjct: 88 KDSLFSSEYEVT 99


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 29.0 bits (65), Expect = 6.9
 Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 3/68 (4%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
               GE KE+    +  +S     E E E+K +  +K  K+++K  +E E+E E+  +  
Sbjct: 503 KTFYGEFKEEINVLIEKLS---ALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERE 559

Query: 62  HNCYLYSY 69
            N  L   
Sbjct: 560 RNKKLELE 567


>gnl|CDD|216295 pfam01093, Clusterin, Clusterin. 
          Length = 434

 Score = 28.9 bits (65), Expect = 6.9
 Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 10/61 (16%)

Query: 6  GEEKEDQE--KEVGGISRGKE--EETEEEKKK-----KKKKKKKKKKKKRYEE-EEKEEE 55
          GE+  D+E    + G+ + K   E TEEE K      +K KK+K++  K   E EEK EE
Sbjct: 10 GEKYVDEEVKNALIGVKQMKTLMERTEEEHKNLMSTLEKTKKEKEEALKLANEVEEKLEE 69

Query: 56 K 56
          +
Sbjct: 70 E 70


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 28.4 bits (64), Expect = 6.9
 Identities = 15/44 (34%), Positives = 26/44 (59%)

Query: 18  GISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           GI  GK+++ ++E++ KK KKK +    + E  +K  + D  LL
Sbjct: 118 GIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALL 161


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 28.8 bits (65), Expect = 6.9
 Identities = 7/41 (17%), Positives = 18/41 (43%)

Query: 8   EKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYE 48
           +   +E +   +   +E    +++  K  K K +  K++Y 
Sbjct: 364 KAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKYH 404


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 28.7 bits (64), Expect = 7.1
 Identities = 8/32 (25%), Positives = 22/32 (68%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEEEKEE 54
           K+++TEEE+++K+++++++    R  +  K  
Sbjct: 394 KQDDTEEEERRKRQERERQGTSSRSSDPSKAS 425



 Score = 28.3 bits (63), Expect = 8.8
 Identities = 14/36 (38%), Positives = 28/36 (77%)

Query: 24  EEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
           EEE EEE+++++++++ ++++   EEEE+E E DN 
Sbjct: 446 EEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 28.9 bits (65), Expect = 7.1
 Identities = 11/50 (22%), Positives = 23/50 (46%), Gaps = 4/50 (8%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EE E++ +E+         + E      KK++ + + + R  E + EE +
Sbjct: 864 EELEEELEEL----EAALRDLESRLGDLKKERDELEAQLRELERKIEELE 909



 Score = 28.5 bits (64), Expect = 9.8
 Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 4/58 (6%)

Query: 3   EKGGEEKEDQEKEVGGI----SRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EK   EKE QE +   I         E+  E    KK++ +++ ++      + E   
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRL 884


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 28.4 bits (64), Expect = 7.3
 Identities = 9/32 (28%), Positives = 19/32 (59%), Gaps = 3/32 (9%)

Query: 27  TEEEKKKKKKKKKK---KKKKKRYEEEEKEEE 55
           T +EK  + ++ K    + +++   E++KEEE
Sbjct: 200 TAKEKAIEAERAKAEAAEAEQELLREKQKEEE 231


>gnl|CDD|236012 PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated.
          Length = 373

 Score = 28.6 bits (64), Expect = 7.3
 Identities = 12/50 (24%), Positives = 22/50 (44%)

Query: 1  MNEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
            +    +  D +++    S   +E +E+E K K   + K   KK Y E+
Sbjct: 14 DRQPAATDDLDSDEDEESSSENADELSEKEAKAKALARIKIGPKKVYTED 63


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 28.8 bits (64), Expect = 7.5
 Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 3/55 (5%)

Query: 10  EDQEKEVGGISRGKEEETEEEKKKKKKKK---KKKKKKKRYEEEEKEEEKDNKLL 61
           E+Q++E+       ++  EE  K K  K    K++ K    E E+KE E   K  
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 28.5 bits (64), Expect = 7.5
 Identities = 17/54 (31%), Positives = 29/54 (53%)

Query: 27  TEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVTHICFSDP 80
           T+EEK +++ +K ++  KKR E      E+D  + + C L   +  TH+C   P
Sbjct: 154 TDEEKVEEELEKAREIYKKRDERARDLHEEDVDVFYGCTLCQSFAPTHVCVITP 207


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 28.6 bits (64), Expect = 7.6
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 20  SRGKEEETEEEKKKKKKKKKKKKKKKR 46
           S   + +  +  KKK K K KK  KKR
Sbjct: 757 SASTKGKAAKTVKKKSKAKSKKTTKKR 783


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 28.0 bits (63), Expect = 7.8
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 2  NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
          NE+  EE E+ E E       +EE  EEE + +   +++ K     E E K +E + + L
Sbjct: 5  NEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIA---ELEAKLDELEERYL 61


>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
           This family consists of the N-terminal region of the
           prokaryotic fibronectin-binding protein. Fibronectin
           binding is considered to be an important virulence
           factor in streptococcal infections. Fibronectin is a
           dimeric glycoprotein that is present in a soluble form
           in plasma and extracellular fluids; it is also present
           in a fibrillar form on cell surfaces. Both the soluble
           and cellular forms of fibronectin may be incorporated
           into the extracellular tissue matrix. While fibronectin
           has critical roles in eukaryotic cellular processes,
           such as adhesion, migration and differentiation, it is
           also a substrate for the attachment of bacteria. The
           binding of pathogenic Streptococcus pyogenes and
           Staphylococcus aureus to epithelial cells via
           fibronectin facilitates their internalisation and
           systemic spread within the host.
          Length = 447

 Score = 28.4 bits (64), Expect = 7.8
 Identities = 14/23 (60%), Positives = 18/23 (78%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKK 45
           +EE  E+   K+KK+KKKKKKKK
Sbjct: 425 REELIEQGYLKEKKRKKKKKKKK 447


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 28.7 bits (65), Expect = 7.9
 Identities = 10/50 (20%), Positives = 25/50 (50%)

Query: 11  DQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           D  KE       ++EE +E+  K+  +  KK + ++ ++ +  + K+  +
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 28.1 bits (62), Expect = 7.9
 Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 4/54 (7%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           E+  +E+EDQE++     R +EE+T EE+    + +  ++ +   E ++ + EK
Sbjct: 148 EEKIDEREDQEEQ----EREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 28.8 bits (65), Expect = 7.9
 Identities = 10/69 (14%), Positives = 29/69 (42%), Gaps = 10/69 (14%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           +++  EE    E +        +++    ++  ++++++++           EEK  +L 
Sbjct: 133 SDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREW----------EEKRAELE 182

Query: 62  HNCYLYSYY 70
              Y  +YY
Sbjct: 183 FYYYQGTYY 191


>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Proteins.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Vetebrates contain two
           Proline-Serine-Threonine Phosphatase-Interacting
           Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
           mainly expressed in hematopoietic cells and are involved
           in the regulation of cell adhesion and motility.
           Mutations in PSTPIPs have been shown to cause
           autoinflammatory disorders. PSTPIP1 contains an
           N-terminal F-BAR domain, PEST motifs, and a C-terminal
           SH3 domain, while PSTPIP2 contains only the N-terminal
           F-BAR domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 239

 Score = 28.2 bits (63), Expect = 8.0
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 7/56 (12%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKK--KKKKKKKKRYEEEEKEEEK 56
           EK  E +E Q++E     R K E+  +  +K KK+  KK  K KK YE++ +E++K
Sbjct: 88  EKLEEFREKQKEE-----RKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKDK 138


>gnl|CDD|204055 pfam08764, Coagulase, Staphylococcus aureus coagulase.
           Staphylococcus aureus secretes a cofactor called
           coagulase. Coagulase is an extracellular protein that
           forms a complex with human prothrombin, and activates it
           without the usual proteolytic cleavages. The resulting
           complex directly initiates blood clotting.
          Length = 282

 Score = 28.2 bits (63), Expect = 8.1
 Identities = 11/40 (27%), Positives = 22/40 (55%)

Query: 34  KKKKKKKKKKKKRYEEEEKEEEKDNKLLHNCYLYSYYRVT 73
           +KKK + +K KK Y++ ++E       +   + Y+ Y +T
Sbjct: 73  EKKKAEYEKYKKWYKKYKEENPNSKLKMKTFHKYNLYDLT 112


>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809).  This
           family consists of several proteins of unknown function
           Raphanus sativus (Radish) and Brassica napus (Rape).
          Length = 138

 Score = 27.4 bits (60), Expect = 8.3
 Identities = 17/43 (39%), Positives = 28/43 (65%)

Query: 14  KEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           K+V  I +G+ E  EE+K+ K + + K++KK+   E E +EEK
Sbjct: 87  KKVSPIIKGEIEGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEK 129


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 28.2 bits (63), Expect = 8.3
 Identities = 14/57 (24%), Positives = 25/57 (43%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK 59
            K   E E   +      + K +  ++ KK K K K K K K + + + K ++  +K
Sbjct: 67  PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSK 123


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 28.2 bits (63), Expect = 8.3
 Identities = 11/57 (19%), Positives = 27/57 (47%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKLLHN 63
           EE+E  E EV   +    +  EE +KK+ + ++ +++ +  +E  +   ++      
Sbjct: 75  EERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLEAAA 131


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 28.4 bits (64), Expect = 8.4
 Identities = 9/59 (15%), Positives = 21/59 (35%)

Query: 2   NEKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            +   +  +      G +    +    +  KKK K   K    K   +++ ++E D+  
Sbjct: 46  KKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104


>gnl|CDD|206050 pfam13879, KIAA1430, KIAA1430 homologue.  This is a family of
          KIAA1430 homologues. The function is not known.
          Length = 98

 Score = 26.9 bits (60), Expect = 8.4
 Identities = 11/33 (33%), Positives = 18/33 (54%), Gaps = 2/33 (6%)

Query: 31 KKKKKKKKKKKKKKKRYEEEE--KEEEKDNKLL 61
          K  ++      K KK  +EEE   E E++N++L
Sbjct: 16 KPPRRPAHLLLKLKKLQQEEERLAEIERENQIL 48


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 27.9 bits (62), Expect = 8.4
 Identities = 18/47 (38%), Positives = 24/47 (51%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEK 52
            +  E QEKE       KEEE E+ K+++    K+KK  K    EEK
Sbjct: 76  EDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKPFKEMNLEEK 122


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 28.1 bits (63), Expect = 8.5
 Identities = 10/22 (45%), Positives = 11/22 (50%)

Query: 24  EEETEEEKKKKKKKKKKKKKKK 45
             E   EKKK + KKK  K  K
Sbjct: 148 RTEILAEKKKPRSKKKSSKNSK 169



 Score = 28.1 bits (63), Expect = 9.7
 Identities = 10/26 (38%), Positives = 13/26 (50%)

Query: 21  RGKEEETEEEKKKKKKKKKKKKKKKR 46
              E   E++K + KKK  K  KK R
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLR 172


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 27.8 bits (62), Expect = 8.5
 Identities = 13/51 (25%), Positives = 27/51 (52%)

Query: 10  EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           E+  K+ G I + + + T+ + + +K +   +  +K  EE  K  +K N+L
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNEL 179


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 28.2 bits (63), Expect = 8.6
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 7   EEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKK 40
           EE  + EKE     R   ++ EEE++  K+K K+
Sbjct: 148 EEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQ 181


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor
          involved in ribosome biogenesis.
          Length = 112

 Score = 27.1 bits (60), Expect = 8.6
 Identities = 11/42 (26%), Positives = 25/42 (59%)

Query: 9  KEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEE 50
          K    K  G ++  K E   ++K++KKK K+  +K+++ +++
Sbjct: 28 KRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKK 69


>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
           recombination, and repair].
          Length = 292

 Score = 28.0 bits (62), Expect = 8.7
 Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 1/32 (3%)

Query: 22  GKEEETEEEKKKKKKKKKKKKKKKRYEEEEKE 53
           G  EE + EK++++K K  +K+KK  E++ KE
Sbjct: 209 GSLEELDREKQRREKMKDDRKEKK-LEKKIKE 239


>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region.  This
          family contains sequences that are similar to the
          N-terminal region of Red protein. This and related
          proteins contain a RED repeat which consists of a
          number of RE and RD sequence elements. The region in
          question has several conserved NLS sequences and a
          putative trimeric coiled-coil region, suggesting that
          these proteins are expressed in the nucleus. The
          function of Red protein is unknown, but efficient
          sequestration to nuclear bodies suggests that its
          expression may be tightly regulated of that the protein
          self-aggregates extremely efficiently.
          Length = 238

 Score = 27.9 bits (62), Expect = 8.8
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 38 KKKKKKKKRYEEEEKEEEKDNK 59
          KKKKK     ++EE  E++ N 
Sbjct: 1  KKKKKYAYLRKQEENAEKEINP 22


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 27.9 bits (62), Expect = 8.9
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 25  EETEEEKKKKKKKKKKKKKKKRYEEE 50
           EE +E+KK   K++K+ KKK+R + E
Sbjct: 94  EEEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 28.6 bits (64), Expect = 8.9
 Identities = 13/43 (30%), Positives = 23/43 (53%)

Query: 3   EKGGEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
           +KG ++K+ +     G +     + EE+  KK KK +KK + K
Sbjct: 417 DKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDK 459


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 28.7 bits (65), Expect = 8.9
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 19  ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
            SR KE++ +EE K  K    KK ++   E++ +EEEKD +L
Sbjct: 488 SSRRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQL 529


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein
          is found in a wide range of eukaryotes. It is a nuclear
          protein and is suggested to be DNA binding. In plants,
          this family is essential for correct circadian clock
          functioning by acting as a light-quality regulator
          coordinating the activities of blue and red light
          signalling pathways during plant growth - inhibiting
          growth in red light but promoting growth in blue light.
          Length = 233

 Score = 28.1 bits (63), Expect = 9.0
 Identities = 12/23 (52%), Positives = 19/23 (82%)

Query: 35 KKKKKKKKKKKRYEEEEKEEEKD 57
          KKKKKKKK K  + ++++EE++D
Sbjct: 1  KKKKKKKKSKLSFGDDDEEEDED 23


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 28.3 bits (63), Expect = 9.0
 Identities = 11/39 (28%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 23  KEEETEEEKKKKKKKKKKKKKKKRYEEE-EKEEEKDNKL 60
              +T+    KKK  +  K++  + EE  EKE  K ++ 
Sbjct: 410 TLYKTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEK 448


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 28.7 bits (65), Expect = 9.1
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query: 27 TEEEKKKKKKKKKKKKKKKRYEEEEKEEEK 56
           EEE +K++ K++ +++ K   EE    E 
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEA 68


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 881

 Score = 28.4 bits (63), Expect = 9.2
 Identities = 14/42 (33%), Positives = 24/42 (57%)

Query: 19 ISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNKL 60
           S  K++  +E+ +   +K+   KK K+ EEEE+  E+D  L
Sbjct: 4  ESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAML 45


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 27.7 bits (62), Expect = 9.6
 Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 6   GEEKEDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKK 45
            E K+ ++KE       KE      KK    +K + KKKK
Sbjct: 157 AERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|212164 cd11650, AT4G37440_like, Uncharacterized protein domain conserved
           in plants.  This domain contains an extensive protein
           sequence fragment that appears conserved in a number of
           plant proteins, including the gene product of
           Arabidopsis thaliana locus AT4G37440, which has been
           identified in transcriptional profiling as expressed at
           different levels in white cabbage cultivars.
          Length = 253

 Score = 27.8 bits (62), Expect = 9.7
 Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 13/50 (26%)

Query: 13  EKEVGGISRGKE-----EETEEEKKK--------KKKKKKKKKKKKRYEE 49
           ++E+  I + KE      + E E  K         ++K  K++K+KR EE
Sbjct: 114 DRELALIEQEKELQLEGTKAEGENSKSAPFSGERHERKVMKRRKRKRVEE 163


>gnl|CDD|233048 TIGR00605, rad4, DNA repair protein rad4.  All proteins in this
          family for which functions are known are involved in
          targeting nucleotide excision repair to specific
          regions of the genome.This family is based on the
          phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
          Stanford University) [DNA metabolism, DNA replication,
          recombination, and repair].
          Length = 713

 Score = 28.3 bits (63), Expect = 9.9
 Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 7/82 (8%)

Query: 10 EDQEKEVGGISRGKEEETEEEKKKKKKKKKKKKKKKRYEEEEKEEEKDNK--LLHN---- 63
          ED  KE   + R  E E E EK+ K +++K +++ +      K+  K     L H     
Sbjct: 3  EDLAKEYFELIRKVENEKEAEKQPKSRRRKVRRENEPSLRRRKKRFKTGLNELPHEVVLM 62

Query: 64 CYLYSYYRVTHICFSDPPGQAP 85
          C L S +    +  S P   + 
Sbjct: 63 CNLDSTHSDDRVV-SVPDSLSV 83


>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
          Length = 437

 Score = 28.2 bits (63), Expect = 9.9
 Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 6/42 (14%)

Query: 26  ETEEEK------KKKKKKKKKKKKKKRYEEEEKEEEKDNKLL 61
           ETEEEK      + + K +KKK+ +++ E E  E E+   LL
Sbjct: 225 ETEEEKLLFEEAEARNKLRKKKRGEQKREFENGEAERLEALL 266


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.137    0.409 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,926,567
Number of extensions: 2282164
Number of successful extensions: 14838
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11304
Number of HSP's successfully gapped: 1122
Length of query: 446
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 346
Effective length of database: 6,502,202
Effective search space: 2249761892
Effective search space used: 2249761892
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (27.4 bits)