RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9536
         (287 letters)



>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 37.8 bits (88), Expect = 0.005
 Identities = 17/49 (34%), Positives = 30/49 (61%)

Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           E+EV  + E   +EE +E+ E     E+ + +E+E ++KKKK KK ++ 
Sbjct: 32  EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80



 Score = 34.7 bits (80), Expect = 0.049
 Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 9/59 (15%)

Query: 233 VWGEREVELDLEWIRREEDEEKTETRKKDEDVLGK------EKENKKKKKKKKKKEKKN 285
           +W E+EVE +   +  EE+EE+ E +K++E+          E+E K++KKKK KK K+ 
Sbjct: 25  LWVEKEVEKE---VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 36.5 bits (85), Expect = 0.012
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 256 ETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           +T K  + VL K  E K+K+K+K K +K++R
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417



 Score = 32.6 bits (75), Expect = 0.21
 Identities = 11/32 (34%), Positives = 19/32 (59%), Gaps = 4/32 (12%)

Query: 254 KTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           K   ++ ++    KEKE +K K KK+ ++ KN
Sbjct: 394 KVLAKRAEK----KEKEKEKPKVKKRHRDTKN 421



 Score = 31.5 bits (72), Expect = 0.48
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           +   E+KT    K       EK+ K+K+K K KK  ++ +
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420



 Score = 28.4 bits (64), Expect = 4.8
 Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 4/53 (7%)

Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGK--EKENKKKKKKKKKKEKKNR 286
           ER +E  L+   R  DE + +T+   E   GK  +K   K+ +KK+K+++K +
Sbjct: 361 ERYIEEPLK--ARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 34.3 bits (79), Expect = 0.022
 Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 9/55 (16%)

Query: 236 EREVELDLEWIRREEDEE----KTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           ER   +D +W +  EDEE    + E ++KDE     EK  KK+ K++KKK+KK +
Sbjct: 50  ERLELMDEKWKKETEDEEFQQKREEKKRKDE-----EKTAKKRAKRQKKKQKKKK 99



 Score = 33.5 bits (77), Expect = 0.042
 Identities = 17/38 (44%), Positives = 24/38 (63%), Gaps = 3/38 (7%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           ++ +DEEKT  ++       K+K+ KKKKKK KK  KK
Sbjct: 75  KKRKDEEKTAKKRAKRQ---KKKQKKKKKKKAKKGNKK 109


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 33.8 bits (78), Expect = 0.066
 Identities = 22/43 (51%), Positives = 28/43 (65%), Gaps = 2/43 (4%)

Query: 244 EWIRREEDEEKT--ETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           EW  R+E  EK   E +KK E+ L K K++KK KKKKK K+K 
Sbjct: 161 EWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKG 203


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 33.2 bits (76), Expect = 0.082
 Identities = 14/41 (34%), Positives = 25/41 (60%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           +  E EE+ +  KK +  + KEK+ KK KK+K  + K +++
Sbjct: 145 KEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185



 Score = 32.8 bits (75), Expect = 0.13
 Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 2/38 (5%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           +++++ +K +  KKD+    K  E K  KKKKKKK+KK
Sbjct: 158 KKKKEVKKEKKEKKDKKE--KMVEPKGSKKKKKKKKKK 193



 Score = 32.4 bits (74), Expect = 0.19
 Identities = 18/42 (42%), Positives = 29/42 (69%), Gaps = 2/42 (4%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKK--KKKKKEKKNRR 287
              ++E T   +K+ +V  +EK+ KKKKK  KK+KKEKK+++
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKK 174



 Score = 30.1 bits (68), Expect = 1.1
 Identities = 14/38 (36%), Positives = 20/38 (52%), Gaps = 1/38 (2%)

Query: 248 REEDEEKTETRKKDEDVL-GKEKENKKKKKKKKKKEKK 284
               E +T  ++    V    E E ++KK+KKKKKE K
Sbjct: 127 ELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVK 164



 Score = 28.1 bits (63), Expect = 4.0
 Identities = 21/45 (46%), Positives = 29/45 (64%), Gaps = 8/45 (17%)

Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKK-------KKKKEKKNRR 287
           E+EEK E +KK E V  ++KE K KK+K       KKKK+KK ++
Sbjct: 150 EEEEKKEKKKKKE-VKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 27.4 bits (61), Expect = 8.0
 Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 2/40 (5%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKK-KKKKK-KKEKKN 285
            +E   K E   + E+   KEK+ KK+ KK+KK KK+KK 
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 33.1 bits (76), Expect = 0.14
 Identities = 10/38 (26%), Positives = 20/38 (52%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
            +DE   E  + + + + K    K+K K ++ KEK+ +
Sbjct: 253 SDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRK 290



 Score = 28.9 bits (65), Expect = 2.9
 Identities = 19/49 (38%), Positives = 22/49 (44%), Gaps = 5/49 (10%)

Query: 238 EVELDLEWIRREEDEEKTETRKKDE-----DVLGKEKENKKKKKKKKKK 281
           +VE  LE  R E         K DE     D  G E   KKK++KK KK
Sbjct: 16  DVEQFLEEQRDERITGGVLAEKPDEDLFVVDTTGDEATIKKKRRKKLKK 64



 Score = 27.4 bits (61), Expect = 9.6
 Identities = 12/50 (24%), Positives = 24/50 (48%), Gaps = 3/50 (6%)

Query: 235 GEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           GE E + +  W   E + E      + +    K K  + K+K++K+ E++
Sbjct: 249 GEEESDDESAWEGFESEYEPINKPVRPKR---KTKAQRNKEKRRKELERE 295


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 33.0 bits (76), Expect = 0.16
 Identities = 14/30 (46%), Positives = 22/30 (73%)

Query: 258 RKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           +K++E    K K+ KK+KKK KK++KK R+
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.5 bits (72), Expect = 0.56
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 257 TRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
            +KK E+   ++++ KKK+KKK KK KK  R
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 28.0 bits (63), Expect = 6.5
 Identities = 15/37 (40%), Positives = 23/37 (62%), Gaps = 6/37 (16%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
           +++ +E+K + RKK      K+K  KK KK+KKK  K
Sbjct: 384 KKKREEKKPQKRKK------KKKRKKKGKKRKKKGRK 414



 Score = 27.6 bits (62), Expect = 7.6
 Identities = 12/31 (38%), Positives = 19/31 (61%), Gaps = 2/31 (6%)

Query: 254 KTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           K    KK +    K+K+ +KKK KK+KK+ +
Sbjct: 385 KKREEKKPQK--RKKKKKRKKKGKKRKKKGR 413


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 33.1 bits (75), Expect = 0.18
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 10/56 (17%)

Query: 241 LDLEWIRREEDEEK--------TETRK--KDEDVLGKEKENKKKKKKKKKKEKKNR 286
           LD++  +   D EK         ET K  +  DV   EK++KK KKK+KK+++K R
Sbjct: 159 LDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKER 214



 Score = 31.2 bits (70), Expect = 0.75
 Identities = 21/78 (26%), Positives = 35/78 (44%)

Query: 210 KTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEK 269
           K  +E  K L     ++ A    V    E  L         D E  E +  + +   K  
Sbjct: 219 KKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSP 278

Query: 270 ENKKKKKKKKKKEKKNRR 287
           ++KKKK++K+K+EKK ++
Sbjct: 279 KHKKKKQRKEKEEKKKKK 296



 Score = 28.9 bits (64), Expect = 4.1
 Identities = 12/30 (40%), Positives = 22/30 (73%), Gaps = 1/30 (3%)

Query: 259 KKDEDVLGKEK-ENKKKKKKKKKKEKKNRR 287
           K +E+   +++ E  K++KKK++KEK+ RR
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRR 112


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 31.9 bits (72), Expect = 0.21
 Identities = 16/39 (41%), Positives = 27/39 (69%), Gaps = 2/39 (5%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           ++ ++  KT+++K D+    KEK ++KK K KKK+EK N
Sbjct: 12  KQRKESGKTQSQKSDKK--KKEKVSEKKGKSKKKEEKPN 48


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 31.7 bits (73), Expect = 0.22
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query: 268 EKENKKKKKKKKKKEKKNR 286
           EK  KK +K K K EKK  
Sbjct: 72  EKAEKKAEKAKAKAEKKKA 90



 Score = 28.3 bits (64), Expect = 3.3
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 252 EEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           EE  + ++  E  L  +KE K  +K +KK EK   
Sbjct: 49  EEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKA 83



 Score = 27.1 bits (61), Expect = 7.9
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query: 267 KEKENKKKKKKKKKKEKKNRR 287
            EK   K +KKK KKE+   R
Sbjct: 78  AEKAKAKAEKKKAKKEEPKPR 98


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 31.6 bits (72), Expect = 0.26
 Identities = 13/49 (26%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 238 EVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           E E   +W  +++  +K + + KD+     +K++K +KK +K+ E K  
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKK--DDKKDDKSEKKDEKEAEDKLE 123



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 18/41 (43%), Positives = 24/41 (58%), Gaps = 6/41 (14%)

Query: 251 DEEKTETRKKDEDV------LGKEKENKKKKKKKKKKEKKN 285
           D E TE +KK +++      + KE E K+K K KKKK KK 
Sbjct: 54  DAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKK 94



 Score = 27.7 bits (62), Expect = 5.9
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           +++E  E+ E  KK+ +   K K  KKK KKKK K+K 
Sbjct: 63  KKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKD 100



 Score = 27.0 bits (60), Expect = 8.4
 Identities = 15/48 (31%), Positives = 24/48 (50%)

Query: 238 EVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           E +   + +  E ++ K E  +K +    K+K  KKK K K KK+ K 
Sbjct: 59  EAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKK 106


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 31.7 bits (72), Expect = 0.27
 Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 4/38 (10%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
            +  EK   +KK ED    +KE KKKKK+KKKK+K++ 
Sbjct: 137 LKGHEKKHKKKKHED----DKERKKKKKEKKKKKKRHS 170



 Score = 28.6 bits (64), Expect = 2.7
 Identities = 20/46 (43%), Positives = 26/46 (56%), Gaps = 6/46 (13%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKK------KKKKKEKKNRR 287
           R +D    ET    E + G EK++KKKK       KKKKKEKK ++
Sbjct: 121 RTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.6 bits (74), Expect = 0.31
 Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           ++   EE+ E RKK ++   KEKE KK K  +K+ + K
Sbjct: 10  KKILTEEELE-RKKKKEEKAKEKELKKLKAAQKEAKAK 46


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 30.0 bits (68), Expect = 0.32
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 259 KKDEDVLGKEKENKKKKKKKKKKEKKN 285
           K + D++ K+K+ KKKK +      K 
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAATAKA 43


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 31.6 bits (72), Expect = 0.38
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           EE++E+ E  K       K K+  K K ++K+K K+
Sbjct: 39  EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74



 Score = 31.2 bits (71), Expect = 0.48
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           ++EE  E  ++   V  K K  K  K K ++KEK  R
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74



 Score = 29.2 bits (66), Expect = 2.4
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 245 WIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           W   +ED++  ++  ++ED   +E++ K   K K KK  K
Sbjct: 24  WDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALK 63



 Score = 28.9 bits (65), Expect = 2.5
 Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 5/44 (11%)

Query: 249 EEDEEKTETR-----KKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           EEDEEK E +     K       K K  +K+K K++K+EK  R 
Sbjct: 40  EEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRE 83



 Score = 28.5 bits (64), Expect = 4.0
 Identities = 11/52 (21%), Positives = 25/52 (48%)

Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           + + ++   W   E++E++ E  K       K+    K ++K+K K +K  +
Sbjct: 28  DEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79



 Score = 28.5 bits (64), Expect = 4.0
 Identities = 9/18 (50%), Positives = 10/18 (55%)

Query: 267 KEKENKKKKKKKKKKEKK 284
            EK  K  KKKK K + K
Sbjct: 197 AEKAAKGGKKKKGKAKAK 214


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 30.9 bits (70), Expect = 0.40
 Identities = 11/41 (26%), Positives = 19/41 (46%)

Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           I RE      E  + +E+   + ++    + K + KEKK R
Sbjct: 87  IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKR 127



 Score = 30.5 bits (69), Expect = 0.51
 Identities = 12/38 (31%), Positives = 20/38 (52%)

Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           E+E   ET ++D      E + KKK++  K K +K + 
Sbjct: 102 EEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKP 139



 Score = 27.0 bits (60), Expect = 6.8
 Identities = 9/39 (23%), Positives = 20/39 (51%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           EE  ++TE     E     +++ K++  K K +++K + 
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKT 141


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 30.3 bits (69), Expect = 0.41
 Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           E +EEKT T K+ +  + KEK   +KK+++KK E +
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAE 36


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 31.3 bits (72), Expect = 0.48
 Identities = 7/21 (33%), Positives = 11/21 (52%)

Query: 267 KEKENKKKKKKKKKKEKKNRR 287
             K  KKK+K++ K  K   +
Sbjct: 69  WHKAQKKKEKQEAKAAKAKSK 89



 Score = 29.0 bits (66), Expect = 2.5
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 252 EEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           E+  E +++ +  L  +KE K   K +KKKEK+  +
Sbjct: 47  EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAK 82


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 30.7 bits (70), Expect = 0.49
 Identities = 10/39 (25%), Positives = 23/39 (58%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           EE E +    ++  D L +++  K K++K+++ E+K + 
Sbjct: 107 EEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 29.5 bits (67), Expect = 0.52
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 13/55 (23%)

Query: 232 IVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
            +W     EL+ +++  EE EEK   +K DE        N   KKKKK+K KK R
Sbjct: 19  RIW----DELNKDYL--EEQEEKELKQKADE-------GNNSGKKKKKRKAKKKR 60


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 30.8 bits (70), Expect = 0.57
 Identities = 13/43 (30%), Positives = 16/43 (37%), Gaps = 1/43 (2%)

Query: 233 VWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKK 275
            + ER+     E    EE +E      K      K  E KKKK
Sbjct: 155 WYAERKDAKQKEEFAAEERKEALAAAAKKSATPQK-VETKKKK 196


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 30.3 bits (69), Expect = 0.67
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 8/57 (14%)

Query: 234 WGEREVELDLE--WIRREEDEEKTETRKKDEDVLGKEKENKKKKKKK-KKKEKKNRR 287
           WG +    DLE  WI      E        ED   K++  KK++  K +K+E KN++
Sbjct: 113 WGYKRANKDLEKDWII-----EVKPNDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 30.5 bits (69), Expect = 0.67
 Identities = 12/39 (30%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 248 REEDEEKTETRKKDEDVLGKEKE--NKKKKKKKKKKEKK 284
           +++  EK +   K +     +K    +KK++KKK K+KK
Sbjct: 39  QDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 30.9 bits (70), Expect = 0.79
 Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 3/48 (6%)

Query: 243 LEWIRREEDEEKTETRKKDEDVLG---KEKENKKKKKKKKKKEKKNRR 287
           L  ++ E D  +    K  E+ L     EK+ K++  + K+K +K+ R
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 30.5 bits (69), Expect = 1.00
 Identities = 11/36 (30%), Positives = 23/36 (63%), Gaps = 4/36 (11%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           +E++ K E  +K +    KE+ ++ K+K++K + KK
Sbjct: 403 KEEKLKQEENEKKQ----KEQADEDKEKRQKDERKK 434



 Score = 30.5 bits (69), Expect = 1.1
 Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 8/58 (13%)

Query: 237 REVELDLEWIRREEDEEKTETRK--------KDEDVLGKEKENKKKKKKKKKKEKKNR 286
           +E    L+   ++  E K ET          K+E +  +E E K+K++  + KEK+ +
Sbjct: 372 KEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429



 Score = 27.5 bits (61), Expect = 9.5
 Identities = 8/33 (24%), Positives = 20/33 (60%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
           +  EEK +  + ++    +  E+K+K++K ++K
Sbjct: 401 KAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 31.3 bits (70), Expect = 0.79
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 246  IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
            I+  E+ +K E  KK  +   K +E++KK  +  KKE +  +
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702



 Score = 30.5 bits (68), Expect = 1.2
 Identities = 16/61 (26%), Positives = 34/61 (55%), Gaps = 9/61 (14%)

Query: 236  EREVELDLEWIRREEDEEKT---------ETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
              E ++  E +++ E+E+K          E +KK E++   E+ENK K  ++ KK ++++
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674

Query: 287  R 287
            +
Sbjct: 1675 K 1675



 Score = 30.1 bits (67), Expect = 1.6
 Identities = 15/41 (36%), Positives = 24/41 (58%)

Query: 247  RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
            + E  E+K E  KK  D   K+ E KKK  + KKK +++++
Sbjct: 1365 KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405



 Score = 29.7 bits (66), Expect = 2.4
 Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 208  QSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRR-EEDEEKTETRKKDEDVLG 266
            +++ A EL K  + +  + +          E +   E  ++ +E ++K E  KK  D   
Sbjct: 1276 EARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAK 1335

Query: 267  KEKENKKKKKKKKKKEKKNRR 287
            K+ E  KK  +  K E +   
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAA 1356



 Score = 29.3 bits (65), Expect = 2.9
 Identities = 20/77 (25%), Positives = 36/77 (46%), Gaps = 1/77 (1%)

Query: 208  QSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGK 267
             ++ A E+ K  + R  E +        + E E   E  R+ ED +K E  KK E+    
Sbjct: 1180 AARKAEEVRKAEELRKAEDARKAEAA-RKAEEERKAEEARKAEDAKKAEAVKKAEEAKKD 1238

Query: 268  EKENKKKKKKKKKKEKK 284
             +E KK ++++  +E +
Sbjct: 1239 AEEAKKAEEERNNEEIR 1255



 Score = 29.3 bits (65), Expect = 3.2
 Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 2/84 (2%)

Query: 204  VALIQSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDED 263
            V  ++ K A E  K  + +  E           ++ E D +  + EE ++  E  KK  +
Sbjct: 1635 VEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK--KAEEAKKAEEDEKKAAE 1692

Query: 264  VLGKEKENKKKKKKKKKKEKKNRR 287
             L KE E  KK ++ KKKE + ++
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEEKK 1716



 Score = 28.6 bits (63), Expect = 5.7
 Identities = 17/52 (32%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 236  EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
            E E     E ++++E EEK    KK E++   E+ENK K ++ KK+ +++++
Sbjct: 1697 EAEEAKKAEELKKKEAEEK----KKAEELKKAEEENKIKAEEAKKEAEEDKK 1744



 Score = 28.2 bits (62), Expect = 6.5
 Identities = 6/38 (15%), Positives = 17/38 (44%)

Query: 247  RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
            + +  ++K E  KK  +    E E    + +  +++ +
Sbjct: 1330 KADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 28.2 bits (62), Expect = 6.7
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query: 247  RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
            + +E ++K E  KK E+   K +E KK  + KKK E+  +
Sbjct: 1445 KADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484



 Score = 27.8 bits (61), Expect = 7.9
 Identities = 14/41 (34%), Positives = 23/41 (56%)

Query: 247  RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
            + +E ++K E  KK  D L K    KKK  + KKK ++ ++
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432



 Score = 27.8 bits (61), Expect = 8.5
 Identities = 14/41 (34%), Positives = 24/41 (58%)

Query: 247  RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
            + EED++K +  KK      K  E KKK ++KKK ++  ++
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439



 Score = 27.8 bits (61), Expect = 8.9
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 247  RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
            + +E ++K E  KK  D   K  E KKK  + KK E+  +
Sbjct: 1484 KADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523



 Score = 27.8 bits (61), Expect = 9.4
 Identities = 12/41 (29%), Positives = 23/41 (56%)

Query: 247  RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
            + +E ++K E  KK ++   K +E KKK  + KK  +  ++
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 30.6 bits (69), Expect = 0.90
 Identities = 21/95 (22%), Positives = 29/95 (30%), Gaps = 6/95 (6%)

Query: 193 IYHPTTSQYGSVALIQSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDE 252
                +   G   L  +K    L  L   + G+    T      RE+             
Sbjct: 89  QPVENSLPPGCGILPGTKGIHHLQMLHHPQLGDNPYGT------REMHHLEVPPISTAPP 142

Query: 253 EKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           E  E +K  +    K  +  K KK KKK    NR 
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.4 bits (69), Expect = 0.95
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 251 DEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           +EE    + + E+   K+K+ KKKKK KK     + 
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSS 201


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 28.4 bits (64), Expect = 0.97
 Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 4/38 (10%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           EED+E+ +  K         + N++KKKKKKK +KK++
Sbjct: 21  EEDDEQIDASKVRRI----SQRNRRKKKKKKKLKKKSK 54



 Score = 26.5 bits (59), Expect = 5.5
 Identities = 14/36 (38%), Positives = 19/36 (52%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           EE EE  E     +     ++  +KKKKKKK K+K 
Sbjct: 18  EEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
            +E + E ++  ++   KE    KK+KK+KK++K
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 29.0 bits (65), Expect = 1.6
 Identities = 12/32 (37%), Positives = 22/32 (68%)

Query: 256 ETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           E+ K+ ++V  + +E +  K KK+KKEKK ++
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 27.8 bits (62), Expect = 4.1
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           E ++E+ E  K+ E+    + + +KK+KK+KK  +K
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 28.5 bits (64), Expect = 1.1
 Identities = 13/19 (68%), Positives = 15/19 (78%)

Query: 266 GKEKENKKKKKKKKKKEKK 284
           GK+ + KKKKKKKKKK K 
Sbjct: 14  GKKIDVKKKKKKKKKKNKS 32



 Score = 26.6 bits (59), Expect = 4.8
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 267 KEKENKKKKKKKKKKEKKN 285
           K K+   KKKKKKKK+K  
Sbjct: 13  KGKKIDVKKKKKKKKKKNK 31



 Score = 26.6 bits (59), Expect = 5.2
 Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 266 GKEKENKKKKKKKKKKEKKN 285
            K    KKKKKKKKK + K 
Sbjct: 15  KKIDVKKKKKKKKKKNKSKE 34



 Score = 25.8 bits (57), Expect = 9.9
 Identities = 12/20 (60%), Positives = 15/20 (75%)

Query: 265 LGKEKENKKKKKKKKKKEKK 284
           L  +K + KKKKKKKKK+ K
Sbjct: 12  LKGKKIDVKKKKKKKKKKNK 31


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 30.6 bits (69), Expect = 1.1
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
            +++ +EK +   KD     + KE +  K+K+K+KEKK
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 29.5 bits (66), Expect = 2.4
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           +E+++EK     +D +   K +  + K + KK  +KK  
Sbjct: 146 KEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPP 184



 Score = 28.7 bits (64), Expect = 4.2
 Identities = 13/41 (31%), Positives = 25/41 (60%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           ++E+ +E+ + RK  E+   K    +K+K+K+KK E+   R
Sbjct: 120 KKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDR 160



 Score = 28.3 bits (63), Expect = 5.4
 Identities = 14/41 (34%), Positives = 24/41 (58%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
              ++EEK + + K+E    KEK  ++ K +K K+E K +R
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141



 Score = 28.3 bits (63), Expect = 5.8
 Identities = 8/40 (20%), Positives = 23/40 (57%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
             ++ +    ++++++    ++E KKKK+K K++ K  + 
Sbjct: 94  PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKP 133



 Score = 27.9 bits (62), Expect = 6.2
 Identities = 9/41 (21%), Positives = 26/41 (63%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           +++E  ++    +K ++   +++  K+K+K+K+KK ++ R 
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 10/44 (22%), Positives = 20/44 (45%)

Query: 244 EWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           E  +R+E+E +   R           + K K+ +K++ E+   R
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 30.3 bits (69), Expect = 1.1
 Identities = 10/48 (20%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 240 ELDLEWIR-REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           E ++E++   ++  +K +   +  +   +E++ +KKKK    K+K+  
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443



 Score = 30.3 bits (69), Expect = 1.2
 Identities = 12/38 (31%), Positives = 27/38 (71%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
            ++ +EEK E +KK      KE+E +++K+KK++++++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 29.9 bits (68), Expect = 1.7
 Identities = 12/36 (33%), Positives = 25/36 (69%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
            EE+EEK +  ++ E+   + +E K+++++KKKK+ 
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 29.5 bits (67), Expect = 2.0
 Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 249 EEDEEKTET-RKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
            E+E +  T  KK    + K  E  +KK++++KKEKK +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKK 433



 Score = 29.5 bits (67), Expect = 2.4
 Identities = 10/39 (25%), Positives = 29/39 (74%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
            ++++EE+ E ++K E+   +E+E  +++K++++++KK 
Sbjct: 437 GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 28.7 bits (65), Expect = 4.0
 Identities = 10/38 (26%), Positives = 27/38 (71%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           +REE++++ + +        +E+E +K+KK+++K+E++
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.5 bits (70), Expect = 1.2
 Identities = 19/54 (35%), Positives = 35/54 (64%), Gaps = 5/54 (9%)

Query: 236 EREVEL-DLE--WIRREE--DEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           ER  EL  LE   +++EE  D +     K++E++  KEKE ++K+++ +KKE++
Sbjct: 79  ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE 132


>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
          Length = 128

 Score = 29.0 bits (66), Expect = 1.2
 Identities = 10/14 (71%), Positives = 10/14 (71%)

Query: 272 KKKKKKKKKKEKKN 285
            KKKK  KKK KKN
Sbjct: 1   AKKKKTVKKKVKKN 14


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 14/52 (26%)

Query: 248 REEDEEKTETRKKDEDVL--------------GKEKENKKKKKKKKKKEKKN 285
           ++  ++K +  KKD D                  ++ NKKKKKKKKKK+KKN
Sbjct: 19  QKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKN 70


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 30.1 bits (68), Expect = 1.2
 Identities = 14/48 (29%), Positives = 22/48 (45%)

Query: 240 ELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           E        +ED+E  +  +  E+V  K+K  KK +K+K      N R
Sbjct: 330 EFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFR 377


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.0 bits (68), Expect = 1.3
 Identities = 20/78 (25%), Positives = 31/78 (39%), Gaps = 10/78 (12%)

Query: 206 LIQSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVL 265
           L++S    E   L ++              +RE E D+E     E   + + + K+E   
Sbjct: 211 LLKSLLIPEFKPLDKYLKESKKK-------KRETEEDVE---AAESRAEKKRKSKEEIKK 260

Query: 266 GKEKENKKKKKKKKKKEK 283
            K KE+K  K  KK   K
Sbjct: 261 KKPKESKGVKALKKVVAK 278



 Score = 28.8 bits (65), Expect = 2.7
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
              E K + R+ +EDV   E   +KK+K K++ +KK  
Sbjct: 226 YLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263



 Score = 27.3 bits (61), Expect = 8.2
 Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 2/37 (5%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
            EED E  E+R + +     ++E KKKK K+ K  K 
Sbjct: 237 TEEDVEAAESRAEKK--RKSKEEIKKKKPKESKGVKA 271


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 30.4 bits (69), Expect = 1.3
 Identities = 15/48 (31%), Positives = 25/48 (52%)

Query: 238  EVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
            + E  LE     E++E  + ++      GK  + +K K KKK+K+KK 
Sbjct: 1136 KFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKK 1183


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 28.9 bits (65), Expect = 1.4
 Identities = 13/19 (68%), Positives = 16/19 (84%)

Query: 267 KEKENKKKKKKKKKKEKKN 285
           K K+ K+KKKKKKKK+KK 
Sbjct: 108 KAKQKKQKKKKKKKKKKKT 126



 Score = 28.5 bits (64), Expect = 2.0
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query: 267 KEKENKKKKKKKKKKEKK 284
           ++K+ KKKKKKKKKK  K
Sbjct: 111 QKKQKKKKKKKKKKKTSK 128



 Score = 28.1 bits (63), Expect = 2.6
 Identities = 13/18 (72%), Positives = 16/18 (88%)

Query: 267 KEKENKKKKKKKKKKEKK 284
           KEK  +KK+KKKKKK+KK
Sbjct: 106 KEKAKQKKQKKKKKKKKK 123



 Score = 27.7 bits (62), Expect = 4.2
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query: 251 DEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           +E+K   +   E    K+++ KKKKKKKKK  KK
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKK 129



 Score = 27.7 bits (62), Expect = 4.2
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 256 ETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           E  K+ +    K+K+ KKK  KK  K+KK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 27.0 bits (60), Expect = 6.2
 Identities = 13/33 (39%), Positives = 19/33 (57%), Gaps = 4/33 (12%)

Query: 253 EKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           ++   +KK +    K+K+ KKKKK  KK  KK 
Sbjct: 106 KEKAKQKKQK----KKKKKKKKKKTSKKAAKKK 134



 Score = 26.6 bits (59), Expect = 8.9
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 252 EEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
           +EK + +K+ +    K+K+   KK  KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 30.0 bits (68), Expect = 1.5
 Identities = 8/38 (21%), Positives = 24/38 (63%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
            ED+E  +    D+D+  +E+  ++++++++ +EK+  
Sbjct: 143 SEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAE 180


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 29.9 bits (67), Expect = 1.5
 Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 249 EEDEEKTETRKKDEDVLGKE-KENKKKKKKKKKKEKK 284
           E+ EE  E + ++E  L K+ K+ KK K KK   +K 
Sbjct: 318 EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354



 Score = 28.8 bits (64), Expect = 4.0
 Identities = 16/52 (30%), Positives = 26/52 (50%)

Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           ERE +L  E   + E E+  ++ + +E+   +E    KK KK KK + K   
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNG 350



 Score = 28.0 bits (62), Expect = 7.3
 Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 2/40 (5%)

Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           I ++ED E++E  K +E+  G   +  KK KK K K+   
Sbjct: 314 IEQDEDSEESEEEKNEEE--GGLSKKGKKLKKLKGKKNGL 351



 Score = 28.0 bits (62), Expect = 7.4
 Identities = 18/51 (35%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 235 GEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           G R  +L ++ +  +++++  E+ K  ED  G E+++KKKKKK  K +KK 
Sbjct: 207 GGRGKDLKIKDLEGDDEDDGDESDKGGED--GDEEKSKKKKKKLAKNKKKL 255


>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
           homology Domain (IMD), a dimerization module that binds
           and bends membranes.  Inverse (I)-BAR (or IMD) is a
           member of the Bin/Amphiphysin/Rvs (BAR) domain family.
           It is a dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions in the
           opposite direction compared to classical BAR and F-BAR
           domains, which produce membrane invaginations. IMD
           domains are found in Insulin Receptor tyrosine kinase
           Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
           Brain-specific Angiogenesis Inhibitor 1-Associated
           Protein 2-like (BAIAP2L) proteins. These are
           multi-domain proteins that act as scaffolding proteins
           and transducers of a variety of signaling pathways that
           link membrane dynamics and the underlying actin
           cytoskeleton. Most members contain an N-terminal IMD, an
           SH3 domain, and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus, exccept for MIM which does not
           carry an SH3 domain. Some members contain additional
           domains and motifs. The IMD domain binds and bundles
           actin filaments, binds membranes and produces membrane
           protrusions, and interacts with the small GTPase Rac.
          Length = 223

 Score = 29.6 bits (67), Expect = 1.5
 Identities = 17/46 (36%), Positives = 28/46 (60%)

Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
           E+++ELD + I + E + K E ++K ED+     E KK +KK +K 
Sbjct: 100 EKKLELDQKVINKFEKDYKKEYKQKREDLDKARSELKKLQKKSQKS 145


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 29.4 bits (66), Expect = 1.5
 Identities = 14/29 (48%), Positives = 20/29 (68%)

Query: 258 RKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           +K+ ED     +ENK+KKKKK KK+K  +
Sbjct: 76  KKETEDWFSPNQENKQKKKKKDKKKKSPK 104


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 29.9 bits (68), Expect = 1.5
 Identities = 17/54 (31%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 236 EREVELDLEWIRREEDEEKTETRKK--DEDVLGKEKENKKKKKKKKKKEKKNRR 287
           E ++    E  R+EE +EK E +KK   E  L K    +++K ++K+++K+ R+
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 28.4 bits (64), Expect = 4.3
 Identities = 12/42 (28%), Positives = 26/42 (61%)

Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           + +  +EE+ +  K  E+   +E + KK++KKK+++E K  +
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302


>gnl|CDD|220274 pfam09510, Rtt102p, Rtt102p-like transcription regulator protein.
           This protein is found in fungi. The family includes
           Rtt102p, a transcription regulator protein which appears
           to be integrally associated with both the Swi-Snf and
           the RSC chromatin remodelling complexes.
          Length = 130

 Score = 28.8 bits (64), Expect = 1.7
 Identities = 15/41 (36%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 245 WIRREEDEEKTETRKKDEDVLG-KEKENKKKKKKKKKKEKK 284
           W +  +DE      +  +DVL   E +  KK KKKKK + K
Sbjct: 60  WQKDPDDEPLDFKDESTDDVLDLDEFDRTKKSKKKKKVQVK 100


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 29.3 bits (66), Expect = 1.7
 Identities = 16/54 (29%), Positives = 38/54 (70%), Gaps = 1/54 (1%)

Query: 234 WGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
             +RE E +    R++ +E++   RK++E+ L +E+E KK+++++K++E++ R+
Sbjct: 18  RQQREAEEEEREERKKLEEKREGERKEEEE-LEEEREKKKEEEERKEREEQARK 70


>gnl|CDD|182080 PRK09795, PRK09795, aminopeptidase; Provisional.
          Length = 361

 Score = 29.5 bits (66), Expect = 1.8
 Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 8/58 (13%)

Query: 205 ALIQSKTAIELSKL-LQFRNGETSAPTHIV------WGEREVELDLEWIRREEDEEKT 255
            L Q KT  E+ K+ L     +  A  HI         ERE+  +LEW  R++  EK 
Sbjct: 123 VLRQIKTPEEVEKIRLACGIADRGA-EHIRRFIQAGMSEREIAAELEWFMRQQGAEKA 179


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is a
           small protein module with a triple-stranded beta-sheet
           fold. This is a family of WW domain binding proteins.
          Length = 78

 Score = 27.6 bits (62), Expect = 1.8
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 269 KENKKKKKKKKKKEKKNRR 287
           KE KKK+ KK K E++ RR
Sbjct: 12  KEQKKKELKKNKAERQARR 30


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 29.3 bits (66), Expect = 2.0
 Identities = 23/85 (27%), Positives = 38/85 (44%), Gaps = 11/85 (12%)

Query: 214 ELSKLLQFRNGETSAPTHIVWGEREVEL-----------DLEWIRREEDEEKTETRKKDE 262
           +L + L   +  ++ P HIV+ + E E              E + R E+  +    +K  
Sbjct: 107 KLKEELHLTDSGSAGPKHIVFVDDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTS 166

Query: 263 DVLGKEKENKKKKKKKKKKEKKNRR 287
            V  K+K+   KKK+K  KE K R+
Sbjct: 167 LVDEKQKKKSAKKKRKLYKELKERK 191


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 29.3 bits (66), Expect = 2.2
 Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 4/54 (7%)

Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGK--EKENKKKKKKKKKKEKKNRR 287
             E+E  +E ++  E   K  T+ K E    +   +  +KKK+KK K E++  +
Sbjct: 343 REELEKRIEKLK--EKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 14/41 (34%), Positives = 26/41 (63%)

Query: 241 LDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
            D E ++RE++E++    +K    L K++E KKK+ +K +K
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 29.2 bits (65), Expect = 2.8
 Identities = 12/48 (25%), Positives = 24/48 (50%)

Query: 235 GEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKE 282
           GE E E ++   R+ E E + E   K+ D  G+ +  + + + + + E
Sbjct: 677 GENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAE 724


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 28.9 bits (65), Expect = 2.9
 Identities = 10/41 (24%), Positives = 17/41 (41%), Gaps = 5/41 (12%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           RR +   K  +          ++++ K K   KK  KK R+
Sbjct: 179 RRAKKAAKLSSTAS-----SGDEKSPKSKAAPKKAGKKMRK 214


>gnl|CDD|227550 COG5225, RRS1, Uncharacterized protein involved in ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 172

 Score = 28.5 bits (63), Expect = 3.2
 Identities = 12/48 (25%), Positives = 20/48 (41%), Gaps = 7/48 (14%)

Query: 240 ELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
            L  +W+   E + KT  R+       + K N KK +  ++     RR
Sbjct: 132 RLLKQWLVEGEVKLKTLLRR-------QRKNNIKKNEANRRGNISRRR 172


>gnl|CDD|131269 TIGR02214, spoVD_pbp, stage V sporulation protein D.  This model
           describes the spoVD subfamily of homologs of the cell
           division protein FtsI, a penicillin binding protein.
           This subfamily is restricted to Bacillus subtilis and
           related Gram-positive species with known or suspected
           endospore formation capability. In these species, the
           functional equivalent of FtsI is desginated PBP-2B, a
           paralog of spoVD [Cell envelope, Biosynthesis and
           degradation of murein sacculus and peptidoglycan,
           Cellular processes, Sporulation and germination].
          Length = 636

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 3/58 (5%)

Query: 222 RNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKK 279
           RNG   A +  V     V+LD   IR+ ++ ++ E   +   VL  +KE   K+  KK
Sbjct: 55  RNGNELAVSVSVD---RVDLDPATIRKNKEPKQREIANELARVLEMDKEKVFKRITKK 109


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 6/39 (15%), Positives = 18/39 (46%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
            ++ EE  + R  ++    + ++    +K  K+ E+  +
Sbjct: 74  EQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 28.8 bits (64), Expect = 3.3
 Identities = 17/58 (29%), Positives = 32/58 (55%), Gaps = 7/58 (12%)

Query: 234 WGEREVELDLEWIRREED----EEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
             E  +EL+    +REE     EE+ + RK++E      K  ++++K++ K+E + RR
Sbjct: 209 QQEAALELEELKKKREERRKVLEEEEQRRKQEE---ADRKSREEEEKRRLKEEIERRR 263



 Score = 27.7 bits (61), Expect = 8.9
 Identities = 12/47 (25%), Positives = 23/47 (48%)

Query: 237 REVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
            E E +     REE EE     K ++    ++ E  +K++K+ + E+
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEE 158


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 28.1 bits (63), Expect = 3.4
 Identities = 9/37 (24%), Positives = 18/37 (48%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           ++D E      ++E  L KE    +++KK   ++ K 
Sbjct: 33  KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKE 69


>gnl|CDD|132849 cd07210, Pat_hypo_W_succinogenes_WS1459_like, Hypothetical patatin
           similar to WS1459 of Wolinella succinogenes.
           Patatin-like phospholipase. This family predominantly
           consists of bacterial patatin glycoproteins. The patatin
           protein accounts for up to 40% of the total soluble
           protein in potato tubers. Patatin is a storage protein,
           but it also has the enzymatic activity of a lipid acyl
           hydrolase, catalyzing the cleavage of fatty acids from
           membrane lipids. Members of this family have also been
           found in vertebrates.
          Length = 221

 Score = 28.5 bits (64), Expect = 3.5
 Identities = 9/56 (16%), Positives = 20/56 (35%), Gaps = 2/56 (3%)

Query: 145 IRCDDTPVVYTHVITTPEGEFFCHNHAGDELRHPFDPSKIYMSLDTGRIYHPTTSQ 200
           +R +   ++Y H +      +   N  G   R    P +  +++ +G I       
Sbjct: 162 LRPEIERILY-HHVAPRRP-WERLNIIGKISRALIRPFRWNLAILSGVIEDVLEPL 215


>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
          Length = 390

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 2/33 (6%)

Query: 255 TETRKKDEDVLGK--EKENKKKKKKKKKKEKKN 285
           +E   KD + + K  ++  KK+KKKKKKKE+K 
Sbjct: 149 SELILKDLEFVEKRLDELTKKRKKKKKKKEEKV 181


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 27.6 bits (62), Expect = 3.9
 Identities = 10/19 (52%), Positives = 15/19 (78%)

Query: 267 KEKENKKKKKKKKKKEKKN 285
           + K  +K+KKKKKKKE ++
Sbjct: 72  EAKAKEKEKKKKKKKELED 90


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 27.7 bits (62), Expect = 4.0
 Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 1/66 (1%)

Query: 213 IELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENK 272
            EL+  + FR    S P HI  G+  V  + +    +E++E+ E  ++D+D    E+E  
Sbjct: 82  FELTPPVTFRLKSGSGPVHIS-GQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEEES 140

Query: 273 KKKKKK 278
             KK K
Sbjct: 141 PVKKVK 146


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 28.6 bits (63), Expect = 4.5
 Identities = 7/34 (20%), Positives = 20/34 (58%)

Query: 249  EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKE 282
            EE E     ++  E+ +GK+++ ++++ +K   +
Sbjct: 1030 EEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 28.6 bits (64), Expect = 4.5
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 252 EEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           E+KT+ +KK     GK  +  KKK K K K+   +R
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 27.2 bits (60), Expect = 4.6
 Identities = 13/31 (41%), Positives = 16/31 (51%)

Query: 256 ETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
              KK E       ++KKK  KKKKKE+  R
Sbjct: 8   SKEKKAEKAKAGTAKDKKKWSKKKKKEEARR 38


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 27.0 bits (60), Expect = 4.7
 Identities = 10/36 (27%), Positives = 13/36 (36%)

Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           E + K       +    +    K K KK KK  KK 
Sbjct: 31  EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKG 66


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 27.2 bits (61), Expect = 4.8
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 254 KTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           K     ++     +E++ +K+KKKKKKKE ++
Sbjct: 62  KGPAAGEEAGKAAEEEKKEKEKKKKKKKELED 93


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 28.1 bits (63), Expect = 5.1
 Identities = 10/42 (23%), Positives = 20/42 (47%)

Query: 244 EWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           EWI  E D+E   +  +DE+   +  +  K+   ++  E+  
Sbjct: 117 EWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDE 158


>gnl|CDD|227478 COG5149, TOA1, Transcription initiation factor IIA, large chain
           [Transcription].
          Length = 293

 Score = 28.1 bits (62), Expect = 5.1
 Identities = 16/92 (17%), Positives = 28/92 (30%), Gaps = 12/92 (13%)

Query: 190 TGRIYHPTTSQYGSVALIQSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRRE 249
            G I     S          +   ELS          +   +  W ER     ++  + E
Sbjct: 137 YGDIIAEVISL----PNRLEQVEDELS----IGKSAITTLRNTDWRER----LIDDTQSE 184

Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
            D E+   R   + +   E+ ++      K K
Sbjct: 185 WDGERMRRRDGKQGIHQYERLSEGPAHAFKGK 216


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 28.1 bits (63), Expect = 5.2
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 254 KTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           K + R++    + K+ + KKK  KKK+K    R
Sbjct: 6   KRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 26.8 bits (60), Expect = 5.2
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 253 EKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           EK + +KK E     EK+ KK+  +++K ++K 
Sbjct: 40  EKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.1 bits (63), Expect = 5.5
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
            E ++K +  KK+    G+ K   K  KK KK +KK
Sbjct: 70  SEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKK 105



 Score = 28.1 bits (63), Expect = 6.4
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
              E     T T+K  ++    E + K +KKKKKKKEKK 
Sbjct: 43  SEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKE 82


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 28.2 bits (63), Expect = 5.6
 Identities = 10/44 (22%), Positives = 19/44 (43%)

Query: 242 DLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
                  + ++E  +   K    L + K  K++K+KK KK+   
Sbjct: 118 LELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 28.2 bits (63), Expect = 5.8
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 267 KEKENKKKKKKKKKKEKKNRR 287
            E    KK KKKKKK+KK R+
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRK 290



 Score = 27.4 bits (61), Expect = 10.0
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query: 266 GKEKENKKKKKKKKKKEKKNRR 287
               E  K KK KKKK+KK +R
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKR 288


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 28.1 bits (63), Expect = 6.0
 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 8/50 (16%)

Query: 244  EWIRREEDEEKTETRKKDEDVLGKEKEN--------KKKKKKKKKKEKKN 285
            E   + E E   E +   E VL  +++N          KK+K KK+ K N
Sbjct: 1529 EPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 27.6 bits (62), Expect = 6.1
 Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           R E+  +K E   + E    +E++ K K  +KK+K +  + 
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 27.2 bits (61), Expect = 6.1
 Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 2/40 (5%)

Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
             +E  +    T  K +  L  E E  K++KKK+ KEKK 
Sbjct: 74  AAKEVKKPGIST--KAQQALKLEHERNKQEKKKRSKEKKE 111


>gnl|CDD|233371 TIGR01354, cyt_deam_tetra, cytidine deaminase, homotetrameric.
           This small, homotetrameric zinc metalloprotein is found
           in humans and most bacteria. A related, homodimeric form
           with a much larger subunit is found in E. coli and in
           Arabidopsis. Both types may act on deoxycytidine as well
           as cytidine [Purines, pyrimidines, nucleosides, and
           nucleotides, Salvage of nucleosides and nucleotides].
          Length = 127

 Score = 27.2 bits (61), Expect = 6.1
 Identities = 15/59 (25%), Positives = 25/59 (42%), Gaps = 8/59 (13%)

Query: 127 LYEDEFPFISPCGRERNFIR---CDDTPVVYTHVITTPEGEFFCHNHAGDELRHPFDPS 182
           + +     +SPCG  R  +      DTP+  T+     +G +  +   G+ L   F PS
Sbjct: 72  VADSADDSVSPCGACRQVLAEFCGPDTPIYMTN----NDGTYKVYT-VGELLPFGFGPS 125


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 27.3 bits (61), Expect = 6.3
 Identities = 13/54 (24%), Positives = 30/54 (55%), Gaps = 2/54 (3%)

Query: 236 EREVELDLEWIR--REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
           ER++  +L+ +    EE   +    +KD + L + ++N K  ++++++E KN  
Sbjct: 33  ERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLH 86


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 28.1 bits (63), Expect = 6.4
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
              + K + + K E    K     K+KKK+KKKE+K ++
Sbjct: 305 MISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 26.9 bits (60), Expect = 6.4
 Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 7/44 (15%)

Query: 249 EEDEE--KTETRKK-----DEDVLGKEKENKKKKKKKKKKEKKN 285
           E DE+   T T+K              K  +K+KKKKKKKE KN
Sbjct: 46  EPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELKN 89


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 27.5 bits (61), Expect = 6.6
 Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 267 KEKENKKKKKKKKKKEKKNRR 287
           KEKE KKK K+ K++EK+ +R
Sbjct: 187 KEKEEKKKVKEAKRREKEEKR 207


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 28.0 bits (63), Expect = 6.6
 Identities = 11/48 (22%), Positives = 24/48 (50%), Gaps = 6/48 (12%)

Query: 243 LEWIRREEDEEKTETR------KKDEDVLGKEKENKKKKKKKKKKEKK 284
           ++ +   E+E + E        + ++++   EK+  +KK KK  K+ K
Sbjct: 60  MKPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGK 107


>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
          Length = 536

 Score = 28.0 bits (62), Expect = 6.8
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
           E  E+   T+ + E V+   +EN KK+  +   EK
Sbjct: 432 EASEKSAITKDEFEKVVQTSQENAKKELDRYLVEK 466


>gnl|CDD|236254 PRK08384, PRK08384, thiamine biosynthesis protein ThiI;
           Provisional.
          Length = 381

 Score = 28.0 bits (62), Expect = 7.0
 Identities = 32/134 (23%), Positives = 59/134 (44%), Gaps = 15/134 (11%)

Query: 163 GEFFCHNHAGDELRHPFD--------PSKIYMSLDTGRIYH--PTTSQYGSVALIQSKTA 212
           GE+   N   +   H +D          K Y+ +D  + +   P  +Q   VAL+     
Sbjct: 133 GEYILENEESEVDLHNYDIEVGVELMEGKAYVFVDKVKAWGGLPIGTQGKVVALLSGGID 192

Query: 213 IELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDE-EKTETRKKDEDVLGKEKEN 271
             ++  L  + G    P HI  GE+     LE +R+  ++ +K     K E ++ K +E 
Sbjct: 193 SPVAAFLMMKRGVEVIPVHIYMGEKT----LEKVRKIWNQLKKYHYGGKAELIVVKPQER 248

Query: 272 KKKKKKKKKKEKKN 285
           ++  +K K+ +K+N
Sbjct: 249 ERIIQKLKELKKEN 262


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 27.9 bits (62), Expect = 7.1
 Identities = 10/55 (18%), Positives = 19/55 (34%), Gaps = 5/55 (9%)

Query: 238 EVELDLEWIRREEDEEKTETRKKDEDVLGK-----EKENKKKKKKKKKKEKKNRR 287
            ++ D +     ++  K         ++ K     EK  KK    K   +KK + 
Sbjct: 88  HIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142


>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
          Length = 362

 Score = 27.8 bits (62), Expect = 7.1
 Identities = 12/45 (26%), Positives = 23/45 (51%)

Query: 242 DLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           ++E I  +   E+ + RK   D  G  +  +K   K++K+ +K R
Sbjct: 236 EVEAIANQRMIERCKNRKAKRDPNGTFQVARKAAMKRRKRNRKLR 280


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 27.6 bits (62), Expect = 7.5
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           +  ++  +   R    + L KEK+ KKKKKKKK++ ++
Sbjct: 67  KLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELRE 104


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 26.8 bits (60), Expect = 7.7
 Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 9/46 (19%)

Query: 249 EEDEE--KTETR--KKDEDVLGK-----EKENKKKKKKKKKKEKKN 285
           E DE+   T TR  +K +    +     E+   K+KKKKKKKE +N
Sbjct: 46  EPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELEN 91


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 27.4 bits (61), Expect = 7.7
 Identities = 10/44 (22%), Positives = 24/44 (54%)

Query: 242 DLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           ++E ++ EE E      K+D+    KE E  +++ +++ +E  +
Sbjct: 61  EIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104



 Score = 27.0 bits (60), Expect = 9.8
 Identities = 10/37 (27%), Positives = 18/37 (48%), Gaps = 3/37 (8%)

Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
           ++ E       K+E+   KE  N + K+ K   EK++
Sbjct: 55  DDQETAEIEEVKEEE---KEAANSEDKEDKGDAEKED 88


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 27.3 bits (61), Expect = 7.9
 Identities = 13/49 (26%), Positives = 23/49 (46%)

Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           E E + + E  +  + EE+ + +K+ +    KE   KKKKK     +  
Sbjct: 65  EPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSP 113


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 27.7 bits (62), Expect = 8.0
 Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 3/41 (7%)

Query: 247 RREEDEEKTETRKKDEDV---LGKEKENKKKKKKKKKKEKK 284
             +   EK E +K  E++   L +E   K KKKK+KKKE  
Sbjct: 398 SAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWF 438


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 26.9 bits (60), Expect = 8.2
 Identities = 14/46 (30%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 242 DLEWIRREEDEEKTETRKKDE--DVLGKEKENKKKKKKKKKKEKKN 285
           D+E  R E++ EK E  +K+E     GK    K++++K+ ++ +K 
Sbjct: 13  DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKA 58


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 27.7 bits (62), Expect = 8.2
 Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 4/52 (7%)

Query: 236 EREVELDLEWIRREEDEEKTETRKKD-EDVLGKEKENKKKKKKKKKKEKKNR 286
           + E EL+L     EE EE  E  ++D  D    +   +  ++ K+++E + R
Sbjct: 142 KNEFELELPEEEEEEPEEMEEELEEDAADR---DARKRAAEEAKEQEELRRR 190


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 27.7 bits (62), Expect = 8.3
 Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 2/65 (3%)

Query: 218 LLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKK 277
           LL+    E +    +   E +  +  E      +E + +  ++ E     ++E KK+  K
Sbjct: 438 LLKKFLEEEAILKSLE--EFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPK 495

Query: 278 KKKKE 282
           K KK 
Sbjct: 496 KVKKI 500


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 27.7 bits (61), Expect = 8.4
 Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 249 EEDEEKTETRKK----DEDVLGKEKENKKKKKKKKKKEKKN 285
           E  +    TR+K      DV  ++ E K +K++K KK  ++
Sbjct: 109 ESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDED 149


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 27.4 bits (61), Expect = 8.6
 Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 7/75 (9%)

Query: 212 AIELSKLLQFRNGETSAPTHIVWGEREVEL-DLEWIRREEDEEKTETRKKDEDVLGKEKE 270
            I LS+    R+G  S P + +  +   +L  L   ++EE+E + E R+     LG E  
Sbjct: 88  LIVLSR--HRRDGIKSDPNYEIRSDVRRQLRFLAQKQKEEEERRVERRR----ELGLEDP 141

Query: 271 NKKKKKKKKKKEKKN 285
            + + K+K K+E+K 
Sbjct: 142 EQLRLKQKAKEEQKA 156


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 27.4 bits (61), Expect = 8.6
 Identities = 11/37 (29%), Positives = 20/37 (54%), Gaps = 2/37 (5%)

Query: 252 EEKTETRKKDEDVLG--KEKENKKKKKKKKKKEKKNR 286
           +   ++ + DED+L    E E  +K K  KK ++ +R
Sbjct: 22  DASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASR 58


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score = 27.8 bits (63), Expect = 8.8
 Identities = 14/46 (30%), Positives = 22/46 (47%)

Query: 239 VELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
            +LDL     EE EEK E  ++ + +L + KE    K K  +   +
Sbjct: 472 GDLDLGKEDEEEKEEKEEAEEEFKPLLERLKEALGDKVKDVRLSHR 517


>gnl|CDD|204061 pfam08779, SARS_X4, SARS coronavirus X4 like.  The structure of the
           coronavirus X4 protein (also known as 7a and U122) shows
           similarities to the immunoglobulin like fold and
           suggests a binding activity to integrin I domains. In
           SARS-CoV- infected cells, the X4 protein is expressed
           and retained intra-cellularly within the Golgi network.
           X4 has been implicated to function during the
           replication cycle of SARS-CoV.
          Length = 83

 Score = 25.9 bits (56), Expect = 8.9
 Identities = 10/31 (32%), Positives = 12/31 (38%)

Query: 125 TGLYEDEFPFISPCGRERNFIRCDDTPVVYT 155
           +G YE   PF  PC      I C  T   + 
Sbjct: 22  SGTYEGNSPFAIPCADNAFAITCTSTHFAFA 52


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 27.1 bits (60), Expect = 8.9
 Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 233 VWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
           VWG  E  LD E  RREE +E+ + +K ++ +    +  +  +   KK+ + 
Sbjct: 89  VWGSEE-ALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRV 139


>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Proteins.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Vetebrates contain two
           Proline-Serine-Threonine Phosphatase-Interacting
           Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
           mainly expressed in hematopoietic cells and are involved
           in the regulation of cell adhesion and motility.
           Mutations in PSTPIPs have been shown to cause
           autoinflammatory disorders. PSTPIP1 contains an
           N-terminal F-BAR domain, PEST motifs, and a C-terminal
           SH3 domain, while PSTPIP2 contains only the N-terminal
           F-BAR domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 239

 Score = 27.1 bits (60), Expect = 9.3
 Identities = 16/38 (42%), Positives = 25/38 (65%)

Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
            E  E++ E RKK ED++ + ++NKK+  KK  K KK+
Sbjct: 91  EEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKS 128


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 26.5 bits (59), Expect = 9.5
 Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 6/41 (14%)

Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
           I      E+T+ R +      KEK  +K ++KK K+ +K +
Sbjct: 94  IALRLRRERTKERAE------KEKRTRKNREKKFKRRQKEK 128


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 27.5 bits (61), Expect = 10.0
 Identities = 12/18 (66%), Positives = 16/18 (88%)

Query: 269 KENKKKKKKKKKKEKKNR 286
           ++ KKKKK K+KK+KKNR
Sbjct: 684 RKKKKKKKSKEKKKKKNR 701


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.138    0.419 

Gapped
Lambda     K      H
   0.267   0.0694    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,162,478
Number of extensions: 1492431
Number of successful extensions: 5599
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4563
Number of HSP's successfully gapped: 432
Length of query: 287
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 191
Effective length of database: 6,679,618
Effective search space: 1275807038
Effective search space used: 1275807038
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.2 bits)