RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12225
         (396 letters)



>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score =  158 bits (401), Expect = 3e-44
 Identities = 90/289 (31%), Positives = 130/289 (44%), Gaps = 76/289 (26%)

Query: 2   EMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEG 61
           +MR  ++++E   +L+ ++RERV+PK+G +D+DY+KLHDAFF    KP LS  G+ Y E 
Sbjct: 209 QMRRMMREREAGMSLRERIRERVQPKMGGLDVDYRKLHDAFFSLGPKPYLSKFGEFYEEV 268

Query: 62  KEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPN 121
                 +K+K+PG +SAELR ALG+  G             PPPWL  MQ++G PPSYP+
Sbjct: 269 DNDYRFVKKKRPGAISAELREALGIDSGT------------PPPWLFNMQKHGMPPSYPD 316

Query: 122 LKIPGLN-APIP-QGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLY 179
           LKIPGLN APIP +G  +GY   GW +P      +                         
Sbjct: 317 LKIPGLNWAPIPLEGDVYGYQPPGWHEP----LFEV------------------------ 348

Query: 180 GDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEE--EEGEDESGLITP 237
                                +TA GEL     EE   D+E+      E  + E  +   
Sbjct: 349 -------------------GPETAEGELLQFEAEEIYNDEEQYTRPVYENEDFEERVDDV 389

Query: 238 AEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAP---QLFQVL 283
           + G                P ++E+  K+     +  +A     L+ VL
Sbjct: 390 SNG----------GLAEFAPAAVEVAPKEDAKVTKRGKAKREKLLYSVL 428


>gnl|CDD|112836 pfam04037, DUF382, Domain of unknown function (DUF382).  This
           domain is specific to the human splicing factor 3b
           subunit 2 and it's orthologues. Splicing factor 3b
           subunit 2 or SAP145 is a suppressor of U2 snRNA
           mutations. Pre-mRNA splicing is catalyzed by a large
           ribonucleoprotein complex called the spliceosome.
           Spliceosomes are multi-component enzymes that catalyze
           pre-mRNA splicing and form step-wise by the ordered
           interaction of UsnRNPs and non-snRNP proteins with short
           conserved regions of the pre-mRNA at the 5' and 3'
           splice sites and branch site.
          Length = 129

 Score =  124 bits (314), Expect = 8e-35
 Identities = 46/64 (71%), Positives = 56/64 (87%)

Query: 1   MEMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYE 60
            EMR +L +KE +KTLK K RERV+PK+GK+DIDYQKLHDAFFK+QTKP L+ +GD+YYE
Sbjct: 66  AEMRDALLEKEAEKTLKQKQRERVQPKMGKLDIDYQKLHDAFFKYQTKPDLTKYGDVYYE 125

Query: 61  GKEF 64
           GKEF
Sbjct: 126 GKEF 129


>gnl|CDD|202864 pfam04046, PSP, PSP.  Proline rich domain found in numerous
           spliceosome associated proteins.
          Length = 48

 Score = 84.1 bits (209), Expect = 9e-21
 Identities = 37/60 (61%), Positives = 39/60 (65%), Gaps = 12/60 (20%)

Query: 72  KPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPI 131
           KPG LS ELR ALGMP G             PPPWL  M+R G PPSYP+LKIPGLNAPI
Sbjct: 1   KPGVLSDELREALGMPEGD------------PPPWLYRMRRLGYPPSYPDLKIPGLNAPI 48


>gnl|CDD|128850 smart00581, PSP, proline-rich domain in spliceosome associated
           proteins. 
          Length = 54

 Score = 82.8 bits (205), Expect = 4e-20
 Identities = 36/65 (55%), Positives = 39/65 (60%), Gaps = 12/65 (18%)

Query: 68  LKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGL 127
            K  KPG +S ELR ALG+P G             PPPWL  M+R G PP YP LKIPGL
Sbjct: 1   FKHFKPGRISDELREALGLPPG------------QPPPWLYRMRRLGYPPGYPRLKIPGL 48

Query: 128 NAPIP 132
           NAPIP
Sbjct: 49  NAPIP 53


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 44.6 bits (105), Expect = 8e-05
 Identities = 33/106 (31%), Positives = 43/106 (40%), Gaps = 14/106 (13%)

Query: 140 HAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDI 199
            AG  G+   DE G     +  G    G+    E       D      ET   E+N E+ 
Sbjct: 784 QAGEDGEMKGDE-GAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQ 842

Query: 200 EKTA-------------WGELESESEEESEEDDEEEEEEEEGEDES 232
            +                G+ E E EEE EE++EEEEEEEE E+E 
Sbjct: 843 GEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEE 888



 Score = 34.6 bits (79), Expect = 0.11
 Identities = 34/105 (32%), Positives = 44/105 (41%), Gaps = 9/105 (8%)

Query: 163 TGGWGKPPVDETGKPLYGDVFGTSA------ETKTAEVNEEDIEKTAWGELESESEEESE 216
           TG  G+ P +  G+   G+  G  A      ETK    +E +I     GE E E E E++
Sbjct: 645 TGEEGERPTEAEGEN--GEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK 702

Query: 217 EDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIE 261
           E D + E E E   E    T AEG      I +   G E  D  E
Sbjct: 703 EADHKGETEAEEV-EHEGETEAEGTEDEGEIETGEEGEEVEDEGE 746



 Score = 30.3 bits (68), Expect = 2.1
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEE 225
           + E+ E+    E E E EEE EE++EEE EE
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 40.0 bits (94), Expect = 0.002
 Identities = 12/41 (29%), Positives = 24/41 (58%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
           E+ +E     A+ +LE + +++ E D+E+EE  +  +DE  
Sbjct: 109 ELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDD 149


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 38.1 bits (88), Expect = 0.010
 Identities = 15/38 (39%), Positives = 26/38 (68%)

Query: 194 VNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           + EED+       ++ + E+E E+DD+EE++EEE E+E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177



 Score = 30.8 bits (69), Expect = 1.5
 Identities = 12/22 (54%), Positives = 18/22 (81%)

Query: 206 ELESESEEESEEDDEEEEEEEE 227
           + + + +++ EEDDEEEEEEEE
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEE 178



 Score = 30.0 bits (67), Expect = 3.2
 Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 5/33 (15%)

Query: 206 ELESESEEESEEDDEEEEEEEE-----GEDESG 233
           E E E +++ E+D+EEEEEEEE      EDE  
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEED 190



 Score = 29.6 bits (66), Expect = 3.5
 Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 9/53 (16%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEED---DEEEEEEEEGED 230
           D F    + +  + +++D       E + E EEE EE    D+E+EE+E GED
Sbjct: 149 DNFVIDDDDEDEDEDDDD------EEDDEEEEEEEEEIKGFDDEDEEDEGGED 195



 Score = 29.2 bits (65), Expect = 4.8
 Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 5/34 (14%)

Query: 205 GELESESEEESEEDDEEEEE-----EEEGEDESG 233
            + + + EE+ EE++EEEEE     +E+ EDE G
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGG 193


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 36.0 bits (84), Expect = 0.018
 Identities = 11/44 (25%), Positives = 20/44 (45%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
                  A      E + E ++DD+E+E+E++ E+      P E
Sbjct: 36  AAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79



 Score = 33.7 bits (78), Expect = 0.095
 Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 5/44 (11%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
           A T  A  +E D E     +LE + +++ +ED+++EEE + G D
Sbjct: 38  AATAAAIESELDEE-----DLEDDDDDDEDEDEDDEEEADLGPD 76



 Score = 33.3 bits (77), Expect = 0.12
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
           N      TA        EE+ E+DD+++E+E+E ++E  
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71



 Score = 30.6 bits (70), Expect = 0.95
 Identities = 6/37 (16%), Positives = 21/37 (56%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           + +    A      ESE + E+ +++++++E+ +++ 
Sbjct: 31  DPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDD 67



 Score = 30.6 bits (70), Expect = 1.0
 Identities = 9/37 (24%), Positives = 20/37 (54%)

Query: 191 TAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEE 227
            A        ++   E + E +++ +ED++E++EEE 
Sbjct: 35  AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71



 Score = 29.8 bits (68), Expect = 1.8
 Identities = 7/42 (16%), Positives = 20/42 (47%)

Query: 191 TAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           +  ++       A         +E + +D+++++E+E ED+ 
Sbjct: 27  SGFIDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDE 68



 Score = 28.3 bits (64), Expect = 5.7
 Identities = 6/35 (17%), Positives = 19/35 (54%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
              +  E E +E+D E++++++ +++      A+ 
Sbjct: 39  ATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADL 73


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 35.9 bits (83), Expect = 0.019
 Identities = 13/43 (30%), Positives = 28/43 (65%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
           E K  E+  ED+++    + E E EEE E++D +++++++ +D
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDD 199



 Score = 34.0 bits (78), Expect = 0.098
 Identities = 17/63 (26%), Positives = 29/63 (46%)

Query: 170 PVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGE 229
              + GK L    F       T E  + D + +   +   E E E  ++++E++EEEE E
Sbjct: 122 INKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEE 181

Query: 230 DES 232
           +E 
Sbjct: 182 EEE 184



 Score = 30.1 bits (68), Expect = 1.8
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
            + K  E  + D E     E E E EEE E+ D+++++++
Sbjct: 158 KKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 36.9 bits (86), Expect = 0.020
 Identities = 16/66 (24%), Positives = 31/66 (46%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLE 255
           ++D +     E    S+EE +E+DE+ ++E++ E+E       +     S  + +P    
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPFTFP 396

Query: 256 TPDSIE 261
            P S E
Sbjct: 397 CPKSHE 402



 Score = 30.7 bits (70), Expect = 1.7
 Identities = 11/37 (29%), Positives = 19/37 (51%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
            E D +         E EEE E+  ++E+EE++ +D 
Sbjct: 306 FEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDL 342


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 36.2 bits (84), Expect = 0.025
 Identities = 18/42 (42%), Positives = 26/42 (61%)

Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
            E  EE +EK    E E +  E  +ED+E+E+EEEE +D+ G
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEG 317



 Score = 33.9 bits (78), Expect = 0.12
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
            +E+  +    + E E EE+   + E+E+EE+E E+E
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEE 310



 Score = 29.6 bits (67), Expect = 3.1
 Identities = 11/34 (32%), Positives = 17/34 (50%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
           E   E  EE EE+D+  E E+E E++       +
Sbjct: 280 EELLEKAEEEEEEDDYSESEDEDEEDEDEEEEED 313



 Score = 28.9 bits (65), Expect = 4.6
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 194 VNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           V  ++ +     E   E EEE +  + E+E+EE+ ++E
Sbjct: 272 VGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEE 309



 Score = 28.1 bits (63), Expect = 8.0
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDESG 233
           G  E + EE  E+ +EEEEE++  E E  
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDE 301


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 34.6 bits (80), Expect = 0.029
 Identities = 13/29 (44%), Positives = 22/29 (75%)

Query: 203 AWGELESESEEESEEDDEEEEEEEEGEDE 231
           A  E ES+ +EE EE++++EE+++E E E
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDEDESE 136



 Score = 34.6 bits (80), Expect = 0.030
 Identities = 13/28 (46%), Positives = 21/28 (75%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDES 232
            E + + E+E EEDDEE+++E+E E+E 
Sbjct: 112 DESDDDEEDEEEEDDEEDDDEDESEEEE 139



 Score = 33.5 bits (77), Expect = 0.076
 Identities = 12/28 (42%), Positives = 22/28 (78%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDES 232
              E ES+++ E+++EE++EE++ EDES
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDEDES 135



 Score = 33.5 bits (77), Expect = 0.079
 Identities = 11/23 (47%), Positives = 19/23 (82%)

Query: 210 ESEEESEEDDEEEEEEEEGEDES 232
           E +EE E+D+E+++E+E  E+ES
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEES 140



 Score = 30.0 bits (68), Expect = 1.2
 Identities = 12/27 (44%), Positives = 21/27 (77%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDES 232
            L +  E+ES++D+E+EEEE++ ED+ 
Sbjct: 105 HLVASEEDESDDDEEDEEEEDDEEDDD 131



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
            V  E+ E     E E E ++E ++D++E EEEE
Sbjct: 106 LVASEEDESDDDEEDEEEEDDEEDDDEDESEEEE 139


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 35.2 bits (81), Expect = 0.056
 Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 2/71 (2%)

Query: 172 DETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESE-EDDEEEEEEEEGED 230
           DE          G   +    E  ++D ++    E +SESE ES+ ED EE+E+E++ E 
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIE-DSESESESDGEDGEEDEQEDDAEA 693

Query: 231 ESGLITPAEGL 241
             G++   + +
Sbjct: 694 NEGVVPIDKAV 704



 Score = 29.1 bits (65), Expect = 5.7
 Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEE-EEGEDES 232
           A+ N+   ++   GE E E + E  +DDE+E E  E+ E ES
Sbjct: 634 ADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESES 675


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 34.8 bits (80), Expect = 0.073
 Identities = 16/45 (35%), Positives = 22/45 (48%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           AE  +    EE +++ A   LE+E  E +EED    E     EDE
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDE 430



 Score = 32.1 bits (73), Expect = 0.59
 Identities = 17/44 (38%), Positives = 22/44 (50%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           E   AE  E   E     EL    E+E E++DE EE + E E+E
Sbjct: 404 ERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEE 447



 Score = 28.6 bits (64), Expect = 7.2
 Identities = 14/42 (33%), Positives = 20/42 (47%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGE 229
           E + AE  EED        L  +  E+ +E +E + EEEE  
Sbjct: 407 EAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEA 448


>gnl|CDD|215971 pfam00524, PPV_E1_N, E1 Protein, N terminal domain. 
          Length = 131

 Score = 33.0 bits (76), Expect = 0.079
 Identities = 10/41 (24%), Positives = 20/41 (48%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLI 235
           + E      W  +E+   +++  +DE+E+  + G D S  I
Sbjct: 8   DGEGGGCGGWFFVEAICSDDNISEDEDEDLFDTGSDLSDFI 48


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 31.2 bits (71), Expect = 0.080
 Identities = 14/21 (66%), Positives = 18/21 (85%)

Query: 212 EEESEEDDEEEEEEEEGEDES 232
           EEE  ED+EE E+EEEGE++S
Sbjct: 22  EEEKREDEEENEDEEEGEEQS 42


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 32.4 bits (74), Expect = 0.085
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 212 EEESEEDDEEEEEEEEGEDESGLITPAEGLM 242
           + E+E DDEEEEEEEE +D   L    E + 
Sbjct: 3   DTEAEVDDEEEEEEEEEDDLEDLSDEDEFID 33



 Score = 28.6 bits (64), Expect = 1.9
 Identities = 14/32 (43%), Positives = 22/32 (68%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
           E+E ++E EE++EEE++ E+  DE   I  AE
Sbjct: 5   EAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAE 36


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 32.4 bits (73), Expect = 0.093
 Identities = 13/27 (48%), Positives = 23/27 (85%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDE 231
           GE +S+S+ + EEDD++E+EE++ ED+
Sbjct: 8   GEEDSDSDSDEEEDDDDEDEEDDDEDD 34



 Score = 32.0 bits (72), Expect = 0.12
 Identities = 13/30 (43%), Positives = 23/30 (76%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLITP 237
           +S+S+EE ++DDE+EE+++E +DE     P
Sbjct: 13  DSDSDEEEDDDDEDEEDDDEDDDEDDDEVP 42



 Score = 30.9 bits (69), Expect = 0.27
 Identities = 11/35 (31%), Positives = 26/35 (74%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
           + +S+ EE+ +++DEE+++E++ ED+  +  P+ G
Sbjct: 13  DSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSFG 47



 Score = 29.4 bits (65), Expect = 1.1
 Identities = 10/26 (38%), Positives = 22/26 (84%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDE 231
           E E +S+ +S+E++++++E+EE +DE
Sbjct: 7   EGEEDSDSDSDEEEDDDDEDEEDDDE 32



 Score = 27.4 bits (60), Expect = 5.6
 Identities = 10/25 (40%), Positives = 20/25 (80%)

Query: 207 LESESEEESEEDDEEEEEEEEGEDE 231
           LE E + +S+ D+EE++++E+ ED+
Sbjct: 6   LEGEEDSDSDSDEEEDDDDEDEEDD 30



 Score = 27.4 bits (60), Expect = 5.9
 Identities = 9/32 (28%), Positives = 23/32 (71%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEE 227
           EED +  +  E + + E+E ++D++++E+++E
Sbjct: 9   EEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 34.4 bits (79), Expect = 0.096
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 189 TKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
            K  E    D E+    E + E EE++ + +EE +EEEE E+
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 28.6 bits (64), Expect = 7.8
 Identities = 13/29 (44%), Positives = 21/29 (72%)

Query: 204 WGELESESEEESEEDDEEEEEEEEGEDES 232
           W E E E E   EE++EE+EE++E E+++
Sbjct: 26  WVEKEVEKEVPDEEEEEEKEEKKEEEEKT 54



 Score = 28.2 bits (63), Expect = 8.8
 Identities = 14/28 (50%), Positives = 16/28 (57%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESG 233
            L  E E E E  DEEEEEE+E + E  
Sbjct: 24  YLWVEKEVEKEVPDEEEEEEKEEKKEEE 51


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 34.1 bits (79), Expect = 0.10
 Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 4/34 (11%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLITPAEGL 241
              +  E EE++EEEEEEEE  +E      A GL
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSEEEA----AAGL 325


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 31.0 bits (71), Expect = 0.13
 Identities = 13/28 (46%), Positives = 20/28 (71%), Gaps = 2/28 (7%)

Query: 206 ELESESEEESE--EDDEEEEEEEEGEDE 231
           + E E EEE E  E ++EE+EEE+ +D+
Sbjct: 45  DAEWEEEEEGEDLESEDEEDEEEDDDDD 72



 Score = 27.6 bits (62), Expect = 3.1
 Identities = 11/28 (39%), Positives = 18/28 (64%), Gaps = 6/28 (21%)

Query: 211 SEEESEEDDE------EEEEEEEGEDES 232
           S+ E EE++E      E+EE+EE +D+ 
Sbjct: 44  SDAEWEEEEEGEDLESEDEEDEEEDDDD 71


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 34.1 bits (78), Expect = 0.15
 Identities = 18/78 (23%), Positives = 32/78 (41%)

Query: 167 GKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
             P    +       +    +E + +   EE+ E+    E +   EEE E+++EEEE E 
Sbjct: 419 SDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEA 478

Query: 227 EGEDESGLITPAEGLMTP 244
           +   E  +   +EG    
Sbjct: 479 DNGSEEEMEGSSEGDGDG 496



 Score = 29.9 bits (67), Expect = 2.7
 Identities = 18/50 (36%), Positives = 26/50 (52%)

Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
            +     ++   E     +   E E   EEE EE++EEEEEE+E E+E G
Sbjct: 418 SSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEG 467



 Score = 29.5 bits (66), Expect = 3.7
 Identities = 24/112 (21%), Positives = 33/112 (29%), Gaps = 4/112 (3%)

Query: 171 VDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEE----EEEE 226
                 P          E+   E  EE+ E+    E E E  E+ EE++E E     EEE
Sbjct: 426 STSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEE 485

Query: 227 EGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQ 278
                 G     E         S  AG+      +  +          E P 
Sbjct: 486 MEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPL 537


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 34.0 bits (78), Expect = 0.16
 Identities = 22/49 (44%), Positives = 26/49 (53%), Gaps = 6/49 (12%)

Query: 189 TKTAEVNEEDIEKTAWGELESESEE-----ESEEDDEEEEEEEEGEDES 232
           TK  E  E+D    A+  LE E        E EED+EEEEEE+E E  S
Sbjct: 362 TKAKE-TEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPS 409



 Score = 32.8 bits (75), Expect = 0.32
 Identities = 21/67 (31%), Positives = 25/67 (37%), Gaps = 11/67 (16%)

Query: 166 WGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEE 225
           W K    E    L       SA        E D E       E E EEE EE+DE+E   
Sbjct: 361 WTKAKETEQDYILDA----FSALEIEDANTERDDE-------EEEDEEEEEEEDEDEGPS 409

Query: 226 EEGEDES 232
           +E  D+ 
Sbjct: 410 KEHSDDE 416



 Score = 32.8 bits (75), Expect = 0.33
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESG 233
           E + E EE+ EE++EE+E+E   ++ S 
Sbjct: 387 ERDDEEEEDEEEEEEEDEDEGPSKEHSD 414


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 33.9 bits (78), Expect = 0.17
 Identities = 15/22 (68%), Positives = 16/22 (72%)

Query: 208 ESESEEESEEDDEEEEEEEEGE 229
           E E EEE E  +EEEEEEEE E
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEE 327



 Score = 32.7 bits (75), Expect = 0.37
 Identities = 16/27 (59%), Positives = 18/27 (66%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
           E E EEE EE+  EEEEEEE E+E   
Sbjct: 304 EPEEEEEEEEEVPEEEEEEEEEEERTF 330



 Score = 32.3 bits (74), Expect = 0.42
 Identities = 15/27 (55%), Positives = 17/27 (62%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDES 232
               E EEE EE  EEEEEEEE E+ +
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERT 329



 Score = 31.5 bits (72), Expect = 0.78
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDE 231
           E E E E   EE++EEEEEE   E+E
Sbjct: 308 EEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 31.5 bits (72), Expect = 0.87
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDES 232
             E EEE EE++E  EEEEE E+E 
Sbjct: 302 PPEPEEEEEEEEEVPEEEEEEEEEE 326



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDE 231
              S  E EE++EEEEE  E E+E
Sbjct: 298 PPPSPPEPEEEEEEEEEVPEEEEE 321



 Score = 30.0 bits (68), Expect = 2.2
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
               EEE EE++  EEEEEE E+E  
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEER 328



 Score = 30.0 bits (68), Expect = 2.5
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDES 232
            S  E E EE++EEE  EEE E+E 
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEE 324



 Score = 28.1 bits (63), Expect = 8.9
 Identities = 13/29 (44%), Positives = 16/29 (55%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLIT 236
                EE EE++EE  EEEE E+E    T
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERT 329


>gnl|CDD|197548 smart00157, PRP, Major prion protein.  The prion protein is a major
           component of scrapie-associated fibrils in
           Creutzfeldt-Jakob disease, kuru, Gerstmann-Straussler
           syndrome and bovine spongiform encephalopathy.
          Length = 218

 Score = 32.5 bits (74), Expect = 0.25
 Identities = 23/64 (35%), Positives = 26/64 (40%), Gaps = 13/64 (20%)

Query: 119 YPNLKIPGLNAPIPQGCSFGY---------HAGGWGKPPVDETGKPLYGDVFGTGG---- 165
           YP    PG N   PQG  +G          H GGWG+P     G+P  G     GG    
Sbjct: 17  YPGQGSPGGNRYPPQGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHNQ 76

Query: 166 WGKP 169
           W KP
Sbjct: 77  WNKP 80


>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727).  This
           domain family is found in bacteria and eukaryotes, and
           is approximately 100 amino acids in length.
          Length = 100

 Score = 31.0 bits (71), Expect = 0.26
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 202 TAWGELESESEEESEEDDEEEEEEEEGEDE 231
           T  GEL    E+E  E +EE++++E+ E+E
Sbjct: 13  TVRGELPEPEEDEILELEEEDDDDEDEEEE 42


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 32.2 bits (73), Expect = 0.26
 Identities = 19/39 (48%), Positives = 23/39 (58%), Gaps = 2/39 (5%)

Query: 196 EEDIEKTAWGELESESEEESEED--DEEEEEEEEGEDES 232
           EED +  A    E E E E  ED  DEE+EE EE E+E+
Sbjct: 21  EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 32.7 bits (75), Expect = 0.26
 Identities = 14/41 (34%), Positives = 24/41 (58%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
           E  +E+  K    E   ES+++ +E++E E EE+E  D+ G
Sbjct: 76  EKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116



 Score = 32.7 bits (75), Expect = 0.28
 Identities = 13/41 (31%), Positives = 23/41 (56%)

Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
            E+ E+  E+    +   +  E  ++DDEEEE E E +++S
Sbjct: 72  LELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDS 112



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE--GLMTPS 245
             +EE+ ++ A    E   EE SEED+EE  EEEE E E    +      ++TP+
Sbjct: 132 SEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPA 186



 Score = 29.6 bits (67), Expect = 2.7
 Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 3/45 (6%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
             +  E +++D E+  W   E E +E+S+++ E  + E + E ES
Sbjct: 88  AEQGLESDDDDDEEEEW---EVEEDEDSDDEGEWIDVESDKEIES 129


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 33.1 bits (75), Expect = 0.27
 Identities = 14/45 (31%), Positives = 26/45 (57%)

Query: 189 TKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
           T+  E  +ED+E+    E     EEE ++D++ E+++E+  D  G
Sbjct: 105 TQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEG 149


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 33.0 bits (75), Expect = 0.27
 Identities = 13/43 (30%), Positives = 23/43 (53%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLI 235
           E   +DI      E+E+ + EESE ++E E+ E+E +    + 
Sbjct: 361 EKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRIC 403



 Score = 30.0 bits (67), Expect = 2.5
 Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 3/73 (4%)

Query: 166 WGKPPVDETGKPLYGDVFGTSAETKTAEVNE-EDIEKTAWGELESESEEESEEDDEEEEE 224
           WG    +   K    D      E +  ++   E +E       ESE EEE E+ ++E + 
Sbjct: 341 WGN--EEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDH 398

Query: 225 EEEGEDESGLITP 237
            +   D+  L   
Sbjct: 399 SKRICDDDELENH 411


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 32.7 bits (75), Expect = 0.29
 Identities = 16/41 (39%), Positives = 22/41 (53%)

Query: 210 ESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSV 250
            +EEE EE++EEEEEEE   +   +  P   L    GI  +
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGGIKII 443


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 32.4 bits (74), Expect = 0.31
 Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 1/49 (2%)

Query: 185 TSAETKTAEVNEEDIE-KTAWGELESESEEESEEDDEEEEEEEEGEDES 232
            S + +TAE+ E   E K A    + E + ++E++DEE EEE E EDE 
Sbjct: 53  KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101



 Score = 30.9 bits (70), Expect = 0.95
 Identities = 19/46 (41%), Positives = 27/46 (58%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
           E K A  +E+  +K    + + ESEEE+EE+DEE  +E E E E  
Sbjct: 68  EEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEK 113


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 32.0 bits (73), Expect = 0.32
 Identities = 15/40 (37%), Positives = 18/40 (45%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           E  EED  K    E E   +EE  ED EE  E +  E + 
Sbjct: 22  ENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61



 Score = 32.0 bits (73), Expect = 0.35
 Identities = 12/45 (26%), Positives = 25/45 (55%), Gaps = 2/45 (4%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           E +  +   E++E+      E+E++EE +  +E+ E EE  ++E 
Sbjct: 2   EKECKDAKHENMEEDC--CKENENKEEDKGKEEDLEFEEIEKEEI 44



 Score = 29.3 bits (66), Expect = 2.4
 Identities = 14/44 (31%), Positives = 17/44 (38%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           E    + NE   E     E     E E EE  E+ EE  E + E
Sbjct: 14  EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57



 Score = 28.6 bits (64), Expect = 4.7
 Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 1/55 (1%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGL 241
            + K   + EED  K    + E + +EE  E +E E+EE   + E       E L
Sbjct: 6   KDAKHENM-EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEEL 59


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 32.5 bits (74), Expect = 0.34
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           EED ++      E E +E+ EE++ EEEEEEE EDE
Sbjct: 250 EEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285



 Score = 29.4 bits (66), Expect = 3.3
 Identities = 16/47 (34%), Positives = 29/47 (61%)

Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
           G+  + K ++  E+D +   + E +   E+E EE+ EEEEEEE+ ++
Sbjct: 239 GSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285



 Score = 28.3 bits (63), Expect = 7.9
 Identities = 14/38 (36%), Positives = 24/38 (63%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           ++++ +    G+   E E + +E++EE EEEEE EDE 
Sbjct: 247 SDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDED 284


>gnl|CDD|219311 pfam07149, Pes-10, Pes-10.  This family consists of several
           Caenorhabditis elegans pes-10 and related proteins.
           Members of this family are typically around 400 residues
           in length. The function of this family is unknown.
          Length = 370

 Score = 32.5 bits (74), Expect = 0.37
 Identities = 14/42 (33%), Positives = 21/42 (50%)

Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
            +++    E+  E E+E E D   +  EE  E ESG+ T  E
Sbjct: 157 KVDQLLKDEVLEEDEQEEEADKLLDNLEEHSETESGIFTDEE 198


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 32.2 bits (73), Expect = 0.40
 Identities = 13/20 (65%), Positives = 17/20 (85%)

Query: 208 ESESEEESEEDDEEEEEEEE 227
            S+SE E +E++EEEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEEE 242



 Score = 31.1 bits (70), Expect = 1.0
 Identities = 14/19 (73%), Positives = 18/19 (94%)

Query: 208 ESESEEESEEDDEEEEEEE 226
           +SESEE+ EE++EEEEEEE
Sbjct: 225 DSESEEDEEEEEEEEEEEE 243



 Score = 30.7 bits (69), Expect = 1.2
 Identities = 16/22 (72%), Positives = 19/22 (86%)

Query: 209 SESEEESEEDDEEEEEEEEGED 230
           S S+ ESEED+EEEEEEEE E+
Sbjct: 222 SGSDSESEEDEEEEEEEEEEEE 243



 Score = 30.3 bits (68), Expect = 1.8
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 209 SESEEESEEDDEEEEEEEEGEDE 231
           S S  +SE +++EEEEEEE E+E
Sbjct: 220 SSSGSDSESEEDEEEEEEEEEEE 242


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 32.5 bits (74), Expect = 0.40
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
            E  ++ IE+ A  E   + + ++ ED E   EE EG DES
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDES 260



 Score = 30.2 bits (68), Expect = 2.1
 Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 3/63 (4%)

Query: 172 DETGKPLYGDVFGTSAETKTAEVNEEDI-EKTAWGELESESEEESEEDDEE--EEEEEEG 228
            +  +    D     A+ +  + ++ D  E +  G  ESE  +ESEED+ E  + E EEG
Sbjct: 216 MDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEG 275

Query: 229 EDE 231
           E +
Sbjct: 276 EMD 278



 Score = 29.4 bits (66), Expect = 4.1
 Identities = 16/54 (29%), Positives = 27/54 (50%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
           AE    +  EED ++    + + ++ E+SE   EE E  +E E++    T  EG
Sbjct: 219 AEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEG 272


>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking.  Sec14-like
           Golgi-trafficking domain The GOLD domain is always found
           combined with lipid- or membrane-association domains.
          Length = 136

 Score = 31.2 bits (71), Expect = 0.42
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 209 SESEEESEEDDEEEEEEEEGEDESG 233
           SES +E EE++ EEEE E G+ E+G
Sbjct: 52  SESSDEEEEEEAEEEEAETGDVEAG 76


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 30.3 bits (69), Expect = 0.44
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
            +  E++ EE++EEE+EE E E  +GL
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEEAAAGL 101



 Score = 29.2 bits (66), Expect = 1.4
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
           E + EEE EE+++EE EEE       L
Sbjct: 78  EEKKEEEEEEEEKEESEEEAAAGLGAL 104


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 32.3 bits (74), Expect = 0.50
 Identities = 18/34 (52%), Positives = 24/34 (70%), Gaps = 2/34 (5%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESG--LITP 237
           + + E E+E EE++EEEEE EE E E G  L+TP
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTP 379



 Score = 31.6 bits (72), Expect = 0.71
 Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 3/39 (7%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTP 244
             + + EEE E+++EEEEEEE  E E     P   L+TP
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPP---LLTP 379


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 30.8 bits (70), Expect = 0.56
 Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 10/88 (11%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAE---GLMTPSGITSVPAGLETPDSIEL 262
             + + ++E +EDDEE + E++ E+   LI   E    L  PS + + PA   T  S  L
Sbjct: 51  GGDIDDDDEDDEDDEEADAEDDDENPYKLIRLEEILAPLTHPSDLPTHPAISRTFKSKTL 110

Query: 263 RK------KKIESDMET-NEAPQLFQVL 283
            +        IE +    N+  +L QVL
Sbjct: 111 SELALEAIALIEKEQNNLNKLNKLLQVL 138


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 32.1 bits (73), Expect = 0.63
 Identities = 14/47 (29%), Positives = 24/47 (51%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
                 E +EE++E  A        ++E   +DE+E+EE++ E ES 
Sbjct: 6   NTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESP 52


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 31.8 bits (72), Expect = 0.68
 Identities = 35/151 (23%), Positives = 59/151 (39%), Gaps = 18/151 (11%)

Query: 127 LNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTS 186
           ++   P+G +    A    +  V+ + K +  +     G  +     T   L       S
Sbjct: 1   MSTSKPKGSAPRGFAP---RTTVESSQKRIQQN----NGDKEDSSTSTSS-LSV-----S 47

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSG 246
           A  KT+   +E+I+       ES  EE   ED+ E E+ +    +S  I   E + T   
Sbjct: 48  AVEKTSNA-KEEIQVDFQHNSESAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMD 106

Query: 247 IT---SVPAGLETPDSIELRKKKIESDMETN 274
                S+   L+  +   LRK++IE   E N
Sbjct: 107 GIDDDSLDRKLKL-ERENLRKREIEELAEEN 136


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 31.5 bits (71), Expect = 0.77
 Identities = 20/107 (18%), Positives = 37/107 (34%), Gaps = 9/107 (8%)

Query: 165 GWGKPPVDETG-----KPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDD 219
           G   P V+ET      +   GD+ G +      +             L SE+ +E+E + 
Sbjct: 357 GESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEV 416

Query: 220 EEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKK 266
            E+        +   +  A     P+     P G+   D    R+++
Sbjct: 417 PEKAAPIPDPAKPDELAVAGPGDDPAE----PDGIRRQDDFSSRRRR 459



 Score = 31.5 bits (71), Expect = 0.80
 Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 2/52 (3%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEE-EEEEEGEDE 231
           D  G S      E +E DIE+   G+L  ++    + D E      EE    
Sbjct: 354 DRDGEST-PAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAAL 404


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 31.4 bits (72), Expect = 0.86
 Identities = 17/53 (32%), Positives = 27/53 (50%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
            + +  E  E+  +  A  + E E EEE E+ +EE+EEEEE  +E       +
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472



 Score = 30.3 bits (69), Expect = 1.8
 Identities = 22/54 (40%), Positives = 28/54 (51%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
           AE K  E  +E  +K   G+ + E EEE +E  EEE+EEEE E E       E 
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472



 Score = 30.3 bits (69), Expect = 2.1
 Identities = 20/66 (30%), Positives = 27/66 (40%)

Query: 174 TGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
           TG            E    +  EE  EK        + EEE EE+ E++EEE+E E+E  
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462

Query: 234 LITPAE 239
                E
Sbjct: 463 EEEKEE 468


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 31.5 bits (72), Expect = 0.90
 Identities = 8/40 (20%), Positives = 23/40 (57%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           +  +++ E+    + + + E++  + D+++EEE + E E 
Sbjct: 223 DDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVER 262


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 29.7 bits (67), Expect = 0.90
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 200 EKTAWGELESESEEESEEDDEEEEEEEEGED 230
              A  E  +E++E  EE+ EEE EEE  +D
Sbjct: 72  AAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 27.8 bits (62), Expect = 3.7
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDESG 233
           G   +   +E+EE+++EEE EEE +D+  
Sbjct: 76  GAEAAAEADEAEEEEKEEEAEEESDDDML 104



 Score = 27.4 bits (61), Expect = 5.6
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
           E+ +E +  E++E+EEE EE  D+  L
Sbjct: 78  EAAAEADEAEEEEKEEEAEEESDDDML 104



 Score = 27.4 bits (61), Expect = 6.3
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
               E E EE +EE EEE + +   GL
Sbjct: 81  AEADEAEEEEKEEEAEEESDDDMLFGL 107


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 29.6 bits (67), Expect = 0.91
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDESGL 234
               +E EEE EE++EEEEEE E E  +GL
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEEAMAGL 100



 Score = 27.7 bits (62), Expect = 3.8
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
           E E EEE EE++EEE EEE       L
Sbjct: 77  EEEEEEEEEEEEEEESEEEAMAGLGAL 103


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 29.9 bits (67), Expect = 0.92
 Identities = 14/57 (24%), Positives = 24/57 (42%)

Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
                   A    E  E+    + + +++E +E D  +E +EEE  DE+   T  E 
Sbjct: 58  AEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEA 114



 Score = 29.5 bits (66), Expect = 1.3
 Identities = 13/46 (28%), Positives = 23/46 (50%)

Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGE 229
              A    A    ++ E+T     E+  E ++E ++ +EEE+EE E
Sbjct: 85  ADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130



 Score = 28.8 bits (64), Expect = 2.3
 Identities = 13/46 (28%), Positives = 20/46 (43%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
              + AE +  D         E+  E   E D E EE +EE ++E+
Sbjct: 84  DADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEA 129


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 30.4 bits (69), Expect = 0.94
 Identities = 14/51 (27%), Positives = 29/51 (56%)

Query: 186 SAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLIT 236
            +E+++ +  E++       + ES  EEE +  D+E++EE + E+E   +T
Sbjct: 2   GSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVT 52


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 30.4 bits (69), Expect = 0.96
 Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 1/40 (2%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
           + +DI      + +SE  EE EE DEE+ E  E  D S L
Sbjct: 3   DSDDIIDYESDDDDSEEYEEEEE-DEEDAESLESSDVSSL 41



 Score = 29.3 bits (66), Expect = 2.4
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 211 SEEESEEDDEEEEEEEEGEDESGLITPAE 239
            + ES++DD EE EEEE ++E      + 
Sbjct: 8   IDYESDDDDSEEYEEEEEDEEDAESLESS 36


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 30.0 bits (68), Expect = 0.97
 Identities = 17/29 (58%), Positives = 20/29 (68%), Gaps = 4/29 (13%)

Query: 206 ELESESEEESEED----DEEEEEEEEGED 230
           E E + EEE E+D    DEE EEEE+GED
Sbjct: 1   EDEPDEEEELEDDIDDLDEEAEEEEDGED 29


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 31.0 bits (70), Expect = 1.0
 Identities = 16/48 (33%), Positives = 31/48 (64%)

Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
             S + +  E +++D E+ +  E ES+SEE +E+D ++EEEE+   ++
Sbjct: 156 KKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSED 203


>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
          Length = 1560

 Score = 31.3 bits (71), Expect = 1.0
 Identities = 16/67 (23%), Positives = 25/67 (37%), Gaps = 1/67 (1%)

Query: 177 PLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLIT 236
            LY  +     E K ++  + D E     +       + E    ++E   EG D + L  
Sbjct: 872 SLYATLAAELKENKDSKEGDADAEVAE-VDAAPGGAVDHEPPVAKQEGNAEGGDGAALDA 930

Query: 237 PAEGLMT 243
            A  LMT
Sbjct: 931 AAGRLMT 937


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 30.4 bits (69), Expect = 1.0
 Identities = 14/55 (25%), Positives = 25/55 (45%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLM 242
           E    EV E ++E+    E+E E+ EE  E +  +EE+ +  +    +   E   
Sbjct: 6   EQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERY 60



 Score = 27.7 bits (62), Expect = 9.8
 Identities = 11/47 (23%), Positives = 17/47 (36%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
           E       E +  +      +   EE  EE+ E E  +EE    + L
Sbjct: 3   EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAEL 49


>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein.  S-antigens are heat
           stable proteins that are found in the blood of
           individuals infected with malaria.
          Length = 94

 Score = 29.1 bits (65), Expect = 1.1
 Identities = 13/60 (21%), Positives = 33/60 (55%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
           +++GT    ++   N ++ +   + +L+ E E E++++++   EE   ++E+GLI     
Sbjct: 34  NIYGTKYYLRSGFFNSKNGKGQKYEDLQEEGEGENDDEEDSNSEESNNDEENGLIEGQGE 93



 Score = 27.6 bits (61), Expect = 3.5
 Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 197 EDIEKTAWGELESESEEESEEDDEEEEEE-EEGEDES 232
           ED+++   GE + E +  SEE + +EE    EG+ ES
Sbjct: 58  EDLQEEGEGENDDEEDSNSEESNNDEENGLIEGQGES 94


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 30.2 bits (68), Expect = 1.1
 Identities = 15/44 (34%), Positives = 27/44 (61%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           + +  E +EE+IE+    E E E  E+ EE++E+EE+  + +D 
Sbjct: 51  DEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDI 94



 Score = 28.7 bits (64), Expect = 3.9
 Identities = 21/83 (25%), Positives = 37/83 (44%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPA 252
           + ++E+ E+    E+E   + E EE+  E+EEEEE ++E  +          + I +   
Sbjct: 48  QEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQ 107

Query: 253 GLETPDSIELRKKKIESDMETNE 275
                + I    KK E   +T E
Sbjct: 108 DDNAQNLISKNYKKNEKSKKTAE 130


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
           mitochondrial protein translocase family, which is
           responsible for movement of nuclear encoded pre-proteins
           into mitochondria, is very complex with at least 19
           components. These proteins include several chaperone
           proteins, four proteins of the outer membrane
           translocase (Tom) import receptor, five proteins of the
           Tom channel complex, five proteins of the inner membrane
           translocase (Tim) and three "motor" proteins. This
           family represents the Tom22 proteins. The N terminal
           region of Tom22 has been shown to have chaperone-like
           activity, and the C terminal region faces the
           intermembrane face.
          Length = 136

 Score = 29.9 bits (68), Expect = 1.1
 Identities = 8/40 (20%), Positives = 20/40 (50%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           EV +E  ++          EE  ++D+++ + + +  D+S
Sbjct: 6   EVEDETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDS 45


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 29.4 bits (66), Expect = 1.1
 Identities = 14/27 (51%), Positives = 18/27 (66%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
           E   E+E E+  EEE+EEEE E  +GL
Sbjct: 76  EKAEEKEEEKKKEEEKEEEEEEALAGL 102



 Score = 27.1 bits (60), Expect = 7.9
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
             E  EE EE+ ++EEE+EE E+E+ 
Sbjct: 74  AEEKAEEKEEEKKKEEEKEEEEEEAL 99



 Score = 26.7 bits (59), Expect = 9.7
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
           E + EE+ +E+++EEEEEE       L
Sbjct: 79  EEKEEEKKKEEEKEEEEEEALAGLGAL 105


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 30.6 bits (69), Expect = 1.2
 Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 6/88 (6%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL-ITPAEGLMTPSGITSVPAG 253
           ++E   +    E   E E E  E+DEEEE EEE E+   L  TP E       +     G
Sbjct: 66  DDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQLEEHEEG 125

Query: 254 LETPDSIELRKKKIESDMETNEAPQLFQ 281
            +      + ++ +E  +      ++  
Sbjct: 126 FQ-----AMVEQAVERGLSAETITRIQA 148



 Score = 29.8 bits (67), Expect = 2.4
 Identities = 19/60 (31%), Positives = 26/60 (43%), Gaps = 3/60 (5%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLE 255
            ED E      +  + EEE  E+ EE+EEEE  E+        +   TP  +T     LE
Sbjct: 64  GEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGD---TPEELTEASEQLE 120


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 30.9 bits (70), Expect = 1.2
 Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 3/90 (3%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
           D   T  +  T +   E +++     +E  +EEE EE  EEEEEEE  E E+ +  P  G
Sbjct: 372 DKIATEEDATTIDELREFLKEKGHPVVERWAEEEEEE--EEEEEEEAAEAEAPMEEPVPG 429

Query: 241 LMTPSGITSVPAGLETPDSIELRKKKIESD 270
              P  +    AG      I L+  KI ++
Sbjct: 430 FEVPE-MPMPAAGGGGGIKIVLKNAKITAE 458


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 31.1 bits (70), Expect = 1.2
 Identities = 12/53 (22%), Positives = 26/53 (49%), Gaps = 3/53 (5%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
           D    + E + ++ + ED+ +    E E++S +E + +D +E E +   D   
Sbjct: 947 DESDETDEDEESDESSEDLSED---ESENDSSDEEDGEDWDELESKAAYDSRP 996



 Score = 30.0 bits (67), Expect = 2.8
 Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 13/76 (17%)

Query: 161 FGTGGWG---KPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEE 217
           F  GGW        DE             +E + +E      +++   + + ES+E SE+
Sbjct: 915 FEDGGWSFLMVGSDDE----------SDESEEEVSEYEASSDDESDETDEDEESDESSED 964

Query: 218 DDEEEEEEEEGEDESG 233
             E+E E +  ++E G
Sbjct: 965 LSEDESENDSSDEEDG 980


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 206  ELESESEEESEEDDEEEEEEEEGEDES 232
            +  SE +++SE DD E+E++E+ ED+ 
Sbjct: 1362 DSSSEDDDDSEVDDSEDEDDEDDEDDD 1388



 Score = 28.9 bits (65), Expect = 7.1
 Identities = 6/26 (23%), Positives = 16/26 (61%)

Query: 208  ESESEEESEEDDEEEEEEEEGEDESG 233
             S  +++  E D+ E+E++E +++  
Sbjct: 1363 SSSEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.7 bits (70), Expect = 1.4
 Identities = 12/46 (26%), Positives = 27/46 (58%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
            +    +++++DI+     E + E +++ + DDE+EE++E  E E 
Sbjct: 135 DDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEK 180



 Score = 28.4 bits (64), Expect = 8.4
 Identities = 7/48 (14%), Positives = 29/48 (60%)

Query: 185 TSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
             A+    + +++D++     + + + +++ ++DD++ ++E+E + E+
Sbjct: 128 NQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEA 175


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 31.1 bits (70), Expect = 1.4
 Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 3/85 (3%)

Query: 190  KTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITS 249
            ++A  NE D+          E ++  E++DEEE  ++ G D+   I P            
Sbjct: 3926 QSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDE--IQPDIQENNSQPPPE 3983

Query: 250  VPAGLETPDSIELRKKKIESDMETN 274
                L+ P+ ++L +K+ +   +++
Sbjct: 3984 NED-LDLPEDLKLDEKEGDVSKDSD 4007



 Score = 29.2 bits (65), Expect = 5.8
 Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 6/49 (12%)

Query: 188  ETKTAEVNEED------IEKTAWGELESESEEESEEDDEEEEEEEEGED 230
            E  TA  ++ D      +E    G  +     E+E  D EEE ++  E+
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891



 Score = 28.8 bits (64), Expect = 7.5
 Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 5/49 (10%)

Query: 198  DIEKTAWGELESE-----SEEESEEDDEEEEEEEEGEDESGLITPAEGL 241
            DI++T    L +E       E+  E  E+E+ E+E  D +  ITPA  +
Sbjct: 4268 DIKQTGEDTLPTEFGSINQSEKVFELSEDEDIEDELPDYNVKITPAMPI 4316



 Score = 28.4 bits (63), Expect = 8.2
 Identities = 27/106 (25%), Positives = 44/106 (41%), Gaps = 12/106 (11%)

Query: 185  TSAETKTAEVNEEDIEKTAWGELESESEEESEEDD----EEEEEEEEGEDESGLITPAEG 240
            ++A  ++  V++ED  K    +   E E+E E  D    ++E + +  E+ S      E 
Sbjct: 3927 SAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENED 3986

Query: 241  LMTPS--------GITSVPAGLETPDSIELRKKKIESDMETNEAPQ 278
            L  P         G  S  + LE  D     + K E+D E +E  Q
Sbjct: 3987 LDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQ 4032


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 30.5 bits (69), Expect = 1.5
 Identities = 19/72 (26%), Positives = 36/72 (50%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKK 265
           E E E EEE E+DD++  ++E+ +++    T      + SG+  V    E  +  +L KK
Sbjct: 136 ESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKK 195

Query: 266 KIESDMETNEAP 277
            +  + +  + P
Sbjct: 196 DVLEEPKMFKKP 207


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 30.6 bits (69), Expect = 1.6
 Identities = 27/155 (17%), Positives = 51/155 (32%), Gaps = 26/155 (16%)

Query: 94  KTFSRETVIPPPWLIAMQRYGPPPSYP--------NLKIPGLNAPI--PQGCSFGYHAGG 143
           KT  R+   P    +  + +  P   P          +I     P+  P   S  Y A  
Sbjct: 258 KTVWRQKNTPAQTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPAMYSSDYQAP- 316

Query: 144 WGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKTA----EVNEEDI 199
             KP    T + L  + F +  +    + ++             + +      E + ++ 
Sbjct: 317 --KPEPIYTWEELLRERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDED 374

Query: 200 EKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
           E         +  +E E D E+E +    ED++ L
Sbjct: 375 E---------DDSDEDEVDYEKERKRRREEDKNFL 400


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           E   E+ EK    E ++  EE ++E   EEEE EE ED+ 
Sbjct: 39  EEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKE 78


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 13/41 (31%), Positives = 22/41 (53%)

Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
            EV+E++ E+      E E EE  + ++E  +  E+G  ES
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSES 408



 Score = 30.1 bits (68), Expect = 2.2
 Identities = 18/62 (29%), Positives = 29/62 (46%)

Query: 209 SESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIE 268
              E + +ED+EEE+  +E E+E G  +  EG  +    +S  +     DS     K+  
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESA 425

Query: 269 SD 270
           SD
Sbjct: 426 SD 427



 Score = 30.1 bits (68), Expect = 2.5
 Identities = 13/26 (50%), Positives = 15/26 (57%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
             E E+E EE   +E EEEEGED   
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEE 395



 Score = 28.9 bits (65), Expect = 4.6
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
           E E EEE +  DE EEEE E  +E G
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEG 397



 Score = 28.9 bits (65), Expect = 5.5
 Identities = 12/39 (30%), Positives = 18/39 (46%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
            E  +       L+    EE +ED++EEEE+   E E  
Sbjct: 350 KESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEE 388



 Score = 28.5 bits (64), Expect = 6.5
 Identities = 13/37 (35%), Positives = 20/37 (54%)

Query: 197 EDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
           + I+     E E E EE+  ++ EEEE E+  E+ S 
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQ 399


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 30.0 bits (67), Expect = 1.8
 Identities = 18/55 (32%), Positives = 25/55 (45%)

Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPA 252
           +  K A  E ES+ +EE EE++EEEE E E  ++    T         G  S   
Sbjct: 124 NAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGG 178



 Score = 29.3 bits (65), Expect = 3.2
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 201 KTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           K      ++  E+ES+ED+EEEEEEEE E E
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEEEAE 151


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.5 bits (70), Expect = 1.9
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
              D              EE ++D++EEEEE+E +D  
Sbjct: 172 GFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSL 209



 Score = 29.8 bits (68), Expect = 3.0
 Identities = 7/26 (26%), Positives = 16/26 (61%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
           E  +   SE ++ +++E+EE E++  
Sbjct: 180 EDPAHVGSELEELDDDEDEEEEEDEN 205



 Score = 29.4 bits (67), Expect = 3.8
 Identities = 13/41 (31%), Positives = 22/41 (53%)

Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
            + N E+       ELE   ++E EE++E+E ++    DES
Sbjct: 174 VDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADES 214



 Score = 29.4 bits (67), Expect = 4.2
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITP 237
            +   +     + SE EE  +++DEEEEE+E  +  +   + 
Sbjct: 174 VDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESE 215



 Score = 28.6 bits (65), Expect = 7.7
 Identities = 10/37 (27%), Positives = 17/37 (45%)

Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
           D       E +        E+ +++E+EEE EDE+  
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDD 207


>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
          Length = 633

 Score = 30.2 bits (68), Expect = 2.0
 Identities = 29/96 (30%), Positives = 40/96 (41%), Gaps = 18/96 (18%)

Query: 185 TSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTP 244
           TS+    + V E    K+   + ES S + S  DDEEEEE    ED           +T 
Sbjct: 438 TSSVNSESLVTEVKTTKSVEHKSESSSSDGSGHDDEEEEEPLLSEDRV---------ITS 488

Query: 245 SGITSVPAG-----LETPDSIELRKKKIESDMETNE 275
           S    V  G     LE     EL++++ E D +  E
Sbjct: 489 SVEEEVTEGELWYELEK----ELQRQETEVDAQAQE 520



 Score = 29.8 bits (67), Expect = 2.9
 Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 6/64 (9%)

Query: 188 ETKTAEVNEEDIEKTAWGELESE-SEEESEEDDEEEEEE-----EEGEDESGLITPAEGL 241
              T+ V EE  E   W ELE E   +E+E D + +EEE     E  E+E+ L    E  
Sbjct: 484 RVITSSVEEEVTEGELWYELEKELQRQETEVDAQAQEEEAAAAKEITEEENVLAKAVESK 543

Query: 242 MTPS 245
            + +
Sbjct: 544 TSIT 547


>gnl|CDD|129574 TIGR00483, EF-1_alpha, translation elongation factor EF-1 alpha.
           This model represents the counterpart of bacterial EF-Tu
           for the Archaea (aEF-1 alpha) and Eukaryotes (eEF-1
           alpha). The trusted cutoff is set fairly high so that
           incomplete sequences will score between suggested and
           trusted cutoff levels [Protein synthesis, Translation
           factors].
          Length = 426

 Score = 30.2 bits (68), Expect = 2.0
 Identities = 26/71 (36%), Positives = 33/71 (46%), Gaps = 10/71 (14%)

Query: 146 KPPVDETGKPL---YGDVFGTGGWGKPPVD--ETGKPLYGDVF-----GTSAETKTAEVN 195
           +PP   T KPL     DV+   G G  PV   ETG    GD       G S E K+ E++
Sbjct: 218 EPPEKPTDKPLRIPIQDVYSITGVGTVPVGRVETGVLKPGDKVVFEPAGVSGEVKSIEMH 277

Query: 196 EEDIEKTAWGE 206
            E IE+   G+
Sbjct: 278 HEQIEQAEPGD 288


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 30.1 bits (67), Expect = 2.2
 Identities = 17/54 (31%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGEL--ESESEEESEEDDEEEEEEEEGEDES 232
           D+   +AE +  EV+E D E+        E+  EE   +  +E  EE+E +DE+
Sbjct: 276 DLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDEN 329


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 30.1 bits (68), Expect = 2.2
 Identities = 16/82 (19%), Positives = 35/82 (42%), Gaps = 6/82 (7%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGI 247
              T+   ++  +K      + +SE++ ++  E++E + EGE + G  TP +   T    
Sbjct: 49  VATTSTKKDKKEDKNNE--SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT---- 102

Query: 248 TSVPAGLETPDSIELRKKKIES 269
              P   +  + ++    KI  
Sbjct: 103 KKKPPKPKPNEDVDNAFNKIAE 124


>gnl|CDD|238836 cd01673, dNK, Deoxyribonucleoside kinase (dNK) catalyzes the
           phosphorylation of deoxyribonucleosides to yield
           corresponding monophosphates (dNMPs). This family
           consists of various deoxynucleoside kinases including
           deoxyribo- cytidine (EC 2.7.1.74), guanosine (EC
           2.7.1.113), adenosine (EC 2.7.1.76), and thymidine (EC
           2.7.1.21) kinases. They are key enzymes in the salvage
           of deoxyribonucleosides originating from extra- or
           intracellular breakdown of DNA.
          Length = 193

 Score = 29.5 bits (67), Expect = 2.4
 Identities = 12/32 (37%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 15  TLKSKMRERVRPKLGKIDIDY-QKLHDAFFKW 45
           T   ++++R RP+   I +DY + LH+A+ KW
Sbjct: 137 TCLKRIKKRGRPEEQGIPLDYLEDLHEAYEKW 168


>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1).  The
           homeobox gene Prox1 is expressed in a subpopulation of
           endothelial cells that, after budding from veins, gives
           rise to the mammalian lymphatic system. Prox1 has been
           found to be an early specific marker for the developing
           liver and pancreas in the mammalian foregut endoderm.
           This family contains an atypical homeobox domain.
          Length = 908

 Score = 30.0 bits (67), Expect = 2.6
 Identities = 36/182 (19%), Positives = 57/182 (31%), Gaps = 22/182 (12%)

Query: 220 EEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQL 279
             +  +       G     EG        S PA   +    E+ ++ I S+   +  P+ 
Sbjct: 130 ISQLLKNNMAKNGGGPDRGEGDRGDREAPSFPASALSNTKPEVPQEDICSNSSRDSPPE- 188

Query: 280 FQVLTEKRPDKLGPGMMASTH--------------MYEVPGAGASAPGAPV----KRTLP 321
             +    RP       +   H              M   P AG S   A V    KR L 
Sbjct: 189 -CMSPFGRPPSSSTERLNDEHMRAKRARVENIVRGMSHSPSAGISEGQAQVNGCKKRKLY 247

Query: 322 SSVELALDPSELDLVDTDAMAARYEQQMREQTSGLQKEDLSDMLAEHVARQKNKRKRQQT 381
              +    P+E        + +R  +  R+Q    Q++   D L      QK + +RQ  
Sbjct: 248 QPQQHETSPAESYGAAAAGLNSR--ENKRKQMLPQQQQQSFDQLVSPRKEQKREERRQLK 305

Query: 382 TQ 383
            Q
Sbjct: 306 QQ 307


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 28.9 bits (65), Expect = 2.8
 Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 1/73 (1%)

Query: 200 EKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDS 259
           E+     LE + EEES E++EE+ EEE  E+E        G             L TP+ 
Sbjct: 67  EEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEE-DGDSYDELPTPEE 125

Query: 260 IELRKKKIESDME 272
            E  +++  S+M 
Sbjct: 126 REELREEFISEMH 138


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 29.6 bits (67), Expect = 3.1
 Identities = 11/48 (22%), Positives = 24/48 (50%), Gaps = 8/48 (16%)

Query: 192 AEVNEEDIE--------KTAWGELESESEEESEEDDEEEEEEEEGEDE 231
            ++ EE +         K  + E   E E++ E+D+++ EE ++  +E
Sbjct: 47  RDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94


>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
          Length = 499

 Score = 29.6 bits (66), Expect = 3.2
 Identities = 21/74 (28%), Positives = 35/74 (47%), Gaps = 10/74 (13%)

Query: 179 YGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEE----DDEEEEEEEEGEDESGL 234
           +G + G ++ET+       D+    + E   E+EEE  +    D E EEE+EE E E+  
Sbjct: 58  HGVIDGENSETERPR----DLTAALFAEAGEEAEEEDNDRECPDTEAEEEDEEEEIEAP- 112

Query: 235 ITPAEGLMTPSGIT 248
             P    +   G++
Sbjct: 113 -DPEVNPLDAEGLS 125


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 209 SESEEESEEDDEEEEEEEEGEDESGL 234
           + +   +     EEEEEEE +   GL
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDDMGFGL 308


>gnl|CDD|220777 pfam10486, PI3K_1B_p101, Phosphoinositide 3-kinase gamma adapter
           protein p101 subunit.  Class I PI3Ks are dual-specific
           lipid and protein kinases involved in numerous
           intracellular signaling pathways. Class IB PI3K,
           p110gamma, is mainly activated by seven-transmembrane
           G-protein-coupled receptors (GPCRs), through its
           regulatory subunit p101 and G-protein beta-gamma
           subunits.
          Length = 856

 Score = 29.6 bits (66), Expect = 3.4
 Identities = 17/49 (34%), Positives = 25/49 (51%)

Query: 212 EEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSI 260
           E    ED+E EEEEEE ++  GL    + L++ S + S  +     DS 
Sbjct: 309 EPVLSEDEEVEEEEEEDDETDGLSPERDSLLSNSSVYSNDSSDSKEDSS 357


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 27.9 bits (62), Expect = 3.5
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDE 231
           E E   EEE EE++EE+E E E   +
Sbjct: 59  ESEENDEEEEEEEEEEDEGEIEYVSD 84



 Score = 27.1 bits (60), Expect = 5.4
 Identities = 12/42 (28%), Positives = 20/42 (47%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
            +  E+ E+    E E E EE+  E +   ++EE  E+   L
Sbjct: 54  ALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDL 95


>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK).  This
           family consists of several plant specific phytosulfokine
           precursor proteins. Phytosulfokines, are active as
           either a pentapeptide or a C-terminally truncated
           tetrapeptide. These compounds were first isolated
           because of their ability to stimulate cell division in
           somatic embryo cultures of Asparagus officinalis.
          Length = 74

 Score = 27.3 bits (61), Expect = 3.6
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDE 231
            ++E+    + +ED+   E E EGE+E
Sbjct: 27  SKIEAAESSKVDEDEVVSECEGEGEEE 53


>gnl|CDD|235030 PRK02315, PRK02315, adaptor protein; Provisional.
          Length = 233

 Score = 29.0 bits (66), Expect = 3.9
 Identities = 10/52 (19%), Positives = 25/52 (48%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           +VF T ++       E+ I+  +  ++E    E+  +  E+  E+++  D +
Sbjct: 72  EVFVTKSKLDLPLNFEDLIDLPSDEDIEDLFPEDFFKLLEQSLEDKDDLDAT 123


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 28.9 bits (65), Expect = 4.2
 Identities = 7/35 (20%), Positives = 16/35 (45%)

Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
            +    D + +        S+++S++DD E+E   
Sbjct: 112 EDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAA 146


>gnl|CDD|206170 pfam14000, Packaging_FI, DNA packaging protein FI.  This family
           includes the lambda phage DNA-packaging protein FI.
           Proteins in this family are typically between 124 and
           140 amino acids in length. There is a conserved EEE
           sequence motif.
          Length = 125

 Score = 28.1 bits (63), Expect = 4.2
 Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 1/60 (1%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLE 255
           E D+  +A  EL     E  EE D+  E +    DES L    + +++  G T +    +
Sbjct: 19  EPDVSGSA-AELAQRVAEWEEELDQVSEADTASSDESALTGHEDEVVSDQGETVIGDEGD 77


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 27.2 bits (61), Expect = 4.2
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
            + +   +EE+ +EEEEEEE +D+ G
Sbjct: 59  AAAAAAAAEEEKKEEEEEEEEDDDMG 84


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 28.6 bits (64), Expect = 4.4
 Identities = 14/53 (26%), Positives = 24/53 (45%)

Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
           E   A   ++DI+         E  +E E D +E++EEEE ++   L    + 
Sbjct: 89  EEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDD 141


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 28.7 bits (64), Expect = 4.5
 Identities = 15/32 (46%), Positives = 19/32 (59%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITP 237
           E E ES EE EE+ EEEEE +  +D    + P
Sbjct: 8   EEEDESGEEEEEESEEEEETDSEDDMEPRLKP 39


>gnl|CDD|234493 TIGR04189, surface_SprA, cell surface protein SprA.  SprA is a cell
            surface protein widely distributed in the Bacteroidetes
            lineage. In Flavobacterium johnsoniae, a species that
            shows gliding motility, mutation disrupts gliding.
          Length = 2315

 Score = 29.3 bits (66), Expect = 4.6
 Identities = 25/137 (18%), Positives = 41/137 (29%), Gaps = 13/137 (9%)

Query: 230  DESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRP- 288
            ++ G +         +     P   + PD  +  K  I           +F+ LT  R  
Sbjct: 1816 NKLGFLKQRSQDNQSARKKPDPRTEKPPDKPKQNKTIIPGP-RPKILSAVFRFLTMLRNI 1874

Query: 289  --------DKLGPGMMASTHMYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDA 340
                      + PG + S   + V   G  APG      L S            L    +
Sbjct: 1875 DVNYSEQKGTVLPGFLPSVKFFGVDVLGGLAPGWGFVFGLQSDDFRDEAARNGWLTTFQS 1934

Query: 341  MAARYEQQMREQTSGLQ 357
              ++Y Q+    +  LQ
Sbjct: 1935 FNSQYTQR---FSQNLQ 1948


>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
          Length = 232

 Score = 28.7 bits (64), Expect = 4.8
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDE 231
           G LE   E E+E +DE  EEEEE + +
Sbjct: 57  GGLEVSEETEAEVEDEGGEEEEEEDAD 83


>gnl|CDD|239523 cd03431, DNA_Glycosylase_C, DNA glycosylase (MutY in bacteria and
           hMYH in humans) is responsible for repairing misread
           A*oxoG residues to C*G by removing the inappropriately
           paired adenine base from the DNA backbone. It belongs to
           the Nudix hydrolase superfamily and is important for the
           repair of various genotoxic lesions. Enzymes belonging
           to this superfamily requires a divalent cation, such as
           Mg2+ or Mn2+ for their activity. They are also
           recognized by a highly conserved 23-residue nudix motif
           (GX5EX7REUXEEXGU, where U = I, L or V). However, DNA
           glycosylase does not seem to contain this signature
           motif. DNA glycosylase consists of 2 domains: the
           N-terminal domain contains the catalytic properties of
           the enzyme and the C-terminal domain affects substrate
           (oxoG) binding and enzymatic turnover. The C-terminal
           domain is highly similar to MutT, based on secondary
           structure and topology, despite low sequence identity.
           MutT sanitizes the nucleotide precursor pool by
           hydrolyzing oxo-dGTP to oxo-dGMO and inorganic
           pyrophosphate. The similarity strongly suggests that the
           two proteins share a common evolutionary origin.
          Length = 118

 Score = 27.7 bits (62), Expect = 5.0
 Identities = 11/35 (31%), Positives = 17/35 (48%), Gaps = 5/35 (14%)

Query: 281 QVLTEKRPDKLGPGMMASTHMYEVPGAGASAPGAP 315
           +VL EKRP+K   G++A   ++E P          
Sbjct: 15  RVLLEKRPEK---GLLAG--LWEFPSVEWEEEADG 44


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 28.5 bits (64), Expect = 5.1
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDE 231
           +SE +E   +D+EE E+E + E+ 
Sbjct: 60  DSEDDEPESDDEEEGEKELQREER 83


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 29.0 bits (65), Expect = 5.1
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
           E + EE+SEE   EE +EE  E+E G
Sbjct: 284 EEKDEEKSEEVKTEEVDEEFEEEEKG 309



 Score = 28.3 bits (63), Expect = 10.0
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDE 231
           E + E +EE  E+ + EE +EE E+E
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEE 306


>gnl|CDD|206355 pfam14187, DUF4310, Domain of unknown function (DUF4310).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 214 and 231 amino
           acids in length.
          Length = 210

 Score = 28.5 bits (64), Expect = 5.2
 Identities = 11/19 (57%), Positives = 12/19 (63%)

Query: 291 LGPGMMASTHMYEVPGAGA 309
           L  G+ A THMY V G GA
Sbjct: 20  LSAGIFAGTHMYVVYGVGA 38


>gnl|CDD|220098 pfam09057, Smac_DIABLO, Second Mitochondria-derived Activator of
           Caspases.  Second Mitochondria-derived Activator of
           Caspases promotes apoptosis by activating caspases in
           the cytochrome c/Apaf-1/caspase-9 pathway, and by
           opposing the inhibitory activity of inhibitor of
           apoptosis proteins (XIAP-BIR3). The protein assumes an
           elongated three-helix bundle structure, and forms a
           dimer in solution.
          Length = 234

 Score = 28.7 bits (64), Expect = 5.2
 Identities = 12/44 (27%), Positives = 18/44 (40%), Gaps = 5/44 (11%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
           AE K AE   E+I++ A       +      + E+  E    ED
Sbjct: 196 AEKKLAESKAEEIQRMA-----EYASSIDLSELEDIPEAYLRED 234


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 28.8 bits (64), Expect = 5.3
 Identities = 25/129 (19%), Positives = 45/129 (34%), Gaps = 10/129 (7%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGL 254
           N+ +  +           E  +++D EE EEE+ E+E GL    + L    G  +     
Sbjct: 295 NDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354

Query: 255 ETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTHMYEVPGAGASAPGA 314
           ++           +SD++  ++  L     +K P K  P     +        G S P  
Sbjct: 355 DSDSG----DDSDDSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPS------SPGNSGPAR 404

Query: 315 PVKRTLPSS 323
           P   +    
Sbjct: 405 PSPESKDKG 413


>gnl|CDD|177537 PHA03134, PHA03134, thymidine kinase; Provisional.
          Length = 340

 Score = 28.6 bits (64), Expect = 5.3
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 335 LVDTDAMAARYEQQMREQTSGLQKEDLSDMLAEHVAR 371
           L +TD ++  Y+ Q R+Q   L  ED + + A + AR
Sbjct: 56  LFETDVVSGIYDAQNRKQQGSLAAEDAAGITAHYQAR 92


>gnl|CDD|235785 PRK06365, PRK06365, acetyl-CoA acetyltransferase; Provisional.
          Length = 430

 Score = 28.7 bits (64), Expect = 5.3
 Identities = 22/67 (32%), Positives = 29/67 (43%), Gaps = 11/67 (16%)

Query: 32  DIDYQKLHDAFFKWQTKPRLSIHGDL----YYEGKEFETKLKEKKPGDLSAELRTAL--- 84
           D+D  +LHDA+    T   +  + DL    Y EG +F    K + PG L       L   
Sbjct: 314 DLDLIELHDAY----TSSEIQTYEDLGLCKYGEGGQFIESGKPELPGKLPVNPSGGLLAA 369

Query: 85  GMPVGPT 91
           G  VG T
Sbjct: 370 GHAVGAT 376


>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
          Length = 487

 Score = 28.9 bits (65), Expect = 5.4
 Identities = 9/46 (19%), Positives = 21/46 (45%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
           D F         E  E +   +  G+LE    E   +++++++++E
Sbjct: 433 DKFEQRMSGDFIEGEEMEPLFSMGGKLEMPGSESVSDEEDDDDDDE 478


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 28.6 bits (64), Expect = 5.5
 Identities = 7/28 (25%), Positives = 14/28 (50%), Gaps = 3/28 (10%)

Query: 208 ESESEEESEEDDEEEEEEE---EGEDES 232
           + E +   ++DDE+  E +     +D S
Sbjct: 250 DDEGDGSDDDDDEDAIESDLDDSDDDVS 277


>gnl|CDD|182789 PRK10863, PRK10863, anti-RNA polymerase sigma factor SigE;
           Provisional.
          Length = 216

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 18/56 (32%), Positives = 23/56 (41%), Gaps = 12/56 (21%)

Query: 302 YEVPGAGASAPGAPVKRTLP-----SSVELALDPSELDLVDTDAMAARYEQQMREQ 352
           Y      +  P  PV  TLP     S V L + PSE       A     +QQ++EQ
Sbjct: 121 YNGQSETSQQPETPVFNTLPMMGKASPVSLGV-PSE------AAAGNGQQQQVQEQ 169


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 28.6 bits (64), Expect = 5.6
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEE 224
            E + AE +  D ++     +E +  E +EE+++EE E
Sbjct: 35  TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 28.2 bits (63), Expect = 6.6
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITP 237
            ++ + E+E+  +++E E  EE EDE G +  
Sbjct: 44  VVDEDEEDEAVVEEDENELTEEEEDEEGEVKA 75


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 28.7 bits (64), Expect = 5.8
 Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 6/57 (10%)

Query: 178 LYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEE---EEEEEEGEDE 231
            Y D+F      +  E +    E T   + ESE E E +E   E   E E  E + +
Sbjct: 324 YYEDIFEVVEVVEKQEGDVVTEEST---DEESEDEVEIDESVIEEVAEMELLEVQVD 377


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 27.7 bits (62), Expect = 6.2
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 190 KTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
           K  + +++ I+     + + + E+  EEDDE+EE+EE
Sbjct: 93  KFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129



 Score = 27.3 bits (61), Expect = 7.2
 Identities = 16/58 (27%), Positives = 28/58 (48%)

Query: 170 PVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEE 227
           PVDE  + +         + K     ++DI        +   EE+ +E+D+E+EE+EE
Sbjct: 72  PVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 28.8 bits (64), Expect = 6.3
 Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 2/90 (2%)

Query: 154 KPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKT-AEVNEEDIEKTAWGELESESE 212
           KP    + G+   GK              F     T+T  E   +D E     E E + +
Sbjct: 241 KPKMFQLGGSDELGKSLPSLMSPRKKTASFKEQVVTRTFPERTSDDDEDAIETE-EDDVD 299

Query: 213 EESEEDDEEEEEEEEGEDESGLITPAEGLM 242
           E + EDD+++ + E+  +ESG  +  E  M
Sbjct: 300 ESAIEDDDDDSDWEDSVEESGRSSVDEKTM 329


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
           Provisional.
          Length = 1136

 Score = 29.0 bits (64), Expect = 6.3
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           D+EK  + ++   SE+ES ED+EEE   +  E+E
Sbjct: 418 DLEKQKYMDMLDGSEDESVEDNEEEHSGDANEEE 451


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 27.7 bits (62), Expect = 6.3
 Identities = 12/45 (26%), Positives = 19/45 (42%)

Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLIT 236
            E ++E+  +T   E   + E E   +    + EEE  DE    T
Sbjct: 81  EENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDKYKT 125


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 28.8 bits (65), Expect = 6.4
 Identities = 13/28 (46%), Positives = 15/28 (53%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDESG 233
           ELE    EE EE+  EE EE + ED   
Sbjct: 827 ELEPLPYEEPEEETAEEPEEVDPEDVKS 854


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 28.6 bits (64), Expect = 6.6
 Identities = 14/36 (38%), Positives = 24/36 (66%), Gaps = 2/36 (5%)

Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
           EE +  + + E E +  +E E+ DE+E+E+E+ EDE
Sbjct: 877 EEYL-ISQFQENEDDDADEDEDQDEDEDEDED-EDE 910


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 28.6 bits (64), Expect = 6.7
 Identities = 13/60 (21%), Positives = 27/60 (45%), Gaps = 4/60 (6%)

Query: 177 PLYGDVFGTSAETKTAEVNEEDIEKTAWGELESES----EEESEEDDEEEEEEEEGEDES 232
           P+  ++   S  +K  +  +E           S +    +E+S++D+E +  E E E E+
Sbjct: 403 PVMVNMASASVSSKKNQKPQESSPSLDQTNAASRNSVMMDEDSDDDEEFQIPESEQEPET 462


>gnl|CDD|226077 COG3547, COG3547, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 303

 Score = 28.4 bits (63), Expect = 6.7
 Identities = 18/101 (17%), Positives = 33/101 (32%), Gaps = 6/101 (5%)

Query: 295 MMASTHMYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDAMAARYEQQMREQTS 354
            + +T   E       A G PV+   P  V+             DA+A        +   
Sbjct: 40  GIEATGYSENLARYLRAEGYPVRLINPRQVKKKARGKRAKTDKADALAIAVAALRPK--- 96

Query: 355 GLQKEDLSDMLAEHVARQKNKRKR--QQTTQDNKQAKKYKE 393
            L+   +       +   K +R+R  +  TQ + + +   E
Sbjct: 97  -LEPVSVPSEEQRALRELKRRRRRLLRLRTQLDNRIRGLLE 136


>gnl|CDD|181506 PRK08622, PRK08622, galactose-6-phosphate isomerase subunit LacB;
           Reviewed.
          Length = 171

 Score = 27.7 bits (62), Expect = 6.9
 Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 5/51 (9%)

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEED--DEEEEEEEEGE 229
           D F  +AE K  E N++ IEK     LE  + +++     DE  E+ + GE
Sbjct: 121 DAF-INAEYKPTEENKKLIEKID--HLEKPNADQANPHFFDEFLEKWDRGE 168


>gnl|CDD|215313 PLN02575, PLN02575, haloacid dehalogenase-like hydrolase.
          Length = 381

 Score = 28.3 bits (63), Expect = 6.9
 Identities = 13/26 (50%), Positives = 14/26 (53%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDE 220
           N  DIE   +G  E E E E EED E
Sbjct: 344 NLADIESPEFGPPEPELEMEKEEDRE 369


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 28.7 bits (65), Expect = 6.9
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 210 ESEEESEEDDEEEEEEEEGEDESGLI 235
              E S+ +DE E+  + G D S LI
Sbjct: 21  AEAECSDGEDELEDLFDTGSDISDLI 46


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 27.6 bits (62), Expect = 7.0
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 206 ELESESEEESEEDDEEEEEEEEGEDES 232
           E E E E E EE DEEE+ +E  E E 
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKEL 128


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 28.2 bits (63), Expect = 7.2
 Identities = 12/39 (30%), Positives = 18/39 (46%)

Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
            E++ +     E    S+  SE+ D   EE E GE E+ 
Sbjct: 231 KEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAA 269


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 28.4 bits (63), Expect = 7.5
 Identities = 12/40 (30%), Positives = 27/40 (67%)

Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           E  EE +E+    E + + +++ +EDD++++++E+ EDE 
Sbjct: 44  EAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDED 83


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 27.9 bits (62), Expect = 7.9
 Identities = 27/96 (28%), Positives = 32/96 (33%), Gaps = 10/96 (10%)

Query: 113 YGPPPSYPNLKIP---GLNAPIP-QGCSFGYHAGGWGKP------PVDETGKPLYGDVFG 162
           YG   +YP    P   G N P P Q    GY   G G P      P    G P      G
Sbjct: 1   YGNMQNYPQAMNPQMGGGNYPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGGPVGGG 60

Query: 163 TGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEED 198
            GG G PP          D      +  ++ + E D
Sbjct: 61  GGGSGGPPGGGEVAGEAEDAMSEFDDYSSSSIEEGD 96


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 27.7 bits (62), Expect = 8.4
 Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 6/58 (10%)

Query: 188 ETKTAEVNEEDIEKTAWG----ELESESEEESEED--DEEEEEEEEGEDESGLITPAE 239
              +    EE  + +A      E +S S E S  D    EEEEEE  E+E  +  P +
Sbjct: 124 SESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEEEVKEPED 181


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 28.0 bits (62), Expect = 9.1
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
           E E   E  +  EEEE+EE G D  G
Sbjct: 177 EEEERLEESDGREEEEDEEVGSDSYG 202


>gnl|CDD|235378 PRK05256, PRK05256, condesin subunit E; Provisional.
          Length = 238

 Score = 27.6 bits (62), Expect = 9.1
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 207 LESESEEESEEDDEEEEEEEEGEDE 231
           +E+  ++ES  +DE EEE+ + E+E
Sbjct: 213 IENHLQQESSLNDETEEEQPDEEEE 237


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 27.7 bits (62), Expect = 9.2
 Identities = 15/39 (38%), Positives = 18/39 (46%), Gaps = 1/39 (2%)

Query: 205 GELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMT 243
           GE E E  +    D  E   E   E+E G +TP EG  T
Sbjct: 25  GEPEPEVPQTEATDPSEVVVETVTEEEGGPLTP-EGNAT 62


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 28.0 bits (62), Expect = 9.3
 Identities = 9/35 (25%), Positives = 18/35 (51%)

Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
           D +   +G    + E  S  D +EEE+ ++ ++E 
Sbjct: 263 DFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNED 297


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 28.1 bits (62), Expect = 9.7
 Identities = 15/52 (28%), Positives = 27/52 (51%)

Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDS 259
           +++SEE+ ++ +EE+ +    +D  G      G+ T SG T+    L T  S
Sbjct: 113 KTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQS 164


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 28.0 bits (62), Expect = 10.0
 Identities = 14/76 (18%), Positives = 22/76 (28%)

Query: 162 GTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEE 221
                 +P  D  G P       +       +   E    T        +++  E +DE 
Sbjct: 178 PEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEP 237

Query: 222 EEEEEEGEDESGLITP 237
            E E EG    G  + 
Sbjct: 238 TEPEREGPPFPGHRSH 253


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.130    0.376 

Gapped
Lambda     K      H
   0.267   0.0685    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,668,806
Number of extensions: 2059186
Number of successful extensions: 7490
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5337
Number of HSP's successfully gapped: 567
Length of query: 396
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 297
Effective length of database: 6,546,556
Effective search space: 1944327132
Effective search space used: 1944327132
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 60 (27.0 bits)