RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7791
         (551 letters)



>gnl|CDD|217253 pfam02854, MIF4G, MIF4G domain.  MIF4G is named after Middle domain
           of eukaryotic initiation factor 4G (eIF4G). Also occurs
           in NMD2p and CBP80. The domain is rich in alpha-helices
           and may contain multiple alpha-helical repeats. In
           eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
          Length = 198

 Score =  102 bits (257), Expect = 3e-25
 Identities = 54/205 (26%), Positives = 92/205 (44%), Gaps = 18/205 (8%)

Query: 59  KSIHGSVNKVNTGNIGIIARKLFQENII--RGRGLLTRTILQAQAASPTFTNVYAALVDI 116
           K + G +NK++  N   I  +L + N+        +   I +     P F   YA L   
Sbjct: 2   KKVKGLLNKLSPSNFEKIIDELLKLNMSDKNLLKYILELIFEKAVEEPNFIPAYARLCAG 61

Query: 117 INSKFPSIGELLLNRCIQQFKR---SFKRNDKALCINSVTFIAHLVNQQVAHEIIVLEIL 173
           +NSK P  GELLLNR  ++F++     +R ++   +  V F+  L N +V  E I+LE L
Sbjct: 62  LNSKNPDFGELLLNRLQEEFEKGVEEEERGNRKRRLGLVRFLGELFNFKVLTEKIILECL 121

Query: 174 TLLVETPTN---DSDFSLEPYSKHKLHHLRSLMCMPLWWISSIQSFQALESFFS--TDVY 228
             L+E+ T      +FSLE         L  L       +   +  + ++          
Sbjct: 122 KELLESLTEEDPRDEFSLEIL-------LLLLTTCG-KLLDREKLKKLMDEILERIQKYL 173

Query: 229 NSSKGKLEKRVQYMIEVLAQVRKDN 253
            S+  +L  R+++M+E L ++RK+ 
Sbjct: 174 LSADPELSSRLRFMLEDLIELRKNK 198


>gnl|CDD|214713 smart00543, MIF4G, Middle domain of eukaryotic initiation factor 4G
           (eIF4G).  Also occurs in NMD2p and CBP80. The domain is
           rich in alpha-helices and may contain multiple
           alpha-helical repeats. In eIF4G, this domain binds
           eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G
           domain homologues (in press).
          Length = 200

 Score = 95.9 bits (239), Expect = 8e-23
 Identities = 55/211 (26%), Positives = 93/211 (44%), Gaps = 28/211 (13%)

Query: 59  KSIHGSVNKVNTGNIGIIARKLFQENIIRG--RGLLTRTILQAQAASPTFTNVYAALVDI 116
           K + G +NK++  N   I ++L + N      R  +   I +     P F   YA L  +
Sbjct: 2   KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCAL 61

Query: 117 INSKFPSIGELLLNRCIQQFKR---SFKRNDKALCINSVTFIAHLVNQQVAHEIIVLEIL 173
           +N+K P  G LLL R  ++F++   S + +DK   +  V F+  L N QV    I+LE+L
Sbjct: 62  LNAKNPDFGSLLLERLQEEFEKGLESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELL 121

Query: 174 TLLVETPTN----DSDFSLEPYSKHKLHHLRSLMCMPLWWI----SSIQSFQALESFFS- 224
             L+   T      SDFS+E             +   L          +S + L+     
Sbjct: 122 KELLNDLTKLDPPRSDFSVE------------CLLSLLPTCGKDLEREKSPKLLDEILER 169

Query: 225 --TDVYNSSKGKLEKRVQYMIEVLAQVRKDN 253
               +    K +L  R+++M+E+L ++RK+ 
Sbjct: 170 LQDYLLKKDKTELSSRLRFMLELLIELRKNK 200


>gnl|CDD|214714 smart00544, MA3, Domain in DAP-5, eIF4G, MA-3 and other proteins.
           Highly alpha-helical. May contain repeats and/or regions
           similar to MIF4G domains Ponting (TIBS) "Novel eIF4G
           domain homologues" in press.
          Length = 113

 Score = 60.7 bits (148), Expect = 2e-11
 Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 10/101 (9%)

Query: 358 LRRTIYLTIHSSL---DFEECAHKLLRMQLKPGQESELCHIVHIDTSLWGLFGDQLRTYE 414
           L++ I+L I   L   D +E  H LL ++L P Q  E+  ++           ++ RTY 
Sbjct: 1   LKKKIFLIIEEYLSSGDTDEAVHCLLELKL-PEQHHEVVKVLLTCAL------EEKRTYR 53

Query: 415 KFFGLLAQRFCQINKMYVAPLEQIFRESYSTVHRLDINKLR 455
           + + +L  R CQ N +     E+ F      +  L+++   
Sbjct: 54  EMYSVLLSRLCQANVISTKQFEKGFWRLLEDIEDLELDIPN 94


>gnl|CDD|111714 pfam02847, MA3, MA3 domain.  Domain in DAP-5, eIF4G, MA-3 and other
           proteins. Highly alpha-helical. May contain repeats
           and/or regions similar to MIF4G domains.
          Length = 113

 Score = 57.3 bits (139), Expect = 3e-10
 Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 10/101 (9%)

Query: 358 LRRTIYLTIHSSL---DFEECAHKLLRMQLKPGQESELCHIVHIDTSLWGLFGDQLRTYE 414
           L++ I+L +   L   D++E A  LL + L P Q  E+  ++ +         ++ +TY 
Sbjct: 1   LKKKIFLILEEYLSSGDYDEAARCLLELGL-PFQHHEVVKVLLVCAL------EEKKTYR 53

Query: 415 KFFGLLAQRFCQINKMYVAPLEQIFRESYSTVHRLDINKLR 455
           + +G+L  R CQ   +     E+ F     ++  L+++   
Sbjct: 54  EMYGVLLSRLCQFGLISTKQFEKGFSRLLESLEDLELDIPD 94


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 43.8 bits (103), Expect = 2e-04
 Identities = 22/52 (42%), Positives = 37/52 (71%), Gaps = 4/52 (7%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI-IDNTETNLVALRRTIYL 364
           GD +E++E+ ++EE+E+EEE E EE E+EN+E + ++  ET     ++ IYL
Sbjct: 861 GDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETR---QKQAIYL 909



 Score = 41.9 bits (98), Expect = 0.001
 Identities = 15/32 (46%), Positives = 24/32 (75%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           G + E++E+ ++EE+E+EEE E EE E+E  E
Sbjct: 860 GGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 40.7 bits (95), Expect = 0.002
 Identities = 25/76 (32%), Positives = 37/76 (48%), Gaps = 8/76 (10%)

Query: 278 TLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDE--------DGDDEEDE 329
           T  G KD  +  +  Q D   + +E     L  E  G+  +D++        DG D E+E
Sbjct: 807 TEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEE 866

Query: 330 DEEESEAEEGEKENKE 345
           +EEE E EE E+E +E
Sbjct: 867 EEEEEEEEEEEEEEEE 882



 Score = 37.7 bits (87), Expect = 0.017
 Identities = 15/32 (46%), Positives = 23/32 (71%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           G +  D E+ ++EE+E+EEE E EE E+E +E
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEE 888



 Score = 36.5 bits (84), Expect = 0.043
 Identities = 14/32 (43%), Positives = 23/32 (71%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            D  + +E+ ++EE+E+EEE E EE E+E +E
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 32.7 bits (74), Expect = 0.66
 Identities = 21/60 (35%), Positives = 26/60 (43%), Gaps = 8/60 (13%)

Query: 302 EEKYTTLRREILGDEDEDD----EDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVA 357
           EE       E  G EDE +    E+G++ EDE E E+E     K   ET  D  ET    
Sbjct: 713 EEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEG----KHEVETEGDRKETEHEG 768



 Score = 29.6 bits (66), Expect = 6.1
 Identities = 13/41 (31%), Positives = 19/41 (46%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           E K    + E   +E E + + + E  EDE E E  E  +E
Sbjct: 700 EAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEE 740


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 40.4 bits (95), Expect = 4e-04
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
              DE +DDE+ ++EED++E++ E E  E+E+   
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVK 143



 Score = 38.5 bits (90), Expect = 0.002
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
            DE++++E+ D+E+D+++E  E E   K+ K
Sbjct: 116 DDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146



 Score = 37.7 bits (88), Expect = 0.003
 Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 1/36 (2%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           E   D+DE+DE+ +++++ED++E E+EE E   K+ 
Sbjct: 111 EDESDDDEEDEE-EEDDEEDDDEDESEEEESPVKKV 145



 Score = 35.4 bits (82), Expect = 0.023
 Identities = 9/37 (24%), Positives = 27/37 (72%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           + ++  E+++ +D +++E+E+++E + +E E E +E+
Sbjct: 104 QHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEEES 140


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 40.4 bits (94), Expect = 0.001
 Identities = 19/38 (50%), Positives = 27/38 (71%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           EDE DED ++EE+E+EEE+E EE E+    T  ++TE 
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEV 169



 Score = 38.1 bits (88), Expect = 0.007
 Identities = 17/34 (50%), Positives = 21/34 (61%)

Query: 321 EDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           ED  DE++E+EEE E EE E E  E   + T TN
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTN 165



 Score = 37.4 bits (86), Expect = 0.015
 Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 5/60 (8%)

Query: 291 VFQYDPEYLMNEEK--YTTLRREIL---GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           VF+Y P Y + +    Y  L+R  +    D  E++ DGD  E+E EEE+  EE   E+ +
Sbjct: 20  VFKYRPRYYLYKHAYFYPPLKRFSVQSGSDSSEENGDGDSSEEEGEEETSNEEENNEDSD 79



 Score = 28.9 bits (64), Expect = 7.1
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 326 EEDEDEEESEAEEGEKENKETIIDN 350
           +EDE +E+ E EE E+E +  + +N
Sbjct: 131 KEDESDEDEEEEEEEEEEEAEVEEN 155


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 10/62 (16%)

Query: 284 DTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
            T  +    + D E+ + EE+          +E+E D D DD ED++ E  + EEGEKE 
Sbjct: 29  WTYLLFEEEEDDEEFEIEEEE----------EEEEVDSDFDDSEDDEPESDDEEEGEKEL 78

Query: 344 KE 345
           + 
Sbjct: 79  QR 80



 Score = 31.6 bits (72), Expect = 0.79
 Identities = 16/35 (45%), Positives = 21/35 (60%), Gaps = 4/35 (11%)

Query: 313 LGDEDEDDEDGDD----EEDEDEEESEAEEGEKEN 343
           L +E E+DE        EE+ED+EE E EE E+E 
Sbjct: 18  LEEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEE 52



 Score = 31.6 bits (72), Expect = 0.80
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
           D+ EDDE   D+E+E E+E + EE  K+ K
Sbjct: 59  DDSEDDEPESDDEEEGEKELQREERLKKKK 88


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 38.5 bits (90), Expect = 0.002
 Identities = 16/45 (35%), Positives = 26/45 (57%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           R+E L   +E+++D +D E ED E+ E +E E   + T+ D  E 
Sbjct: 72  RKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEE 116



 Score = 29.2 bits (66), Expect = 2.4
 Identities = 9/36 (25%), Positives = 19/36 (52%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIID 349
            DE++ + +  ++ ++DE E   E    + +E  ID
Sbjct: 84  DDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYID 119



 Score = 27.7 bits (62), Expect = 8.0
 Identities = 9/38 (23%), Positives = 17/38 (44%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           ++DE+D + +D ED +++E E                E
Sbjct: 83  NDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDE 120


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 39.6 bits (93), Expect = 0.003
 Identities = 13/53 (24%), Positives = 31/53 (58%)

Query: 293 QYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           + +PE +  +E+      E   +E+E+D+  + E++++E+E E EE + +  +
Sbjct: 266 KGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318



 Score = 35.0 bits (81), Expect = 0.091
 Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEK 341
           D+  + ED D+E++++EEE + +EG+K
Sbjct: 293 DDYSESEDEDEEDEDEEEEEDDDEGDK 319



 Score = 33.9 bits (78), Expect = 0.17
 Identities = 17/49 (34%), Positives = 31/49 (63%), Gaps = 5/49 (10%)

Query: 311 EILGDEDEDDED-----GDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           E++G ++ED E+      ++EE++D  ESE E+ E E++E   D+ E +
Sbjct: 270 EVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318



 Score = 28.5 bits (64), Expect = 9.7
 Identities = 11/36 (30%), Positives = 22/36 (61%)

Query: 303 EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
           EK      E    E ED+++ D++E+E+E++ E ++
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 38.8 bits (91), Expect = 0.007
 Identities = 18/34 (52%), Positives = 27/34 (79%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
             +EDE+DED DDE+DE+EEE E E+ +K++ E+
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAES 386



 Score = 37.7 bits (88), Expect = 0.015
 Identities = 14/32 (43%), Positives = 23/32 (71%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           D  +++ED +DE+ +DE++ E EE EKE K+ 
Sbjct: 350 DLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKK 381



 Score = 36.9 bits (86), Expect = 0.033
 Identities = 15/36 (41%), Positives = 27/36 (75%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           ++   ++E+DE+ +D +DED+EE E EE EK+ K++
Sbjct: 348 DVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKS 383



 Score = 36.9 bits (86), Expect = 0.034
 Identities = 16/38 (42%), Positives = 26/38 (68%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           DE+ED+ED D ++++DEEE E E+ +K+ K      +E
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 36.5 bits (85), Expect = 0.037
 Identities = 12/33 (36%), Positives = 24/33 (72%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            G +DED+ED DD+ +E+EE+ +  + E++ ++
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEED 360



 Score = 35.4 bits (82), Expect = 0.089
 Identities = 12/39 (30%), Positives = 25/39 (64%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           D D  DE+ D+E+++ ++E + EE E+E ++    + E+
Sbjct: 348 DVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAES 386



 Score = 35.0 bits (81), Expect = 0.13
 Identities = 12/40 (30%), Positives = 23/40 (57%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
            +  D+E+ +++ED D+E+ E EE E++ K+       T 
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTR 388



 Score = 34.6 bits (80), Expect = 0.15
 Identities = 10/36 (27%), Positives = 25/36 (69%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           ++  +E++++++  D+ED++EEE E +E +K+    
Sbjct: 350 DLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAE 385



 Score = 33.8 bits (78), Expect = 0.28
 Identities = 20/76 (26%), Positives = 42/76 (55%), Gaps = 3/76 (3%)

Query: 280 DGVKDTQDILNVFQYDPE---YLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEA 336
           D  +   D+ + F+ D +    L   E+      + + DEDE+D+D D EE+E++ +   
Sbjct: 294 DSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSD 353

Query: 337 EEGEKENKETIIDNTE 352
           EE ++E++++  ++ E
Sbjct: 354 EEEDEEDEDSDDEDDE 369



 Score = 33.4 bits (77), Expect = 0.38
 Identities = 12/40 (30%), Positives = 25/40 (62%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           +E+E+D D  DEE+++E+E   +E ++E +E   +  +  
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382



 Score = 33.0 bits (76), Expect = 0.41
 Identities = 9/49 (18%), Positives = 25/49 (51%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIY 363
           +E++ D   ++E++EDE+  + ++ E+E +E      ++         +
Sbjct: 345 EEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPF 393



 Score = 33.0 bits (76), Expect = 0.50
 Identities = 10/35 (28%), Positives = 23/35 (65%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           E   D+ E++E+  D  DE+E+E + +  +++++E
Sbjct: 336 EDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEE 370



 Score = 32.3 bits (74), Expect = 0.82
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           D +E++ED D  ++E++EE E  + E + +E
Sbjct: 341 DLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371



 Score = 31.5 bits (72), Expect = 1.5
 Identities = 12/46 (26%), Positives = 30/46 (65%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNL 355
            E +   DE++++ D++ D++++E E EE +++ K+   ++T + L
Sbjct: 346 EEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391



 Score = 30.7 bits (70), Expect = 2.3
 Identities = 13/36 (36%), Positives = 24/36 (66%), Gaps = 1/36 (2%)

Query: 308 LRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
           LRR + G+E++D+E+ D +E  D+ + E E  + +N
Sbjct: 279 LRR-MRGEEEDDEEEEDSKESADDLDDEFEPDDDDN 313



 Score = 30.0 bits (68), Expect = 4.1
 Identities = 12/43 (27%), Positives = 24/43 (55%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNL 355
             DE+E+++  DDE++ED+++   EE E  +     ++ E   
Sbjct: 320 EEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDED 362



 Score = 29.6 bits (67), Expect = 5.1
 Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 10/48 (20%)

Query: 313 LGDEDEDDEDGDDEEDE----------DEEESEAEEGEKENKETIIDN 350
             D+ +DD+ GD   D+          DEE+ E +   K++K+ ++  
Sbjct: 134 DDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKE 181



 Score = 29.2 bits (66), Expect = 6.5
 Identities = 15/49 (30%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 302 EEKYTTLRREILGDEDEDDEDGD-DEEDEDEEESEAEEGEKENKETIID 349
           EE+ +    + L DE E D+D +      +E+E E E+G  +  E   D
Sbjct: 291 EEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDD 339


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 37.8 bits (88), Expect = 0.008
 Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 4/65 (6%)

Query: 294 YDP-EYLMNEEKYTTLR--REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDN 350
           YDP EY    E+Y  L   R+ L  + EDD D  D EDEDEE    +E +  + ++    
Sbjct: 14  YDPSEYKEVVEEYEKLEELRKKLKAKKEDDSDDTDYEDEDEE-LYLDESDMGDSDSKTRT 72

Query: 351 TETNL 355
           T  NL
Sbjct: 73  TVRNL 77


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 37.6 bits (88), Expect = 0.008
 Identities = 11/26 (42%), Positives = 21/26 (80%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
           +ED +D+D DDE++++++E EA+ G 
Sbjct: 50  EEDLEDDDDDDEDEDEDDEEEADLGP 75



 Score = 37.6 bits (88), Expect = 0.008
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            ++ED ED DD++++++E+ E E     + E
Sbjct: 48  LDEEDLEDDDDDDEDEDEDDEEEADLGPDPE 78



 Score = 36.4 bits (85), Expect = 0.019
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           + DE+D + DD++DEDE+E + EE +  
Sbjct: 47  ELDEEDLEDDDDDDEDEDEDDEEEADLG 74



 Score = 36.4 bits (85), Expect = 0.021
 Identities = 11/29 (37%), Positives = 21/29 (72%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
           + + D+ED +D++D+DE+E E +E E + 
Sbjct: 45  ESELDEEDLEDDDDDDEDEDEDDEEEADL 73



 Score = 31.8 bits (73), Expect = 0.63
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIID 349
             E + D +D ED+D+++ + +E ++E  +   D
Sbjct: 43  AIESELDEEDLEDDDDDDEDEDEDDEEEADLGPD 76



 Score = 30.2 bits (69), Expect = 1.9
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEE 338
           E L D+D+DDED D++++E+ +     E
Sbjct: 51  EDLEDDDDDDEDEDEDDEEEADLGPDPE 78


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 34.9 bits (81), Expect = 0.009
 Identities = 10/24 (41%), Positives = 19/24 (79%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
           +E+E+ ED + E++EDEEE + ++
Sbjct: 49  EEEEEGEDLESEDEEDEEEDDDDD 72



 Score = 33.0 bits (76), Expect = 0.050
 Identities = 12/29 (41%), Positives = 20/29 (68%), Gaps = 1/29 (3%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
             E E++E+G+D E EDEE+ E E+ + +
Sbjct: 45  DAEWEEEEEGEDLESEDEEDEE-EDDDDD 72



 Score = 31.8 bits (73), Expect = 0.12
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 311 EILGDEDEDDEDGDDEEDED 330
           E L  EDE+DE+ DD++D D
Sbjct: 55  EDLESEDEEDEEEDDDDDMD 74



 Score = 31.4 bits (72), Expect = 0.16
 Identities = 7/23 (30%), Positives = 17/23 (73%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAE 337
           +E ED E  D+E++E++++ + +
Sbjct: 52  EEGEDLESEDEEDEEEDDDDDMD 74



 Score = 28.3 bits (64), Expect = 2.0
 Identities = 7/20 (35%), Positives = 16/20 (80%)

Query: 314 GDEDEDDEDGDDEEDEDEEE 333
           G++ E +++ D+EED+D++ 
Sbjct: 54  GEDLESEDEEDEEEDDDDDM 73



 Score = 26.8 bits (60), Expect = 6.7
 Identities = 6/20 (30%), Positives = 15/20 (75%)

Query: 314 GDEDEDDEDGDDEEDEDEEE 333
            D + +DE+ ++E+D+D+ +
Sbjct: 55  EDLESEDEEDEEEDDDDDMD 74


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 37.4 bits (87), Expect = 0.010
 Identities = 14/44 (31%), Positives = 30/44 (68%)

Query: 300 MNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
           M E+K   L  E + +EDE DE+ ++EE+E++E+ + ++ + ++
Sbjct: 155 MLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198



 Score = 36.3 bits (84), Expect = 0.024
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            ED D+ED  DEE+E+EEE E E+ + ++ +
Sbjct: 165 AEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD 195


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 38.1 bits (89), Expect = 0.010
 Identities = 16/58 (27%), Positives = 37/58 (63%)

Query: 282 VKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEG 339
           V++  D L+ + +  ++++ + + T +  E   +E+E +++ ++EE+E+ EE E EEG
Sbjct: 316 VRELADSLDNWVHHVQHILPQGRCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 33.5 bits (77), Expect = 0.27
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
            E +D+E+  ++E+E+EEE E EE E E
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEEPEEPEPE 371



 Score = 30.0 bits (68), Expect = 4.1
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
            +DE++E  D+EE+E+EEE E  E E+ 
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEPEEG 373



 Score = 29.6 bits (67), Expect = 4.2
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 318 EDDEDGDDEEDEDEEESEAEEGEKENKE 345
             ++  ++EE EDEEE E EE  +E + 
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEP 370



 Score = 29.6 bits (67), Expect = 5.2
 Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAE 337
             EDE++E+ ++E +E E E    
Sbjct: 352 EQEDEEEEEEEEEPEEPEPEEGPP 375



 Score = 29.3 bits (66), Expect = 5.9
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKE 342
             E  ++ +++EDE+EEE E E  E E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPE 369


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 37.7 bits (87), Expect = 0.011
 Identities = 24/103 (23%), Positives = 40/103 (38%), Gaps = 11/103 (10%)

Query: 252 DNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRRE 311
           ++  + P    D+D   EE         DG ++ +D         E   N E       E
Sbjct: 162 EDNDEAPPAQPDVDNEEEERLEES----DGREEEEDEEVGSDSYGE--GNREL-----NE 210

Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
              +E E  +DG+D  D + E  + ++GE+E  E  + N    
Sbjct: 211 EEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEI 253



 Score = 35.4 bits (81), Expect = 0.054
 Identities = 13/41 (31%), Positives = 19/41 (46%), Gaps = 3/41 (7%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNL 355
           +E+ED+E G D   E   E   EE E+       D+ E  +
Sbjct: 189 EEEEDEEVGSDSYGEGNRELNEEEEEEAEGS---DDGEDVV 226



 Score = 30.3 bits (68), Expect = 2.2
 Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 7/59 (11%)

Query: 303 EKYTTLRREILGDEDEDDE-------DGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           EK    R  I+ D DED++       D D+EE+E  EES+  E E++ +       E N
Sbjct: 147 EKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGN 205



 Score = 30.3 bits (68), Expect = 2.7
 Identities = 14/41 (34%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 314 GDEDEDDED-GDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           G+E+E +E+  +  E E EEES +EE  + N+E+     + 
Sbjct: 238 GEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQKV 278


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 37.1 bits (86), Expect = 0.015
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           R++L   D  +E GD+ E  D E++E E+  KE+++
Sbjct: 200 RDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDED 235



 Score = 36.7 bits (85), Expect = 0.024
 Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 7/66 (10%)

Query: 285 TQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEE----SEAEEGE 340
            +D+L+      E     ++  +   E   DED+  ED DD++ E+EE     S +E+ +
Sbjct: 199 VRDMLSSMDMAEEL---GDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSD 255

Query: 341 KENKET 346
             ++E 
Sbjct: 256 ASSEEM 261



 Score = 31.7 bits (72), Expect = 0.87
 Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 314 GDEDEDDEDGDDEEDEDE--EESEAEEGEKENKET 346
            DED+D  + ++    D   E+S+A   E E+ E 
Sbjct: 232 EDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEM 266


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 37.3 bits (87), Expect = 0.016
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           G E +DD+D ++E + +E+E   +EGE  + E
Sbjct: 91  GLESDDDDDEEEEWEVEEDEDSDDEGEWIDVE 122



 Score = 34.2 bits (79), Expect = 0.16
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           D++ +  D +DEE++DE   +A+E   E   
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELS 154



 Score = 32.3 bits (74), Expect = 0.60
 Identities = 13/40 (32%), Positives = 21/40 (52%)

Query: 303 EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           EK+    R+    E   + D DD+E+E+ E  E E+ + E
Sbjct: 76  EKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDE 115



 Score = 32.3 bits (74), Expect = 0.61
 Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 12/51 (23%)

Query: 314 GDEDEDDEDG------------DDEEDEDEEESEAEEGEKENKETIIDNTE 352
            DED DDE               D EDE+E++  A++ ++++ E + +  E
Sbjct: 108 EDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDE 158



 Score = 31.5 bits (72), Expect = 0.92
 Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 7/38 (18%)

Query: 315 DEDEDDE-------DGDDEEDEDEEESEAEEGEKENKE 345
           DE+E DE       D D+E  E++EE  AEE E E ++
Sbjct: 134 DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEK 171



 Score = 30.0 bits (68), Expect = 3.2
 Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 9/90 (10%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLT-IHSS 369
           EI   + ED+E+ D+   + +E+S+ E  E++ +E   +            +  T I + 
Sbjct: 126 EIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTP 185

Query: 370 LDF--------EECAHKLLRMQLKPGQESE 391
            DF        E+   K L  +LK   +  
Sbjct: 186 ADFAKIQELRLEKGVDKALGGKLKRRDKDA 215



 Score = 29.2 bits (66), Expect = 4.8
 Identities = 8/29 (27%), Positives = 15/29 (51%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
            +E+ + E+ +D +DE E      + E E
Sbjct: 100 EEEEWEVEEDEDSDDEGEWIDVESDKEIE 128



 Score = 28.9 bits (65), Expect = 6.6
 Identities = 9/39 (23%), Positives = 23/39 (58%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           + D++ E  D E++E+++E+  +  E  ++E   ++ E 
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEE 160


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 37.3 bits (86), Expect = 0.018
 Identities = 16/41 (39%), Positives = 29/41 (70%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNT 351
           E + +ED+D+ED DD++DEDE++ + ++ E +  E   D+T
Sbjct: 48  EAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST 88



 Score = 36.1 bits (83), Expect = 0.041
 Identities = 16/52 (30%), Positives = 33/52 (63%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           E +   +  E   +ED+DD+D +DE+D+D+++ E +E E ++  T+ D++  
Sbjct: 44  EAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSA 95



 Score = 34.6 bits (79), Expect = 0.13
 Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 11/104 (10%)

Query: 254 FKDF-PDVIEDLDLVPEEDKFTHLMTL-DGVKDTQDILNVFQYDPEYLMNEEKYTTLRRE 311
           F +F    +ED D V     +   +TL D +K    I  + +   E  M EE       +
Sbjct: 2   FHEFASSELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDD 61

Query: 312 ILGDEDEDDEDGDDEEDEDEEE---------SEAEEGEKENKET 346
              DEDEDD+D DD+ED+++E+         S A++G + + E 
Sbjct: 62  DDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEA 105



 Score = 33.0 bits (75), Expect = 0.42
 Identities = 16/53 (30%), Positives = 27/53 (50%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           EE       +   D+D+DDED DD++D+D+E+ E E+ +           + N
Sbjct: 47  EEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGN 99



 Score = 32.7 bits (74), Expect = 0.49
 Identities = 14/39 (35%), Positives = 26/39 (66%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           D+DED++D DD++DED+E+ + ++    +  +  D  ET
Sbjct: 63  DDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNET 101



 Score = 30.3 bits (68), Expect = 3.0
 Identities = 9/41 (21%), Positives = 27/41 (65%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLV 356
            +E+D+D +D++D+D+E+ + ++ + +  +   D+ ++ L 
Sbjct: 50  MEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLH 90


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 37.3 bits (86), Expect = 0.019
 Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 2/68 (2%)

Query: 293 QYDPEYLMNEEKYTTLRREILGDEDEDDEDGDD--EEDEDEEESEAEEGEKENKETIIDN 350
           Q   E+L+N+E    L +    +  ED  DG D   ED  ++E E EE E    E I D 
Sbjct: 76  QKYVEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDE 135

Query: 351 TETNLVAL 358
            E +L   
Sbjct: 136 DEADLFNE 143



 Score = 31.1 bits (70), Expect = 1.7
 Identities = 13/38 (34%), Positives = 20/38 (52%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
            E   ++  DDE ++DEE+   EE   E KE++   T 
Sbjct: 144 SESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATR 181


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 37.3 bits (87), Expect = 0.020
 Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 299 LMNEEKYTTLRREILG-DEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
              EE    LRR+I G    E+DED D + +E++E+ + ++ + EN   +
Sbjct: 309 RQGEE----LRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWML 354


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 34.9 bits (80), Expect = 0.021
 Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEG-EKENKETIIDNTET 353
           DED+++E+G+ EE+E EEE E E    K   E   D+ ET
Sbjct: 58  DEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAET 97



 Score = 31.9 bits (72), Expect = 0.25
 Identities = 12/44 (27%), Positives = 22/44 (50%), Gaps = 7/44 (15%)

Query: 314 GDEDEDDEDGDDEEDEDEEES-------EAEEGEKENKETIIDN 350
            +E+E + + ++ E+E+E E        E EE + E K+   D 
Sbjct: 61  DEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTDE 104



 Score = 29.9 bits (67), Expect = 0.95
 Identities = 14/31 (45%), Positives = 24/31 (77%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           DE E++E+ D++++E+E E E EEGE+E + 
Sbjct: 49  DEMEEEEEVDEDDEEEEGEGEEEEGEEEEET 79



 Score = 29.2 bits (65), Expect = 1.7
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           +E+E+ ++ D+EE+ + EE E EE E+    T
Sbjct: 52  EEEEEVDEDDEEEEGEGEEEEGEEEEETEGAT 83



 Score = 28.8 bits (64), Expect = 2.4
 Identities = 13/32 (40%), Positives = 22/32 (68%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           G ++ DDE  ++EE ++++E E  EGE+E  E
Sbjct: 43  GAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGE 74



 Score = 28.4 bits (63), Expect = 3.1
 Identities = 12/40 (30%), Positives = 21/40 (52%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
            E E++E  ++EE E      A E E+++ ET    T+ +
Sbjct: 66  GEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTDED 105



 Score = 27.2 bits (60), Expect = 8.5
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           +E+   ++GDDE +E+EE  E +E E+   E
Sbjct: 39  NEENGAQEGDDEMEEEEEVDEDDEEEEGEGE 69


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 34.3 bits (79), Expect = 0.025
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 319 DDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           D E   D+E+E+EEE E +  +  +++  ID  E 
Sbjct: 3   DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEA 37



 Score = 30.1 bits (68), Expect = 0.76
 Identities = 11/29 (37%), Positives = 20/29 (68%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
           +E+E++E+ DD ED  +E+   +E E E+
Sbjct: 11  EEEEEEEEEDDLEDLSDEDEFIDEAEAED 39



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 9/32 (28%), Positives = 21/32 (65%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           + D+++E+ ++EED+ E+ S+ +E   E +  
Sbjct: 7   EVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAE 38



 Score = 28.2 bits (63), Expect = 3.1
 Identities = 7/25 (28%), Positives = 17/25 (68%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEE 338
            DE+E++E+ +D+ ++  +E E  +
Sbjct: 9   DDEEEEEEEEEDDLEDLSDEDEFID 33



 Score = 27.0 bits (60), Expect = 8.5
 Identities = 10/34 (29%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 315 DEDE--DDEDGDDEEDEDEEESEAEEGEKENKET 346
            E E  D+E+ ++EE++D E+   E+   +  E 
Sbjct: 4   TEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEA 37


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 36.6 bits (84), Expect = 0.028
 Identities = 19/95 (20%), Positives = 37/95 (38%), Gaps = 3/95 (3%)

Query: 252 DNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQ-DILNVFQYDPEYLMNEEKYTTLRR 310
           +NFK+ P        + +E +       +G+ +   D+           ++E      + 
Sbjct: 242 ENFKEEPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVGAAEI--ENKEVSEGDKEQQQE 299

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           E+   E   +E   D  DE  EE E ++  +EN+ 
Sbjct: 300 EVENAEAHKEEVQSDRPDEIGEEKEEDDENEENER 334



 Score = 31.6 bits (71), Expect = 1.2
 Identities = 12/47 (25%), Positives = 20/47 (42%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRR 360
            +   D  D   EE E+++E+E  E   E     ++  E  +   RR
Sbjct: 309 EEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRR 355


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 36.8 bits (85), Expect = 0.031
 Identities = 15/30 (50%), Positives = 24/30 (80%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
             E++E+ ++EE+E+E+ESE EEGE E +E
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEEGEDEEEE 473



 Score = 33.3 bits (76), Expect = 0.35
 Identities = 14/32 (43%), Positives = 25/32 (78%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            +E+  +E+ ++EE+E+EEE E+EE E E++E
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEE 471



 Score = 32.2 bits (73), Expect = 0.78
 Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 2/44 (4%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVA 357
             E E++E  ++EE+E+EEE E E+  +E +    D  E   V 
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEGE--DEEEEEEVE 477


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 36.9 bits (85), Expect = 0.031
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKET 346
            E     + E+DED EESE E+ E+E   +
Sbjct: 306 PEIPAKPEIEQDEDSEESEEEKNEEEGGLS 335



 Score = 33.4 bits (76), Expect = 0.35
 Identities = 12/38 (31%), Positives = 18/38 (47%), Gaps = 1/38 (2%)

Query: 309 RREILGDEDED-DEDGDDEEDEDEEESEAEEGEKENKE 345
           R + L  E     E   DE+ E+ EE + EE    +K+
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKK 337



 Score = 32.6 bits (74), Expect = 0.51
 Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 8/39 (20%)

Query: 314 GDEDEDD---EDGDDEEDE-----DEEESEAEEGEKENK 344
           GD+D D+   +DGDDE  E     D   S  +  E+E+K
Sbjct: 265 GDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDK 303



 Score = 30.3 bits (68), Expect = 3.1
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENK 344
           E +D++DGD+ +   E+  E +  +K+ K
Sbjct: 219 EGDDEDDGDESDKGGEDGDEEKSKKKKKK 247



 Score = 29.9 bits (67), Expect = 3.6
 Identities = 11/43 (25%), Positives = 21/43 (48%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
           EE+   L  EI    + + ++  +E +E++ E E    +K  K
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340



 Score = 29.9 bits (67), Expect = 3.8
 Identities = 8/31 (25%), Positives = 19/31 (61%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
            + ++D++  + EE+++EEE    +  K+ K
Sbjct: 312 PEIEQDEDSEESEEEKNEEEGGLSKKGKKLK 342


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 36.9 bits (85), Expect = 0.032
 Identities = 16/39 (41%), Positives = 27/39 (69%)

Query: 300 MNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
           + EE  +     ++ D+DED+++ DD+E++DEEE E EE
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEE 178



 Score = 35.8 bits (82), Expect = 0.082
 Identities = 14/28 (50%), Positives = 23/28 (82%)

Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENK 344
           D+DDED D+++D++E++ E EE E+E K
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEIK 181



 Score = 35.4 bits (81), Expect = 0.090
 Identities = 17/39 (43%), Positives = 27/39 (69%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGE 340
            E+  + R   + D+D++DED DD+++ED+EE E EE E
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179



 Score = 33.9 bits (77), Expect = 0.26
 Identities = 11/38 (28%), Positives = 23/38 (60%)

Query: 308 LRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           L  E +   D    D DDE+++++++ E ++ E+E +E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177



 Score = 32.7 bits (74), Expect = 0.59
 Identities = 16/31 (51%), Positives = 25/31 (80%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           DEDEDD+D +D+E+E+EEE E +  + E++E
Sbjct: 159 DEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189



 Score = 31.6 bits (71), Expect = 1.6
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           D    D+D +DE+++D++E + EE E+E +E
Sbjct: 149 DNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179



 Score = 30.0 bits (67), Expect = 4.7
 Identities = 17/32 (53%), Positives = 22/32 (68%), Gaps = 5/32 (15%)

Query: 314 GDEDEDDEDGDDEEDEDEEE-----SEAEEGE 340
            DED+DDE+ D+EE+E+EEE      E EE E
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191



 Score = 29.2 bits (65), Expect = 8.3
 Identities = 11/37 (29%), Positives = 23/37 (62%), Gaps = 3/37 (8%)

Query: 313 LGDEDEDDEDG---DDEEDEDEEESEAEEGEKENKET 346
           L +ED    D    DD++++++E+ + EE ++E +E 
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEE 176


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 34.9 bits (80), Expect = 0.033
 Identities = 9/39 (23%), Positives = 19/39 (48%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
            +E E  +  +D ++  E ++  E  E+E  +  +D T 
Sbjct: 73  AEEAEAADADEDADEAAEADAADEADEEEETDEAVDETA 111



 Score = 33.4 bits (76), Expect = 0.10
 Identities = 8/22 (36%), Positives = 16/22 (72%)

Query: 315 DEDEDDEDGDDEEDEDEEESEA 336
             DE D + ++ ++E++EE+EA
Sbjct: 110 TADEADAEAEEADEEEDEEAEA 131



 Score = 32.6 bits (74), Expect = 0.16
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
           DE  D+ D + EE ++EE+ EAE 
Sbjct: 108 DETADEADAEAEEADEEEDEEAEA 131



 Score = 31.8 bits (72), Expect = 0.33
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 1/53 (1%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEE-ESEAEEGEKENKETIIDNTET 353
            EK          +  E+ E  D +ED DE  E++A +   E +ET     ET
Sbjct: 58  AEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDET 110



 Score = 30.3 bits (68), Expect = 1.1
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           +E+E DE  D+  DE + E+E  + E++
Sbjct: 99  EEEETDEAVDETADEADAEAEEADEEED 126



 Score = 30.3 bits (68), Expect = 1.2
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           DE +++E+ D+  DE  +E++AE  E + +E 
Sbjct: 95  DEADEEEETDEAVDETADEADAEAEEADEEED 126



 Score = 29.1 bits (65), Expect = 2.7
 Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 3/42 (7%)

Query: 315 DEDEDD---EDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           DED D+    D  DE DE+EE  EA +   +  +   +  + 
Sbjct: 82  DEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADE 123



 Score = 28.8 bits (64), Expect = 3.7
 Identities = 11/25 (44%), Positives = 15/25 (60%), Gaps = 1/25 (4%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGE 340
            DE  ++ D E +E +EE E EE E
Sbjct: 107 VDETADEADAEAEEADEE-EDEEAE 130



 Score = 28.8 bits (64), Expect = 3.9
 Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 8/43 (18%)

Query: 311 EILGDEDEDDEDGDDE--------EDEDEEESEAEEGEKENKE 345
                E +  ++ D+E        E  DE ++EAEE ++E  E
Sbjct: 85  ADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDE 127


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 36.5 bits (85), Expect = 0.035
 Identities = 19/45 (42%), Positives = 30/45 (66%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALR 359
           ++D++D+D DD +DEDEE+ EA+E EK + +      E +  ALR
Sbjct: 154 EDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALR 198



 Score = 35.0 bits (81), Expect = 0.095
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
            D+++DDED DD++ +DE+E + E  E E 
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEK 180



 Score = 33.4 bits (77), Expect = 0.35
 Identities = 17/51 (33%), Positives = 28/51 (54%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLT 365
           DED+DD+D DDE++E +E  E E+   ++     ++    L   R+   LT
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLT 207



 Score = 31.9 bits (73), Expect = 0.86
 Identities = 11/38 (28%), Positives = 23/38 (60%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           D+ +DD+  DD++DED++E + ++   +  E   +  E
Sbjct: 140 DDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKE 177



 Score = 31.1 bits (71), Expect = 1.7
 Identities = 16/107 (14%), Positives = 40/107 (37%), Gaps = 6/107 (5%)

Query: 247 AQVRKDNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYT 306
            +    +  D  + ++D+D++ + D        D + D     +    D +   +++   
Sbjct: 107 EKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE--DDDDDD 164

Query: 307 TLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
               +    E ++ E   D++D   +E ++E   +  K    D   T
Sbjct: 165 VDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARK----DAKLT 207


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 36.4 bits (84), Expect = 0.036
 Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 1/58 (1%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALR 359
           E+ +  +      + D   E+  DEE EDE E   E   +E  E  +   + + +A R
Sbjct: 326 EDIFEVVEVVEKQEGDVVTEESTDEESEDEVE-IDESVIEEVAEMELLEVQVDDLAER 382


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 36.1 bits (83), Expect = 0.036
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
            D+DE     D +E+ED ++ + E+ +  N
Sbjct: 273 SDDDEGSSSNDYDEEEDGDDDDNEDNDDTN 302



 Score = 33.8 bits (77), Expect = 0.21
 Identities = 10/29 (34%), Positives = 20/29 (68%), Gaps = 1/29 (3%)

Query: 315 DEDEDDE-DGDDEEDEDEEESEAEEGEKE 342
             D D+E DGDD+++ED +++    G+++
Sbjct: 281 SNDYDEEEDGDDDDNEDNDDTNTNHGDED 309



 Score = 32.2 bits (73), Expect = 0.67
 Identities = 10/38 (26%), Positives = 21/38 (55%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
            + DD++G    D DEEE   ++  ++N +T  ++ + 
Sbjct: 271 YESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDE 308



 Score = 30.7 bits (69), Expect = 1.9
 Identities = 11/61 (18%), Positives = 26/61 (42%), Gaps = 6/61 (9%)

Query: 301 NEEKYTTLRREILGDEDED------DEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           +++ Y   ++ +  ++  D        + DD+E     + + EE   ++     D+T TN
Sbjct: 245 DDQGYGRYKQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTN 304

Query: 355 L 355
            
Sbjct: 305 H 305


>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
           Glycoprotein E (gE) of Alphaherpesvirus forms a complex
           with glycoprotein I (gI) (pfam01688), functioning as an
           immunoglobulin G (IgG) Fc binding protein. gE is
           involved in virus spread but is not essential for
           propagation.
          Length = 437

 Score = 36.2 bits (84), Expect = 0.042
 Identities = 12/50 (24%), Positives = 19/50 (38%)

Query: 293 QYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
            Y          YT+L    L ++   D + DDEE + + + E     K 
Sbjct: 383 PYAVILNPFSPVYTSLPTNELFEDVSSDSESDDEEFDSDSDREPSGPSKS 432


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 35.8 bits (83), Expect = 0.047
 Identities = 12/49 (24%), Positives = 17/49 (34%)

Query: 298 YLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
             M  E            E+ DD+  ++ +DE   E    E E  NK  
Sbjct: 225 EKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPV 273



 Score = 32.4 bits (74), Expect = 0.54
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
            +D+ +E+ DDE   +  ESE E   K  +
Sbjct: 245 SDDDGEEESDDESAWEGFESEYEPINKPVR 274


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 36.1 bits (83), Expect = 0.051
 Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 9/51 (17%)

Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEE---------SEAEEGEKE 342
           +++ +  + RE+L   +  +E GDD E EDEE+          + E+GE E
Sbjct: 182 DQQAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGE 232



 Score = 30.3 bits (68), Expect = 3.1
 Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAE-EGEKENK 344
           +E   T R    G+E+    D DD  DE +++SE   EG +  +
Sbjct: 242 QESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGARPAR 285



 Score = 28.8 bits (64), Expect = 9.1
 Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 9/42 (21%)

Query: 314 GDEDEDDEDGDDEEDEDE---------EESEAEEGEKENKET 346
           GD+D+  E+  +E+ E E         +ESEA + E E+ E 
Sbjct: 215 GDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEE 256


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 35.2 bits (81), Expect = 0.062
 Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKE-TIIDNTETNLVA 357
           E    EDG++EE E+ EE E EEGE+E++E   + +T   L  
Sbjct: 72  EVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTE 114



 Score = 31.7 bits (72), Expect = 0.87
 Identities = 11/39 (28%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 301 NEEKYTTLR-REILGDEDEDDEDGDDEEDEDEEESEAEE 338
           ++E    +R  E   +E+ ++ + D+EE+ +EE  E E 
Sbjct: 66  DDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEP 104



 Score = 30.6 bits (69), Expect = 2.2
 Identities = 13/34 (38%), Positives = 22/34 (64%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           R E+   ED ++E+ ++ E+++EEE E E  E E
Sbjct: 70  RIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFE 103



 Score = 29.8 bits (67), Expect = 3.7
 Identities = 14/61 (22%), Positives = 29/61 (47%)

Query: 280 DGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEG 339
           D ++  +   +   Y       E+    +   I  D +E++ +  +E++E+E E E+EE 
Sbjct: 43  DAIELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEF 102

Query: 340 E 340
           E
Sbjct: 103 E 103


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 34.8 bits (80), Expect = 0.063
 Identities = 10/47 (21%), Positives = 20/47 (42%), Gaps = 3/47 (6%)

Query: 313 LGDEDEDDEDGDDEEDEDEEE---SEAEEGEKENKETIIDNTETNLV 356
           L  E+  D + ++E D+  +           + N E +ID ++  L 
Sbjct: 80  LAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDVIDLSDLKLA 126


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 34.7 bits (80), Expect = 0.065
 Identities = 14/56 (25%), Positives = 27/56 (48%)

Query: 297 EYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           E    + K+  +  +   + +  +ED   EED + EE E EE  ++++E+     E
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57



 Score = 30.5 bits (69), Expect = 1.9
 Identities = 14/63 (22%), Positives = 29/63 (46%)

Query: 283 KDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           K+ +D  +    +     NE K     +E   + +E +++   E+ E+  E + EE + E
Sbjct: 3   KECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62

Query: 343 NKE 345
           N +
Sbjct: 63  NNK 65



 Score = 28.6 bits (64), Expect = 7.7
 Identities = 10/59 (16%), Positives = 28/59 (47%)

Query: 300 MNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVAL 358
           M +E        +  D  +++E+ ++++ ++E+    E  ++E  E   ++ E  +  L
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEEL 59


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 33.8 bits (78), Expect = 0.084
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
           + DE++E  DD +D DEE  E E+GE
Sbjct: 3   EPDEEEELEDDIDDLDEEAEEEEDGE 28



 Score = 28.1 bits (63), Expect = 7.6
 Identities = 12/37 (32%), Positives = 15/37 (40%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           RR    DED D +D D +  E+E   E     K    
Sbjct: 109 RRRRQYDEDRDADDEDIDPLEEELSLEDLSDVKAASI 145


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 33.5 bits (77), Expect = 0.084
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
           DED  D D DDE+DED+EE++AE+ +
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDD 73



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 10/40 (25%), Positives = 21/40 (52%)

Query: 299 LMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
           L  +         I  D  + D+D +D+ED++E ++E ++
Sbjct: 34  LSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDD 73



 Score = 30.1 bits (68), Expect = 1.5
 Identities = 10/27 (37%), Positives = 19/27 (70%)

Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKEN 343
           DED  D DD++++DE++ EA+  + + 
Sbjct: 48  DEDGGDIDDDDEDDEDDEEADAEDDDE 74


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 35.2 bits (81), Expect = 0.084
 Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 4/41 (9%)

Query: 316 EDEDDEDGDD---EEDEDEEE-SEAEEGEKENKETIIDNTE 352
           ED D+EDGDD   + +ED E   E  EG  E++E   + T+
Sbjct: 229 EDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATD 269



 Score = 33.7 bits (77), Expect = 0.28
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           R++LG  D  +E GDD  +ED +E + ++ + +N E 
Sbjct: 210 RDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNED 246



 Score = 30.6 bits (69), Expect = 2.9
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           GD+D+ D + D E   +E E   E  E E + T
Sbjct: 236 GDDDQPDNNEDSEAGREESEGSDESEEDEAEAT 268



 Score = 29.4 bits (66), Expect = 5.4
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
            +ED +    + E  ++ EE EAE  + E +E  +D  E
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAE 281


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 33.4 bits (77), Expect = 0.091
 Identities = 15/25 (60%), Positives = 21/25 (84%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESE 335
           +IL D+D D+ED D+E+DEDEE+ E
Sbjct: 105 DILPDDDFDEEDLDEEDDEDEEDEE 129



 Score = 31.5 bits (72), Expect = 0.41
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           GD+D  D+D   ++D DEE+ + E+ E E  E
Sbjct: 97  GDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 28.8 bits (65), Expect = 3.8
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEE 338
            I  D   DD+  +++ DE+++E E +E
Sbjct: 101 IIDDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 27.7 bits (62), Expect = 8.7
 Identities = 11/37 (29%), Positives = 22/37 (59%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           +++   D D+D  D D   D+D +E + +E + E++E
Sbjct: 90  KKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEE 126


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 33.0 bits (74), Expect = 0.094
 Identities = 19/60 (31%), Positives = 33/60 (55%), Gaps = 1/60 (1%)

Query: 293 QYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           +YD  ++ NE++  T    I   E+ED++  +  E+EDE++ E  E E E K    +N +
Sbjct: 40  KYDKIFISNEDEMKTTHEHI-KKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENED 98


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 34.3 bits (79), Expect = 0.10
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           D D  D+D DD++ EDE  +   E EK  KE
Sbjct: 127 DSDSSDDDSDDDDSEDETAALLRELEKIKKE 157



 Score = 32.4 bits (74), Expect = 0.43
 Identities = 15/44 (34%), Positives = 19/44 (43%), Gaps = 9/44 (20%)

Query: 311 EILGDEDEDDEDGDDEEDEDEE---------ESEAEEGEKENKE 345
               D  +DD D DD EDE            +  AEE E+E +E
Sbjct: 125 SSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEE 168



 Score = 32.4 bits (74), Expect = 0.47
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           + D DD D   + D  +++S+ ++ E E   
Sbjct: 116 NSDADDSDSSSDSDSSDDDSDDDDSEDETAA 146



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 9/44 (20%), Positives = 20/44 (45%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           EE     +R      + D +D D   D D  + ++++ + E++ 
Sbjct: 101 EEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDET 144



 Score = 28.2 bits (63), Expect = 9.5
 Identities = 6/32 (18%), Positives = 14/32 (43%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGE 340
            R    D+ +   D D  +D+ +++   +E  
Sbjct: 114 ARNSDADDSDSSSDSDSSDDDSDDDDSEDETA 145


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 32.1 bits (73), Expect = 0.11
 Identities = 10/34 (29%), Positives = 27/34 (79%)

Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           I G+E EDD+D +D++D+D+++ + ++ + ++++
Sbjct: 45  IEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78



 Score = 31.7 bits (72), Expect = 0.15
 Identities = 9/31 (29%), Positives = 23/31 (74%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            ED+DD++ DD++D+ +++ + ++ + E+ E
Sbjct: 50  PEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80



 Score = 27.1 bits (60), Expect = 6.3
 Identities = 8/24 (33%), Positives = 18/24 (75%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAE 337
            D+D+DD+  D ++D+D+++ + E
Sbjct: 57  EDDDDDDDKDDKDDDDDDDDEDDE 80



 Score = 26.7 bits (59), Expect = 9.6
 Identities = 8/29 (27%), Positives = 22/29 (75%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
            D+D++D+D DD++D+ +++ + ++ + E
Sbjct: 52  DDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80


>gnl|CDD|233222 TIGR00986, 3a0801s05tom22, mitochondrial import receptor subunit
           Tom22.  The mitochondrial protein translocase (MPT)
           family, which brings nuclearly encoded preproteins into
           mitochondria, is very complex with 19 currently
           identified protein constituents.These proteins include
           several chaperone proteins, four proteins of the outer
           membrane translocase (Tom) import receptor, five
           proteins of the Tom channel complex, five proteins of
           the inner membrane translocase (Tim) and three "motor"
           proteins. This family is specific for the Tom22 proteins
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 145

 Score = 33.4 bits (76), Expect = 0.12
 Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 7/50 (14%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALR 359
           ++ + + +EDDED  D + ED  +S+ E  E   +ET  D     + AL+
Sbjct: 17  QQTVVEPEEDDEDFTDVDSEDSVDSDFESLE---EETFTD----RIYALK 59


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 33.7 bits (77), Expect = 0.13
 Identities = 20/74 (27%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 280 DGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEG 339
           D +K+ +D+ +  Q D E    EE+      E   D ++++E  +DEE+E+E+E +  + 
Sbjct: 35  DIIKENEDVKDEKQEDDE--EEEEEDEEEIEEP-EDIEDEEEIVEDEEEEEEDEEDNVDL 91

Query: 340 EKENKETIIDNTET 353
           +   K+ I D   +
Sbjct: 92  KDIEKKNINDIFNS 105



 Score = 33.3 bits (76), Expect = 0.13
 Identities = 18/51 (35%), Positives = 28/51 (54%)

Query: 297 EYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
           E L+   K T+  +E +  E+ED +D   E+DE+EEE + EE E+      
Sbjct: 20  ENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIED 70



 Score = 29.8 bits (67), Expect = 2.0
 Identities = 20/72 (27%), Positives = 40/72 (55%), Gaps = 8/72 (11%)

Query: 289 LNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDD-----EEDEDEEE-SEAEEGEKE 342
           + +  +D E ++ E +    + E   D++E++E+ ++     E+ EDEEE  E EE E+E
Sbjct: 26  VKITSWDKEDIIKENEDV--KDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEE 83

Query: 343 NKETIIDNTETN 354
           ++E  +D  +  
Sbjct: 84  DEEDNVDLKDIE 95


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 34.1 bits (78), Expect = 0.18
 Identities = 20/68 (29%), Positives = 29/68 (42%), Gaps = 14/68 (20%)

Query: 314 GDEDEDDEDGDDEEDEDEEES------EAEEGEKENKETIIDNTETNLVALRRTIYLTIH 367
           G + E +ED ++EE+E+EEE       E        K      +E+  V  RR      H
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA---STSESITVPSRRH-----H 274

Query: 368 SSLDFEEC 375
           S L  + C
Sbjct: 275 SPLVLKRC 282


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 34.3 bits (79), Expect = 0.18
 Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 23/77 (29%)

Query: 301 NEEKYTTL-------------------RREILGDEDEDDEDGDDEEDEDE----EESEAE 337
           N++K+T                           DE+E+DE+ ++EEDEDE    E S+ E
Sbjct: 357 NKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDE 416

Query: 338 EGEKENKETIIDNTETN 354
           E E+++ E+  ++++ N
Sbjct: 417 EFEEDDVESKYEDSDGN 433


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 34.0 bits (78), Expect = 0.19
 Identities = 18/61 (29%), Positives = 28/61 (45%), Gaps = 3/61 (4%)

Query: 296 PEYLMNE---EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           P YL  E   EK      E    E++ +E+    + E+E + E E+ EK+ K   +  T 
Sbjct: 22  PIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETT 81

Query: 353 T 353
           T
Sbjct: 82  T 82


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 31.9 bits (73), Expect = 0.20
 Identities = 9/30 (30%), Positives = 22/30 (73%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGE 340
           +++ D+D+DD+D DD  D D+++ + ++ +
Sbjct: 65  DVVLDDDDDDDDDDDLPDLDDDDVDLDDDD 94



 Score = 30.0 bits (68), Expect = 0.90
 Identities = 6/29 (20%), Positives = 21/29 (72%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
           DE+++D+   D++D+D+++ +  + + ++
Sbjct: 59  DEEDEDDVVLDDDDDDDDDDDLPDLDDDD 87



 Score = 30.0 bits (68), Expect = 1.0
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEE 333
             L D+D D +D DD+  ED+++
Sbjct: 81  PDLDDDDVDLDDDDDDFLEDDDD 103



 Score = 29.2 bits (66), Expect = 1.8
 Identities = 7/37 (18%), Positives = 17/37 (45%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
            E       +   ++D   +D DD++D+D+     ++
Sbjct: 50  AEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDD 86



 Score = 29.2 bits (66), Expect = 1.9
 Identities = 9/42 (21%), Positives = 24/42 (57%)

Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVAL 358
           D +D    DE++EDE++   ++ + ++ +  + + + + V L
Sbjct: 49  DAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDL 90



 Score = 28.8 bits (65), Expect = 2.4
 Identities = 8/37 (21%), Positives = 18/37 (48%)

Query: 306 TTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
               R    D ++  +  +DEEDED+   + ++ + +
Sbjct: 40  VAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDD 76



 Score = 27.7 bits (62), Expect = 5.3
 Identities = 8/38 (21%), Positives = 21/38 (55%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           D  + DED +DE+D   ++ + ++ + +  +   D+ +
Sbjct: 52  DAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVD 89



 Score = 27.7 bits (62), Expect = 6.0
 Identities = 7/24 (29%), Positives = 15/24 (62%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
             D DD+D D ++D+D+   + ++
Sbjct: 80  LPDLDDDDVDLDDDDDDFLEDDDD 103



 Score = 27.3 bits (61), Expect = 9.4
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
           D  + D+D  D +D+D++  E ++ E
Sbjct: 79  DLPDLDDDDVDLDDDDDDFLEDDDDE 104



 Score = 26.9 bits (60), Expect = 9.9
 Identities = 7/41 (17%), Positives = 24/41 (58%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNL 355
           +++ED++D   ++D+D+++ +      ++   + D+ +  L
Sbjct: 58  EDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFL 98


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 33.9 bits (78), Expect = 0.22
 Identities = 9/45 (20%), Positives = 20/45 (44%)

Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
                   R + +  E+ D+++ ++EE   +E  E E  + E + 
Sbjct: 353 KMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEG 397



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           DEDE++E   DE +E+E E   EEG + 
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQS 400



 Score = 30.1 bits (68), Expect = 3.3
 Identities = 12/38 (31%), Positives = 18/38 (47%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           DEDED+E+    ++ +EEE E  E E         +  
Sbjct: 371 DEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSES 408



 Score = 29.7 bits (67), Expect = 4.6
 Identities = 9/33 (27%), Positives = 21/33 (63%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
             + DED+++ +++  ++ EE E E+ E+E  +
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQ 399



 Score = 29.3 bits (66), Expect = 5.1
 Identities = 9/32 (28%), Positives = 18/32 (56%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           D  + +E  +DE++E+E+ S+  E E+     
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSE 394



 Score = 28.9 bits (65), Expect = 8.2
 Identities = 12/52 (23%), Positives = 21/52 (40%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           EE+  +   E    ED ++E     ED   E S     + E+K      +++
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDS 428


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 32.7 bits (75), Expect = 0.24
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           DE+  +ED DDE  ++EE    ++ + E  +
Sbjct: 13  DEELPEEDEDDESSDEEEVDLPDDEQDEESD 43



 Score = 32.0 bits (73), Expect = 0.40
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIID 349
           E L +EDEDDE  D+EE +  ++ + EE + E ++  + 
Sbjct: 14  EELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVT 52



 Score = 31.6 bits (72), Expect = 0.59
 Identities = 8/39 (20%), Positives = 24/39 (61%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           G+EDE+  + D++++  +EE      +++++E+  +  +
Sbjct: 10  GEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQ 48



 Score = 30.8 bits (70), Expect = 0.99
 Identities = 11/39 (28%), Positives = 20/39 (51%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
              DE++ D  D+E ++E +SE E+     +E  +D   
Sbjct: 24  ESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA 62



 Score = 30.8 bits (70), Expect = 1.1
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
           D+ E+DE+  +E+++DE   E E 
Sbjct: 8   DDGEEDEELPEEDEDDESSDEEEV 31



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
             +D ++++   EEDED+E S+ EE +  + E
Sbjct: 6   ESDDGEEDEELPEEDEDDESSDEEEVDLPDDE 37



 Score = 28.9 bits (65), Expect = 4.2
 Identities = 7/32 (21%), Positives = 19/32 (59%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           D +ED+E  +++ED++  + E  +   + ++ 
Sbjct: 9   DGEEDEELPEEDEDDESSDEEEVDLPDDEQDE 40



 Score = 28.5 bits (64), Expect = 6.7
 Identities = 10/40 (25%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
             E +D E+ D+E  E++E+ E+ + E+ +      + E+
Sbjct: 4   ESESDDGEE-DEELPEEDEDDESSDEEEVDLPDDEQDEES 42


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 33.1 bits (76), Expect = 0.25
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
           DEDEDD+  D  ++E++EE E E+ +   K
Sbjct: 26  DEDEDDDVKDSWDEEEDEEKEEEKAKVAAK 55



 Score = 31.6 bits (72), Expect = 0.85
 Identities = 11/30 (36%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           +DED++D   +  ++EE+ E EE EK    
Sbjct: 25  DDEDEDDDVKDSWDEEEDEEKEE-EKAKVA 53


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 32.8 bits (75), Expect = 0.25
 Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 235 LEKRVQYMIEVLAQVRKDNFKD-FPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQ 293
           +++ +Q M E      KD   D F +  ED             +  D  K+ ++   V  
Sbjct: 87  IDEEIQAMTEKKDIKAKDKEVDAFEEGDEDE------------LDYDEDKEEEEDDEVDS 134

Query: 294 YDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESE 335
            D +   +E++      EIL ++DE DED DD+EDE++EE +
Sbjct: 135 LD-DENDDEDEDDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175



 Score = 29.3 bits (66), Expect = 3.7
 Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 1/69 (1%)

Query: 285 TQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDE-EDEDEEESEAEEGEKEN 343
            +DI    +    +   +E       +   +ED++ +  DDE +DEDE++ E  E   E+
Sbjct: 97  KKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIED 156

Query: 344 KETIIDNTE 352
            E   D  +
Sbjct: 157 DEVDEDEDD 165


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 33.2 bits (75), Expect = 0.27
 Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 4/59 (6%)

Query: 315 DEDEDDEDGDDE----EDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTIHSS 369
           D D+D E+GDD+    EDE+EE+   + G++        +TE  L+   +   L+I S+
Sbjct: 80  DVDDDSEEGDDDDGGAEDEEEEKVRGQSGQEGTVALGSGSTEKKLIGSEKQTELSISSA 138


>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding.  This domain is
           found on CASC3 (cancer susceptibility candidate gene 3
           protein) which is also known as Barentsz (Btz). CASC3 is
           a component of the EJC (exon junction complex) which is
           a complex that is involved in post-transcriptional
           regulation of mRNA in metazoa. The complex is formed by
           the association of four proteins (eIF4AIII, Barentsz,
           Mago, and Y14), mRNA, and ATP. This domain wraps around
           eIF4AIII and stacks against the 5' nucleotide.
          Length = 116

 Score = 31.6 bits (72), Expect = 0.29
 Identities = 7/32 (21%), Positives = 15/32 (46%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
                  E  + +ED++ +  E E+ + E+ E
Sbjct: 8   SGRTPSAEPTEPKEDKERKRREHEKYDDEDDE 39


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 32.8 bits (75), Expect = 0.29
 Identities = 18/52 (34%), Positives = 26/52 (50%)

Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           +E+K      E   +E E++ + +DEE  DE E E EE  + N E  I N  
Sbjct: 75  SEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126



 Score = 32.0 bits (73), Expect = 0.49
 Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
             ED+ D + +DEE E+E E E EE   EN++   + TE+N
Sbjct: 77  DKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESN 117



 Score = 31.6 bits (72), Expect = 0.83
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           ++E+ E  + E+ ED+ ++E E+ E E +    D   ++
Sbjct: 66  KEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104



 Score = 30.9 bits (70), Expect = 1.2
 Identities = 13/40 (32%), Positives = 28/40 (70%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           +E+++  + +D+ED+ + E E EE E+EN+E   ++++ N
Sbjct: 67  EEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDEN 106



 Score = 28.9 bits (65), Expect = 4.9
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           ++ ED  D + E++E EEE+E E+ E  ++       +T 
Sbjct: 76  EDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTE 115



 Score = 28.5 bits (64), Expect = 6.7
 Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 1/39 (2%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
             ED ED  D E EDE ESE E  E++ + +  +  ET 
Sbjct: 74  NSEDKEDKGDAEKEDE-ESEEENEEEDEESSDENEKETE 111


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 33.4 bits (77), Expect = 0.29
 Identities = 11/28 (39%), Positives = 21/28 (75%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKEN 343
           + E+D+D D+E DE++EES   E ++++
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDD 149



 Score = 32.3 bits (74), Expect = 0.77
 Identities = 7/40 (17%), Positives = 21/40 (52%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETII 348
             E   D+DE+ ++ D+E  + E++ + ++ + ++     
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161



 Score = 32.3 bits (74), Expect = 0.77
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGE 340
            +   D +EDD+D ++ ++EDEE S++E+ E
Sbjct: 117 DDAYRDLEEDDDDDEESDEEDEESSKSEDDE 147



 Score = 31.5 bits (72), Expect = 1.3
 Identities = 11/40 (27%), Positives = 25/40 (62%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           L ++D+DDE+ D+E++E  +  + E+ + ++ +  I   E
Sbjct: 123 LEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162



 Score = 29.6 bits (67), Expect = 5.5
 Identities = 11/50 (22%), Positives = 26/50 (52%), Gaps = 3/50 (6%)

Query: 302 EEKYTTLRREILGDEDEDDED---GDDEEDEDEEESEAEEGEKENKETII 348
           ++ Y  L  +   DE+ D+ED      E+DED+++ + ++     + ++ 
Sbjct: 117 DDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLE 166



 Score = 29.6 bits (67), Expect = 5.7
 Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 14/63 (22%)

Query: 312 ILGDEDEDDE--------------DGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVA 357
           I  D D ++E              + DD++DE+ +E + E  + E+ E   D+ + + +A
Sbjct: 100 IFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIA 159

Query: 358 LRR 360
            R 
Sbjct: 160 TRE 162


>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727).  This
           domain family is found in bacteria and eukaryotes, and
           is approximately 100 amino acids in length.
          Length = 100

 Score = 31.4 bits (72), Expect = 0.30
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
            E E+DE  + EE++D++E E EE E
Sbjct: 19  PEPEEDEILELEEEDDDDEDEEEEYE 44



 Score = 27.6 bits (62), Expect = 7.0
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
           +EDE  E  ++++D+++EE E E 
Sbjct: 22  EEDEILELEEEDDDDEDEEEEYEL 45


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 33.2 bits (76), Expect = 0.31
 Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 1/53 (1%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTIH 367
           D  ED+E  DD  DEDEE+ EA   E EN+ T  +  E   V        TI 
Sbjct: 33  DLTEDEEAEDDVVDEDEED-EAVVEEDENELTEEEEDEEGEVKASPDADTTIL 84



 Score = 29.7 bits (67), Expect = 3.2
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           E   D+  D+++ D+   E++E    EE E E  E 
Sbjct: 38  EEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEV 73


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 33.3 bits (77), Expect = 0.33
 Identities = 25/102 (24%), Positives = 40/102 (39%), Gaps = 28/102 (27%)

Query: 260 VIEDLDLVPEEDKFTHLMTL------------------DGVKD-TQDILNVFQYDPEYLM 300
           V   +DL+ EE+       +                   GVK+   D++   + +P    
Sbjct: 281 VFNKIDLLDEEEAEERAKAIVEALGWEGPVYLISAASGLGVKELCWDLMTFIEENPREEA 340

Query: 301 NEEKYT--------TLRREIL-GDEDEDDEDGDDEEDEDEEE 333
            E +             RE L   E+EDD+D DD+ DED++E
Sbjct: 341 EEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDE 382



 Score = 29.1 bits (66), Expect = 5.9
 Identities = 10/38 (26%), Positives = 20/38 (52%)

Query: 295 DPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEE 332
             E++ ++     L      D+D+ D+D D+++DE  E
Sbjct: 348 KVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVE 385


>gnl|CDD|151714 pfam11273, DUF3073, Protein of unknown function (DUF3073).  This
           family of proteins with unknown function appears to be
           restricted to Actinobacteria.
          Length = 64

 Score = 30.3 bits (69), Expect = 0.36
 Identities = 10/38 (26%), Positives = 18/38 (47%), Gaps = 2/38 (5%)

Query: 305 YTTLRREILG--DEDEDDEDGDDEEDEDEEESEAEEGE 340
              L+RE+ G       D+D   E+D D+ +  A+  +
Sbjct: 25  LDALQRELSGGSSRSSADDDDQYEDDYDDYDKYADYDD 62


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 31.3 bits (70), Expect = 0.36
 Identities = 11/28 (39%), Positives = 21/28 (75%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           D D D+E+ DD+EDE++++ + +E + E
Sbjct: 13  DSDSDEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 31.3 bits (70), Expect = 0.37
 Identities = 11/34 (32%), Positives = 24/34 (70%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
           G+ED D +  ++E+D+DE+E + +E + E+ + +
Sbjct: 8   GEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEV 41



 Score = 30.1 bits (67), Expect = 0.90
 Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 298 YLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEE 333
           +L  EE   +   E   D+DED+ED D+++DED++E
Sbjct: 5   FLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40



 Score = 30.1 bits (67), Expect = 0.95
 Identities = 10/30 (33%), Positives = 22/30 (73%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           E E+D D D +E+ED+++ + E+ ++++ E
Sbjct: 7   EGEEDSDSDSDEEEDDDDEDEEDDDEDDDE 36



 Score = 29.0 bits (64), Expect = 2.1
 Identities = 11/30 (36%), Positives = 21/30 (70%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           L  E++ D D D+EED+D+E+ E ++ + +
Sbjct: 6   LEGEEDSDSDSDEEEDDDDEDEEDDDEDDD 35


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 31.2 bits (71), Expect = 0.41
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGE 340
             E++E+ ++EE+E+EEESE E   
Sbjct: 74  AAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 29.2 bits (66), Expect = 1.7
 Identities = 11/24 (45%), Positives = 19/24 (79%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEG 339
           E+E++E+ ++EE+E+E E EA  G
Sbjct: 76  EEEEEEEEEEEEEEEESEEEAMAG 99


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 33.4 bits (76), Expect = 0.42
 Identities = 27/115 (23%), Positives = 51/115 (44%), Gaps = 15/115 (13%)

Query: 244  EVLAQVRKDNFKDFPDVIEDLDLVPEEDKF-THLMTLDGVKDTQDI-LNVFQYDPEYLMN 301
            E+   ++++N +  P+  EDLDL PE+ K       +    D +D+ +     + E    
Sbjct: 3968 EIQPDIQENNSQPPPE-NEDLDL-PEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADA 4025

Query: 302  E-----------EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            E           E+  TL  +I  D+  D  + D++ +ED  E   +E E+  ++
Sbjct: 4026 EKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTED 4080



 Score = 32.7 bits (74), Expect = 0.70
 Identities = 24/97 (24%), Positives = 45/97 (46%), Gaps = 4/97 (4%)

Query: 259  DVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDE 318
            D+ E+++ +PE+   +    L    + +D+L   Q   E      +   + +E   D+++
Sbjct: 3887 DLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKE---DDNK 3943

Query: 319  DDEDGDDEEDEDEEESEAEEG-EKENKETIIDNTETN 354
              ED D +E EDEEE   + G + E +  I +N    
Sbjct: 3944 ALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQP 3980



 Score = 29.6 bits (66), Expect = 6.9
 Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 315  DEDEDDEDGDDE-EDEDEEESEAEEGEKENKETIIDNTETN 354
             +D D ED D E  DE++EE++AE+ E    E   D  E N
Sbjct: 4003 SKDSDLEDMDMEAADENKEEADAEKDEPMQDE---DPLEEN 4040


>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
          Length = 325

 Score = 32.6 bits (75), Expect = 0.43
 Identities = 13/56 (23%), Positives = 26/56 (46%), Gaps = 9/56 (16%)

Query: 306 TTLRREILGDEDEDDEDG---------DDEEDEDEEESEAEEGEKENKETIIDNTE 352
            TL+   L ++ + DE+G          +EE  D + +E E   +   + ++D T+
Sbjct: 1   MTLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQ 56


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 31.8 bits (72), Expect = 0.53
 Identities = 15/28 (53%), Positives = 17/28 (60%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEE 338
           E   +E ED ED  DEEDE+ EE E E 
Sbjct: 32  EEDEEEMEDWEDSLDEEDEEAEEVEEET 59



 Score = 28.0 bits (62), Expect = 9.4
 Identities = 14/37 (37%), Positives = 22/37 (59%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
           EE + +   E+  DE+E ++  D  ++EDEE  E EE
Sbjct: 21  EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEE 57


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 32.5 bits (75), Expect = 0.58
 Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 8/61 (13%)

Query: 286 QDILNVFQYDPEYLMNEEKYTTLRREIL-GDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
             IL    YD   L N E      RE++ G  D + E+       + EE + +E E+E +
Sbjct: 150 DAILE--WYDR--LENGE---RRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEE 202

Query: 345 E 345
           +
Sbjct: 203 D 203



 Score = 32.1 bits (74), Expect = 0.89
 Identities = 19/79 (24%), Positives = 33/79 (41%), Gaps = 4/79 (5%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETII---DNTETNLVALRRTIYLTI 366
             +  + +E D+D D+EE+EDE +      E E  E ++            LR+     +
Sbjct: 183 AHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKV 242

Query: 367 HSSLDFEECAHKLLRMQLK 385
              L   +  +  LR +LK
Sbjct: 243 EGRLAQHK-KYAKLREKLK 260



 Score = 29.4 bits (67), Expect = 6.3
 Identities = 7/38 (18%), Positives = 15/38 (39%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
            +D    + + +   DE+  E       + E+ I  T+
Sbjct: 66  ADDLLLAENEADAQTDEDAEEEAAAALSSVESEIGRTD 103


>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3). 
           This family contains EBNA-3A, -3B, and -3C which are
           latent infection nuclear proteins important for
           Epstein-Barr virus (EBV)-induced B-cell immortalisation
           and the immune response to EBV infection.
          Length = 254

 Score = 32.0 bits (73), Expect = 0.61
 Identities = 13/46 (28%), Positives = 19/46 (41%), Gaps = 1/46 (2%)

Query: 308 LRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           L    L    E D+D   E   + E+SE+ E + E    I+   E 
Sbjct: 199 LGLSALRTPGEPDDDATVETSSESEDSES-ESDDEELPYIVPRMEP 243


>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655).  This
           domain family is found in viruses, and is approximately
           70 amino acids in length. The family is found in
           association with pfam08716, pfam01661, pfam05409,
           pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
           pfam08715, pfam08710.
          Length = 70

 Score = 29.6 bits (66), Expect = 0.62
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
           DE+ED E+ + EE+E EE  E E G +++
Sbjct: 9   DEEEDCEEYECEEEEIEETCEHEYGTEDD 37


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
            [Transcription / DNA replication, recombination, and
            repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 32.7 bits (74), Expect = 0.64
 Identities = 7/30 (23%), Positives = 16/30 (53%)

Query: 315  DEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
            ++  D+EDG+D ++ + + +      K  K
Sbjct: 972  NDSSDEEDGEDWDELESKAAYDSRPGKRRK 1001



 Score = 30.0 bits (67), Expect = 4.2
 Identities = 8/37 (21%), Positives = 19/37 (51%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDN 350
            DE  +D   D+ E++  +E + E+ ++   +   D+
Sbjct: 958 SDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDS 994



 Score = 29.6 bits (66), Expect = 5.4
 Identities = 10/48 (20%), Positives = 25/48 (52%)

Query: 295 DPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           D     +E++ +    E L +++ +++  D+E+ ED +E E++     
Sbjct: 947 DESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDS 994



 Score = 29.2 bits (65), Expect = 8.2
 Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 1/32 (3%)

Query: 315 DEDEDDEDGDD-EEDEDEEESEAEEGEKENKE 345
           DE ++DE+ D+  ED  E+ESE +  ++E+ E
Sbjct: 950 DETDEDEESDESSEDLSEDESENDSSDEEDGE 981


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 32.7 bits (74), Expect = 0.64
 Identities = 9/30 (30%), Positives = 19/30 (63%)

Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKET 346
             D E+ DD+ +E++ +S + +  K +K+T
Sbjct: 113 KTDSEEDDDDSEEEDNKSTSSKDGKGSKKT 142


>gnl|CDD|234064 TIGR02936, fdxN_nitrog, ferredoxin III, nif-specific.  Members of
           this family are homodimeric ferredoxins from nitrogen
           fixation regions of many nitrogen-fixing bacteria. As
           characterized in Rhodobacter capsulatus, these proteins
           are homodimeric, with two 4Fe-4S clusters bound per
           monomer. Although nif-specific, this protein family is
           not usiveral, as other nitrogenase systems may
           substitute flavodoxins, or different types of ferredoxin
           [Central intermediary metabolism, Nitrogen fixation].
          Length = 91

 Score = 30.0 bits (68), Expect = 0.68
 Identities = 11/51 (21%), Positives = 25/51 (49%), Gaps = 9/51 (17%)

Query: 294 YDPEYLM--NEEK-------YTTLRREILGDEDEDDEDGDDEEDEDEEESE 335
           + P+++   ++EK       Y    R++L  +  ++E      D+D++E E
Sbjct: 10  WTPQFVTSIDQEKCIGCGRCYKVCGRDVLTLKGINEEGELVASDDDDDEIE 60


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 31.0 bits (70), Expect = 0.71
 Identities = 16/31 (51%), Positives = 20/31 (64%), Gaps = 2/31 (6%)

Query: 314 GDEDEDDEDGDDEEDEDEEES--EAEEGEKE 342
           G EDE DE   +EEDE+ +E+  EAEE  K 
Sbjct: 22  GKEDEGDESDAEEEDEEIQEALREAEEERKA 52



 Score = 29.5 bits (66), Expect = 2.3
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 318 EDDEDGDDEEDEDEEESEAEEGEKENKE 345
              ED  DE D +EE+ E +E  +E +E
Sbjct: 21  GGKEDEGDESDAEEEDEEIQEALREAEE 48



 Score = 28.3 bits (63), Expect = 6.2
 Identities = 10/33 (30%), Positives = 21/33 (63%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           LG ++++ ++ D EE+++E +    E E+E K 
Sbjct: 20  LGGKEDEGDESDAEEEDEEIQEALREAEEERKA 52


>gnl|CDD|239201 cd02803, OYE_like_FMN_family, Old yellow enzyme (OYE)-like FMN
           binding domain. OYE was the first flavin-dependent
           enzyme identified, however its true physiological role
           remains elusive to this day.  Each monomer of OYE
           contains FMN as a non-covalently bound cofactor, uses
           NADPH as a reducing agent with oxygens, quinones, and
           alpha,beta-unsaturated aldehydes and ketones, and can
           act as electron acceptors in the catalytic reaction.
           Members of OYE family include trimethylamine
           dehydrogenase, 2,4-dienoyl-CoA reductase, enoate
           reductase, pentaerythriol tetranitrate reductase,
           xenobiotic reductase, and morphinone reductase.
          Length = 327

 Score = 32.2 bits (74), Expect = 0.74
 Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 9/46 (19%)

Query: 219 LESFFS------TDVYNSSKGKLEKRVQYMIEVLAQVRKDNFKDFP 258
           L  F S      TD Y    G LE R ++++E++A VR+    DFP
Sbjct: 168 LSQFLSPYTNKRTDEYG---GSLENRARFLLEIVAAVREAVGPDFP 210


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 32.0 bits (73), Expect = 0.76
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
             DED  + D DD +D+  +E   +  + +N
Sbjct: 259 DDDEDAIESDLDDSDDDVSDEDGEDLFDTDN 289



 Score = 30.9 bits (70), Expect = 1.5
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEE 333
             I  D++ D  D DD+ED  E +
Sbjct: 245 DGIDSDDEGDGSDDDDDEDAIESD 268



 Score = 30.1 bits (68), Expect = 3.0
 Identities = 11/34 (32%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESE 335
            ++ T  + + +  +DE D   DD++DED  ES+
Sbjct: 236 SKRRTIAQIDGIDSDDEGDGS-DDDDDEDAIESD 268



 Score = 29.7 bits (67), Expect = 4.0
 Identities = 10/31 (32%), Positives = 13/31 (41%), Gaps = 6/31 (19%)

Query: 314 GDEDEDDE------DGDDEEDEDEEESEAEE 338
            D+DED          DD  DED E+    +
Sbjct: 258 DDDDEDAIESDLDDSDDDVSDEDGEDLFDTD 288


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 32.5 bits (74), Expect = 0.78
 Identities = 13/41 (31%), Positives = 25/41 (60%)

Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEK 341
           +EE+   + R    D D+D+   +DE++++E++ EAE   K
Sbjct: 14  DEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAK 54



 Score = 29.8 bits (67), Expect = 5.0
 Identities = 15/59 (25%), Positives = 28/59 (47%)

Query: 284 DTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           + Q +  +++ +  YLM   +   L+  I   E E+  D    E+++E+E   EEG   
Sbjct: 768 NVQRLTELYEKEVRYLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEEGFST 826


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
           terminus.  The C terminus of the plasma membrane Nha1
           antiporter plays an important role in the immediate cell
           response to hypo-osmotic shock which prevents an
           execessive loss of ions and water. This domain is found
           with pfam00999.
          Length = 430

 Score = 32.1 bits (73), Expect = 0.80
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 307 TLRREILGDEDEDDEDGDDEEDEDEEESEAE 337
             R     D +++D + DD + +DEEE+ AE
Sbjct: 379 PSRPRSRRDSEDEDTERDDSDSDDEEETPAE 409


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 32.1 bits (73), Expect = 0.85
 Identities = 24/68 (35%), Positives = 35/68 (51%), Gaps = 12/68 (17%)

Query: 283 KDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDED-GDDEEDEDEEESEAEEGEK 341
           K  +D +  ++ DPE    EE           +EDEDD+D G D+EDEDE+   A E   
Sbjct: 122 KQFEDDITRYREDPESEDEEE-----------EEDEDDDDDGSDDEDEDEDGVGATEEVA 170

Query: 342 ENKETIID 349
            + E+ +D
Sbjct: 171 ASSESGVD 178


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 32.2 bits (73), Expect = 0.89
 Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 13/79 (16%)

Query: 273 FTHL---MTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDD-EDGDDEED 328
           FT+    +T +G++   D+L           +E K    +     +EDEDD E+ DD+ED
Sbjct: 613 FTYFIEDVTNEGLQLILDVLK---------ADENKSRHQQLFEGEEEDEDDLEETDDDED 663

Query: 329 EDEEESEAEEGEKENKETI 347
           E E   ++E   + + E  
Sbjct: 664 ECEAIEDSESESESDGEDG 682



 Score = 31.0 bits (70), Expect = 2.2
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEG 339
           + + + +  D EEDE E+++EA EG
Sbjct: 672 ESESESDGEDGEEDEQEDDAEANEG 696


>gnl|CDD|236876 PRK11191, PRK11191, RNase E inhibitor protein; Provisional.
          Length = 138

 Score = 30.7 bits (70), Expect = 0.90
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 316 EDEDDEDGDDEEDEDEEESE 335
           ED + EDGDDE+  DE++  
Sbjct: 116 EDPNAEDGDDEDFVDEDDDG 135


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 31.9 bits (72), Expect = 1.0
 Identities = 6/30 (20%), Positives = 11/30 (36%)

Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           DE  E       +    +  +  E E++ T
Sbjct: 209 DEPGEPQSPTPQQAPSPNTQQAVEHEDEPT 238



 Score = 31.1 bits (70), Expect = 1.8
 Identities = 5/31 (16%), Positives = 7/31 (22%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
             +           +   E   EE E    E
Sbjct: 154 SHNPSPNQQPSSFLQPSHEDSPEEPEPPTSE 184



 Score = 30.7 bits (69), Expect = 2.2
 Identities = 7/33 (21%), Positives = 10/33 (30%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
                  ED  +E +    E E +       ET
Sbjct: 165 SFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSET 197


>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein.  S-antigens are heat
           stable proteins that are found in the blood of
           individuals infected with malaria.
          Length = 94

 Score = 29.5 bits (66), Expect = 1.0
 Identities = 16/40 (40%), Positives = 26/40 (65%), Gaps = 1/40 (2%)

Query: 303 EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           +KY  L+ E  G E++D+ED + EE  ++EE+   EG+ E
Sbjct: 55  QKYEDLQEEGEG-ENDDEEDSNSEESNNDEENGLIEGQGE 93


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 11/41 (26%), Positives = 20/41 (48%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
            D  + + D DD E+ +EEE + E+ E      +    ++N
Sbjct: 5   DDIIDYESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSN 45


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 31.3 bits (71), Expect = 1.1
 Identities = 16/48 (33%), Positives = 29/48 (60%), Gaps = 2/48 (4%)

Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDE--DEEESEAEEGEKENKET 346
           +  K + ++ +   DE++DD+DGD  E++   E+E E E  E+E +E 
Sbjct: 235 DGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEED 282



 Score = 30.6 bits (69), Expect = 1.8
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
           +E E  ED ++EE E+EEE E E+
Sbjct: 261 EEKELKEDEEEEETEEEEEEEDED 284



 Score = 30.6 bits (69), Expect = 1.8
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAE 337
            +   E ++DE+ ++ E+E+EEE E E
Sbjct: 259 YVEEKELKEDEEEEETEEEEEEEDEDE 285



 Score = 29.0 bits (65), Expect = 6.8
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           D++EDD+   D  +E E + + EE E E +E   D  E
Sbjct: 248 DDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285



 Score = 28.6 bits (64), Expect = 7.9
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
           ++  DDE+ DD++ +  EE E +E E+E +    +  E 
Sbjct: 244 NKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEED 282


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 32.0 bits (72), Expect = 1.1
 Identities = 10/26 (38%), Positives = 19/26 (73%)

Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAE 337
           I+ ++D++ E+ + E D DE++S AE
Sbjct: 354 IIVEDDDESEEIESECDPDEDKSGAE 379


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 19/99 (19%), Positives = 36/99 (36%), Gaps = 29/99 (29%)

Query: 294 YDPEYLMNEEKYTTLRREI-------------------LGDEDEDDEDGDDEEDEDEEES 334
           YD EY   ++K   L  EI                      + + D+D  D++ +D+ E 
Sbjct: 53  YDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112

Query: 335 EAEEGEKENKETI----------IDNTETNLVALRRTIY 363
           + E+  ++  E +          +   +    AL + IY
Sbjct: 113 KDEKEAEDKLEDLTKSYSETLSTLSELKPRKYALHKDIY 151


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 29.5 bits (67), Expect = 1.3
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
               +++  ++EE+E++EESE E   
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 28.8 bits (65), Expect = 2.9
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEG 339
            +E  E+ ++EE+++E E EA  G
Sbjct: 77  AEEKKEEEEEEEEKEESEEEAAAG 100


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
           mitochondrial protein translocase family, which is
           responsible for movement of nuclear encoded pre-proteins
           into mitochondria, is very complex with at least 19
           components. These proteins include several chaperone
           proteins, four proteins of the outer membrane
           translocase (Tom) import receptor, five proteins of the
           Tom channel complex, five proteins of the inner membrane
           translocase (Tim) and three "motor" proteins. This
           family represents the Tom22 proteins. The N terminal
           region of Tom22 has been shown to have chaperone-like
           activity, and the C terminal region faces the
           intermembrane face.
          Length = 136

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 10/58 (17%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALR 359
           +EK    +     + D+DDED  D + +  ++S+ E       ET+ D     +VAL+
Sbjct: 13  QEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFEN------ETLYD----RIVALK 60


>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 508

 Score = 31.5 bits (71), Expect = 1.4
 Identities = 11/43 (25%), Positives = 20/43 (46%)

Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
           NEE    L+ ++    D +D +     +EDE E E  +   ++
Sbjct: 441 NEEVQERLQTDLGSISDSEDINMGSAGEEDESEDEDFQMVSDS 483


>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
          Length = 130

 Score = 29.8 bits (67), Expect = 1.4
 Identities = 14/51 (27%), Positives = 20/51 (39%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIY 363
           +G EDE  E   +E  E++ ES   +  K +     D  E   V   R   
Sbjct: 7   MGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQRPRKTT 57


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 31.6 bits (71), Expect = 1.5
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 13/100 (13%)

Query: 252 DNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRRE 311
            +  D  D  +D +L   ED F+     D   D +  L+    D E   +E    + + E
Sbjct: 696 KSRPDVEDDSDDSELDFAEDDFS-----DSTSDDEPKLD--AIDDEDAKSEGSQESDQEE 748

Query: 312 ILG------DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            L       D ++D+ D   E  E++E SE E+ E+ENKE
Sbjct: 749 GLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEEENKE 788


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 31.3 bits (71), Expect = 1.5
 Identities = 8/31 (25%), Positives = 15/31 (48%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
             +   E  D E+D++E     E+G K+  +
Sbjct: 157 APETGREGDDGEDDDEEGSDGEEDGAKKKAK 187



 Score = 30.1 bits (68), Expect = 3.3
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
           +  E EDD D D E+  D  ++E  E EKE
Sbjct: 344 INAEVEDDPDEDAEDGGDGGDAEEFEAEKE 373



 Score = 30.1 bits (68), Expect = 4.1
 Identities = 14/47 (29%), Positives = 23/47 (48%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRR 360
           G E +D ED D+E  + EE+   ++ +K+ +       E N  AL  
Sbjct: 161 GREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPESTLEKNFEALNV 207



 Score = 29.7 bits (67), Expect = 4.8
 Identities = 10/36 (27%), Positives = 17/36 (47%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDN 350
           D + + ED  DE+ ED  +    E  +  KE ++  
Sbjct: 343 DINAEVEDDPDEDAEDGGDGGDAEEFEAEKEPMLKV 378



 Score = 29.7 bits (67), Expect = 4.9
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
            +  E   +GDD ED+DEE S+ EE   + K
Sbjct: 155 KEAPETGREGDDGEDDDEEGSDGEEDGAKKK 185


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 30.3 bits (69), Expect = 1.5
 Identities = 14/41 (34%), Positives = 25/41 (60%), Gaps = 4/41 (9%)

Query: 310 REILG----DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
           R++LG    +++E++E+  + E+ DEEE   E  EKE  + 
Sbjct: 91  RKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKL 131



 Score = 28.0 bits (63), Expect = 7.3
 Identities = 8/32 (25%), Positives = 19/32 (59%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
            +E+E + +  DEE++ +E  E E  + + ++
Sbjct: 104 EEEEEVEVEELDEEEQIDELLEKELAKLKREK 135


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 30.4 bits (68), Expect = 1.6
 Identities = 18/53 (33%), Positives = 28/53 (52%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
           EE+    RRE   DE ED E+ + E +E   E ++++ E E  E     TE++
Sbjct: 139 EERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESD 191


>gnl|CDD|225618 COG3076, COG3076, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 135

 Score = 29.8 bits (67), Expect = 1.7
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 316 EDEDDEDGDDEEDEDEEESE 335
           ED + ED DD+ED DE++  
Sbjct: 113 EDPNAEDDDDDEDVDEDDDG 132


>gnl|CDD|239240 cd02930, DCR_FMN, 2,4-dienoyl-CoA reductase (DCR) FMN-binding
           domain.  DCR in E. coli  is an iron-sulfur flavoenzyme
           which contains FMN, FAD, and a 4Fe-4S cluster. It is
           also a monomer, unlike that of its eukaryotic
           counterparts which form homotetramers and lack the
           flavin and iron-sulfur cofactors. Metabolism of
           unsaturated fatty acids requires auxiliary enzymes in
           addition to those used in b-oxidation. After a given
           number of cycles through the b-oxidation pathway, those
           unsaturated fatty acyl-CoAs with double bonds at
           even-numbered carbon positions contain 2-trans, 4-cis
           double bonds that can not be modified by enoyl-CoA
           hydratase. DCR utilizes NADPH to remove the C4-C5 double
           bond. DCR can catalyze the reduction of both natural
           fatty acids with cis double bonds, as well as substrates
           containing trans double bonds. The reaction is initiated
           by hybrid transfer from NADPH to FAD, which in turn
           transfers electrons, one at a time, to FMN via the
           4Fe-4S cluster. The fully reduced FMN provides a hydrid
           ion to the C5 atom of substrate, and Tyr and His are
           proposed to form a catalytic dyad that protonates the C4
           atom of the substrate and completes the reaction.
          Length = 353

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 5/59 (8%)

Query: 213 IQSFQALESFFSTDVYNSSKGKLEKRVQYMIEVLAQVRKDNFKDFPDV--IEDLDLVPE 269
           I  F A  +   TD +    G  E R+++ +E++  VR    +DF  +  +  LDLV  
Sbjct: 164 INQFLAPRTNKRTDEWG---GSFENRMRFPVEIVRAVRAAVGEDFIIIYRLSMLDLVEG 219


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 30.3 bits (69), Expect = 1.8
 Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 316 EDEDDEDGDDE-EDEDEEESEAEEGEKENKETIIDNTETNL 355
           E + DED + E    +E+E + ++ E ++K+ I D T   L
Sbjct: 1   ESKSDEDDEFEVVPAEEDEKKVDKDEDDDKDDIDDLTAEAL 41


>gnl|CDD|204975 pfam12598, TBX, T-box transcription factor.  This domain family is
           found in eukaryotes, and is typically between 77 and 89
           amino acids in length. The family is found in
           association with pfam00907. There are two completely
           conserved residues (S and P) that may be functionally
           important. T-box genes encode transcription factors
           involved in morphogenesis and organogenesis of
           vertebrates and invertebrates.
          Length = 91

 Score = 28.6 bits (64), Expect = 2.1
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 307 TLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
            L+R    +ED D E  DD+  E + +SE  +
Sbjct: 44  RLKRLCPSEEDSDVESRDDKIPEADSDSEISD 75


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 24/132 (18%), Positives = 51/132 (38%), Gaps = 21/132 (15%)

Query: 225 TDVYNSSKGKLEKRVQYM-------IEVLAQVRKDNFKDFPDVIEDLDLVPEEDKFTHLM 277
           T + N    KL K VQY+       +E L   +K+N  +        ++  + D    L 
Sbjct: 269 TTLENYDPEKLPKSVQYILAVSYVNLEDLTTTKKENILN--------NISLKSDDNYLLY 320

Query: 278 TLD-GVKDTQDILNVFQY--DPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEE- 333
            +  G  + ++ +N+ +   D +  +     T L  E+  + D   +   +   E  ++ 
Sbjct: 321 WIYSGRGEFKEAINIARNLDDNDLTL--LALTKLYEEVKSNTDLSGDKRQELLKEYNKKL 378

Query: 334 SEAEEGEKENKE 345
            +  +   E K+
Sbjct: 379 QDYTKKLGEVKD 390


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 16/63 (25%), Positives = 23/63 (36%), Gaps = 1/63 (1%)

Query: 284 DTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDED-GDDEEDEDEEESEAEEGEKE 342
           D +    + Q + E  +  E    L  E     +ED          EDE E E E  E +
Sbjct: 382 DAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQ 441

Query: 343 NKE 345
            +E
Sbjct: 442 PEE 444


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.  LIN-8
           is a nuclear protein, present at the sites of
           transcriptional repressor complexes, which interacts
           with LIN-35 Rb.Lin35 Rb is a product of the class B
           synMuv gene lin-35 which silences genes required for
           vulval specification through chromatin modification and
           remodelling. The biological role of the interaction has
           not yet been determined however predictions have been
           made. The interaction shows that class A synMuv genes
           control vulval induction through the transcriptional
           regulation of gene expression. LIN-8 normally functions
           as part of a protein complex however when the complex is
           absent, other family members can partially replace LIN-8
           activity.
          Length = 316

 Score = 30.4 bits (69), Expect = 2.1
 Identities = 15/60 (25%), Positives = 24/60 (40%), Gaps = 13/60 (21%)

Query: 307 TLRREILGDEDEDDEDGDD-----------EEDEDEEESEA--EEGEKENKETIIDNTET 353
            LR+E    ED++ +D DD            E +D EE++   E   +E  +    N   
Sbjct: 125 ELRKEPWTGEDQEADDEDDIIYDGIFEDRTRESQDMEENDEVDEVEVEEVPDDYGANLAV 184


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
           domain family is found in eukaryotes, and is typically
           between 99 and 114 amino acids in length. The family is
           found in association with pfam08697, pfam01585. There
           are two completely conserved residues (G and F) that may
           be functionally important. TIP is involved in enamel
           assembly by interacting with one of the major proteins
           responsible for biomineralisation of enamel - tuftelin.
          Length = 106

 Score = 28.9 bits (65), Expect = 2.2
 Identities = 7/26 (26%), Positives = 18/26 (69%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEG 339
            + +EDD++ + E+D++ +E + E+ 
Sbjct: 81  KEPEEDDKEDESEDDDESDEDDDEDD 106



 Score = 28.1 bits (63), Expect = 4.1
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESE 335
           E   D+ ED+ + DDE DED++E +
Sbjct: 82  EPEEDDKEDESEDDDESDEDDDEDD 106


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 30.0 bits (68), Expect = 2.2
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query: 317 DEDDEDGDDEEDEDEEESEAEEGE 340
              D+D D+ +  DE+E E +E  
Sbjct: 189 GAPDDDLDEYDYGDEDEEEDDEPP 212



 Score = 29.6 bits (67), Expect = 3.6
 Identities = 8/22 (36%), Positives = 15/22 (68%)

Query: 315 DEDEDDEDGDDEEDEDEEESEA 336
           D+D D+ D  DE++E+++E   
Sbjct: 192 DDDLDEYDYGDEDEEEDDEPPW 213



 Score = 28.5 bits (64), Expect = 7.3
 Identities = 7/25 (28%), Positives = 15/25 (60%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEE 338
           G  D+D ++ D  ++++EE+ E   
Sbjct: 189 GAPDDDLDEYDYGDEDEEEDDEPPW 213


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 30.7 bits (69), Expect = 2.4
 Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 11/55 (20%)

Query: 302 EEKYTTLRREILGDEDEDDE-----------DGDDEEDEDEEESEAEEGEKENKE 345
           +++Y+        DED+DD+           +  D+E+E E  S+A EG  E+ E
Sbjct: 38  QDEYSQRSYSRFEDEDDDDDFPAPADGYSRGEAADDEEEGEASSDATEGHDEDDE 92


>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
           factors. 
          Length = 116

 Score = 29.0 bits (66), Expect = 2.4
 Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 15/78 (19%)

Query: 313 LGDEDEDDEDGDDEEDE--DEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTIHS-- 368
           L +E+EDDE  +  E+E  + E    EE E+  K  ++     +     +   L I +  
Sbjct: 2   LLEEEEDDEMRELAEEELEELEAELEEELEELLKLLLLPKDPND----DKNAILEIRAGA 57

Query: 369 ----SLDFEECAHKLLRM 382
               +  F   A  LLRM
Sbjct: 58  GGDEAALF---AGDLLRM 72


>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110).  This is
           a family that is found predominantly at the C-terminus
           of Kelch-containing proteins. However, the exact
           function of this region is not known.
          Length = 99

 Score = 28.9 bits (65), Expect = 2.5
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)

Query: 300 MNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           + EE+Y  LR E+   EDE++E    E    E  S A+E +    E
Sbjct: 54  LAEERYWELREELRALEDEEEEQAGIE----EVVSLAKEKDWGGGE 95


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.6 bits (69), Expect = 2.5
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           DE+E +E      +ED  ESE++E + E  E
Sbjct: 216 DEEEGEEAPSINYNEDTSESESDESDSEISE 246



 Score = 29.4 bits (66), Expect = 5.0
 Identities = 12/40 (30%), Positives = 17/40 (42%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEK 341
           +E       E    EDE   +   E+D D+EE E  + E 
Sbjct: 164 KESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSED 203



 Score = 28.6 bits (64), Expect = 8.6
 Identities = 6/38 (15%), Positives = 15/38 (39%)

Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDN 350
           L D  +++E  +       E++   E ++ + E     
Sbjct: 211 LVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESR 248



 Score = 28.6 bits (64), Expect = 9.3
 Identities = 8/27 (29%), Positives = 13/27 (48%)

Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKE 342
            + +DE   +E  ED+ + E EE    
Sbjct: 175 SESEDESKSEESAEDDSDDEEEEDSDS 201


>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein.  Members of this
           family are bacterial proteins with a conserved motif
           [KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
           followed by a long region of low complexity sequence in
           which roughly half the residues are Asp and Glu,
           including multiple runs of five or more acidic residues.
           The function of members of this family is unknown.
          Length = 129

 Score = 29.2 bits (65), Expect = 2.5
 Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 5/32 (15%)

Query: 311 EILGDEDEDDEDGD-----DEEDEDEEESEAE 337
           ++  DED  D+D D     DEED+D ++ +  
Sbjct: 93  DLAEDEDIADDDDDVTFLEDEEDDDIDDEDII 124


>gnl|CDD|235636 PRK05886, yajC, preprotein translocase subunit YajC; Validated.
          Length = 109

 Score = 29.0 bits (65), Expect = 2.6
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAE 337
           R+ +  +DEDDE+ + + D+D ++   E
Sbjct: 81  RDRILPDDEDDEELNADLDKDVDDVAGE 108


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 30.2 bits (68), Expect = 2.6
 Identities = 16/62 (25%), Positives = 29/62 (46%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTIHS 368
           RR  L  E E +E+ + + D+++E  E  E  +   E+ I +     ++  R +     S
Sbjct: 238 RRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSEYIERISEIRKMKDERLS 297

Query: 369 SL 370
           SL
Sbjct: 298 SL 299


>gnl|CDD|219285 pfam07065, D123, D123.  This family contains a number of eukaryotic
           D123 proteins approximately 330 residues long. It has
           been shown that mutated variants of D123 exhibit
           temperature-dependent differences in their degradation
           rate. D123 proteins are regulators of eIF2, the central
           regulator of translational initiation.
          Length = 295

 Score = 30.4 bits (69), Expect = 2.7
 Identities = 17/62 (27%), Positives = 28/62 (45%)

Query: 286 QDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           ++ L     D   L +EE    + +E   +E  D  + DD+ED D    E  E  ++ +E
Sbjct: 19  EEFLEYLLQDGILLPSEESSLPIYQESSDNEYSDWFEDDDDEDTDVTRPEFPELHQKIRE 78

Query: 346 TI 347
            I
Sbjct: 79  AI 80


>gnl|CDD|217573 pfam03462, PCRF, PCRF domain.  This domain is found in peptide
           chain release factors.
          Length = 115

 Score = 28.7 bits (65), Expect = 3.0
 Identities = 24/75 (32%), Positives = 31/75 (41%), Gaps = 14/75 (18%)

Query: 318 EDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVAL----RRTIYLTIHS----- 368
           E  E+ DD E  +E E E EE EKE  E +       L+ L     +   L I +     
Sbjct: 1   ELLEEEDDPELREEAEEELEELEKE-LEELEKELLNLLLPLDPYDDKNAILEIRAGAGGT 59

Query: 369 -SLDFEECAHKLLRM 382
            +  F   A  LLRM
Sbjct: 60  EAQLF---AEDLLRM 71


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 30.4 bits (68), Expect = 3.1
 Identities = 14/69 (20%), Positives = 24/69 (34%), Gaps = 18/69 (26%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGE-----------KENKETI-------IDNT 351
            E L + + +     D + +DE++   EE +              KE I       I+  
Sbjct: 92  EEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICTKIIEEP 151

Query: 352 ETNLVALRR 360
           E NL  +  
Sbjct: 152 EENLGMMEE 160


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score = 28.4 bits (64), Expect = 3.3
 Identities = 14/32 (43%), Positives = 21/32 (65%), Gaps = 3/32 (9%)

Query: 321 EDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           +D  D ++EDEEE EA+  + E ++  IDN E
Sbjct: 64  DDVGDGDEEDEEEREAKR-KLEQEK--IDNAE 92


>gnl|CDD|236765 PRK10810, PRK10810, anti-sigma28 factor FlgM; Provisional.
          Length = 98

 Score = 28.1 bits (63), Expect = 3.4
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 8/52 (15%)

Query: 32 AKLKLMQQSISDKSSVEYQRISWETLKKSIHGSVNKVNTGNIGIIARKLFQE 83
          A+ KLMQ    D   +  +R+  E LK++I     K++TG    IA  L +E
Sbjct: 47 AQAKLMQPGSQD---INVERV--EALKQAIRNGELKMDTGK---IADALIKE 90


>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
           Provisional.
          Length = 231

 Score = 29.6 bits (66), Expect = 3.7
 Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 4/52 (7%)

Query: 303 EKYTTLRREILGDEDEDDEDGDDEEDED----EEESEAEEGEKENKETIIDN 350
           EK      ++   +D DD++ + E  E+    EE  +AE G      +  DN
Sbjct: 100 EKSPKAVEKLCPPDDRDDKNEEKEPTEEAQRNEESGDAEGGASGRSPSDDDN 151


>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein. 
          Length = 283

 Score = 29.5 bits (66), Expect = 3.8
 Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 18/103 (17%)

Query: 253 NFKDFPDVIEDLDLVPE--EDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRR 310
           N+      + +++ + E   DK +        + T          P Y + EE   +   
Sbjct: 9   NYPRLDQTLSEIEEMEEQRADKSSTFQEDSVEEHTS---------PSYYLAEELSDS--- 56

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
               + +   ED      +     + E   +   + I D+   
Sbjct: 57  ----ETEPSIEDDQGLYTQLPPAEQVEGFIQGPLDDIADDDID 95


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 29.6 bits (67), Expect = 3.9
 Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 10/80 (12%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN----KETIIDNTETNLVALRRTIYLTIHSSL 370
            +D+++ED D+ EDE +   E+ E ++ N    K+ I  N   +      T +L   +  
Sbjct: 14  GDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVD------TSFLPDKARE 67

Query: 371 DFEECAHKLLRMQLKPGQES 390
           + E    + LR +    QE+
Sbjct: 68  EKEAELREELREEFLKKQEA 87


>gnl|CDD|165509 PHA03250, PHA03250, UL35; Provisional.
          Length = 564

 Score = 30.1 bits (68), Expect = 3.9
 Identities = 12/43 (27%), Positives = 18/43 (41%), Gaps = 2/43 (4%)

Query: 296 PEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
           P YL     +  L R +    D     G D++ E +E +E E 
Sbjct: 332 PGYLHFA-IFALLLR-LYNLRDATHCAGLDDDLEGQESAELEV 372


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 30.0 bits (68), Expect = 4.0
 Identities = 14/64 (21%), Positives = 30/64 (46%), Gaps = 6/64 (9%)

Query: 307 TLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTI 366
           +LR EI  +  + D   + +E+ED+EE + ++ E+        +T       R  +   +
Sbjct: 53  SLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEEN------KDTVEEFPKARDEVLEQL 106

Query: 367 HSSL 370
           + +L
Sbjct: 107 NQAL 110


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 30.1 bits (68), Expect = 4.2
 Identities = 9/39 (23%), Positives = 20/39 (51%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
            R + G ED+ ++    ++D+D+ +S     E+E    +
Sbjct: 27  SRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNM 65


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 27.9 bits (62), Expect = 4.3
 Identities = 10/35 (28%), Positives = 21/35 (60%)

Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
            L  E+ ++ D ++EE+E+EE+    E   +++E 
Sbjct: 54  ALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEEL 88



 Score = 27.9 bits (62), Expect = 4.7
 Identities = 11/38 (28%), Positives = 23/38 (60%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           + +E+DE+ ++EE+E++E       + E  E  I++ E
Sbjct: 59  ESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96



 Score = 27.5 bits (61), Expect = 7.1
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 308 LRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
           L  E   + DE++E+ ++EEDE E E  +++ E E + 
Sbjct: 55  LEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEI 92


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 29.9 bits (68), Expect = 4.5
 Identities = 30/119 (25%), Positives = 52/119 (43%), Gaps = 12/119 (10%)

Query: 231 SKGKLEKRVQYMIEVLAQVRKDNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILN 290
           SK K+ + V   + ++    K N +    +   L+L  EE +F     L G K     + 
Sbjct: 362 SKRKVRREVLPFLSIIF---KHNPELAARLAAFLELTEEEIEF-----LTGSKKATKKIK 413

Query: 291 VFQYDPEYLMNEE----KYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
                 E    EE    K      +   +E+E++++  +EE E+EEE   EE E+E ++
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472


>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein.  This family consists of
           Ebola and Marburg virus nucleoproteins. These proteins
           are responsible for encapsidation of genomic RNA. It has
           been found that nucleoprotein DNA vaccines can offer
           protection from the virus.
          Length = 717

 Score = 29.6 bits (66), Expect = 5.4
 Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 10/71 (14%)

Query: 280 DGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILG---DEDEDDEDGDDE-------EDE 329
            G + +QD+ N  Q   +   N    T+LR + L    +EDE ++  DD+       E +
Sbjct: 508 PGNRASQDLNNNNQKQEDESTNPIGKTSLRYQELTPVQEEDEPEDQTDDDDSSLPPLESD 567

Query: 330 DEEESEAEEGE 340
           D+  S+ E+G 
Sbjct: 568 DDPGSDNEQGV 578


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 29.3 bits (66), Expect = 5.4
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
             +D DD D DD+ED+DE+E  + +   E 
Sbjct: 481 LPDDSDDADEDDDEDDDEDEDSSSDSTLEE 510


>gnl|CDD|219343 pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE).  This
           family contains a number of bacterial stage II
           sporulation E proteins (EC:3.1.3.16). These are required
           for formation of a normal polar septum during
           sporulation. The N-terminal region is hydrophobic and is
           expected to contain up to 12 membrane-spanning segments.
          Length = 192

 Score = 28.8 bits (65), Expect = 5.6
 Identities = 10/49 (20%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 414 EKFFGLLAQRFCQ-INKMYVAPLEQIFRESYSTVHRLDINKLRDDVCVI 461
            + FGL  +R    + + +    E++       + RL   +L DD+ ++
Sbjct: 141 GELFGL--ERLLALLAERHGLSPEELLDALLEDLLRLGGGELEDDITLL 187


>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
          Length = 251

 Score = 29.2 bits (66), Expect = 5.6
 Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 1/27 (3%)

Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEE 338
           IL  E  DDED  ++E   E   EA +
Sbjct: 67  ILDSEPPDDED-LEDEQFLEVYQEASD 92


>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
          Length = 233

 Score = 29.0 bits (65), Expect = 5.7
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESE 335
           E+ Y     +    +DE++ D DDEE+ +E+E  
Sbjct: 198 EDGYVASEAKARKPQDEEEWDEDDEEESEEDEDG 231



 Score = 28.6 bits (64), Expect = 7.4
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
            E +  +  D+EE ++++E E+EE E  +
Sbjct: 204 SEAKARKPQDEEEWDEDDEEESEEDEDGD 232


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 29.2 bits (66), Expect = 5.8
 Identities = 17/44 (38%), Positives = 25/44 (56%), Gaps = 2/44 (4%)

Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
             +L DED+DDED   + D+D+EE    + E E   TI + +E 
Sbjct: 231 ERVLADEDDDDEDDMFDMDDDDEEE--SDPEVERTSTIKEVSEK 272


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 29.2 bits (65), Expect = 6.0
 Identities = 13/50 (26%), Positives = 25/50 (50%), Gaps = 4/50 (8%)

Query: 296 PEYLMNEEKYTTLRRE---ILGDE-DEDDEDGDDEEDEDEEESEAEEGEK 341
           P Y + +E   ++ +E   ILG+  D+ D +       +EE+ +A +  K
Sbjct: 136 PLYTLCKEDLESMDKEVDDILGEGSDDSDSEKKKPPGNEEEQEQAPQPRK 185


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 29.4 bits (66), Expect = 6.0
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query: 316 EDEDDEDGDDEEDEDEEESEAE 337
           +D+ DED D +EDEDE+E E E
Sbjct: 889 DDDADEDEDQDEDEDEDEDEDE 910



 Score = 29.0 bits (65), Expect = 7.8
 Identities = 14/24 (58%), Positives = 18/24 (75%), Gaps = 1/24 (4%)

Query: 313 LGDEDED-DEDGDDEEDEDEEESE 335
             D+D D DED D++EDEDE+E E
Sbjct: 887 NEDDDADEDEDQDEDEDEDEDEDE 910


>gnl|CDD|185558 PTZ00329, PTZ00329, eukaryotic translation initiation factor 1A;
           Provisional.
          Length = 155

 Score = 28.2 bits (63), Expect = 6.4
 Identities = 10/37 (27%), Positives = 18/37 (48%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
           +I  DE +D  +  D+E ++E   E     K + + I
Sbjct: 119 DIFDDEGDDGIEFQDDESDEEATDEMLNLTKADIDDI 155


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 29.0 bits (65), Expect = 6.8
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 1/53 (1%)

Query: 310 REILGDEDEDDEDGDDEE-DEDEEESEAEEGEKENKETIIDNTETNLVALRRT 361
           ++++ DEDED+E     E   +EEESE  E     K+       T      R 
Sbjct: 299 KKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRR 351


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 29.2 bits (65), Expect = 6.8
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIID 349
           E+  +EDE+  + +  +  DE+  E EE E +  +  ID
Sbjct: 145 EVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFID 183


>gnl|CDD|215541 PLN03020, PLN03020, low-temperature-induced protein; Provisional.
          Length = 556

 Score = 28.9 bits (64), Expect = 7.5
 Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 315 DEDEDDEDGDD-EEDEDEEESEAEEGE 340
           DE    +D D  EED+DE+E  AE+ +
Sbjct: 58  DEHRVPDDHDLYEEDDDEDEEMAEDPQ 84


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 28.9 bits (64), Expect = 7.5
 Identities = 23/75 (30%), Positives = 32/75 (42%), Gaps = 10/75 (13%)

Query: 290 NVFQYDP-EYLMNEEKYTTLRREILGD--EDEDDE-------DGDDEEDEDEEESEAEEG 339
           +V  YDP   LM  E+      E L +  ED   E       D DDEE + ++E E E  
Sbjct: 452 SVDSYDPRASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQ 511

Query: 340 EKENKETIIDNTETN 354
             +  ET   + + N
Sbjct: 512 GIKYSETSEADKDVN 526


>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking.  Sec14-like
           Golgi-trafficking domain The GOLD domain is always found
           combined with lipid- or membrane-association domains.
          Length = 136

 Score = 27.7 bits (62), Expect = 8.0
 Identities = 13/28 (46%), Positives = 17/28 (60%), Gaps = 2/28 (7%)

Query: 321 EDGDDEEDEDEEESEAEEG--EKENKET 346
           E  D+EE+E+ EE EAE G  E  +K  
Sbjct: 53  ESSDEEEEEEAEEEEAETGDVEAGSKSQ 80


>gnl|CDD|218970 pfam06278, DUF1032, Protein of unknown function (DUF1032).  This
           family consists of several conserved eukaryotic proteins
           of unknown function.
          Length = 565

 Score = 28.8 bits (64), Expect = 8.4
 Identities = 26/95 (27%), Positives = 39/95 (41%), Gaps = 16/95 (16%)

Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRT 361
           E+K   L  + L D  EDD D DD  D ++     E  + E  E    + E         
Sbjct: 369 EQKLWPLEEDRLEDSVEDDGDADDFSDPEDYLEPPEGLDPE--EQAFQDLEAE------- 419

Query: 362 IYLTIHSSLDFEECAHKLLRMQL----KPGQESEL 392
               + SSL +EE   + + + +    K  QE+EL
Sbjct: 420 ---AMPSSLSYEELVRRNVDLFIANSQKYAQETEL 451


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 28.8 bits (64), Expect = 8.4
 Identities = 16/42 (38%), Positives = 24/42 (57%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
           +IL  E  + E+ + EE E EEE E  E E ++ + I D+ E
Sbjct: 366 DILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDE 407


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 28.5 bits (63), Expect = 8.4
 Identities = 12/43 (27%), Positives = 23/43 (53%), Gaps = 2/43 (4%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE--NKETIIDNTETNL 355
           +++  DEDG ++   D E  + ++G KE  +KET   + +   
Sbjct: 35  EKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRT 77


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 28.5 bits (64), Expect = 8.8
 Identities = 16/77 (20%), Positives = 30/77 (38%), Gaps = 14/77 (18%)

Query: 283 KDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
            +  +  NV + D E ++          E + +E E +E  DDE  ++ +    +     
Sbjct: 128 NEEDEEENVDEDDAEIIL----------EEVEEEVEIEEVDDDEGTQETKYKRGDTSLTP 177

Query: 343 NK----ETIIDNTETNL 355
                 E++ID  E  L
Sbjct: 178 QAKDVLESLIDAAEWKL 194


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 28.7 bits (65), Expect = 9.1
 Identities = 18/83 (21%), Positives = 35/83 (42%), Gaps = 7/83 (8%)

Query: 283 KDTQDILNVFQYDPE------YLMNEEKYTTLRREILGDEDEDDE-DGDDEEDEDEEESE 335
            +   IL++F    +       L+ EE +++L        +E  E +G DEE  +E    
Sbjct: 349 AEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRER 408

Query: 336 AEEGEKENKETIIDNTETNLVAL 358
           A+E  +       +    +L++L
Sbjct: 409 AKEALETEALAQEEKLADDLLSL 431


>gnl|CDD|177868 PLN02222, PLN02222, phosphoinositide phospholipase C 2.
          Length = 581

 Score = 28.8 bits (64), Expect = 9.2
 Identities = 8/22 (36%), Positives = 18/22 (81%)

Query: 316 EDEDDEDGDDEEDEDEEESEAE 337
            D++D +GDD++D+D+ E +++
Sbjct: 288 VDKNDSNGDDDDDDDDGEDKSK 309


>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B.  Escherichia coli
           stringent starvation protein B (SspB), is thought to
           enhance the specificity of degradation of tmRNA-tagged
           proteins by the ClpXP protease. The tmRNA tag, also
           known as ssrA, is an 11-aa peptide added to the C
           terminus of proteins stalled during translation, targets
           proteins for degradation by ClpXP and ClpAP. SspB a
           cytoplasmic protein that specifically binds to residues
           1-4 and 7 of the tag. Binding of SspB enhances
           degradation of tagged proteins by ClpX, and masks
           sequence elements important for ClpA interactions,
           inhibiting degradation by ClpA. However, more recent
           work has cast doubt on the importance of SspB in
           wild-type cells. SspB is encoded in an operon whose
           synthesis is stimulated by carbon, amino acid, and
           phosphate starvation. SspB may play a special role
           during nutrient stress, for example by ensuring rapid
           degradation of the products of stalled translation,
           without causing a global increase in degradation of all
           ClpXP substrates.
          Length = 153

 Score = 27.9 bits (63), Expect = 9.2
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETII 348
           +E+EDD+  D  EDED E  +  + +      ++
Sbjct: 119 EEEEDDDADDSPEDEDPEPKDPPKTKGRPSLKVV 152


>gnl|CDD|219912 pfam08574, DUF1762, Protein of unknown function (DUF1762).  This is
           a family of proteins of unknown function. Yeast IWR1 is
           known to interact with RNA polymerase II and deletion of
           this protein results in hypersensitivity to the K1
           killer toxin.
          Length = 77

 Score = 26.6 bits (59), Expect = 9.3
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAE 337
           E   D   D++D  D+   D+E+S AE
Sbjct: 40  EYHEDLANDEDDDADQVLSDDEDSNAE 66


>gnl|CDD|215562 PLN03078, PLN03078, Putative tRNA pseudouridine synthase;
           Provisional.
          Length = 513

 Score = 28.7 bits (64), Expect = 9.3
 Identities = 9/47 (19%), Positives = 21/47 (44%)

Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIY 363
           +E+  +  +E++ED   S    G  +  E I+    + +    R ++
Sbjct: 278 EENHGEISEEDEEDLSFSSIPSGSSDENEDILKFQSSQVQIRARWLH 324


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 28.6 bits (64), Expect = 9.3
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
             E LG     DED +DE+DE E + E E+ +K  K+ 
Sbjct: 328 LSEFLGYMGGIDEDDEDEDDE-ESKEEVEKKQKVKKKP 364


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 28.9 bits (65), Expect = 9.5
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 315  DEDEDDEDGDDEEDEDEEESEAEE 338
             ED+DD + DD EDED+E+ E ++
Sbjct: 1365 SEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus.  This
           family represents the C-terminus (approximately 500
           residues) of the eukaryotic coatomer alpha subunit.
           Coatomer (COPI) is a large cytosolic protein complex
           which forms a coat around vesicles budding from the
           Golgi apparatus. Such coatomer-coated vesicles have been
           proposed to play a role in many distinct steps of
           intracellular transport. Note that many family members
           also contain the pfam04053 domain.
          Length = 421

 Score = 28.7 bits (64), Expect = 9.6
 Identities = 10/44 (22%), Positives = 13/44 (29%), Gaps = 11/44 (25%)

Query: 313 LGDEDEDDEDGDDEE-----DEDEE------ESEAEEGEKENKE 345
             D      D   E+     DED          + EEG +   E
Sbjct: 30  AADFAVMGGDDWGEDADLGLDEDGYLEGEDGLLDDEEGPEGGDE 73


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.135    0.384 

Gapped
Lambda     K      H
   0.267   0.0857    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,937,354
Number of extensions: 2795072
Number of successful extensions: 7731
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5817
Number of HSP's successfully gapped: 533
Length of query: 551
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 449
Effective length of database: 6,413,494
Effective search space: 2879658806
Effective search space used: 2879658806
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.0 bits)