RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8152
         (444 letters)



>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score = 67.7 bits (166), Expect = 1e-12
 Identities = 49/277 (17%), Positives = 105/277 (37%), Gaps = 35/277 (12%)

Query: 42  KASVLALNWGNVEESELLIAYSGQK-IKIYDTDFKNFNTNTDVSFGEGDIVGV-----SR 95
              V  + +    + +LL   SG   IK++D +     T   +   +G    V     S 
Sbjct: 9   TGGVTCVAF--SPDGKLLATGSGDGTIKVWDLE-----TGELLRTLKGHTGPVRDVAASA 61

Query: 96  YDECILTAVKSGDVKLWRYKTEPDQWTNFNVGPQLECMKHNKHSNIIATGGKENDLKLWD 155
               + +      ++LW  +T     T       +  +  +    I+++  ++  +K+WD
Sbjct: 62  DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD 121

Query: 156 LNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWVNGISFLPNSDKVAVCSKQGYVRVYD 215
           + T K + + +  +                   WVN ++F P+   VA  S+ G ++++D
Sbjct: 122 VETGKCLTTLRGHTD------------------WVNSVAFSPDGTFVASSSQDGTIKLWD 163

Query: 216 PKITTQRRPVLKVDIPDVALSCITTTNQSHYVIVGSTTGDMMAVDLRGKGKPCYKYKGFA 275
            +     + V  +      ++ +  +     ++  S+ G +   DL   GK     +G  
Sbjct: 164 LR---TGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS-TGKCLGTLRGHE 219

Query: 276 GCVKDIACPLNENLILSVSLDRHFRVHDFHSKKLLRK 312
             V  +A   +  L+ S S D   RV D  + + ++ 
Sbjct: 220 NGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQT 256



 Score = 33.5 bits (77), Expect = 0.17
 Identities = 36/203 (17%), Positives = 84/203 (41%), Gaps = 36/203 (17%)

Query: 24  SKDKCI-IKNIQNVKYITN----KASVLALNWGNVEESELLIAYSGQ--KIKIYDTDFKN 76
           S+DK I + +++  K +T        V ++ +         +A S Q   IK++D   + 
Sbjct: 112 SRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFS---PDGTFVASSSQDGTIKLWD--LRT 166

Query: 77  FNTNTDVSFGEGDIVGVSRYD--ECILTAVKSGDVKLWRYKTEPDQWTNFNVG--PQLEC 132
                 ++   G++  V+     E +L++   G +KLW   T   +      G    +  
Sbjct: 167 GKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLST--GKCLGTLRGHENGVNS 224

Query: 133 MKHNKHSNIIATGGKENDLKLWDLNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWVNG 192
           +  +    ++A+G ++  +++WDL T + + +    ++                   V  
Sbjct: 225 VAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------------------VTS 266

Query: 193 ISFLPNSDKVAVCSKQGYVRVYD 215
           +++ P+  ++A  S  G +R++D
Sbjct: 267 LAWSPDGKRLASGSADGTIRIWD 289


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 51.1 bits (122), Expect = 1e-06
 Identities = 30/114 (26%), Positives = 51/114 (44%), Gaps = 7/114 (6%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           E+E + +  E  E KGD   +   EH    +   ++  E ++     D + + +    + 
Sbjct: 778 EDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQEL 837

Query: 391 I--DNDEDKQDESEEDGEEEG-----EEEASENEYYTETDEDEKENEEDDGESD 437
              +  E KQDE   DG         EEE  E E   E +E+E+E EE++ E++
Sbjct: 838 NAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 45.8 bits (108), Expect = 4e-05
 Identities = 31/101 (30%), Positives = 49/101 (48%), Gaps = 10/101 (9%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           E+E + E  E  E +G S+ + D    +     +E  AE++         DE  ++    
Sbjct: 802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAK----QDEKGVDGGGG 857

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
            D  + +++E EE+ EEE EEE  E E      E+E+ENEE
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEE------EEEEENEE 892



 Score = 44.6 bits (105), Expect = 1e-04
 Identities = 26/118 (22%), Positives = 44/118 (37%), Gaps = 9/118 (7%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDI----- 385
           E E + E +   E +GD  E       E      E+  E +  ++     DE        
Sbjct: 744 EGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEH 803

Query: 386 -EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE---EDDGESDGG 439
               +  + DE +     +  + E ++E  E E   E   + K++E   +  G SDGG
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGG 861



 Score = 44.6 bits (105), Expect = 1e-04
 Identities = 30/120 (25%), Positives = 55/120 (45%), Gaps = 7/120 (5%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES-DIEIVD 389
           E+  + E++  +E KG+++ + +       +   E   E+K   +  + + E  + E   
Sbjct: 662 ESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGET 721

Query: 390 EIDNDEDKQD-ESEEDGEEEGEEEASE----NEYYTETDEDEKENE-EDDGESDGGRRRG 443
           E +  ED+ + E+ E+GEE  +E   E    +E  TE D  E E+E E + E       G
Sbjct: 722 EAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEG 781



 Score = 41.9 bits (98), Expect = 6e-04
 Identities = 30/111 (27%), Positives = 51/111 (45%), Gaps = 8/111 (7%)

Query: 331 ENETKKE-EQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
           E E   E E E+ EE G      + E     +   EN +E +        + E + EI  
Sbjct: 648 EGERPTEAEGENGEESGG-----EAEQEGETETKGENESEGEIPAERKG-EQEGEGEIEA 701

Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDED-EKENEEDDGESDGG 439
           +  + + + +  E + E E E E +E+E   ET E+ E+  +E +GE++G 
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGK 752



 Score = 41.1 bits (96), Expect = 0.001
 Identities = 29/114 (25%), Positives = 48/114 (42%), Gaps = 8/114 (7%)

Query: 331 ENETKKEEQEHSEEKGDSDEK--YDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
               + E +  +E +G  DE      E  E  +   E  AE K    T     E++ E  
Sbjct: 710 TEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGE 769

Query: 389 DEIDNDEDKQD---ESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
            E +  ED+ +   ++ EDGE +G+E A   E   E + + +  E+D+ E    
Sbjct: 770 TEAEGKEDEDEGEIQAGEDGEMKGDEGA---EGKVEHEGETEAGEKDEHEGQSE 820



 Score = 38.1 bits (88), Expect = 0.009
 Identities = 26/100 (26%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 345 KGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEED 404
           KGD  E          +  R   AE + N   S  + E + E   + +N+ + +  +E  
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGE-NGEESGGEAEQEGETETKGENESEGEIPAERK 690

Query: 405 GEEEGEEEASENEYYTETDEDEKENE-EDDGESDGGRRRG 443
           GE+EGE E    E   + + + +E E E + E++G    G
Sbjct: 691 GEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEG 730



 Score = 35.7 bits (82), Expect = 0.057
 Identities = 23/118 (19%), Positives = 46/118 (38%), Gaps = 4/118 (3%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQV---TRENYAESKTNKNTSDYDDESDIEI 387
           E E + E +    +     E  + EH    +      E   E+       + + E + E 
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEG 751

Query: 388 VDEIDNDEDKQDESEEDGEE-EGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
             E++ + D+++   E   E EG+E+  E E     D + K +E  +G+ +      +
Sbjct: 752 KHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEA 809


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 48.7 bits (116), Expect = 4e-06
 Identities = 30/104 (28%), Positives = 45/104 (43%), Gaps = 8/104 (7%)

Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDED 396
           E++   +  G + E  D + F   +V R+        + T D   E D    DE D D+D
Sbjct: 185 EDKAGEDLDGLAAEIDDQQAF--ARVVRDMLGSMDMAEETGDDGIEED---ADEEDGDDD 239

Query: 397 --KQDESEEDGEEEGE-EEASENEYYTETDEDEKENEEDDGESD 437
               +E  E G EE E  + SE +    TD + +E E D  E+ 
Sbjct: 240 QPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEAS 283



 Score = 38.7 bits (90), Expect = 0.006
 Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 7/81 (8%)

Query: 364 RENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESE--EDGEEEGEEEASENEYYTE 421
            E+  E   + +  D +++S+    +   +DE ++DE+E  +   EEGE +A+E      
Sbjct: 228 EEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAE-----A 282

Query: 422 TDEDEKENEEDDGESDGGRRR 442
           +++ E +  ++D E+ G   R
Sbjct: 283 SEDSESDESDEDTETPGEDAR 303



 Score = 33.3 bits (76), Expect = 0.32
 Identities = 26/140 (18%), Positives = 49/140 (35%), Gaps = 27/140 (19%)

Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRE------------------ 365
           V  + S   E K ++   S+     D    P    +  + RE                  
Sbjct: 124 VAKNLSAMLEEKYDKANFSKRTDKED---APLEEAVALLVREKLTGDAPPDSAGKVLDLW 180

Query: 366 -NYAESKTNKNTSDYDDESDIE-----IVDEIDNDEDKQDESEEDGEEEGEEEASENEYY 419
            +  E K  ++      E D +     +V ++    D  +E+ +DG EE  +E   ++  
Sbjct: 181 RDEIEDKAGEDLDGLAAEIDDQQAFARVVRDMLGSMDMAEETGDDGIEEDADEEDGDDDQ 240

Query: 420 TETDEDEKENEEDDGESDGG 439
            + +ED +   E+   SD  
Sbjct: 241 PDNNEDSEAGREESEGSDES 260



 Score = 31.0 bits (70), Expect = 1.5
 Identities = 22/115 (19%), Positives = 48/115 (41%), Gaps = 10/115 (8%)

Query: 315 MQSRLTSILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTREN-----YAE 369
             +R+   ++ S   +  ET  +  E   ++ D D+   P++ E ++  RE       +E
Sbjct: 204 AFARVVRDMLGSM-DMAEETGDDGIEEDADEEDGDDD-QPDNNEDSEAGREESEGSDESE 261

Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
               + T    +E +++  +     ED + +  ++  E   E+A     +TE  E
Sbjct: 262 EDEAEATDGEGEEGEMDAAEA---SEDSESDESDEDTETPGEDARPATPFTELME 313


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score = 47.0 bits (110), Expect = 1e-05
 Identities = 39/255 (15%), Positives = 82/255 (32%), Gaps = 30/255 (11%)

Query: 37  KYITNKASVLALNWGNVEESELLIAYSGQKIKIYDTDFKNFNTNTDVSFGEGDIVGVSRY 96
               +   V +L +       +    S   I+++D        +T     +  +   S  
Sbjct: 193 TLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPD 252

Query: 97  DECILTAVKSGDVKLWRYKTEPDQWTNFNVGPQ-LECMKHNKHSNIIATGGKENDLKLWD 155
              + +    G ++LW  ++        +     +  +  +    ++A+G  +  ++LWD
Sbjct: 253 GSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWD 312

Query: 156 LNTTKNIFSAKNVSHRVLVYSVK--------------------------PDEFDLRRPVW 189
           L T K + S     H   V S+                                L     
Sbjct: 313 LETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSN 372

Query: 190 VNGISFLPNSDKVAVCSKQGYVRVYDPKITTQRRPVLKVDIPDVALSCITTTNQSHYVIV 249
           V  +SF P+   V+  S  G VR++D    +  R    +D     ++ +  +     +  
Sbjct: 373 VLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLR---NLDGHTSRVTSLDFSPDGKSLAS 429

Query: 250 GSTTGDMMAVDLRGK 264
           GS+   +   DL+  
Sbjct: 430 GSSDNTIRLWDLKTS 444



 Score = 46.2 bits (108), Expect = 2e-05
 Identities = 32/184 (17%), Positives = 66/184 (35%), Gaps = 16/184 (8%)

Query: 132 CMKHNKHSNIIATGGKENDLKLWDLNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWVN 191
            +  +    ++ +G  +  +KLWDL+  + +  +    H   V  +              
Sbjct: 70  SIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLA------------- 116

Query: 192 GISFLPNSDKVAVCSKQGYVRVYDPKITTQRRPVLKVDIPDVALSCITTTNQSHYVIVGS 251
                P+ + + + S      V    ++T  + +  ++    +++ +  +     +  GS
Sbjct: 117 --LSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGS 174

Query: 252 TTGDMMAVDLRGKGKPCYKYKGFAGCVKDIA-CPLNENLILSVSLDRHFRVHDFHSKKLL 310
           +    + +     GKP     G    V  +A  P    LI S S D   R+ D  + KLL
Sbjct: 175 SLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLL 234

Query: 311 RKDY 314
           R   
Sbjct: 235 RSTL 238



 Score = 46.2 bits (108), Expect = 2e-05
 Identities = 58/283 (20%), Positives = 101/283 (35%), Gaps = 38/283 (13%)

Query: 70  YDTDFKNFNTNTDVSFGEGDIV---GVSRYDECILTAVKSGDVKLWRYKTEPDQWTNFNV 126
            D+     + ++ +  G  D +     S   E +L+    G +KLW          +   
Sbjct: 47  SDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEG 106

Query: 127 GP-----QLECMKHNKHSNIIATGGKENDLKLWDLNT-TKNIFSAKNVSHRVLVYSVKPD 180
                  +L     + +S ++A+   +  +KLWDL+T  K I + +  S  V   +  PD
Sbjct: 107 LHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPD 166

Query: 181 E---------------FDLRRPV----------WVNGISFLPNSDK-VAVCSKQGYVRVY 214
                           +DLR              V+ ++F P+    +A  S  G +R++
Sbjct: 167 GKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLW 226

Query: 215 DPKITTQRRPVLKVDIPDVALSCITTTNQSHYVIVGSTTGDMMAVDLRGKGKPCYKYKGF 274
           D       R  L      V  S    +     +  GS+ G +   DLR          G 
Sbjct: 227 DLSTGKLLRSTLSGHSDSVVSSF---SPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGH 283

Query: 275 AGCVKDIACPLNENLILSVSLDRHFRVHDFHSKKLLRKDYMQS 317
           +  V  +A   +  L+ S S D   R+ D  + KLL    ++ 
Sbjct: 284 SSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKG 326



 Score = 39.7 bits (91), Expect = 0.003
 Identities = 27/167 (16%), Positives = 59/167 (35%), Gaps = 3/167 (1%)

Query: 149 NDLKLWDLNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWVNGISFLPNSDKVAVCSKQ 208
           +   L   ++   I     +S  ++                +  I+F P+ + +   S  
Sbjct: 27  SLSLLSLGSSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSD 86

Query: 209 GYVRVYDPKITTQRRPVLKVDIPDVALSCITTTNQSHYVIVGSTTGDMMAV--DLRGKGK 266
           G ++++D     +    L+            ++   + +++ S++ D      DL   GK
Sbjct: 87  GTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGK 146

Query: 267 PCYKYKGFAGCVKDIA-CPLNENLILSVSLDRHFRVHDFHSKKLLRK 312
                +G +  V  +A  P  + L    SLD   ++ D  + K L  
Sbjct: 147 LIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLST 193


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 46.5 bits (111), Expect = 2e-05
 Identities = 28/105 (26%), Positives = 56/105 (53%), Gaps = 12/105 (11%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           E E  +EE++  E   D D++++P+         +N+   +  ++  + +D  D E  DE
Sbjct: 285 EEEDDEEEEDSKESADDLDDEFEPD-------DDDNFGLGQGEEDEEEEEDGVDDE--DE 335

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
            D+D+D ++E E+    + EE+  + +     DED++E EE++ E
Sbjct: 336 EDDDDDLEEEEEDVDLSDEEEDEEDED---SDDEDDEEEEEEEKE 377



 Score = 45.8 bits (109), Expect = 4e-05
 Identities = 25/109 (22%), Positives = 49/109 (44%), Gaps = 6/109 (5%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           E E  KEE E  ++      +       M     ++  E  + ++  D DDE + +  D 
Sbjct: 260 EEELAKEEAERLKKLEAERLR------RMRGEEEDDEEEEDSKESADDLDDEFEPDDDDN 313

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
               + ++DE EE+   + E+E  +++   E +ED   ++E++ E D  
Sbjct: 314 FGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDED 362



 Score = 41.5 bits (98), Expect = 8e-04
 Identities = 26/104 (25%), Positives = 55/104 (52%), Gaps = 7/104 (6%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVT--RENYAESKTNKNTSDYDD-----ES 383
           ++E +++ +E +++  D  E  D ++F + Q     E   +   +++  D DD     E 
Sbjct: 288 DDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEE 347

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK 427
           D+++ DE +++ED+  + E+D EEE EE+  + +   E+   E 
Sbjct: 348 DVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 44.6 bits (105), Expect = 1e-04
 Identities = 32/115 (27%), Positives = 48/115 (41%), Gaps = 10/115 (8%)

Query: 331  ENETKK--EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
            ENE     E+ +  E++GD  +  D E  +M +   EN  E+   +      DE  +E  
Sbjct: 3983 ENEDLDLPEDLKLDEKEGDVSKDSDLEDMDM-EAADENKEEAD-AEKDEPMQDEDPLEEN 4040

Query: 389  DEIDNDEDKQD-----ESEEDGEEEG-EEEASENEYYTETDEDEKENEEDDGESD 437
            + +D D  + D     E +E   E+G EE   ENE  TE      E  E     +
Sbjct: 4041 NTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPE 4095



 Score = 40.8 bits (95), Expect = 0.002
 Identities = 25/112 (22%), Positives = 51/112 (45%), Gaps = 7/112 (6%)

Query: 329  SLENETKKEEQEHSEEKGDSDEKYDPEHFE----MNQVTRENYAESKTNKNTSDYDDESD 384
            +L  +   EE  + E+  +  +  + E  E    MN VT+++      N ++ + + + D
Sbjct: 3830 TLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLD 3889

Query: 385  IEIVDEIDNDEDKQDES---EEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
             E+ D  ++  +  +E    E + E+  E E   NE     +E +  ++EDD
Sbjct: 3890 EEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDD 3941



 Score = 38.1 bits (88), Expect = 0.010
 Identities = 28/114 (24%), Positives = 48/114 (42%), Gaps = 11/114 (9%)

Query: 336  KEEQEHSEEK----GDSDEKYDPEHFEMNQVTRENYA-----ESKTNKNTSDYDDESDIE 386
            ++E+E  EE     G  DE         +Q   EN       + K ++   D   +SD+E
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009

Query: 387  IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
             +D    DE+K++   E  E   +E+  E       DED ++++  D   D  +
Sbjct: 4010 DMDMEAADENKEEADAEKDEPMQDEDPLEEN--NTLDEDIQQDDFSDLAEDDEK 4061



 Score = 36.5 bits (84), Expect = 0.031
 Identities = 25/104 (24%), Positives = 47/104 (45%), Gaps = 3/104 (2%)

Query: 336  KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDE 395
            KE+   + E  D  EK D E    + V  ++  +    +N S    E++   + E    +
Sbjct: 3938 KEDDNKALEDKDRQEKEDEEEMS-DDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLD 3996

Query: 396  DKQDESEEDG--EEEGEEEASENEYYTETDEDEKENEEDDGESD 437
            +K+ +  +D   E+   E A EN+   + ++DE   +ED  E +
Sbjct: 3997 EKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEEN 4040



 Score = 35.0 bits (80), Expect = 0.10
 Identities = 31/119 (26%), Positives = 45/119 (37%), Gaps = 14/119 (11%)

Query: 329  SLENETKKEEQEHSEEKGD---SDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDI 385
             LE     +E    ++  D    DEK + + FE N    E   ES  +   SD + E   
Sbjct: 4036 PLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENE---ESTEDGVKSDEELEQGE 4092

Query: 386  EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
               D+  ++  K D        E +EE         TD+      E+ GE DG R  G+
Sbjct: 4093 VPEDQAIDNHPKMDAKSTFASAEADEEN--------TDKGIVGENEELGEEDGVRGNGT 4143



 Score = 34.2 bits (78), Expect = 0.16
 Identities = 36/152 (23%), Positives = 58/152 (38%), Gaps = 30/152 (19%)

Query: 315  MQSRLTSILVRSDFSLENE---TKKEEQEHSEEKGDS-------------DEKYDPEHFE 358
            ++S +  +   S  S ENE   +++E Q+  EE  D              DE  + +  E
Sbjct: 3860 LESDMNGVTKDSVVS-ENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLE 3918

Query: 359  MNQVTRENYAESKTNKNTSDYDDESDIEIVD--------EIDNDEDKQDESEEDGEEEGE 410
              Q + E  A +  +   S  DD   +E  D        E+ +D    DE + D +E   
Sbjct: 3919 TEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNS 3978

Query: 411  EEASENEYY-----TETDEDEKENEEDDGESD 437
            +   ENE        + DE E +  +D    D
Sbjct: 3979 QPPPENEDLDLPEDLKLDEKEGDVSKDSDLED 4010



 Score = 33.4 bits (76), Expect = 0.32
 Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 3/63 (4%)

Query: 375  NTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
             T+  +++ D+E   E+ N+ED  ++S+ D  E  E E+  N    ++   E EN + + 
Sbjct: 3827 ITNTLNEDDDLE---ELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEE 3883

Query: 435  ESD 437
            E+ 
Sbjct: 3884 ENQ 3886



 Score = 31.5 bits (71), Expect = 1.1
 Identities = 13/59 (22%), Positives = 27/59 (45%), Gaps = 1/59 (1%)

Query: 380  DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTET-DEDEKENEEDDGESD 437
            D  + +   D+++   +++D + +   +E E    E++    T D    ENE  D E +
Sbjct: 3826 DITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEE 3884



 Score = 29.2 bits (65), Expect = 5.5
 Identities = 27/107 (25%), Positives = 43/107 (40%), Gaps = 21/107 (19%)

Query: 332  NETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI 391
            N    E+ E  +EK D                 +   ++   K+  DYDD+  I    E 
Sbjct: 2922 NINVSEQIEEEDEKDD-----------EYGEQEKVSDKNDYLKDFIDYDDDGSISTESEP 2970

Query: 392  DNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
            + +E ++ ES  DG          N+Y  E D+DEK  +E + E + 
Sbjct: 2971 E-EEKRECESGFDGW---------NDYEEEEDDDEKNEKECELEQNA 3007


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 43.5 bits (102), Expect = 3e-04
 Identities = 20/55 (36%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 381 DESDIEIVDE--IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
            E D+   D   ID+D++ +DE ++D E++ EEE  E E     DEDE++   +D
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGED 195



 Score = 29.6 bits (66), Expect = 4.9
 Identities = 16/65 (24%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEED----GEEEGEEEASENEYYTETDEDEKE--NEEDD 433
           DD+ D +  D+ + D+++++E EE+     +E+ E+E  E+  Y +++ D+ +     + 
Sbjct: 155 DDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTDCFKFIEA 214

Query: 434 GESDG 438
           G  D 
Sbjct: 215 GAGDD 219


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 42.7 bits (101), Expect = 3e-04
 Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 2/66 (3%)

Query: 379 YDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
           +DD    E + +    +D   + EED +++ E +  + E  +   ED++++++DD + D 
Sbjct: 101 FDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEE--SSKSEDDEDDDDDDDDDDI 158

Query: 439 GRRRGS 444
             R  S
Sbjct: 159 ATRERS 164



 Score = 29.2 bits (66), Expect = 5.8
 Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 8/68 (11%)

Query: 375 NTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
              D     D     E D+D+D++ + E++   + E++        E D+D+ ++++   
Sbjct: 109 ELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDD--------EDDDDDDDDDDIAT 160

Query: 435 ESDGGRRR 442
                 RR
Sbjct: 161 RERSLERR 168


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 42.6 bits (100), Expect = 4e-04
 Identities = 18/71 (25%), Positives = 38/71 (53%)

Query: 369 ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
            S  + + +  + E +  + +E + +E++++E +E  EEEGE+E  E E   +   +E+ 
Sbjct: 427 TSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEM 486

Query: 429 NEEDDGESDGG 439
               +G+ DG 
Sbjct: 487 EGSSEGDGDGE 497



 Score = 40.7 bits (95), Expect = 0.001
 Identities = 21/114 (18%), Positives = 48/114 (42%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           +++T++EE+   +E+                 T        + ++  +   E + E  +E
Sbjct: 395 QDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEE 454

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
            + +E + +E E + EEE EE  ++N    E +   + + + +   +   RR S
Sbjct: 455 EEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNS 508



 Score = 33.0 bits (75), Expect = 0.34
 Identities = 21/113 (18%), Positives = 48/113 (42%), Gaps = 4/113 (3%)

Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEI- 387
           S E+ +   ++   EE  + +E+ + E  E  Q + E   E +  +   + D+ S+ E+ 
Sbjct: 428 SGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487

Query: 388 ---VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
                + D +E ++D    + E  G    SE +    +    +  +E+  + +
Sbjct: 488 GSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPE 540


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 41.4 bits (97), Expect = 5e-04
 Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 9/126 (7%)

Query: 314 YMQSRLTSILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTN 373
           Y    + S +V      E+E      + +   GD     D       + + + Y     N
Sbjct: 9   YASFGVNSAVVTGSTPTEHEQNMLALDVAARDGD-----DAIELAEPETSDDPYG----N 59

Query: 374 KNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
            +    DDE  IE+    D +E++ +E EED EEEGEEE+ E E   +T E+  E  E  
Sbjct: 60  PDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQL 119

Query: 434 GESDGG 439
            E + G
Sbjct: 120 EEHEEG 125


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 40.5 bits (95), Expect = 7e-04
 Identities = 23/75 (30%), Positives = 36/75 (48%)

Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYT 420
                +  E+K + +    + E   E   E  N EDK+D+ + + E+E  EE +E E   
Sbjct: 42  DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101

Query: 421 ETDEDEKENEEDDGE 435
            +DE+EKE EE    
Sbjct: 102 SSDENEKETEEKTES 116



 Score = 34.7 bits (80), Expect = 0.059
 Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 10/91 (10%)

Query: 342 SEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDES 401
             ++  +DE+   +  +      E   E +  K  ++ +D+ D    ++ D      +ES
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKEEE--KEAANSEDKEDKGDAEKED------EES 91

Query: 402 EEDGEEEGEEEASENEYYTETDEDEKENEED 432
           EE+ EEE EE + ENE   ET+E  + N E 
Sbjct: 92  EEENEEEDEESSDENE--KETEEKTESNVEK 120



 Score = 30.9 bits (70), Expect = 0.98
 Identities = 20/107 (18%), Positives = 46/107 (42%), Gaps = 13/107 (12%)

Query: 319 LTSILVRSDFSLE-NETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTS 377
              +   S  S +    ++E ++  +++    E+   E  E         +E K +K  +
Sbjct: 31  AYQLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAAN------SEDKEDKGDA 84

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDG-EEEGEEEASENEYYTETD 423
           + +DE       E +N+E+ ++ S+E+  E E + E++  +  T   
Sbjct: 85  EKEDEES-----EEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126



 Score = 28.2 bits (63), Expect = 7.0
 Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 4/62 (6%)

Query: 380 DDESDIEIVDEIDNDEDKQDE--SEEDGEEEGEEEA--SENEYYTETDEDEKENEEDDGE 435
              SD    DE +  +    E    E+ +EE +E A   + E   + +++++E+EE++ E
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97

Query: 436 SD 437
            D
Sbjct: 98  ED 99


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 41.5 bits (97), Expect = 7e-04
 Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 8/80 (10%)

Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDED--KQDESEEDGEEEGE-EEASENE 417
           +V RE     +  +   D     D E  DE D D+D   ++E EE GE EGE +E S  +
Sbjct: 188 RVVREMLRSMELAEEMGD-----DTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQ 242

Query: 418 YYTETDEDEKENEEDDGESD 437
               TD + +  EE+  +SD
Sbjct: 243 ESEATDRESESGEEEMVQSD 262



 Score = 41.1 bits (96), Expect = 0.001
 Identities = 24/103 (23%), Positives = 40/103 (38%), Gaps = 18/103 (17%)

Query: 340 EHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQD 399
           E +EE GD  E  D E  + +Q T     E++  +      +  +     E         
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPT-----ENEQEEQGEGEGEGQEGSAPQE--------S 244

Query: 400 ESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRR 442
           E+ +   E GEEE  ++       +D  +  +DD E+ G   R
Sbjct: 245 EATDRESESGEEEMVQS-----DQDDLPDESDDDSETPGEGAR 282



 Score = 34.5 bits (79), Expect = 0.11
 Identities = 15/79 (18%), Positives = 31/79 (39%), Gaps = 15/79 (18%)

Query: 332 NETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI 391
           ++ +  E E  E+     E  +              +  + ++ T    +  + E+V   
Sbjct: 216 DDDQPTENEQEEQGEGEGEGQEG-------------SAPQESEATDRESESGEEEMVQS- 261

Query: 392 DNDEDKQDESEEDGEEEGE 410
            + +D  DES++D E  GE
Sbjct: 262 -DQDDLPDESDDDSETPGE 279



 Score = 28.8 bits (64), Expect = 6.4
 Identities = 13/59 (22%), Positives = 29/59 (49%), Gaps = 2/59 (3%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
           D ++   +V E+    +  +E  +D E E EE+  +++     +E E++ E +    +G
Sbjct: 182 DQQAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQ--PTENEQEEQGEGEGEGQEG 238


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 40.7 bits (95), Expect = 0.001
 Identities = 32/123 (26%), Positives = 52/123 (42%), Gaps = 8/123 (6%)

Query: 323 LVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD-D 381
           L   D   ++E   +E +   E  D DE+   +  +     ++   + K  K   D D D
Sbjct: 213 LKIKDLEGDDEDDGDESDKGGE--DGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDAD 270

Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEAS-----ENEYYTETDEDEKENEEDDGES 436
           E D +  D+   +ED   +S   G +  E E         +   E DED +E+EE+  E 
Sbjct: 271 EYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE 330

Query: 437 DGG 439
           +GG
Sbjct: 331 EGG 333



 Score = 34.2 bits (78), Expect = 0.14
 Identities = 24/114 (21%), Positives = 48/114 (42%), Gaps = 9/114 (7%)

Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
            +++  K   +   ++ DSD+  D    E + ++  + + +   +       E  I    
Sbjct: 256 DDDKKGKRGGDDDADEYDSDDGDDEGR-EEDYISDSSASGNDPEEREDKLSPE--IPAKP 312

Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYT------ETDEDEKENEEDDGESD 437
           EI+ DED ++  EE  EEEG       +           D+D+ ++ +D  +SD
Sbjct: 313 EIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSD 366



 Score = 30.3 bits (68), Expect = 2.2
 Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 15/70 (21%)

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEE---------------EASENEYYTETDEDEKE 428
           D++I D   +DED  DES++ GE+  EE                  + +     D+D  E
Sbjct: 212 DLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADE 271

Query: 429 NEEDDGESDG 438
            + DDG+ +G
Sbjct: 272 YDSDDGDDEG 281


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 40.4 bits (94), Expect = 0.001
 Identities = 29/117 (24%), Positives = 52/117 (44%), Gaps = 3/117 (2%)

Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES 383
           VR  + + NE     +    +  +  EK D E  E   +  E    ++ N    + +D  
Sbjct: 107 VRVGYYVNNEYPGITKLEKSDVEEPSEKVDEEDVERE-ILAEKPRVTRFNIVWDNDEDND 165

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDED--EKENEEDDGESDG 438
           +        ++E+++   E DG EE E+E   ++ Y E + +  E+E EE +G  DG
Sbjct: 166 EAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDG 222



 Score = 35.4 bits (81), Expect = 0.046
 Identities = 21/117 (17%), Positives = 50/117 (42%), Gaps = 3/117 (2%)

Query: 325 RSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESD 384
           R +   +N+   +E   ++   D++E+   E  +  +   E   E   + +  + + E +
Sbjct: 153 RFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGRE---EEEDEEVGSDSYGEGNRELN 209

Query: 385 IEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRR 441
            E  +E +  +D +D  + +GE   +++  E E   E     +   E++  S+   R
Sbjct: 210 EEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPR 266



 Score = 28.8 bits (64), Expect = 5.0
 Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 5/97 (5%)

Query: 337 EEQEHSEEKGDSDEKYDPE--HFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDND 394
           EE+E  EE    +E+ D E       +  RE   E +  +     D E  ++   E +  
Sbjct: 177 EEEERLEESDGREEEEDEEVGSDSYGEGNRE-LNEEEEEEAEGSDDGEDVVD--YEGERI 233

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
           + KQ E EE  EE       E E  + ++E  + NEE
Sbjct: 234 DKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEE 270


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 40.9 bits (96), Expect = 0.001
 Identities = 21/78 (26%), Positives = 41/78 (52%)

Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYT 420
            + ++ + ++K  +     D  S +EI D     +D+++E EE+ EEE E+E    E+  
Sbjct: 355 TLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSD 414

Query: 421 ETDEDEKENEEDDGESDG 438
           + + +E + E    +SDG
Sbjct: 415 DEEFEEDDVESKYEDSDG 432



 Score = 28.6 bits (64), Expect = 7.9
 Identities = 13/67 (19%), Positives = 29/67 (43%)

Query: 363 TRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTET 422
           T ++Y     +    +  +    +  +E + +E+++DE E   +E  ++E  E +     
Sbjct: 367 TEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESK 426

Query: 423 DEDEKEN 429
            ED   N
Sbjct: 427 YEDSDGN 433


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 38.4 bits (89), Expect = 0.001
 Identities = 17/75 (22%), Positives = 30/75 (40%)

Query: 363 TRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTET 422
           T    AE       +   + ++     + D D D+  E++   E + EEE  E    T  
Sbjct: 53  TEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETAD 112

Query: 423 DEDEKENEEDDGESD 437
           + D +  E D+ E +
Sbjct: 113 EADAEAEEADEEEDE 127



 Score = 29.9 bits (67), Expect = 1.0
 Identities = 13/45 (28%), Positives = 24/45 (53%)

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
            +  +E+E    +E  +EA+E +   E DE+E+ +E  D  +D  
Sbjct: 70  AEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEA 114


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 39.4 bits (92), Expect = 0.002
 Identities = 15/77 (19%), Positives = 35/77 (45%), Gaps = 1/77 (1%)

Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYT 420
           +V R+  +     +   D  + +D E  +E ++D  + ++ ++  EEE     S +E   
Sbjct: 197 RVVRDMLSSMDMAEELGDEPESADSE-DNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSD 255

Query: 421 ETDEDEKENEEDDGESD 437
            + E+ +  E +  E+ 
Sbjct: 256 ASSEEMESGEMEAAEAS 272



 Score = 32.5 bits (74), Expect = 0.34
 Identities = 13/44 (29%), Positives = 23/44 (52%), Gaps = 4/44 (9%)

Query: 396 DKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
              D +EE G+E    ++ +NE     ++D KE+E+DD   +  
Sbjct: 204 SSMDMAEELGDEPESADSEDNE----DEDDPKEDEDDDQGEEEE 243



 Score = 31.3 bits (71), Expect = 0.79
 Identities = 12/71 (16%), Positives = 26/71 (36%), Gaps = 5/71 (7%)

Query: 365 ENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
              A+S+ N++  D  ++ D    D+   +E+         + +   E  E     E  E
Sbjct: 216 PESADSEDNEDEDDPKEDED----DDQGEEEESGSSDSLSEDSDASSEEME-SGEMEAAE 270

Query: 425 DEKENEEDDGE 435
              ++  D  +
Sbjct: 271 ASADDTPDSDD 281



 Score = 31.3 bits (71), Expect = 0.91
 Identities = 22/92 (23%), Positives = 37/92 (40%), Gaps = 14/92 (15%)

Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDE 395
               + +EE GD  E  D E  E     +E+           + DD+ + E  +   +D 
Sbjct: 203 LSSMDMAEELGDEPESADSEDNEDEDDPKED-----------EDDDQGEEE--ESGSSDS 249

Query: 396 DKQD-ESEEDGEEEGEEEASENEYYTETDEDE 426
             +D ++  +  E GE EA+E       D D+
Sbjct: 250 LSEDSDASSEEMESGEMEAAEASADDTPDSDD 281



 Score = 29.8 bits (67), Expect = 2.9
 Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 5/53 (9%)

Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
           +++  +D  E+  DE E    E+ E+E    E     DED+ + EE++  S  
Sbjct: 201 DMLSSMDMAEELGDEPESADSEDNEDEDDPKE-----DEDDDQGEEEESGSSD 248


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 39.6 bits (92), Expect = 0.003
 Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 4/55 (7%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEY-YTETDEDEKENEEDDG---ESDGG 439
           D  D+DED  DE E   E    E+ S++ Y Y+ T  +++ +  DD    ESD G
Sbjct: 448 DRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSG 502


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 38.6 bits (90), Expect = 0.004
 Identities = 11/55 (20%), Positives = 23/55 (41%), Gaps = 2/55 (3%)

Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
           E  +++      E+D + +  +  S +   ++++ D   NEEDD          S
Sbjct: 278 EAIDEDSDYSSDEDDDDYDSYD--SSDSASSDSNSDVDTNEEDDRGEKESNGAKS 330


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 38.8 bits (91), Expect = 0.005
 Identities = 25/103 (24%), Positives = 50/103 (48%), Gaps = 2/103 (1%)

Query: 330 LENETKKEEQEHSEEKGDSDEK--YDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEI 387
           L++E    ++   +   D D+   Y  +   +NQ   ++  +   + +  D DD+ D E 
Sbjct: 96  LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155

Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
            DE D+D+D  DE EE  E +  E+ S+++ +   ++D +   
Sbjct: 156 DDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALR 198



 Score = 38.4 bits (90), Expect = 0.007
 Identities = 24/108 (22%), Positives = 53/108 (49%), Gaps = 6/108 (5%)

Query: 333 ETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEID 392
           + + +  + +E+K   D+  D  + +   V      +   + +  D DD+ DI+  D+ +
Sbjct: 97  KDELDSSKKAEKKNALDKDDDLNYVKDIDVL-NQADDDDDDDDDDDLDDD-DIDDDDDDE 154

Query: 393 NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
           +D++  D+ + D E+E ++EA E     E   D+ +   D+ +S+  R
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKE----LEKLSDDDDFVWDEDDSEALR 198



 Score = 34.6 bits (80), Expect = 0.095
 Identities = 18/98 (18%), Positives = 49/98 (50%), Gaps = 1/98 (1%)

Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
            L++  K E++   ++  D +   D +         ++  +   + +  D DD+ + +  
Sbjct: 99  ELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE 158

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
           D+ D+D D +DE +++  +E E+ + ++++  + D+ E
Sbjct: 159 DDDDDDVDDEDEEKKE-AKELEKLSDDDDFVWDEDDSE 195


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 39.0 bits (90), Expect = 0.005
 Identities = 33/110 (30%), Positives = 61/110 (55%), Gaps = 6/110 (5%)

Query: 329  SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
            ++E   ++ ++E+ EE  ++ E+YD E+ E  +   E   E    +N  +YD+E+  EI 
Sbjct: 1014 NIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIE 1073

Query: 389  DEID-NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
            + I+ N E+  +E+ E+  EE EE   EN      +E+ +EN E++ E +
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEEN-----VEENAEENAEENAEEN 1118



 Score = 34.4 bits (78), Expect = 0.13
 Identities = 32/110 (29%), Positives = 55/110 (50%), Gaps = 3/110 (2%)

Query: 331  ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
            EN  +  E+   E   ++ E+YD E+ E  +   E Y E    +   + ++  +  I + 
Sbjct: 1001 ENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEEN 1060

Query: 391  IDN-DEDKQDESEEDGEEEGEEEASEN--EYYTETDEDEKENEEDDGESD 437
            I+  DE+  +E EE+ EE  EE   EN  E   E +E+ +EN E++ E +
Sbjct: 1061 IEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEEN 1110



 Score = 29.0 bits (64), Expect = 5.9
 Identities = 26/116 (22%), Positives = 44/116 (37%), Gaps = 11/116 (9%)

Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
           D S +   +  E+EHS +  + +   D EH E +        +S          +E   E
Sbjct: 429 DGSEDESVEDNEEEHSGDANEEELSVD-EHVEEHNADDSGEQQSDDESGEHQSVNEIVEE 487

Query: 387 ----------IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
                      V +I   E   +  EE   +E EE+ + +E+  E    E+  EE+
Sbjct: 488 QSVNEHVEEPTVADIVEQETVDEHVEEPAVDENEEQQTADEHVEEPTIAEEHVEEE 543


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 38.1 bits (89), Expect = 0.006
 Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 369 ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
           +    K   D+  + + E+V    ++ED ++  E+  EEE E++ SE+E   E DEDE+E
Sbjct: 253 KEVIKKKGGDFKVKGEPEVVG--GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEE 310

Query: 429 NEEDDGESD 437
            E+DD    
Sbjct: 311 EEDDDEGDK 319


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 38.1 bits (89), Expect = 0.006
 Identities = 21/106 (19%), Positives = 45/106 (42%), Gaps = 18/106 (16%)

Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
           LE   ++E ++   E+G   +                  +    +     +DE   +  +
Sbjct: 75  LEKWKEEERKKKEAEQGLESD-----------------DDDDEEEEWEVEEDEDSDDEGE 117

Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
            ID + DK+ ES  D E+E E++ +  +   ++DE+  E +E++  
Sbjct: 118 WIDVESDKEIESS-DSEDEEEKDEAAKKAKEDSDEELSEEDEEEAA 162



 Score = 37.7 bits (88), Expect = 0.009
 Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 19/108 (17%)

Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDE-----SD 384
            E E KK+E E   E  D D++ +    E +                SD + E     SD
Sbjct: 79  KEEERKKKEAEQGLESDDDDDEEEEWEVEED--------------EDSDDEGEWIDVESD 124

Query: 385 IEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
            EI      DE+++DE+ +  +E+ +EE SE +     +E+E E E++
Sbjct: 125 KEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172



 Score = 36.6 bits (85), Expect = 0.019
 Identities = 23/60 (38%), Positives = 32/60 (53%), Gaps = 1/60 (1%)

Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
           SD DD+ + E   E D D D + E   D E + E E+S++E   E DE  K+ +ED  E 
Sbjct: 94  SDDDDDEEEEWEVEEDEDSDDEGE-WIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEE 152



 Score = 33.1 bits (76), Expect = 0.24
 Identities = 15/60 (25%), Positives = 31/60 (51%)

Query: 379 YDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
            D    +E++++   +E K+ E+E+  E + +++  E     E ++ + E E  D ESD 
Sbjct: 66  VDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDK 125


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 38.7 bits (90), Expect = 0.007
 Identities = 26/130 (20%), Positives = 51/130 (39%), Gaps = 12/130 (9%)

Query: 310 LRKDYMQSRLTSILVRSDFSLE-NETKKEEQEHSEEKGDS-DEKYDPEHFEMNQVTRENY 367
           LR +Y+QS +  I   S      +   ++E +H+    +S           +NQ  R++ 
Sbjct: 29  LRSNYLQSTVEDIEGPSAVDERTSGVLRDEGKHANILYNSILCNQKKHASFLNQ--RKSL 86

Query: 368 AESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK 427
            +   ++    Y+D+ D       D       ES  D +   E    + +  T+ + D+ 
Sbjct: 87  DDDDDDEFDFLYEDDED-------DAGNATSGESSTDDDSLLELPDRDEDADTQANNDQ- 138

Query: 428 ENEEDDGESD 437
            N+ D  +S 
Sbjct: 139 TNDFDQDDSS 148


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 38.3 bits (90), Expect = 0.008
 Identities = 17/113 (15%), Positives = 40/113 (35%), Gaps = 1/113 (0%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           E  +  E  +  E +   +        +  +   +   E++ +      ++E D E    
Sbjct: 117 EAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERD-ERRRR 175

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRG 443
            D ++ + +    +     E     ++       ++ +  E+ G  DGG RRG
Sbjct: 176 GDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRG 228



 Score = 33.7 bits (78), Expect = 0.21
 Identities = 16/110 (14%), Positives = 34/110 (30%), Gaps = 9/110 (8%)

Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD-DESDIEIVDEIDND 394
           + E         + E  +    E      E   E + ++     D ++   E   E    
Sbjct: 133 RGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAE--AERGER 190

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
             +++   +  + +  +   +       D  E+    D G+  G RRR  
Sbjct: 191 GRREERGRDGDDRDRRDRREQG------DRREERGRRDGGDRRGRRRRRD 234



 Score = 32.2 bits (74), Expect = 0.61
 Identities = 23/115 (20%), Positives = 42/115 (36%), Gaps = 13/115 (11%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
              T++EE++    +GD +++      E  +  R    E   + +  D  D  +      
Sbjct: 161 AERTEEEERDERRRRGDREDRQA----EAERGERGRREERGRDGDDRDRRDRRE------ 210

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTE--TDEDEKENEEDDGESDGGRRRG 443
              D  ++    + G+  G     +          ED  + + DDGE  GGRR  
Sbjct: 211 -QGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGR 264


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 38.0 bits (88), Expect = 0.009
 Identities = 13/106 (12%), Positives = 25/106 (23%), Gaps = 14/106 (13%)

Query: 337 EEQEHSEEKGDSDEKYD-PEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDE 395
             + H+         +  P H        E+  E      +    D       +   +  
Sbjct: 151 PPESHNPSPNQQPSSFLQPSH--------EDSPEEPEPPTSEPEPDSPGPPQSETPTSSP 202

Query: 396 DKQDESEEDGEEEGE-----EEASENEYYTETDEDEKENEEDDGES 436
             Q   +E GE +          +  +     DE  +   E     
Sbjct: 203 PPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFP 248



 Score = 36.1 bits (83), Expect = 0.030
 Identities = 14/126 (11%), Positives = 27/126 (21%), Gaps = 4/126 (3%)

Query: 316 QSRLTSILVRSDFSLENETKKE-EQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNK 374
           +SR      R          +          G ++E         N       + +  + 
Sbjct: 78  ESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELAS-GLSPENTSGSSPESPASHSP 136

Query: 375 NTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED-- 432
             S        E      ++     +     +   E+   E E  T   E +        
Sbjct: 137 PPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSE 196

Query: 433 DGESDG 438
              S  
Sbjct: 197 TPTSSP 202



 Score = 33.4 bits (76), Expect = 0.24
 Identities = 20/119 (16%), Positives = 33/119 (27%), Gaps = 13/119 (10%)

Query: 331 ENETKKEEQEHSE--EKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
           E     E++E  +    G   E         +  + E  A   + +NTS    ES     
Sbjct: 77  EESRHGEKEERGQGGPSGSGSESVGSPTPSPSG-SAEELASGLSPENTSGSSPESPAS-- 133

Query: 389 DEIDND--------EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
                         E    ES      +      +  +    +E E    E + +S G 
Sbjct: 134 HSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192



 Score = 31.5 bits (71), Expect = 0.90
 Identities = 16/110 (14%), Positives = 30/110 (27%), Gaps = 2/110 (1%)

Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
           L  E        S     S     P H   ++           N+  S +   S  +  +
Sbjct: 118 LSPENTSGSSPESPA-SHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPE 176

Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
           E +     + E +  G  + E   S     +  DE  +       ++   
Sbjct: 177 EPE-PPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSP 225



 Score = 31.1 bits (70), Expect = 1.1
 Identities = 15/107 (14%), Positives = 19/107 (17%), Gaps = 15/107 (14%)

Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDED 396
           E    S  +  +     P                           ES             
Sbjct: 121 ENTSGSSPESPASHSPPPS-------------PPSHPGPHEPAPPESHNP--SPNQQPSS 165

Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRG 443
               S ED  EE E   SE E  +      +               G
Sbjct: 166 FLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPG 212



 Score = 28.4 bits (63), Expect = 8.7
 Identities = 11/82 (13%), Positives = 23/82 (28%), Gaps = 3/82 (3%)

Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDG--EEEGEEEASENEY 418
           Q  R+  ++++T + +   + E   +        E     +       EE     S    
Sbjct: 64  QGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENT 123

Query: 419 YTETDEDEKENEEDDGE-SDGG 439
              + E    +       S  G
Sbjct: 124 SGSSPESPASHSPPPSPPSHPG 145


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 37.5 bits (87), Expect = 0.011
 Identities = 29/133 (21%), Positives = 58/133 (43%), Gaps = 4/133 (3%)

Query: 308 KLLRKDYMQSRLTSILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENY 367
           +LL    +++ L SI+   D  L  ETK ++ +  ++   S +K   E  + +       
Sbjct: 118 RLLNSKPVKNVLESIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESES 177

Query: 368 AESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESE--EDGEEEGEEEASENEYYTETDED 425
            +    K+    +D+SD E  ++ D+++  Q +    +  +EE  EEA    Y  +T E 
Sbjct: 178 EDES--KSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSES 235

Query: 426 EKENEEDDGESDG 438
           E +  + +     
Sbjct: 236 ESDESDSEISESR 248



 Score = 30.6 bits (69), Expect = 1.8
 Identities = 23/124 (18%), Positives = 51/124 (41%), Gaps = 4/124 (3%)

Query: 318 RLTSILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTS 377
             T            ++KK+E + S +K D +E    +  +  +   ++  + +   + S
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDS 201

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           +   + D  +VD  D +E ++  S    E+  E E+ E    ++++  E  +  D  ES 
Sbjct: 202 EDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDE----SDSEISESRSVSDSEESS 257

Query: 438 GGRR 441
              +
Sbjct: 258 PPSK 261


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 37.5 bits (87), Expect = 0.013
 Identities = 14/64 (21%), Positives = 32/64 (50%)

Query: 374 KNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
           K++++YD E++   V E+ +  ++++  + + EE  E E +E         +     ED+
Sbjct: 371 KDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDE 430

Query: 434 GESD 437
            E +
Sbjct: 431 FEDE 434



 Score = 30.1 bits (68), Expect = 2.4
 Identities = 14/79 (17%), Positives = 33/79 (41%)

Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDE 395
           K+  E+ +E  D+         E  +  +    E    +     ++++ +  +  +  DE
Sbjct: 371 KDSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDE 430

Query: 396 DKQDESEEDGEEEGEEEAS 414
            + ++  E+ + E EEEA 
Sbjct: 431 FEDEDELEEAQPEEEEEAR 449


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 36.8 bits (85), Expect = 0.014
 Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 14/110 (12%)

Query: 333 ETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEID 392
            T  E+ E  EE      + D E           + + +  +N  + + E    + D  +
Sbjct: 183 STLHEDDELDEEVTSYLNELDDE-------VTSYFNDGENEENDDELEAEVISYLKDGEN 235

Query: 393 NDEDKQ------DESEEDGEEEGEEEAS-ENEYYTETDEDEKENEEDDGE 435
           ++E K+       E + D     E E   E+E   E +  E++  ED GE
Sbjct: 236 DNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENEDKGE 285



 Score = 30.6 bits (69), Expect = 1.7
 Identities = 20/78 (25%), Positives = 41/78 (52%), Gaps = 4/78 (5%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNK-NTSDYDDESDIEIVD 389
           + E ++ + E   E     +  + ++ E+ +  R  Y E K +K NT++ + E + E  D
Sbjct: 212 DGENEENDDELEAEVISYLKDGENDN-EVKEKIRREYREWKGDKANTNETEIEDESE--D 268

Query: 390 EIDNDEDKQDESEEDGEE 407
           E + +  ++ E+E+ GEE
Sbjct: 269 EYEEEAGEEQENEDKGEE 286


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 37.0 bits (86), Expect = 0.015
 Identities = 16/64 (25%), Positives = 29/64 (45%)

Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
           D  D E VDE +++E++Q   E + EE  + E   ++   +   +   +   D ES   +
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADK 422

Query: 441 RRGS 444
              S
Sbjct: 423 ESAS 426



 Score = 37.0 bits (86), Expect = 0.018
 Identities = 17/85 (20%), Positives = 39/85 (45%), Gaps = 4/85 (4%)

Query: 355 EHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEAS 414
           E    ++             +    +DE + E     +++E++ ++SEE+G +  E+ +S
Sbjct: 351 ESKMRDKRRARLDPIDFEEVD----EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSS 406

Query: 415 ENEYYTETDEDEKENEEDDGESDGG 439
           E+     +D + K ++E   +SD  
Sbjct: 407 ESSSDVGSDSESKADKESASDSDSE 431



 Score = 28.9 bits (65), Expect = 5.7
 Identities = 19/112 (16%), Positives = 36/112 (32%), Gaps = 38/112 (33%)

Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES 383
            R+     +  + +E E  EE+  SDE                                 
Sbjct: 358 RRARLDPIDFEEVDEDEDEEEEQRSDE--------------------------------- 384

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
                 E +  ED ++E  +  E+   E +S+    +E+  D++   + D E
Sbjct: 385 -----HEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 37.3 bits (86), Expect = 0.016
 Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 376 TSDYDDESDIEIVDEIDNDEDKQDESEEDGE-EEGEEEASENEYYTETDEDEKENEEDDG 434
           + D  DES+ E+ +   + +D+ DE++ED E +E  E+ SE+E   ++ ++E   + D+ 
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986

Query: 435 ESDGGR 440
           ES    
Sbjct: 987 ESKAAY 992



 Score = 34.6 bits (79), Expect = 0.13
 Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 4/73 (5%)

Query: 370  SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKEN 429
             ++ +  S+Y+  SD E  DE D DE+  + SE+  E+E E ++S+ E   + DE E + 
Sbjct: 932  DESEEEVSEYEASSDDE-SDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKA 990

Query: 430  EEDDGESDGGRRR 442
              D      G+RR
Sbjct: 991  AYDS---RPGKRR 1000



 Score = 33.8 bits (77), Expect = 0.22
 Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 6/76 (7%)

Query: 346 GDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDG 405
           G  DE  + E     +   E  A S    + +D D+ESD E  +++  DE + D S+E+ 
Sbjct: 926 GSDDESDESE-----EEVSEYEASSDDESDETDEDEESD-ESSEDLSEDESENDSSDEED 979

Query: 406 EEEGEEEASENEYYTE 421
            E+ +E  S+  Y + 
Sbjct: 980 GEDWDELESKAAYDSR 995


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 32.7 bits (75), Expect = 0.019
 Identities = 7/27 (25%), Positives = 15/27 (55%)

Query: 189 WVNGISFLPNSDKVAVCSKQGYVRVYD 215
            V  ++F P+   +A  S  G ++++D
Sbjct: 14  PVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 30.7 bits (70), Expect = 0.11
 Identities = 10/40 (25%), Positives = 16/40 (40%)

Query: 264 KGKPCYKYKGFAGCVKDIACPLNENLILSVSLDRHFRVHD 303
            G+     KG  G V  +A   +   + S S D   ++ D
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 28.0 bits (63), Expect = 1.0
 Identities = 6/24 (25%), Positives = 12/24 (50%)

Query: 132 CMKHNKHSNIIATGGKENDLKLWD 155
            +  +     +A+G  +  +KLWD
Sbjct: 17  SVAFSPDGKYLASGSDDGTIKLWD 40


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 37.0 bits (85), Expect = 0.019
 Identities = 25/114 (21%), Positives = 50/114 (43%), Gaps = 1/114 (0%)

Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
           S+    K E++   +E+  S E +  E  E          +    +   +   E D+++ 
Sbjct: 221 SVSVVLKDEKELARQERVSSWENFKEEPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVG 280

Query: 389 D-EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRR 441
             EI+N E  + + E+  EE    EA + E  ++  ++  E +E+D E++   R
Sbjct: 281 AAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENER 334


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 32.7 bits (75), Expect = 0.022
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 189 WVNGISFLPNSDKVAVCSKQGYVRVYD 215
            V  ++F P+ + +A  S  G VRV+D
Sbjct: 13  PVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 28.9 bits (65), Expect = 0.48
 Identities = 5/24 (20%), Positives = 14/24 (58%)

Query: 132 CMKHNKHSNIIATGGKENDLKLWD 155
            +  +   N++A+G  +  +++WD
Sbjct: 16  SVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 28.5 bits (64), Expect = 0.72
 Identities = 15/39 (38%), Positives = 18/39 (46%)

Query: 265 GKPCYKYKGFAGCVKDIACPLNENLILSVSLDRHFRVHD 303
           GK     KG  G V  +A   + NL+ S S D   RV D
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 36.3 bits (84), Expect = 0.022
 Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 10/101 (9%)

Query: 346 GDSDEKYDPEHFEM-NQVTRENYAESKTNKNTS---DYDDESDIEIVDEIDNDED----- 396
               E  D     + N  +     +S  +KN S    Y+ ES      + D  +      
Sbjct: 185 PGEPEPEDDPKDSLGNGSSTNGLPDSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKN 244

Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           K+ + EED +++G+    E E   + +E+E E EE++ + D
Sbjct: 245 KKSDDEEDDDQDGDY-VEEKELKEDEEEEETEEEEEEEDED 284



 Score = 34.0 bits (78), Expect = 0.13
 Identities = 26/97 (26%), Positives = 41/97 (42%), Gaps = 13/97 (13%)

Query: 321 SILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD 380
           S       S +++ K  E+ + +E  D+    D               +    KN    D
Sbjct: 202 SSTNGLPDSSQDKNKSLEEYYEKESSDAAASQDDGP------------KGSDVKNKKSDD 249

Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
           +E D +  D ++  E K+DE EE+  EE EEE  E+E
Sbjct: 250 EEDDDQDGDYVEEKELKEDE-EEEETEEEEEEEDEDE 285


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 36.7 bits (85), Expect = 0.022
 Identities = 23/113 (20%), Positives = 51/113 (45%), Gaps = 4/113 (3%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           E E+ K +    +E  +     + E  +  + + E  ++S+      + +    ++ + E
Sbjct: 245 EEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGE-RSDSEEETEEKEKEKRKRLKKMME 303

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRG 443
            ++++++ +   E   EE E E  E     + +E+++E        DGGRRRG
Sbjct: 304 DEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTV---SPDGGRRRG 353


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 37.0 bits (86), Expect = 0.023
 Identities = 20/110 (18%), Positives = 44/110 (40%), Gaps = 11/110 (10%)

Query: 336 KEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI---- 391
           K   E  E++    E+ D +  E +      +   K      + +D+ +   +  +    
Sbjct: 321 KSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQ 380

Query: 392 DNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRR 441
             +  K++E++ + EE   E   E       +E ++E  E+  + + GRR
Sbjct: 381 RAEARKKEENDAEIEELRRELEGE-------EESDEEENEEPSKKNVGRR 423



 Score = 32.3 bits (74), Expect = 0.51
 Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 20/140 (14%)

Query: 303 DFHSKKLLRKDYMQSRLTSILVRSDFSLENETKKEEQE-----HSEEKGDSDEKYDPEHF 357
           D  +  LL   +MQ        R++   + E   E +E       EE+ D +E  +P   
Sbjct: 367 DEENSGLLSMKFMQ--------RAEARKKEENDAEIEELRRELEGEEESDEEENEEPSK- 417

Query: 358 EMNQVTRENYAESKTNKNTSD--YDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASE 415
               V R  +      K         E+  E  ++ ++DE+++ E EE+ +   E+ A++
Sbjct: 418 --KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKV--EKVANK 473

Query: 416 NEYYTETDEDEKENEEDDGE 435
               +E  + E+E EE D E
Sbjct: 474 LLKRSEKAQKEEEEEELDEE 493



 Score = 30.4 bits (69), Expect = 2.5
 Identities = 19/80 (23%), Positives = 26/80 (32%), Gaps = 4/80 (5%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPE---HFEMNQVTRENYAESKTNKNTSDYDDESDIEI 387
           EN   K      +     D K           N++++      K  K      D  D  I
Sbjct: 493 ENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKS-IDLDDDLI 551

Query: 388 VDEIDNDEDKQDESEEDGEE 407
            +E     D  DE +ED EE
Sbjct: 552 DEEDSIKLDVDDEEDEDDEE 571


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 35.2 bits (81), Expect = 0.027
 Identities = 20/95 (21%), Positives = 44/95 (46%), Gaps = 4/95 (4%)

Query: 343 EEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESE 402
           E  G + EK   E  +   +       S   ++    +++    + DE   D+++++E +
Sbjct: 3   EFGGGAPEKKPEEEKKDENLLEHVKITSWDKEDIIKENED----VKDEKQEDDEEEEEED 58

Query: 403 EDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           E+  EE E+   E E   + +E+E++ E++    D
Sbjct: 59  EEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKD 93



 Score = 33.3 bits (76), Expect = 0.11
 Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 3/89 (3%)

Query: 347 DSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGE 406
           +  ++   EH ++    +E+    K N++  D   E D E  +E D +E ++ E  ED E
Sbjct: 16  EKKDENLLEHVKITSWDKEDI--IKENEDVKDEKQEDD-EEEEEEDEEEIEEPEDIEDEE 72

Query: 407 EEGEEEASENEYYTETDEDEKENEEDDGE 435
           E  E+E  E E   +  + +   +++  +
Sbjct: 73  EIVEDEEEEEEDEEDNVDLKDIEKKNIND 101


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 35.4 bits (82), Expect = 0.027
 Identities = 16/61 (26%), Positives = 32/61 (52%)

Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
           SD  +E +    ++ D++   ++E +   +E+ EE  SE E    T ++E+ + E + E 
Sbjct: 7   SDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEAEF 66

Query: 437 D 437
           D
Sbjct: 67  D 67



 Score = 33.9 bits (78), Expect = 0.077
 Identities = 10/47 (21%), Positives = 25/47 (53%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
            E ++D+ ++DE   + +E+ E    E     + ++DE+ + E++  
Sbjct: 3   SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQI 49



 Score = 31.2 bits (71), Expect = 0.53
 Identities = 15/47 (31%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 392 DNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
              E + D+ EED E   E+E  E+    + +E +  ++E D ESD 
Sbjct: 1   SGSESESDDGEEDEELPEEDEDDESS---DEEEVDLPDDEQDEESDS 44



 Score = 30.0 bits (68), Expect = 1.4
 Identities = 11/57 (19%), Positives = 30/57 (52%)

Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
             S+ E  D  +++E  +++ +++  +E E +  ++E   E+D +E++      E +
Sbjct: 1   SGSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEE 57


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 35.9 bits (83), Expect = 0.027
 Identities = 17/80 (21%), Positives = 42/80 (52%)

Query: 358 EMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
           E+ +  R + ++ +      D D  ++++  +E D D+D++DE +    EE    +  ++
Sbjct: 62  ELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADD 121

Query: 418 YYTETDEDEKENEEDDGESD 437
             + +D D  +++ DD +S+
Sbjct: 122 SDSSSDSDSSDDDSDDDDSE 141



 Score = 32.8 bits (75), Expect = 0.29
 Identities = 24/104 (23%), Positives = 42/104 (40%), Gaps = 4/104 (3%)

Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
           ++E+  K    + S E  + D++ D    +  +    N     ++   S   D SD +  
Sbjct: 79  AIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDS--SSDSDSSDDDSD 136

Query: 389 DEIDNDEDKQ--DESEEDGEEEGEEEASENEYYTETDEDEKENE 430
           D+   DE      E E+  +E  EE+  E E     +E  +E E
Sbjct: 137 DDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|206365 pfam14197, Cep57_CLD_2, Centrosome localisation domain of PPC89.
           The N-terminal region of the fission yeast spindle pole
           body protein PPC89 has low similarity to the human Cep57
           protein. The CLD or centrosome localisation domain of
           Cep57 and PPC89 is found at the N-terminus. This region
           localises to the centrosome internally to gamma-tubulin,
           suggesting that it is either on both centrioles or on a
           centromatrix component. This N-terminal region can also
           multimerise with the N-terminus of other Cep57
           molecules. The C-terminal part, Family Cep57_MT_bd,
           pfam06657, is the microtubule-binding region of Cep57
           and PPC89.
          Length = 69

 Score = 33.1 bits (76), Expect = 0.028
 Identities = 13/54 (24%), Positives = 21/54 (38%), Gaps = 2/54 (3%)

Query: 294 SLDRHFRVHDFHSKKLLRKDYMQSRLTSILVRS--DFSLENETKKEEQEHSEEK 345
           S  R   VH+   K+L R+     R   +      +   ENE  ++E E    +
Sbjct: 16  SATRKVAVHEIELKRLNRERDSALRQLGVAYLECQELKAENEALRKELEELRAQ 69


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 36.4 bits (84), Expect = 0.030
 Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 6/67 (8%)

Query: 371 KTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
           K ++N S +    + E   E D D+ ++ + +ED  E  E+  SE+E      ED +E+E
Sbjct: 633 KADENKSRHQQLFEGE---EEDEDDLEETDDDEDECEAIEDSESESE---SDGEDGEEDE 686

Query: 431 EDDGESD 437
           ++D    
Sbjct: 687 QEDDAEA 693



 Score = 34.8 bits (80), Expect = 0.086
 Identities = 15/56 (26%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 369 ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
           E + +   +D DDE + E ++  D++ + + + E+  E+E E++A  NE     D+
Sbjct: 650 EDEDDLEETD-DDEDECEAIE--DSESESESDGEDGEEDEQEDDAEANEGVVPIDK 702



 Score = 34.1 bits (78), Expect = 0.16
 Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 6/55 (10%)

Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK---ENEEDDGESDG 438
           I+D +  DE+K          EGEEE  ++   T+ DEDE    E+ E + ESDG
Sbjct: 628 ILDVLKADENKSRHQ---QLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDG 679



 Score = 31.4 bits (71), Expect = 1.1
 Identities = 20/84 (23%), Positives = 36/84 (42%), Gaps = 11/84 (13%)

Query: 362 VTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQ-DESEEDGEEEGEEEASENEYYT 420
           V + +  +S+  +     +++ D     + D DE +  ++SE + E +GE+         
Sbjct: 631 VLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDG-------- 682

Query: 421 ETDEDEKENE--EDDGESDGGRRR 442
           E DE E + E  E     D   RR
Sbjct: 683 EEDEQEDDAEANEGVVPIDKAVRR 706



 Score = 29.5 bits (66), Expect = 5.1
 Identities = 18/108 (16%), Positives = 37/108 (34%), Gaps = 7/108 (6%)

Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYD--PEHFEMNQVTRENYAESKTNKNTSDYDDESD 384
           D     E  + E E   E G+ DE+ D    +  +  + +         K  +  D    
Sbjct: 663 DECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKA--VRRALPKVLNLPDALDG 720

Query: 385 IEIVDEIDNDEDKQDESEEDGEE---EGEEEASENEYYTETDEDEKEN 429
            +  DE   D+++    +    +   E +E         +  + +K+N
Sbjct: 721 GDSEDEEGMDDEQMMRLDTYLAQIFKEKKERIQAGGETKKEAQSQKQN 768


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 35.5 bits (82), Expect = 0.035
 Identities = 15/67 (22%), Positives = 33/67 (49%)

Query: 371 KTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
              K       E + +I +++   E K  E E +  +E +E+  E E   E ++++ +++
Sbjct: 133 SKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDD 192

Query: 431 EDDGESD 437
           +DD + D
Sbjct: 193 DDDDDDD 199



 Score = 32.4 bits (74), Expect = 0.30
 Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 8/55 (14%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK----ENEEDDGESDGG 439
           +  D DE+ + + EE+ EEE E+E    ++  + D+D+     EN  D+GE D  
Sbjct: 164 EAEDVDEEDEKDEEEEEEEEEEDE----DFDDDDDDDDDDYNAENYFDNGEDDDY 214



 Score = 30.9 bits (70), Expect = 1.0
 Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 3/57 (5%)

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEAS-ENEYYTETDEDEKENEEDDGESDGG 439
             ++ +E + DE++++E EE+ E+  +++   +++Y  E   D    E+DD + D  
Sbjct: 165 AEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAENYFD--NGEDDDYDDDEA 219


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 35.0 bits (80), Expect = 0.045
 Identities = 15/49 (30%), Positives = 32/49 (65%), Gaps = 1/49 (2%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           ++ID  ED++++  E  E+  EE++ ++E+    ++DE E E DD +++
Sbjct: 149 EKIDEREDQEEQEREREEQTIEEQSDDSEHEI-IEQDESETESDDDKTE 196


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 34.7 bits (80), Expect = 0.046
 Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 7/86 (8%)

Query: 359 MNQVTRENYA--ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEE-----DGEEEGEE 411
           ++++  E  A  E K  K      D  +    DE+D DEDK++E ++     D E + E+
Sbjct: 84  LDEIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDED 143

Query: 412 EASENEYYTETDEDEKENEEDDGESD 437
           E  +       ++DE + +EDD E +
Sbjct: 144 EDDDEIVEILIEDDEVDEDEDDDEDE 169



 Score = 32.4 bits (74), Expect = 0.22
 Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 10/89 (11%)

Query: 333 ETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTS---------DYDDES 383
           E  +E Q  +E+K    +  + + FE       +Y E K  +            D +DE 
Sbjct: 86  EIDEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDED 145

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEE 412
           D EIV+ +  D D+ DE E+D E+E +EE
Sbjct: 146 DDEIVEILIED-DEVDEDEDDDEDEEDEE 173


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 35.0 bits (81), Expect = 0.049
 Identities = 18/50 (36%), Positives = 25/50 (50%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
           DE        +E E+D E E EEE  E E  ++ D+ E +  E D E +G
Sbjct: 25  DEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEG 74



 Score = 31.2 bits (71), Expect = 0.91
 Identities = 15/38 (39%), Positives = 21/38 (55%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
            +E + E   + D D+ + DE E D EEEGE+E    E
Sbjct: 45  IEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREE 82



 Score = 28.9 bits (65), Expect = 5.4
 Identities = 17/65 (26%), Positives = 33/65 (50%)

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           D    + +   +E D++E + +E EE+ E + + + SE++     DE+E E E    E  
Sbjct: 25  DEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERL 84

Query: 438 GGRRR 442
             ++R
Sbjct: 85  KKKKR 89


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 35.7 bits (82), Expect = 0.052
 Identities = 20/107 (18%), Positives = 49/107 (45%), Gaps = 1/107 (0%)

Query: 330 LENETKKEEQEHSEEKGDSDEKYDP-EHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV 388
           LE    +EE    +E    D   D  E  + ++       E    +   +Y+D  D +  
Sbjct: 204 LEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEK 263

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
           D+  +  D  +  +++ ++E  ++ ++++   E +ED+++ ++ D E
Sbjct: 264 DKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEE 310



 Score = 35.0 bits (80), Expect = 0.080
 Identities = 22/108 (20%), Positives = 43/108 (39%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           E+     + + SE   D  +  D +  +      E   +        D +DE+D+    E
Sbjct: 86  EHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESE 145

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
              ++   DE+E+D E++ EEE +  E  +      ++  +  G  D 
Sbjct: 146 SSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDK 193



 Score = 34.2 bits (78), Expect = 0.16
 Identities = 28/132 (21%), Positives = 55/132 (41%), Gaps = 14/132 (10%)

Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHF--------------EMNQVTRENYAE 369
           +  D + ++E KK E+E + E+ +S E+   E                EMN+      AE
Sbjct: 151 LSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAE 210

Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKEN 429
            +      D  ++   +  ++  +DED     +E+ +EEG  E  +     E D+ +   
Sbjct: 211 EEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAG 270

Query: 430 EEDDGESDGGRR 441
           ++ + E D   +
Sbjct: 271 DDAELEDDEPDK 282



 Score = 33.0 bits (75), Expect = 0.29
 Identities = 20/112 (17%), Positives = 49/112 (43%)

Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
           D   E+E+  E+    E + D ++K + E     + + E     K    +   D    ++
Sbjct: 139 DLFNESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLD 198

Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
            ++E     + ++E+    E++ E+   ++    + DED    E+++ + +G
Sbjct: 199 EMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEG 250



 Score = 32.7 bits (74), Expect = 0.50
 Identities = 14/60 (23%), Positives = 27/60 (45%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
           +D SD   +D  D+ +D+++E E++  E+   +  +        E   E+  DD   D  
Sbjct: 101 EDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDE 160



 Score = 32.3 bits (73), Expect = 0.54
 Identities = 14/87 (16%), Positives = 38/87 (43%)

Query: 340 EHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQD 399
           +  ++K D+ +  + E  E ++   +  A+SK  +   + D++ D +  +E       + 
Sbjct: 261 KEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKV 320

Query: 400 ESEEDGEEEGEEEASENEYYTETDEDE 426
           + +E   E  + E+ +     E  + +
Sbjct: 321 KLDEPVLEGVDLESPKELSSFEKRQAK 347



 Score = 31.1 bits (70), Expect = 1.5
 Identities = 19/107 (17%), Positives = 42/107 (39%), Gaps = 7/107 (6%)

Query: 335 KKEEQEHSEEKGDSDEKYDPEHFEMNQ---VTRENYAESKTNKNTSDYDDESDIEIVDEI 391
             E+ +  E+ G  +++ D E   +        +   + K   + ++ +D+   +   + 
Sbjct: 228 DSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKK 287

Query: 392 DNDEDKQDESEEDGEEEG----EEEASENEYYTETDEDEKENEEDDG 434
           + D   ++E EED E+E     EE         + DE   E  + + 
Sbjct: 288 EADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLES 334


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 34.5 bits (80), Expect = 0.065
 Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
           D +  +       I+++ D++D  ++D ++E E+E  E E     D +E
Sbjct: 31  DPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79



 Score = 33.7 bits (78), Expect = 0.11
 Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 7/58 (12%)

Query: 383 SDIEIVDEID-NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
           SD+ I   ID N       +     E  EE+  +++     D+DE E+E+D+ E+D G
Sbjct: 23  SDL-ISGFIDPNAAAAAATAAAIESELDEEDLEDDD-----DDDEDEDEDDEEEADLG 74



 Score = 30.2 bits (69), Expect = 1.5
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query: 375 NTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEA 413
           N +     +     +  + D +  D+ +ED +E+ EEEA
Sbjct: 33  NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 35.2 bits (82), Expect = 0.069
 Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 3/56 (5%)

Query: 385 IEIVDEIDNDEDKQDE---SEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           +E  D ++N E +  E      D   E +     +E     D++++E EED+ +  
Sbjct: 153 LEWYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDS 208



 Score = 30.5 bits (70), Expect = 1.8
 Identities = 9/42 (21%), Positives = 17/42 (40%)

Query: 385 IEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
           I++V+E  + +D      E   +  E+   E      + E E
Sbjct: 57  IQVVEEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98



 Score = 30.5 bits (70), Expect = 2.0
 Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 7/56 (12%)

Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
             +  + + ID   D    +EED    G E    ++     DEDE+E E+++ +S 
Sbjct: 161 NGERRLRELIDGFVD--PNAEEDPAHVGSELEELDD-----DEDEEEEEDENDDSL 209



 Score = 29.8 bits (68), Expect = 3.1
 Identities = 10/45 (22%), Positives = 18/45 (40%)

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
           D  +    + D        E+ +++ +EE  E+E       DE E
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESE 215



 Score = 28.6 bits (65), Expect = 8.2
 Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 4/65 (6%)

Query: 380 DDESDIEIVDEID---NDEDKQ-DESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
           +D  D E +++I    ND   Q  E   D ++    E   +    E  E+E        E
Sbjct: 37  EDIVDSEQIEDIIAMLNDMGIQVVEEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVE 96

Query: 436 SDGGR 440
           S+ GR
Sbjct: 97  SEIGR 101


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 35.0 bits (81), Expect = 0.077
 Identities = 16/71 (22%), Positives = 26/71 (36%), Gaps = 3/71 (4%)

Query: 373 NKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
            +     + + + +   E   D +K DE E+D E+ G           E + D    EE 
Sbjct: 315 EQPQGKDEKDLEEKPE-EPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPD--PQEEA 371

Query: 433 DGESDGGRRRG 443
           DG+       G
Sbjct: 372 DGQGSSTDPAG 382



 Score = 33.5 bits (77), Expect = 0.23
 Identities = 7/54 (12%), Positives = 20/54 (37%)

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
             +++D +++ EE G +  + +  E++              DD +    +    
Sbjct: 319 GKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQEEAD 372


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 34.7 bits (80), Expect = 0.081
 Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 4/36 (11%)

Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEE 412
            D DDE  IE   ++D+ +D  D S+EDGE+  + +
Sbjct: 257 DDDDDEDAIE--SDLDDSDD--DVSDEDGEDLFDTD 288



 Score = 29.7 bits (67), Expect = 3.4
 Identities = 14/64 (21%), Positives = 30/64 (46%), Gaps = 5/64 (7%)

Query: 374 KNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
           K  +       I  +D ID+D++     ++D E+  E +  +++     D+   E+ ED 
Sbjct: 230 KKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSD-----DDVSDEDGEDL 284

Query: 434 GESD 437
            ++D
Sbjct: 285 FDTD 288


>gnl|CDD|182306 PRK10207, PRK10207, dipeptide/tripeptide permease B; Provisional.
          Length = 489

 Score = 34.8 bits (80), Expect = 0.083
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 158 TTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWV 190
           T+ N F+  NV H +L +S+ P  F    P WV
Sbjct: 290 TSLNFFAINNVHHEILGFSINPVSFQALNPFWV 322


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 34.6 bits (79), Expect = 0.11
 Identities = 28/111 (25%), Positives = 50/111 (45%), Gaps = 14/111 (12%)

Query: 321 SILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD 380
           S LV+S   +E+++   E +   E   SD   D E             E   ++ + + D
Sbjct: 692 SALVKSRPDVEDDSDDSELD-FAEDDFSDSTSDDE-----PKLDAIDDEDAKSEGSQESD 745

Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
            E   E +DEI    D + ++ +   E  EE+ S     +E +++E+EN+E
Sbjct: 746 QE---EGLDEIFYSFDGEQDNSDSFAESSEEDES-----SEEEKEEEENKE 788



 Score = 33.1 bits (75), Expect = 0.37
 Identities = 28/111 (25%), Positives = 54/111 (48%), Gaps = 13/111 (11%)

Query: 325 RSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESD 384
           ++ F  ++E  + E   +  K   D + D +  E++      +AE   + +TSD     D
Sbjct: 676 KASFDSDDEMDENEIWSALVKSRPDVEDDSDDSELD------FAEDDFSDSTSD-----D 724

Query: 385 IEIVDEIDNDEDKQDESEEDGEEEGEEEA--SENEYYTETDEDEKENEEDD 433
              +D ID+++ K + S+E  +EEG +E   S +     +D   + +EED+
Sbjct: 725 EPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDE 775



 Score = 30.0 bits (67), Expect = 3.2
 Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 12/93 (12%)

Query: 350 EKYDPEHFEMNQVTRENYAESK-TNKNTSDYDDESDIEIVDEIDNDED-----KQDESEE 403
           E   PE     Q   +  A+ K    N + +D +      DE+D +E      K     E
Sbjct: 649 EDIKPEDKFFYQYFTKKTADGKGKKSNKASFDSD------DEMDENEIWSALVKSRPDVE 702

Query: 404 DGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
           D  ++ E + +E+++   T +DE + +  D E 
Sbjct: 703 DDSDDSELDFAEDDFSDSTSDDEPKLDAIDDED 735


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 34.2 bits (78), Expect = 0.11
 Identities = 19/113 (16%), Positives = 44/113 (38%), Gaps = 18/113 (15%)

Query: 322 ILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDD 381
                D S+   + K  ++    + D D+ Y                  K N +++ Y D
Sbjct: 219 TTETKDLSMSTRSYKFGRQTRGFRSDDDQGYG---------------RYKQNLSSNKYGD 263

Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
              +    E D+DE       ++ E+  +++  +N+   +T+ +  + + D+G
Sbjct: 264 FKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDND---DTNTNHGDEDCDNG 313


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 32.7 bits (75), Expect = 0.12
 Identities = 16/59 (27%), Positives = 28/59 (47%)

Query: 352 YDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGE 410
           Y  +  +   +  E   + K  K     DD  D +I+ + D DE+  DE +++ EE+ E
Sbjct: 71  YPVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129



 Score = 32.7 bits (75), Expect = 0.13
 Identities = 12/55 (21%), Positives = 32/55 (58%)

Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
           DE D EI+   +  + K+ +  +  ++  +++   ++ + E D DE+++E+++ E
Sbjct: 74  DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128



 Score = 31.5 bits (72), Expect = 0.29
 Identities = 13/62 (20%), Positives = 28/62 (45%)

Query: 364 RENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETD 423
           R  Y   + ++     +++ D +    +D D+D  D+     ++  EE+  E +   E D
Sbjct: 68  RSWYPVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEED 127

Query: 424 ED 425
           E+
Sbjct: 128 EE 129


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 34.4 bits (79), Expect = 0.13
 Identities = 19/81 (23%), Positives = 36/81 (44%), Gaps = 4/81 (4%)

Query: 367 YAESKTNKNTSDYDDESDI-EIVDEIDN--DEDKQDESEEDGEEEGEEEASENEYYT-ET 422
           Y E K+N + S    +  + E   ++ +   +  + + E D  EE E +A E +    E 
Sbjct: 353 YEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEEN 412

Query: 423 DEDEKENEEDDGESDGGRRRG 443
           ++ +KE  ++D E      R 
Sbjct: 413 EKKQKEQADEDKEKRQKDERK 433



 Score = 31.3 bits (71), Expect = 1.0
 Identities = 11/69 (15%), Positives = 27/69 (39%)

Query: 367 YAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
            A +K  +      D S  +  + +     K  +  +   E  +E  +  E   +  E++
Sbjct: 347 LALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEK 406

Query: 427 KENEEDDGE 435
            + EE++ +
Sbjct: 407 LKQEENEKK 415


>gnl|CDD|181969 PRK09584, tppB, putative tripeptide transporter permease; Reviewed.
          Length = 500

 Score = 34.3 bits (79), Expect = 0.13
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 158 TTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWV 190
           T+ N F+ +NV H +L  +V+P+++    P W+
Sbjct: 297 TSLNFFAIRNVEHSILGIAVEPEQYQALNPFWI 329


>gnl|CDD|233189 TIGR00924, yjdL_sub1_fam, amino acid/peptide transporter
           (Peptide:H+ symporter), bacterial.  The model describes
           proton-dependent oligopeptide transporters in bacteria.
           This model is restricted in its range in recognizing
           bacterial proton-dependent oligopeptide transporters,
           although they are found in yeast, plants and animals.
           They function by proton symport in a 1:1 stoichiometry,
           which is variable in different species. All of them are
           predicted to contain 12 transmembrane domains, for which
           limited experimental evidence exists [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 475

 Score = 34.0 bits (78), Expect = 0.15
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 153 LWD-LNTTKNIFSAKNVSHRVLVYSVKPDEFDLRRPVWV 190
           L+  + T+ N F+  N+ H +L  SV    F    P WV
Sbjct: 287 LYAQMPTSLNFFADNNMHHEMLGMSVPVIWFQSLNPFWV 325


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 33.2 bits (76), Expect = 0.19
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 367 YAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASEN 416
            A+  T    ++ DD  D +  DE   +ED+ + +EE+ +EEGE +AS +
Sbjct: 30  SAQDLTEDEEAE-DDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKASPD 78



 Score = 31.7 bits (72), Expect = 0.78
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
           D  +++E + D  +ED E+E   E  ENE   E +++E E
Sbjct: 33  DLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 33.6 bits (77), Expect = 0.23
 Identities = 13/52 (25%), Positives = 26/52 (50%)

Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           E V+   N+E+ ++E E      G +   +     + DEDE+++EE +  + 
Sbjct: 3   EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAK 54



 Score = 32.5 bits (74), Expect = 0.59
 Identities = 11/44 (25%), Positives = 24/44 (54%)

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
           E++ +    + E+E E EA      +++D+DE   E++D + + 
Sbjct: 2   EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEED 45



 Score = 31.7 bits (72), Expect = 0.97
 Identities = 16/54 (29%), Positives = 28/54 (51%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRR 442
           +E  N +  ++E EE+ E       S+++      EDE E+EEDD E++   + 
Sbjct: 2   EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKA 55



 Score = 29.4 bits (66), Expect = 4.3
 Identities = 20/64 (31%), Positives = 34/64 (53%), Gaps = 4/64 (6%)

Query: 368 AESKTNKNTSDYDDESDIEIVDEI---DNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
            E + N   ++ +DE ++E V      D+D+D+    +ED +EE +EEA E+    E  +
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEA-ESPAKAEISK 59

Query: 425 DEKE 428
            EK 
Sbjct: 60  REKA 63


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 33.1 bits (75), Expect = 0.26
 Identities = 18/87 (20%), Positives = 37/87 (42%)

Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
              L N+   E ++  +EK    E+ +    E+  VT ++  E    K+   +       
Sbjct: 182 REHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSM 241

Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEA 413
              E + +E  + ES+ED +E+ + + 
Sbjct: 242 ETSESEEEESSESESDEDEDEDNKGKI 268


>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin). 
          Length = 585

 Score = 33.4 bits (76), Expect = 0.27
 Identities = 26/125 (20%), Positives = 49/125 (39%), Gaps = 8/125 (6%)

Query: 322 ILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDD 381
           I+  +    ENE     +E+       ++ +  +H E  +  +    + K  K    Y+D
Sbjct: 88  IITEALRQAENEPGGHPRENQPYNLQVEKNFPTDHSEDYETQQWEEEKLKHMKFPRRYED 147

Query: 382 ESD---IEIVDEIDN---DEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
            S+    +  +EI     +   Q   +   EE G+ +    +    +DE EK  ++D  E
Sbjct: 148 NSEEKHSKRTNEIVEVFYNPQSQATLKSVFEEVGKLDGQSKQKRERSDEREKSYQDD--E 205

Query: 436 SDGGR 440
            D  R
Sbjct: 206 DDTYR 210



 Score = 33.4 bits (76), Expect = 0.27
 Identities = 20/142 (14%), Positives = 38/142 (26%), Gaps = 21/142 (14%)

Query: 323 LVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYA--ESKTNKNTSDYD 380
            V          + EE +  EE   S E+        ++  R      E +  +   +  
Sbjct: 218 DVVGGEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLPDESKRSGQLGLEDEAEEEKEEKG 277

Query: 381 DESD-----IEIVDEIDNDEDK-QDESEEDGEEEGEEEASENE-------------YYTE 421
            ES         +    N   + + ++  + E   E E  E +                 
Sbjct: 278 QESRGLSAVQTYLLRWVNARGRGRSQNRAERERSEESEEKELDRASPYQEIEITANLQIP 337

Query: 422 TDEDEKENEEDDGESDGGRRRG 443
             +  +   + DG    GR   
Sbjct: 338 PSDLIRMLRKGDGRKPRGRVEE 359


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
           terminus.  The C terminus of the plasma membrane Nha1
           antiporter plays an important role in the immediate cell
           response to hypo-osmotic shock which prevents an
           execessive loss of ions and water. This domain is found
           with pfam00999.
          Length = 430

 Score = 33.2 bits (76), Expect = 0.27
 Identities = 22/115 (19%), Positives = 36/115 (31%), Gaps = 7/115 (6%)

Query: 329 SLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKT-NKNTSDYDDESDIEI 387
            + N   K  +E   E    +    P   E++        +SK+  K   D         
Sbjct: 304 QIRNLDPKARREVDLETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSE 363

Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRR 442
             E   + +    S         +  SE+E     D D     +D+ E+   RRR
Sbjct: 364 ASERRKNREPSSSSPPSRPRSRRD--SEDEDTERDDSDS----DDEEETPAERRR 412



 Score = 33.2 bits (76), Expect = 0.28
 Identities = 13/88 (14%), Positives = 24/88 (27%)

Query: 346 GDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDG 405
             S     P+              +    +    D E+         ++     E     
Sbjct: 94  PISQSAISPQRRSSTGENSAESDNTSYGLSKLAEDSENIDVRPVYESDERSGISEEGSRP 153

Query: 406 EEEGEEEASENEYYTETDEDEKENEEDD 433
            +  E+E    E Y E D+   EN + +
Sbjct: 154 SKLREQEQRPTEAYQEGDQIIIENRQGE 181


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 33.0 bits (75), Expect = 0.31
 Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 1/71 (1%)

Query: 369 ESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
           E+   ++  + DD+ D +  ++ D+D+D +D+ +ED ++    + S  +   ETD +   
Sbjct: 48  EAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGF 107

Query: 429 NEEDDGESDGG 439
            + DD ESD G
Sbjct: 108 ADSDD-ESDDG 117



 Score = 30.7 bits (69), Expect = 1.9
 Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 21/98 (21%)

Query: 333 ETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEID 392
           +  +EE    E+  + D+  D +  E                   D DD+ D +  DE D
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDED------------------DDDDDDDEDDEDEDD 84

Query: 393 NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
           +D    D+S  D   +G E  +E  +    DE +  +E
Sbjct: 85  DDSTLHDDSSAD---DGNETDNEAGFADSDDESDDGSE 119



 Score = 30.0 bits (67), Expect = 2.6
 Identities = 9/41 (21%), Positives = 26/41 (63%)

Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           ++   EED +EE +++  + +   + D+D++++E++D +  
Sbjct: 47  EEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDS 87



 Score = 29.6 bits (66), Expect = 3.4
 Identities = 23/98 (23%), Positives = 39/98 (39%), Gaps = 20/98 (20%)

Query: 335 KKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDND 394
            KE +E + E+ D DE+ D +                      D D++ D +  DE D D
Sbjct: 42  GKEAEEEAMEEEDDDEEDDDD--------------------DDDEDEDDDDDDDDEDDED 81

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
           ED  D +  D     +   ++NE      +DE ++  +
Sbjct: 82  EDDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSE 119



 Score = 29.6 bits (66), Expect = 3.8
 Identities = 15/43 (34%), Positives = 28/43 (65%)

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           E++  E E+D EE+ +++  E+E   + D+DE + +EDD +S 
Sbjct: 46  EEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST 88



 Score = 29.6 bits (66), Expect = 3.9
 Identities = 13/52 (25%), Positives = 29/52 (55%)

Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           E  +E   +ED  +E ++D ++E E++  +++   + DED+ ++   D  S 
Sbjct: 44  EAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSA 95



 Score = 29.2 bits (65), Expect = 4.5
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 407 EEGEEEASENEYYTETDEDEKENEEDDGESD 437
           +E EEEA E E   E D+D+ ++E++D + D
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDD 73



 Score = 28.4 bits (63), Expect = 8.4
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 403 EDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           ++ EEE  EE  ++E   + D+DE E+++DD + +
Sbjct: 43  KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDE 77


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 32.8 bits (75), Expect = 0.39
 Identities = 15/48 (31%), Positives = 27/48 (56%)

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
            I +  E + +++  DE EE+ +EE +EE  +     E  ++E+E EE
Sbjct: 22  PIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 30.1 bits (68), Expect = 2.4
 Identities = 14/46 (30%), Positives = 26/46 (56%), Gaps = 5/46 (10%)

Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEK 427
           E + E+ DE + +E ++ + EE+   + EEE  E     E +++EK
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDE-----EEEKEEK 70



 Score = 29.7 bits (67), Expect = 3.7
 Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 367 YAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
           + E +  K   D ++E + E   E + ++    E E D EEE EE+  + +   ET  + 
Sbjct: 26  WVEKEVEKEVPDEEEEEEKEEKKE-EEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEW 84

Query: 427 KE 428
           + 
Sbjct: 85  EL 86


>gnl|CDD|226246 COG3723, RecT, Recombinational DNA repair protein (RecE pathway)
           [DNA replication, recombination, and repair].
          Length = 276

 Score = 32.1 bits (73), Expect = 0.44
 Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 3/56 (5%)

Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
           DE++     ++  +ED +   + D E  GE EA E +   E   D  E +E   ES
Sbjct: 224 DEAERREPLDVTPEEDSE---QLDSEPSGEPEAQETDEENEKPIDVSEEKEPITES 276


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 32.0 bits (73), Expect = 0.45
 Identities = 9/59 (15%), Positives = 25/59 (42%), Gaps = 1/59 (1%)

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
           +  D     + ++   + + ++E +   E+   EE  + E   E  E+  E + ++ + 
Sbjct: 4   ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEE-IIEDSEESNEVKIEELKD 61



 Score = 31.3 bits (71), Expect = 0.75
 Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 389 DEIDNDEDKQDESEEDGEEEGE---EEASENEYYTETDEDEKENEEDDGESD 437
            E +  + K +  EED  +E E   E+  + E     + +++E  ED  ES+
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESN 52



 Score = 31.3 bits (71), Expect = 0.84
 Identities = 20/85 (23%), Positives = 37/85 (43%), Gaps = 5/85 (5%)

Query: 355 EHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEE--EGEEE 412
              E      EN  E    +N    + E D    ++++ +E +++E  ED EE  E + E
Sbjct: 1   MEKECKDAKHENMEEDCCKENE---NKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57

Query: 413 ASENEYYTETDEDEKENEEDDGESD 437
             ++E     +E++K   E +   D
Sbjct: 58  ELKDENNKLKEENKKLENELEALKD 82


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 30.3 bits (68), Expect = 0.52
 Identities = 19/59 (32%), Positives = 34/59 (57%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
           DDE + E   + D++E++ +  EE+GEEE E E +  +   E +ED+ E ++   + D 
Sbjct: 48  DDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTDEDD 106


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 32.3 bits (73), Expect = 0.53
 Identities = 25/80 (31%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 348 SDEKYDPEH---FEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDE---S 401
           +++K D E+    E   +  +N  E  T K   + +D  +IE V+E++  E++ D+   S
Sbjct: 78  ANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDS 137

Query: 402 EEDGEEEGEEEASENEYYTE 421
           E+D E+E + E  ENE   E
Sbjct: 138 EKDDEKESDAEGDENELAGE 157



 Score = 29.3 bits (65), Expect = 4.7
 Identities = 22/68 (32%), Positives = 39/68 (57%), Gaps = 8/68 (11%)

Query: 379 YDDESDIEIV---DEIDNDEDK--QDESEEDGEEEGEEEASENEYYTET---DEDEKENE 430
            D+E+D+E+      + N+E++  Q E EED + E  EE  E E   E    DED ++++
Sbjct: 82  SDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDD 141

Query: 431 EDDGESDG 438
           E + +++G
Sbjct: 142 EKESDAEG 149


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 32.0 bits (73), Expect = 0.63
 Identities = 23/94 (24%), Positives = 32/94 (34%), Gaps = 7/94 (7%)

Query: 351 KYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIV-DEIDNDEDKQDESEEDGEEEG 409
               + F       EN  ES  N +    D++   ++  D     ED  DE  EDG + G
Sbjct: 305 GSLDDPFLKAPEDTENEDESDDNAD-DFDDNDKKGDVGFDINAEVEDDPDEDAEDGGDGG 363

Query: 410 EEEASENEYYT-----ETDEDEKENEEDDGESDG 438
           + E  E E          D    E  + D   D 
Sbjct: 364 DAEEFEAEKEPMLKVCRVDRQVTEIIDGDEPVDY 397


>gnl|CDD|107300 cd06305, PBP1_methylthioribose_binding_like,
           Methylthioribose-binding protein-like of ABC-type
           transport systems that belong to a family of
           pentose/hexose sugar-binding proteins of the type I
           periplasmic binding protein (PBP1) superfamily.
           Methylthioribose-binding protein-like of ABC-type
           transport systems that belong to a family of
           pentose/hexose sugar-binding proteins of the type I
           periplasmic binding protein (PBP1) superfamily, which
           consists of two alpha/beta globular domains connected by
           a three-stranded hinge. This Venus flytrap-like domain
           undergoes transition from an open to a closed
           conformational state upon ligand binding. The
           sugar-binding domain of the periplasmic proteins in this
           group is also homologous to the ligand-binding domain of
           eukaryotic receptors such as metabotropic glutamate
           receptor (mGluR), DNA-binding transcriptional repressors
           such as LacI and GalR.
          Length = 273

 Score = 31.9 bits (73), Expect = 0.63
 Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 3/57 (5%)

Query: 224 PVLKVDIPDVALSCITTTNQSHYVIVGSTTGDMMAVDLRGKGKPCYKY-KGFAGCVK 279
           PV+  D+ D     +  T Q  Y +    + D +  DL GKG   Y    GF    +
Sbjct: 82  PVVAFDV-DSDNPKVNNTTQDDYSL-ARLSLDQLVKDLGGKGNVGYVNVAGFPPLDR 136


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 32.1 bits (73), Expect = 0.64
 Identities = 15/102 (14%), Positives = 37/102 (36%), Gaps = 16/102 (15%)

Query: 334 TKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEI---VDE 390
           ++K  Q+++ +K DS              T  +       + TS+  +E  ++     + 
Sbjct: 23  SQKRIQQNNGDKEDS-------------STSTSSLSVSAVEKTSNAKEEIQVDFQHNSES 69

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
              + + +DE E +  +    ++S          D    ++D
Sbjct: 70  AVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMDGIDDD 111


>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein. 
          Length = 283

 Score = 31.8 bits (72), Expect = 0.65
 Identities = 13/69 (18%), Positives = 28/69 (40%), Gaps = 2/69 (2%)

Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKEN 429
           S+  +   +Y        + EI+  E+++ +     +E+  EE +   YY   +  + E 
Sbjct: 1   SRLRQYLKNYPRLDQT--LSEIEEMEEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSET 58

Query: 430 EEDDGESDG 438
           E    +  G
Sbjct: 59  EPSIEDDQG 67


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 32.0 bits (73), Expect = 0.75
 Identities = 23/127 (18%), Positives = 43/127 (33%), Gaps = 31/127 (24%)

Query: 330 LENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVD 389
              + +KEEQE  +   D D+           + ++     K  K     D      I  
Sbjct: 324 DSPKKRKEEQEDDDFVEDDDDLQ-------ASLAKQRRLAQKKRKKLRPED------IAR 370

Query: 390 EIDNDEDKQDESEEDGEEEG------------------EEEASENEYYTETDEDEKENEE 431
           +I  +  + +E   +  + G                  EE+    +   E   D +E++E
Sbjct: 371 QIAEERSEDEEEAAENNDNGIVIDETSEFVRSLQKEPLEEKPENKDESVEEISDAEEDDE 430

Query: 432 DDGESDG 438
           D+ + DG
Sbjct: 431 DEEDEDG 437



 Score = 30.1 bits (68), Expect = 2.8
 Identities = 32/148 (21%), Positives = 55/148 (37%), Gaps = 27/148 (18%)

Query: 313 DYMQSRLTSILVRSDFSLENETKKEEQE-HSEEKGDSDEKYDPEHFEMNQVTRENY---A 368
           D  +    SIL + D  +E + KK +     +  G +D++ + +  E+ +  + N     
Sbjct: 196 DDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLD 255

Query: 369 ESKTNKNTSDYDDESDIEIV--------------DEIDNDE---------DKQDESEEDG 405
           +  T    SDY D S++                  ++D DE              S +D 
Sbjct: 256 DDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDV 315

Query: 406 EEEGEEEASENEYYTETDEDEKENEEDD 433
           EEE        +   E  ED+   E+DD
Sbjct: 316 EEENARLEDSPKKRKEEQEDDDFVEDDD 343


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 31.2 bits (71), Expect = 0.80
 Identities = 18/67 (26%), Positives = 27/67 (40%)

Query: 365 ENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDE 424
           EN  +  T+  +S    E   +   E    E+K   S E    + E    E E   E +E
Sbjct: 115 ENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEE 174

Query: 425 DEKENEE 431
           + KE E+
Sbjct: 175 EVKEPED 181



 Score = 30.0 bits (68), Expect = 2.2
 Identities = 22/131 (16%), Positives = 42/131 (32%), Gaps = 34/131 (25%)

Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAES--KTNKNTSDYDD------------- 381
           E +++  +K   D   +    E N  T     +   KT        D             
Sbjct: 45  EAEDNLLDKAVDDILSNHYLKEGNGTTSVAIGDGMEKTIHGDVKESDPMSFITNPSLTVS 104

Query: 382 ------ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYT-------------ET 422
                 ESD+E +++ D   +     +E+ ++   E     E  +             E 
Sbjct: 105 VPIASGESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEE 164

Query: 423 DEDEKENEEDD 433
           +E+E   EE++
Sbjct: 165 EEEEIAEEEEE 175


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 32.0 bits (72), Expect = 0.83
 Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 2/94 (2%)

Query: 340 EHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQD 399
           E  EE    DEK + E  E  + + +N  E     + +  ++ES  E+  E + +E+ + 
Sbjct: 590 EGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARK 649

Query: 400 ESEEDG--EEEGEEEASENEYYTETDEDEKENEE 431
           + E  G  E E   +  + +    T+E  K  E+
Sbjct: 650 KEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQ 683


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 30.4 bits (69), Expect = 0.95
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENE 417
            +E ++D+D++DE EED EE+ +E+ SE E
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEE 138



 Score = 28.1 bits (63), Expect = 6.2
 Identities = 13/27 (48%), Positives = 18/27 (66%), Gaps = 2/27 (7%)

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEED 404
             DDE D E  +E D ++D +DESEE+
Sbjct: 114 SDDDEEDEE--EEDDEEDDDEDESEEE 138



 Score = 27.7 bits (62), Expect = 6.9
 Identities = 15/35 (42%), Positives = 24/35 (68%), Gaps = 5/35 (14%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEAS 414
           +DESD +     + DE+++D+ E+D E+E EEE S
Sbjct: 111 EDESDDD-----EEDEEEEDDEEDDDEDESEEEES 140



 Score = 27.3 bits (61), Expect = 9.9
 Identities = 16/35 (45%), Positives = 24/35 (68%), Gaps = 5/35 (14%)

Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
           ++DES++D E+E EE+  E++     DEDE E EE
Sbjct: 110 EEDESDDDEEDEEEEDDEEDD-----DEDESEEEE 139


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 31.6 bits (72), Expect = 0.96
 Identities = 26/122 (21%), Positives = 47/122 (38%), Gaps = 4/122 (3%)

Query: 311 RKDYMQSRLTSILVRSDFSLENETKKEEQE-HSEEKGDSDEKYDPEHFEMNQVTRENYAE 369
            K+ +++    +  R     E E    E E   EE  +       E  E+ +  RE  AE
Sbjct: 328 LKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE 387

Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKEN 429
                  ++  +E + E+  EI++ E++ +   E  E+  EE         E   + +E 
Sbjct: 388 --LEAELAEIRNELE-ELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEEL 444

Query: 430 EE 431
            E
Sbjct: 445 NE 446


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 29.5 bits (66), Expect = 0.99
 Identities = 16/41 (39%), Positives = 24/41 (58%)

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
           E ++ E  ++ EEE EEE  E E    +D++E E E +D E
Sbjct: 56  EAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96



 Score = 28.3 bits (63), Expect = 2.3
 Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 1/49 (2%)

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
                 +    E+ ++  EE+ EEE EEE      Y   DE+ +E  ED
Sbjct: 47  SQSAFKKALEAEESEENDEEEEEEE-EEEDEGEIEYVSDDEELEEEIED 94



 Score = 27.9 bits (62), Expect = 3.8
 Identities = 12/43 (27%), Positives = 24/43 (55%)

Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKE 428
            +  E   + D+++E EE+ E+EGE E   ++   E + ++ E
Sbjct: 54  ALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 31.6 bits (71), Expect = 1.0
 Identities = 22/133 (16%), Positives = 55/133 (41%), Gaps = 25/133 (18%)

Query: 326 SDFSLENETKKEEQE-HSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESD 384
             F  E+E +  +   ++ + G S++ +      +N     +  +S  + N+   ++E  
Sbjct: 60  EAFDEEDEKRFADWSFNASKSGKSNKDHK----NLNNTKEISLNDSDDSVNSDKLENEGS 115

Query: 385 IEIVDE---------IDNDEDKQDESEEDGEEE-----------GEEEASENEYYTETDE 424
           +  +DE         +DND+ +++ES  +                  ++   E  +E++ 
Sbjct: 116 VSSIDENELVDLDTLLDNDQPEKNESGNNDHATDKENLLESDASSSNDSESEESDSESEI 175

Query: 425 DEKENEEDDGESD 437
           +  +++ DD  SD
Sbjct: 176 ESSDSDHDDENSD 188



 Score = 30.1 bits (67), Expect = 2.8
 Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 13/113 (11%)

Query: 327 DFSLENETKKEEQEHSEEKGDSDEKYDP---------EHFEMNQVTRENYAESKTNKNTS 377
           D    N TK+     S++  +SD+  +          E  +++ +   +  E   + N  
Sbjct: 86  DHKNLNNTKEISLNDSDDSVNSDKLENEGSVSSIDENELVDLDTLLDNDQPEKNESGNN- 144

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
             D  +D E + E  +     D   E+ + E E E+S++++  E  + + +N 
Sbjct: 145 --DHATDKENLLE-SDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNL 194


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 31.5 bits (71), Expect = 1.1
 Identities = 17/106 (16%), Positives = 42/106 (39%), Gaps = 3/106 (2%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDY---DDESDIEI 387
           E   K++E ++  +  D+    + +    NQ      A+ +  KN  +     D    ++
Sbjct: 249 EVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDL 308

Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
             E    E + ++ E + +++ E  A + +      E +  +  +D
Sbjct: 309 KQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNED 354


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 11/51 (21%), Positives = 25/51 (49%)

Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
           E     +++++ EED E E  E+  ++E+    +    + EE+  + D  +
Sbjct: 74  EALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDKYK 124


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 16/52 (30%), Positives = 26/52 (50%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
            +E   E V+E + +E  +DE EE+  EE  E    +E   +  E E + +E
Sbjct: 4   KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDE 55



 Score = 28.0 bits (63), Expect = 6.6
 Identities = 14/40 (35%), Positives = 19/40 (47%)

Query: 393 NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEED 432
            +E  ++E EE   EE  E+  E E   E  E E  +EE 
Sbjct: 4   KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQ 43


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
           mitochondrial protein translocase family, which is
           responsible for movement of nuclear encoded pre-proteins
           into mitochondria, is very complex with at least 19
           components. These proteins include several chaperone
           proteins, four proteins of the outer membrane
           translocase (Tom) import receptor, five proteins of the
           Tom channel complex, five proteins of the inner membrane
           translocase (Tim) and three "motor" proteins. This
           family represents the Tom22 proteins. The N terminal
           region of Tom22 has been shown to have chaperone-like
           activity, and the C terminal region faces the
           intermembrane face.
          Length = 136

 Score = 29.9 bits (68), Expect = 1.2
 Identities = 12/49 (24%), Positives = 25/49 (51%), Gaps = 1/49 (2%)

Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
           + E++ DE  Q++         EE   ++E  T+TD D  ++ + + E+
Sbjct: 4   LTEVE-DETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENET 51



 Score = 28.0 bits (63), Expect = 4.9
 Identities = 11/47 (23%), Positives = 25/47 (53%), Gaps = 5/47 (10%)

Query: 355 EHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDES 401
           E F+      +N A+ +   +  D +D++D +   +I +D D ++E+
Sbjct: 10  ETFQEKPAAPKNLAQEE---SDDDDEDDTDTD--SDISDDSDFENET 51


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 28.7 bits (65), Expect = 1.2
 Identities = 15/36 (41%), Positives = 23/36 (63%), Gaps = 1/36 (2%)

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEA 413
           DYD +SD E  +E +  ED + E EED EE+ +++ 
Sbjct: 39  DYDYDSDAEW-EEEEEGEDLESEDEEDEEEDDDDDM 73



 Score = 28.0 bits (63), Expect = 1.9
 Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 10/49 (20%)

Query: 390 EIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
            +D D D   E EE  EEEGE+         E++++E E E+DD + DG
Sbjct: 37  GLDYDYDSDAEWEE--EEEGED--------LESEDEEDEEEDDDDDMDG 75


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 31.1 bits (70), Expect = 1.2
 Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 5/60 (8%)

Query: 369 ESKTNKNTSDYDDESDIEIVDEIDN-----DEDKQDESEEDGEEEGEEEASENEYYTETD 423
            ++T    +  DDE  IE  ++  +     D+D   + E+  EE G     E   +   D
Sbjct: 275 VTRTFPERTSDDDEDAIETEEDDVDESAIEDDDDDSDWEDSVEESGRSSVDEKTMFQRVD 334


>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
          Length = 325

 Score = 30.7 bits (70), Expect = 1.4
 Identities = 10/48 (20%), Positives = 22/48 (45%)

Query: 368 AESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASE 415
                + N     DE+ +E+ DE    E++  +++   EE   + A++
Sbjct: 2   TLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 30.2 bits (68), Expect = 1.5
 Identities = 16/85 (18%), Positives = 35/85 (41%), Gaps = 1/85 (1%)

Query: 351 KYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGE 410
           K +     M +      A+ K  +  +  +     E   +I+ +E ++++   + E E  
Sbjct: 19  KLEEAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDI 78

Query: 411 EEASENEYYTETDEDEKENEEDDGE 435
            E  E E     +E+++E  ED  +
Sbjct: 79  AEQQEKE-EIAQEEEKEEEAEDVKQ 102


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 30.8 bits (70), Expect = 1.6
 Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENEY----YTETDEDEKENEED 432
           V+    DE+++ E EE+ EEE E E  E E      T   ED     +D
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAPLPNDD 390



 Score = 29.6 bits (67), Expect = 3.8
 Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 11/42 (26%)

Query: 393 NDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
           N E K +E EE  +EE EEE          +E+ +E E ++G
Sbjct: 343 NPEQKDEE-EEQEDEEEEEE----------EEEPEEPEPEEG 373


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 30.7 bits (70), Expect = 1.6
 Identities = 13/54 (24%), Positives = 28/54 (51%), Gaps = 6/54 (11%)

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           D+EI +E+D+ +D+++E     E++ +E+   +       +D+ E E D     
Sbjct: 215 DVEIQNELDDIDDEEEERVLADEDDDDEDDMFD------MDDDDEEESDPEVER 262



 Score = 29.2 bits (66), Expect = 4.7
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASEN 416
           D+E +  + DE D+DED   + ++D EEE + E    
Sbjct: 227 DEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERT 263


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 29.0 bits (64), Expect = 1.7
 Identities = 13/34 (38%), Positives = 23/34 (67%), Gaps = 1/34 (2%)

Query: 400 ESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
           E EED + + +EE  +++   E D+DE ++E+DD
Sbjct: 7   EGEEDSDSDSDEE-EDDDDEDEEDDDEDDDEDDD 39


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 30.9 bits (69), Expect = 1.7
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 413  ASENEYYTETDEDEKENEEDDGESDGGRRRG 443
            +S +   +E D DEK  +ED   + GG+RRG
Sbjct: 1267 SSSDSSSSEEDGDEKNEKEDRERAGGGKRRG 1297


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 30.4 bits (68), Expect = 1.8
 Identities = 31/118 (26%), Positives = 48/118 (40%), Gaps = 13/118 (11%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDY-----DDESDI 385
           E ET  EE+ + +  G+ DE+ + E+  ++ VT     ++                ++  
Sbjct: 65  EEETSNEEENNEDSDGNEDEEAEAENTTLSTVTLGYGGDATPGTGNIGLAALQLPKKAGN 124

Query: 386 EIVDEIDNDEDKQDESEEDGEEEGEEEASENEY--------YTETDEDEKENEEDDGE 435
                   DE  +DE EE+ EEE E E  ENE          TE D     +  D+GE
Sbjct: 125 AGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGE 182



 Score = 30.0 bits (67), Expect = 2.2
 Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 2/42 (4%)

Query: 399 DESEE--DGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
           D SEE  DG+   EE   E     E +ED   NE+++ E++ 
Sbjct: 49  DSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAEN 90


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 30.7 bits (69), Expect = 1.8
 Identities = 14/66 (21%), Positives = 19/66 (28%), Gaps = 2/66 (3%)

Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEE-EASENEYYTETDEDEKEN-EEDDG 434
            +   E+DIE     D         + D E      E          DE E E  E+   
Sbjct: 363 VEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAP 422

Query: 435 ESDGGR 440
             D  +
Sbjct: 423 IPDPAK 428


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 30.4 bits (68), Expect = 2.0
 Identities = 15/90 (16%), Positives = 39/90 (43%), Gaps = 1/90 (1%)

Query: 328 FSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEI 387
           + L+  +  EE++  E +      ++ E +    +  ++   +  N N  ++  ES+ E 
Sbjct: 44  YDLKKSSSDEEEQDYELRPRVSSSWNNESYNRLPIKTKDNVVADVN-NGEEFLSESESEA 102

Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASENE 417
             EID+D   + +   + ++   E   + +
Sbjct: 103 SLEIDSDIKDEKQKSLEEQKIAPEIPVKQQ 132



 Score = 30.0 bits (67), Expect = 3.0
 Identities = 26/138 (18%), Positives = 49/138 (35%), Gaps = 32/138 (23%)

Query: 325 RSDFSLENET-KKEEQEHSEEKG--------DSDEK---------YDPEHFEMNQVTREN 366
            S  S E  T KK E+E ++E          D  +K          D E  +     R +
Sbjct: 6   NSQSSTEPRTTKKAERELTQEDIEFFNENPEDLGKKVIYDLKKSSSDEEEQDYELRPRVS 65

Query: 367 YAES---------KTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
            + +         KT  N       +D+   +E  ++ + +   E D + + E++ S  E
Sbjct: 66  SSWNNESYNRLPIKTKDNV-----VADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEE 120

Query: 418 YYTETDEDEKENEEDDGE 435
                +   K+  + + E
Sbjct: 121 QKIAPEIPVKQQIDSEKE 138


>gnl|CDD|218504 pfam05215, Spiralin, Spiralin.  This family consists of Spiralin
           proteins found in spiroplasma bacteria. Spiroplasmas are
           helically shaped pathogenic bacteria related to the
           mycoplasmas. The surface of spiroplasma bacteria is
           crowded with the membrane-anchored lipoprotein spiralin
           whose structure and function are unknown although its
           cellular function is thought to be a structural and
           mechanical one rather than a catalytic one.
          Length = 239

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 17/39 (43%), Positives = 21/39 (53%), Gaps = 4/39 (10%)

Query: 197 PNSDKVAVCSKQGYVRVYDPKITTQRRPVLKVDIPDVAL 235
           P  DK  V  K GY++V  PK +     V KVDI DV +
Sbjct: 111 PAKDKTVVIGKTGYIKVTLPKNS----EVKKVDISDVTV 145


>gnl|CDD|164860 PHA01625, PHA01625, hypothetical protein.
          Length = 249

 Score = 29.8 bits (67), Expect = 2.3
 Identities = 17/55 (30%), Positives = 34/55 (61%), Gaps = 3/55 (5%)

Query: 21  VNLSKDKCIIKNIQNVKYITNKASVLALNWGNVEESELLIAYSGQKIKIYDTDFK 75
           +NL+  + +I++I ++K I      L L+  +++ ++LL+ YS   I++YD  FK
Sbjct: 77  INLTYRQKMIRDIDDLKQILES---LNLSKVSLKTNKLLVKYSNYSIELYDIYFK 128


>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
           N-terminal domain.  Autophagocytosis is a
           starvation-induced process responsible for transport of
           cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
           ubiquitin like modifier that is topologically similar to
           the canonical E2 enzyme. It catalyzes the conjugation of
           Atg8 and phosphatidylethanolamine.
          Length = 146

 Score = 29.2 bits (66), Expect = 2.3
 Identities = 10/44 (22%), Positives = 21/44 (47%)

Query: 370 SKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEA 413
           +         DD +D E + EI +D+D   +S +  E++ ++  
Sbjct: 103 TTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDDDDIP 146


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
           Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
           CPA1 family is a large family of proteins derived from
           Gram-positive and Gram-negative bacteria, blue green
           bacteria, yeast, plants and animals. Transporters from
           eukaryotes have been functionally characterized, and all
           of these catalyze Na+:H+ exchange. Their primary
           physiological functions may be in (1) cytoplasmic pH
           regulation, extruding the H+ generated during
           metabolism, and (2) salt tolerance (in plants), due to
           Na+ uptake into vacuoles. This model is specific for the
           fungal members of this family [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 810

 Score = 30.3 bits (68), Expect = 2.3
 Identities = 22/74 (29%), Positives = 33/74 (44%), Gaps = 7/74 (9%)

Query: 365 ENYAESKTNK-----NTSDYDD-ESDIEIVDEIDNDEDKQDESEEDG-EEEGEEEASENE 417
           +  A    N+      TS YD  ES+I+ V   +ND +    SE    ++  EEE     
Sbjct: 589 DKLARDTENRDVTYVPTSRYDGIESEIDDVYTYENDSESIASSERRRIKKLREEEQQAYI 648

Query: 418 YYTETDEDEKENEE 431
            YTE ++   EN +
Sbjct: 649 AYTEDNQVIIENRQ 662


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 30.1 bits (67), Expect = 2.4
 Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 324 VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES 383
            ++  S++   +   Q+ + ++ D   + + E  E +++T      S +++ T +YD   
Sbjct: 47  FKTTASVDRYGRPLNQDKATKEIDRLYELENESSESSEITDNEEVASASSELTDEYDPAR 106

Query: 384 DIEIVDEIDNDEDKQDESEEDGEEEGEEEASEN 416
                + I +  +  DES E+ EEE   E SE 
Sbjct: 107 G----EGIISTSESSDESREESEEEKANEISEK 135


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 30.4 bits (68), Expect = 2.4
 Identities = 18/80 (22%), Positives = 37/80 (46%), Gaps = 4/80 (5%)

Query: 361 QVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEE----ASEN 416
            V++E+  +  T +N      ++D E  D+   +ED +  S +DG+   + +     S  
Sbjct: 92  PVSQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSG 151

Query: 417 EYYTETDEDEKENEEDDGES 436
              + TD + K+++   G S
Sbjct: 152 STTSGTDLNTKQSQTGLGAS 171


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 30.2 bits (68), Expect = 2.4
 Identities = 22/116 (18%), Positives = 41/116 (35%), Gaps = 25/116 (21%)

Query: 331 ENETKKEEQEHS--------EEKGDSDEKYDPEHFEMNQVT-----RENYAESKTN--KN 375
           E E +KEEQ               DSDE  +P   E   +T     ++   E +    + 
Sbjct: 164 EREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEERRAAVERR 223

Query: 376 TSDYDDESDIEIVDEIDNDEDKQ----------DESEEDGEEEGEEEASENEYYTE 421
            ++  +  +  + +   +   +Q          D+ +E GE+  + E     Y T 
Sbjct: 224 LAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITS 279


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 30.5 bits (69), Expect = 2.4
 Identities = 7/35 (20%), Positives = 14/35 (40%)

Query: 376 TSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGE 410
            S+ D E    + D+ D+ +      EE+ +    
Sbjct: 32  GSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNMR 66


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 29.9 bits (67), Expect = 2.6
 Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 6/52 (11%)

Query: 383 SDIEIVD--EIDND-EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
             +E+V+  E D   E+  DE  ED  E  E    E     E +  E + ++
Sbjct: 330 EVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEV---AEMELLEVQVDD 378


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 30.3 bits (68), Expect = 2.6
 Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 3/57 (5%)

Query: 383 SDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
            DI   ++++ +    +ESE   EEE E+   EN++     +D+ E E     +D  
Sbjct: 365 DDILYFEKMEIENRNPEESEH--EEEVEDYEDENDHSKRICDDD-ELENHFRAADEK 418



 Score = 29.2 bits (65), Expect = 5.0
 Identities = 30/126 (23%), Positives = 45/126 (35%), Gaps = 11/126 (8%)

Query: 286 NENLILSVSLDRHFRVHDFHSKKLLRKDYMQSRLTSILVRSDFSLENETK-KEEQEHSEE 344
           N   + +   D  F          L      S+    L R+   L NE    EE E  + 
Sbjct: 295 NTTFVWNYMEDNVFHSFSLRFLDNLDFLQFLSKYIGCLWRN---LNNEKWGNEEAERKDY 351

Query: 345 KGDSD------EKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQ 398
             DS       +  D  +FE  ++   N  ES+  +   DY+DE+D       D+DE + 
Sbjct: 352 ILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDEND-HSKRICDDDELEN 410

Query: 399 DESEED 404
                D
Sbjct: 411 HFRAAD 416


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 380 DDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASE 415
           +D  DI   +  D+D ++ +E EED EE+ E   S 
Sbjct: 2   EDSDDIIDYESDDDDSEEYEEEEED-EEDAESLESS 36


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 29.2 bits (66), Expect = 3.0
 Identities = 14/40 (35%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 398 QDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
             E EE+  E   +EA EN    +   DE + E D+ E D
Sbjct: 20  PKEEEEEALELAIQEAHENAL-EKKLLDELDEELDEEEDD 58


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 29.8 bits (67), Expect = 3.0
 Identities = 26/111 (23%), Positives = 42/111 (37%), Gaps = 9/111 (8%)

Query: 323 LVRSDFSLENETKKEEQEH--SEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD 380
           ++R   + E      E+E   S+   D   +   +  +     R      +   +     
Sbjct: 252 VLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSL 311

Query: 381 DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEE 431
            ESDI   +EI   ++K   SE  G   G +E  E+E     DE+ KE  E
Sbjct: 312 PESDIH--EEIPKLDEKS-LSEFLGYMGGIDEDDEDE----DDEESKEEVE 355


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
           consists of several mammalian dentin matrix protein 1
           (DMP1) sequences. The dentin matrix acidic
           phosphoprotein 1 (DMP1) gene has been mapped to human
           chromosome 4q21. DMP1 is a bone and teeth specific
           protein initially identified from mineralised dentin.
           DMP1 is primarily localised in the nuclear compartment
           of undifferentiated osteoblasts. In the nucleus, DMP1
           acts as a transcriptional component for activation of
           osteoblast-specific genes like osteocalcin. During the
           early phase of osteoblast maturation, Ca(2+) surges into
           the nucleus from the cytoplasm, triggering the
           phosphorylation of DMP1 by a nuclear isoform of casein
           kinase II. This phosphorylated DMP1 is then exported out
           into the extracellular matrix, where it regulates
           nucleation of hydroxyapatite. DMP1 is a unique molecule
           that initiates osteoblast differentiation by
           transcription in the nucleus and orchestrates
           mineralised matrix formation extracellularly, at later
           stages of osteoblast maturation. The DMP1 gene has been
           found to be ectopically expressed in lung cancer
           although the reason for this is unknown.
          Length = 514

 Score = 30.0 bits (67), Expect = 3.2
 Identities = 28/121 (23%), Positives = 58/121 (47%), Gaps = 9/121 (7%)

Query: 329 SLENETKKEEQEHSEEKGDSDEKYDP--EHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
           S EN ++ +E+  SE +GD+ +      E  E ++ + E+  ++ ++  +   ++++D E
Sbjct: 343 SQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEEQADSE 402

Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTET------DEDEKENEEDDGESDGGR 440
             + + + E+   ES ED     +E    +   TE+       E +  +EEDD +S    
Sbjct: 403 SNESLSSSEESP-ESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSS 461

Query: 441 R 441
           R
Sbjct: 462 R 462


>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
          Length = 130

 Score = 28.7 bits (64), Expect = 3.4
 Identities = 19/53 (35%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 392 DNDED--KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRR 442
            ND D   +DE  E   EEG EE SE+    E      ENE+ + +    R R
Sbjct: 2   YNDMDMGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQRPR 54


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 28.0 bits (63), Expect = 3.5
 Identities = 9/56 (16%), Positives = 29/56 (51%), Gaps = 3/56 (5%)

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
           D   + + E  ++    +D  D+ ++D   + +++  + +   + D+D  E+++D+
Sbjct: 52  DAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLD---DDDDDFLEDDDDE 104


>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2.  This
           domain family is found in eukaryotes, and is typically
           between 138 and 153 amino acids in length. The family is
           found in association with pfam00493. Mini-chromosome
           maintenance (MCM) proteins are essential for DNA
           replication. These proteins use ATPase activity to
           perform this function.
          Length = 145

 Score = 28.8 bits (65), Expect = 3.5
 Identities = 19/72 (26%), Positives = 35/72 (48%), Gaps = 10/72 (13%)

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEE-------EASENEYYTETDEDEKENE 430
           + + + + E+ D+ID D D++ E EEDGE+   +          E + Y E   D++ + 
Sbjct: 1   EDEPDEEEELEDDID-DLDEEAEEEEDGEDLFGDNMERDYRANPELDRYDEDGLDDESDY 59

Query: 431 EDDGESDGGRRR 442
           E+   +   RR 
Sbjct: 60  EELSAAA--RRA 69


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 30.0 bits (68), Expect = 3.6
 Identities = 10/111 (9%), Positives = 35/111 (31%)

Query: 324  VRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDES 383
            V +         K     S            +  +  + +     + K ++  +    +S
Sbjct: 1278 VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKS 1337

Query: 384  DIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
               +     +   +        + +   E  ++    ++++++ E++EDD 
Sbjct: 1338 KTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
          Length = 233

 Score = 29.4 bits (66), Expect = 3.6
 Identities = 17/44 (38%), Positives = 26/44 (59%)

Query: 394 DEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           ++D  DE EEDG    E +A + +   E DED++E  E+D + D
Sbjct: 189 EDDWADEDEEDGYVASEAKARKPQDEEEWDEDDEEESEEDEDGD 232


>gnl|CDD|223042 PHA03322, PHA03322, tegument protein VP11/12; Provisional.
          Length = 674

 Score = 29.6 bits (66), Expect = 3.8
 Identities = 29/146 (19%), Positives = 50/146 (34%), Gaps = 16/146 (10%)

Query: 301 VHDFHSKKLLRKDYMQSRLTSILVRSDFSLENETKKEEQEHSEEK-GDSDEKYDPEHFEM 359
           +H    K  L     +SR   +LVR   S +          ++   G   +K  P ++ +
Sbjct: 487 IHMDSPKSPLSISSFKSRARRVLVRPSTSTDTTRNDPSYWFADGDDGRIRDKTYPLNYTL 546

Query: 360 NQVTREN-YAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEY 418
           N+ T E   +   T+++    D+ ++       D     + ES +   E  E   S   Y
Sbjct: 547 NKATYEGPVSGPGTSEDGEYVDETNN-------DTATGPKCESPDHVYENIEPAESNYSY 599

Query: 419 YTETDED-------EKENEEDDGESD 437
             +TD D             D G   
Sbjct: 600 ENDTDFDSVASDLGRNHRSTDGGSPV 625


>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 508

 Score = 29.6 bits (66), Expect = 3.9
 Identities = 16/78 (20%), Positives = 31/78 (39%), Gaps = 3/78 (3%)

Query: 365 ENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQ-DESEEDGEEEGEEE--ASENEYYTE 421
           E +  SK  K  ++   E     +  I + ED     + E+ E E E+    S+++   E
Sbjct: 429 EQFLHSKGIKARNEEVQERLQTDLGSISDSEDINMGSAGEEDESEDEDFQMVSDSDVAEE 488

Query: 422 TDEDEKENEEDDGESDGG 439
            D     ++ + G  +  
Sbjct: 489 YDLQAALSDAEGGSDEER 506


>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
           (DUF2052).  This entry is of sequences of two conserved
           domains separated by a region of low complexity,
           spanning some 200 residues. The function is unknown.
          Length = 178

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 13/49 (26%), Positives = 23/49 (46%)

Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
           +   +D +E + +  E+  EEE  EE  E+      +E+E+      GE
Sbjct: 60  LESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGE 108



 Score = 28.5 bits (64), Expect = 5.6
 Identities = 21/71 (29%), Positives = 27/71 (38%), Gaps = 5/71 (7%)

Query: 377 SDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASE-----NEYYTETDEDEKENEE 431
           S  D E     + E   +E+  +E EED EEE EEE  E          E D D  +   
Sbjct: 62  SSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELP 121

Query: 432 DDGESDGGRRR 442
              E +  R  
Sbjct: 122 TPEEREELREE 132


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 29.6 bits (67), Expect = 4.1
 Identities = 14/38 (36%), Positives = 19/38 (50%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDE 426
           +EI  +  K D SEE  EEE +EE   ++     D  E
Sbjct: 56  EEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVE 93


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 29.3 bits (66), Expect = 4.9
 Identities = 12/47 (25%), Positives = 26/47 (55%)

Query: 389 DEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
           DEIDND     +  ++ ++E ++E  ++   +  +E E+  ++ D E
Sbjct: 475 DEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAE 521



 Score = 28.5 bits (64), Expect = 7.7
 Identities = 25/106 (23%), Positives = 44/106 (41%), Gaps = 9/106 (8%)

Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDI--EIVDEIDND 394
           E+ +  ++  D D  +D + F       +N    K ++  +D  D+SD   E  DE D+D
Sbjct: 442 EDDDEPDDSEDKDVSFDEDEFFEFL---KNMLGLKDDEIDNDLPDDSDDADEDDDE-DDD 497

Query: 395 EDKQDESE-EDGEEEGEEEASENEYY--TETDEDEKENEEDDGESD 437
           ED+   S+    E E   +  + E      ++  +  N    G  D
Sbjct: 498 EDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAED 543


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 29.1 bits (65), Expect = 5.0
 Identities = 13/31 (41%), Positives = 20/31 (64%), Gaps = 1/31 (3%)

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDED 425
           E  + E E++  EE EEE SE E  T++++D
Sbjct: 3   EVLELEEEDESGEE-EEEESEEEEETDSEDD 32



 Score = 28.3 bits (63), Expect = 7.3
 Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 4/109 (3%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFE----MNQVTRENYAESKTNKNTSDYDDESDIE 386
           E+E+ +EE+E SEE+ ++D + D E         +  R    E +         +E    
Sbjct: 10  EDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKR 69

Query: 387 IVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGE 435
             +E   +  K  E E   E E ++  +  E   +  + + ENEE++ E
Sbjct: 70  KAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYE 118


>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809).  This
           family consists of several proteins of unknown function
           Raphanus sativus (Radish) and Brassica napus (Rape).
          Length = 138

 Score = 28.2 bits (62), Expect = 5.1
 Identities = 15/36 (41%), Positives = 22/36 (61%)

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENE 430
           E K+++ E  GE EG+EE  E +   E  E++KE E
Sbjct: 98  EGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEKKEVE 133


>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein.  Members of this
           family are bacterial proteins with a conserved motif
           [KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
           followed by a long region of low complexity sequence in
           which roughly half the residues are Asp and Glu,
           including multiple runs of five or more acidic residues.
           The function of members of this family is unknown.
          Length = 129

 Score = 28.0 bits (62), Expect = 5.2
 Identities = 19/92 (20%), Positives = 43/92 (46%), Gaps = 3/92 (3%)

Query: 350 EKYDPEHFEMNQVTRENYAESKTNKNTSD--YDDESDIEIVDEIDNDEDKQDESEEDGEE 407
           E++ PE    ++  R +  +        D   D+ +++  ++E D + D    +   G++
Sbjct: 34  EQFPPEEALKSRRVRADDEKEAVAVKEVDPEIDEAAELVSLEEADEEVDPGAPTPAGGDD 93

Query: 408 EGEEEASENEYYTET-DEDEKENEEDDGESDG 438
             E+E   ++    T  EDE++++ DD +  G
Sbjct: 94  LAEDEDIADDDDDVTFLEDEEDDDIDDEDIIG 125


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 29.5 bits (66), Expect = 5.2
 Identities = 24/111 (21%), Positives = 46/111 (41%), Gaps = 10/111 (9%)

Query: 332 NETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI 391
           N  + E  E  +E+ +SD  +     E++ V+     +  +    +D  +E+    V  I
Sbjct: 640 NPMEVEPMESEKEESESDGSFI----EVDSVSSTLELQVPSKSQPTDESEENAENKVASI 695

Query: 392 DNDEDKQ------DESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGES 436
           + +  K+      DESEED      EE  + + +    +D    E +  E+
Sbjct: 696 EGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEALEA 746


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 28.7 bits (64), Expect = 5.7
 Identities = 20/116 (17%), Positives = 44/116 (37%), Gaps = 7/116 (6%)

Query: 322 ILVRSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDD 381
           + + +D S+    ++   +    + D +E+ +    E     +E  A  +   +    + 
Sbjct: 125 LQLTADISISGIQQENNLDAEPAREDEEEE-ESFSAEFEHPAQEETAGEEERTDEPKVEH 183

Query: 382 ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           E+  +     D+D D+   S        E    E+E     +E+  E  ED+ E +
Sbjct: 184 EAHEQHEQPADDDPDEWKISA------SEPFQLESEVEASPEEENYEEYEDETELE 233


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 28.7 bits (65), Expect = 5.9
 Identities = 22/140 (15%), Positives = 48/140 (34%), Gaps = 30/140 (21%)

Query: 305 HSKKLLRKDYMQSRLTSILV--RSDFSLENETKKEEQEHSEEKGDSDEKYDPEHFEMNQV 362
           +S KL  K          LV  + D   E E ++  +   E  G          + ++  
Sbjct: 269 YSPKLAEKP-------RWLVFNKIDLLDEEEAEERAKAIVEALGWEGP-----VYLISAA 316

Query: 363 TRENYAESKTNKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEE---EGEEEASENEYY 419
           +     E                +++  I+ +  ++ E  E  E+     ++   E    
Sbjct: 317 SGLGVKELC-------------WDLMTFIEENPREEAEEAEAPEKVEFMWDDYHREQLEE 363

Query: 420 TETDEDEKENEEDDGESDGG 439
            E ++D+  +++ D + D G
Sbjct: 364 VEEEDDDDWDDDWDEDDDEG 383


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 28.9 bits (64), Expect = 6.0
 Identities = 18/67 (26%), Positives = 27/67 (40%), Gaps = 3/67 (4%)

Query: 369 ESKTNKNTSDYDD---ESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDED 425
           E + +   +D +      D +   E DN+E   DE     + EG EE S  E   +  E 
Sbjct: 396 EEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREF 455

Query: 426 EKENEED 432
               +ED
Sbjct: 456 RDMEKED 462


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 27.5 bits (61), Expect = 6.2
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 390 EIDNDEDKQDESEEDGEEEGEEEASE 415
           E   +E ++++ +E+ +EE EEEA  
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEEALA 100


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.0 bits (64), Expect = 6.7
 Identities = 26/121 (21%), Positives = 58/121 (47%), Gaps = 3/121 (2%)

Query: 324  VRSDFSLENETKKEEQEHSEE---KGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYD 380
            V   +  E + K EE + +EE   K +  +K + E  ++ Q+ ++   E K  +     +
Sbjct: 1597 VMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAE 1656

Query: 381  DESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGR 440
            +E+ I+  +E    E+ + ++EE  + E +E+ +      E +E +K  E    E++  +
Sbjct: 1657 EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK 1716

Query: 441  R 441
            +
Sbjct: 1717 K 1717


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is
           found in bacteria. Proteins in this family are typically
           between 234 to 465 amino acids in length. There is a
           conserved GEV sequence motif.Most members are annotated
           as being SprA-related.
          Length = 261

 Score = 28.6 bits (64), Expect = 6.9
 Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 2/57 (3%)

Query: 331 ENETKKE-EQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIE 386
           ENE + E  QE  EE  + DE  +      NQ+T        T +     +    I 
Sbjct: 206 ENEARAELAQERQEEASEEDEA-EALAGAENQITALEPLAPSTPQVDGQVEATGAIA 261


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 27.2 bits (61), Expect = 7.0
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 394 DEDKQDESEEDGEEEGEEEASE 415
           +E K++E EE+ +EE EEEA+ 
Sbjct: 78  EEKKEEEEEEEEKEESEEEAAA 99



 Score = 26.8 bits (60), Expect = 8.2
 Identities = 10/35 (28%), Positives = 16/35 (45%), Gaps = 3/35 (8%)

Query: 405 GEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
                 EE  E E   E +E+++E+EE+     G 
Sbjct: 72  AAAAAAEEKKEEE---EEEEEKEESEEEAAAGLGA 103


>gnl|CDD|218803 pfam05904, DUF863, Plant protein of unknown function (DUF863).
           This family consists of a number of hypothetical
           proteins from Arabidopsis thaliana and Oryza sativa. The
           function of this family is unknown.
          Length = 766

 Score = 28.9 bits (64), Expect = 7.0
 Identities = 23/122 (18%), Positives = 36/122 (29%), Gaps = 22/122 (18%)

Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRE------NYAESKTNKNTSDYDDES---DIEI 387
           EE +   E     +K D    E+ ++  E         ES  +  T            E+
Sbjct: 521 EEDQKRSEVLSPQQKDDDSSDELIKIAAESIVAICRDVESAASSLTDALSKSPLHWFAEL 580

Query: 388 VDEIDNDEDKQDESEEDGEEE-------------GEEEASENEYYTETDEDEKENEEDDG 434
                 D + + + E   E                 EE  E EY  +    E  N E+ G
Sbjct: 581 ASSSKEDLEDKIDKEGCVESSSSDEIDYFEAMTLQLEETKEEEYMPKPLVPENLNFEETG 640

Query: 435 ES 436
            +
Sbjct: 641 TA 642


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 28.2 bits (63), Expect = 7.2
 Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 1/80 (1%)

Query: 331 ENETKKEEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDE 390
           E ET++ +QE  E +   +             T   +  +  N       D++  E   +
Sbjct: 102 EAETRQLQQELREAQEAHERARQELLEAAAAPTAPPHVAAPVN-GEQLEPDDNGEEASAD 160

Query: 391 IDNDEDKQDESEEDGEEEGE 410
           ++ D D +D SEE+     E
Sbjct: 161 LETDPDMKDRSEEERVTYAE 180


>gnl|CDD|217956 pfam04194, PDCD2_C, Programmed cell death protein 2, C-terminal
           putative domain. 
          Length = 165

 Score = 27.8 bits (62), Expect = 7.3
 Identities = 14/56 (25%), Positives = 22/56 (39%), Gaps = 7/56 (12%)

Query: 378 DYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDD 433
           +Y+      + +E +  E    E  E     G+E+        ETDE EK   +D 
Sbjct: 12  EYETLHAPSLPEEPEEREIDAFEELEKEGGGGKEDE-------ETDESEKAKSDDK 60


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 28.0 bits (62), Expect = 7.4
 Identities = 14/43 (32%), Positives = 21/43 (48%)

Query: 395 EDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESD 437
           E+  D   E+ EE+ EE     +   E DE+ +E EE+   S 
Sbjct: 21  EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASS 63


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 27.0 bits (60), Expect = 7.7
 Identities = 8/38 (21%), Positives = 15/38 (39%), Gaps = 5/38 (13%)

Query: 397 KQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDG 434
               +  +   E +E   E     E +E+ +E  +DD 
Sbjct: 71  AAAAAGAEAAAEADEAEEE-----EKEEEAEEESDDDM 103


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 28.2 bits (62), Expect = 7.8
 Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 388 VDEIDNDEDKQDESEEDGEEEGEEEASE-NEYYTETDEDEKENEEDDGESD 437
           +DE+   E  + E EE  E++ E+   E  +   E  E E+ENEE   + +
Sbjct: 18  IDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPKEE 68


>gnl|CDD|185626 PTZ00447, PTZ00447, apical membrane antigen 1-like protein;
           Provisional.
          Length = 508

 Score = 28.4 bits (63), Expect = 8.0
 Identities = 22/80 (27%), Positives = 39/80 (48%), Gaps = 6/80 (7%)

Query: 360 NQVTRENYAESKTNKNTSDYDDE--SDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENE 417
           +++ +E Y +  + + T+   DE  +D+   DEI  D    DE E D  +E E E    +
Sbjct: 403 HKMIKEMYVQKDSLETTAGERDEFKNDVLEEDEISGDNIGPDEIENDHYQEKEIENDNIQ 462

Query: 418 YYTETDEDEKENEEDDGESD 437
           Y    DE++ E + D   ++
Sbjct: 463 Y----DENKNEGKNDHIPNN 478


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 28.5 bits (63), Expect = 8.3
 Identities = 27/107 (25%), Positives = 39/107 (36%), Gaps = 3/107 (2%)

Query: 334 TKKEEQEHSEEKGDSDEKY--DPEHFEMNQVTRENYAESKTNKNTSDYDDESDIEIVDEI 391
           TKK   E SE   +  E    +    +            K    +SD ++E   + V + 
Sbjct: 81  TKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRK- 139

Query: 392 DNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDG 438
                K DE  ED   E E    E   +  + E+E E E D  + DG
Sbjct: 140 RRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDG 186


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 28.5 bits (63), Expect = 8.4
 Identities = 17/67 (25%), Positives = 28/67 (41%), Gaps = 1/67 (1%)

Query: 365 ENYAESKT-NKNTSDYDDESDIEIVDEIDNDEDKQDESEEDGEEEGEEEASENEYYTETD 423
           ENY   K    + S +      +    +   E+ Q  SEED     E+   E+    E D
Sbjct: 435 ENYCVGKRLPPHLSPFASVDSYDPRASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDD 494

Query: 424 EDEKENE 430
           +D++E +
Sbjct: 495 DDDEELQ 501


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 26.9 bits (60), Expect = 9.2
 Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 3/35 (8%)

Query: 405 GEEEGEEEASENEYYTETDEDEKENEEDDGESDGG 439
                EEE  E E   E +E+E+E+EE+     G 
Sbjct: 71  AAAAAEEEEEEEE---EEEEEEEESEEEAMAGLGA 102


>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
           This family consists of several mammalian protein
           phosphatase inhibitor 1 (IPP-1) and dopamine- and
           cAMP-regulated neuronal phosphoprotein (DARPP-32)
           proteins. Protein phosphatase inhibitor-1 is involved in
           signal transduction and is an endogenous inhibitor of
           protein phosphatase-1. It has been demonstrated that
           DARPP-32, if phosphorylated, can inhibit
           protein-phosphatase-1. DARPP-32 has a key role in many
           neurotransmitter pathways throughout the brain and has
           been shown to be involved in controlling receptors, ion
           channels and other physiological factors including the
           brain's response to drugs of abuse, such as cocaine,
           opiates and nicotine. DARPP-32 is reciprocally regulated
           by the two neurotransmitters that are most often
           implicated in schizophrenia - dopamine and glutamate.
           Dopamine activates DARPP-32 through the D1 receptor
           pathway and disables DARPP-32 through the D2 receptor.
           Glutamate, acting through the N-methyl-d-aspartate
           receptor, renders DARPP-32 inactive. A mutant form of
           DARPP-32 has been linked with gastric cancers.
          Length = 170

 Score = 27.6 bits (61), Expect = 9.5
 Identities = 10/54 (18%), Positives = 16/54 (29%)

Query: 391 IDNDEDKQDESEEDGEEEGEEEASENEYYTETDEDEKENEEDDGESDGGRRRGS 444
             + E+ Q   EED  +  +E          T+  + E        D     G 
Sbjct: 94  QGSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQDAQSEVGL 147


>gnl|CDD|215541 PLN03020, PLN03020, low-temperature-induced protein; Provisional.
          Length = 556

 Score = 28.1 bits (62), Expect = 10.0
 Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 9/80 (11%)

Query: 337 EEQEHSEEKGDSDEKYDPEHFEMNQVTRENYAESKTNKNT-------SDYDDESDIEIVD 389
           EE  H+     + E  D  H E   V ++   ++K  K+T         +D+    +  D
Sbjct: 10  EEDPHNVGLHSATEGED--HHEKKSVLKKVKDKAKKIKDTITKHGHGHHHDEHRVPDDHD 67

Query: 390 EIDNDEDKQDESEEDGEEEG 409
             + D+D+ +E  ED +  G
Sbjct: 68  LYEEDDDEDEEMAEDPQIHG 87


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.131    0.379 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,230,928
Number of extensions: 2140521
Number of successful extensions: 6086
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3913
Number of HSP's successfully gapped: 567
Length of query: 444
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 344
Effective length of database: 6,502,202
Effective search space: 2236757488
Effective search space used: 2236757488
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 61 (27.3 bits)