RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12224
         (904 letters)



>gnl|CDD|214744 smart00611, SEC63, Domain of unknown function in Sec63p, Brr2p and
           other proteins. 
          Length = 312

 Score =  179 bits (456), Expect = 3e-50
 Identities = 70/268 (26%), Positives = 118/268 (44%), Gaps = 32/268 (11%)

Query: 109 GMWWYKSIRYTGDKVLL-ETINFYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIER 167
           G+W     R      +   TI  +     + P M  K +L IL  S EFD+         
Sbjct: 1   GIWPTDLGRIASYYYISYTTIRTF--NELLKPKMTTKDLLRILSMSSEFDQ--------- 49

Query: 168 PSDEIEVPQLIRQIPNLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLEADRMYIV 227
                E   L+ ++        E P      +KA  L+ AHLS ++L S  LE+D +Y++
Sbjct: 50  IPVRHEEDLLLEELAEKLPIRLENPSLDDPHVKANLLLQAHLSRLKLPSFALESDTVYVL 109

Query: 228 KKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFKNPLLQLP 287
           +    LLQ MV    +   L+ A             N + L  MI+Q +W   +PLLQLP
Sbjct: 110 QNAGRLLQAMVDIALERGWLSTA------------LNALNLSQMIIQALWPTDSPLLQLP 157

Query: 288 YVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDLHV 347
           ++ ++ LK    +K+ + SL+   ++++EER  ++  + D +   + KVL  +P +++ +
Sbjct: 158 HLPEEILKR--LEKKKVLSLEDLLELEDEERGELLG-LLDAEGERVYKVLSRLPKLNIEI 214

Query: 348 QPEVIDDEATTEYTAGAIITVTCTLMRK 375
             E I          G  +T+T  L   
Sbjct: 215 SLEPITRT-----VLGVEVTLTVDLTWD 237



 Score = 83.5 bits (207), Expect = 2e-17
 Identities = 27/73 (36%), Positives = 36/73 (49%), Gaps = 1/73 (1%)

Query: 556 PEDKQEFWWIYISDRKSRTLLTSPYH-ITELVDQEQVQLKFTAPRWPGVYTFSVCLRCDS 614
              KQE WW+ I D     LL      + +    E+V+L FTAP   G Y +++ L  DS
Sbjct: 240 IHGKQEGWWLVIGDSDGNELLHIERFSLNKKNVSEEVKLDFTAPATEGNYQYTLRLVSDS 299

Query: 615 YLGFDQMQDIKLD 627
           YLG DQ   +  D
Sbjct: 300 YLGCDQEYPLSFD 312


>gnl|CDD|217270 pfam02889, Sec63, Sec63 Brl domain.  This domain (also known as the
           Brl domain) is required for assembly of functional
           endoplasmic reticulum translocons.
          Length = 309

 Score =  143 bits (362), Expect = 1e-37
 Identities = 61/255 (23%), Positives = 116/255 (45%), Gaps = 40/255 (15%)

Query: 105 PTAVGM---WWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFN 161
           PT +G     +Y         +L ET+  +     + P+  LK +L IL ++ EF+    
Sbjct: 1   PTELGRIASHYY---------ILYETMETF--NTSLKPNTTLKDLLEILSSASEFE---- 45

Query: 162 SEIIERPSDEIEVPQLIRQIPN-LGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLE 220
            EI  R  ++ E+ +L+ ++P  +   + + P       K   L+ A+LS ++L   +L 
Sbjct: 46  -EIPVREEEKKELKKLLERVPIPVKSPSIDDP-----HAKVNLLLQAYLSRLKLPDFSLV 99

Query: 221 ADRMYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFK 280
           +D  YI++    LL+ +       I L+          +    N ++L  MI Q +W   
Sbjct: 100 SDLNYILQNAGRLLRALF-----EIALSKG-------WLSPALNALELSKMIEQRLWPSD 147

Query: 281 NPLLQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNM 340
           +PL QLP++  + LK    +K+ I SL+    +  EE   ++   N +Q  D+ + +   
Sbjct: 148 SPLRQLPHIPPEVLKK--LEKKGISSLEDLMDLDPEELGELLG-NNPKQGKDIAEFVNRF 204

Query: 341 PYVDLHVQPEVIDDE 355
           P +++  + + I  E
Sbjct: 205 PRLEIEAEVQPITRE 219



 Score = 40.7 bits (96), Expect = 0.002
 Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 4/79 (5%)

Query: 550 VHCPRFPEDKQEFWWIYISDRKSRTLLTSPYHITELVDQEQVQLKFTAPRWPGVYTFSVC 609
               RF   K E +WI + D  +  LL      T    +++ +L+FT P   G     + 
Sbjct: 233 EWDERFH-GKSEGFWIVVEDSDNNELLAIER-FTLNKKKDEHELEFTVPLSGGPLPPQLT 290

Query: 610 LR--CDSYLGFDQMQDIKL 626
           +R   DS+LG DQ   + L
Sbjct: 291 IRLVSDSWLGCDQEVPVSL 309


>gnl|CDD|227694 COG5407, SEC63, Preprotein translocase subunit Sec63 [Intracellular
           trafficking and secretion].
          Length = 610

 Score =  120 bits (301), Expect = 6e-28
 Identities = 107/554 (19%), Positives = 190/554 (34%), Gaps = 96/554 (17%)

Query: 13  GASEASIKKAYRKQSLILHPDK---ETGD-----EKAFMKLTKAYQALTDEESRRNFEKY 64
             SE  IKK YR  S+  HPDK      +     E+ +  +TKAY  LTD++ R N+  Y
Sbjct: 109 DTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEYEEKYKTITKAYGLLTDKKRRENYLNY 168

Query: 65  GNPDGPGAMSFGIALPSYIVEKENSVWVLGLYALVFMVALPTAVGMWWYKSIRYTGDKVL 124
           G PD P   S GIALP  IV  E S++   +Y+L+  V LP  V  WW +   YT   V 
Sbjct: 169 GTPDSPQHTSEGIALPKVIVRSERSMYAFVMYSLLLGVFLPYWVYRWWREIRDYTKVGVH 228

Query: 125 LETINFYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIERPSDEIEVPQLIRQIPNL 184
             T+  +Y   +I   + L  +L +  +S E  R                       P  
Sbjct: 229 FVTMEMFYE--RIDGSLTLDELLSLFASSKELTRMN---------------------PKG 265

Query: 185 GEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLEADRMYIVKKCPYLLQEMVTCISQL 244
            +   E+            L+  HL N      ++E +   I      LL  ++      
Sbjct: 266 PKCTLEK------------LLGDHL-NRA---RSVEFNEYRIKSNVEGLLGALLR----- 304

Query: 245 ILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFKNPLLQLPYVTDDHLKHFICKKRYI 304
           I   +A  +         + +++  P+      D    +LQLP    ++ +      R I
Sbjct: 305 IASNFAFPLKECGKENKGQYIVQAIPL------DHLFRILQLPRSDVEYAQRV--SLRLI 356

Query: 305 KSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDLHVQPEVIDDEATTEYTAGA 364
           + ++    +  +   +V+K +   +  ++  V   +P   L   P ++ ++      +  
Sbjct: 357 EGMKAIGSLIAKRYGNVLKNLVVLELMEIQAVADGIPRYFLLQAPFLVFEDLFITEKSKE 416

Query: 365 IITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTH 424
            +++   L   P              +E S      IE    K + + +P        T 
Sbjct: 417 RVSLKGYLGAIPGPEHRTSALNVYNQVEISDFEASVIETGAIKNDSSDSPYSEAPDFATR 476

Query: 425 RPV--WMKNKK---------GGKKKKFTKNKHDKKSSGQ--------------------- 452
                 ++  K         GG     +K   D++  G                      
Sbjct: 477 NDSEWAVRVAKCEQTVYIIPGGSIATVSKVTLDRRIQGDVAPVDKTGGKQLIVHAPFMVG 536

Query: 453 ----KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKN 508
               K I           +  K++  D  E+  +V    +     S  +++  V  +E  
Sbjct: 537 AFSVKWIVMLTVDNVIYGEQLKIQVVDDFEKKANVRFSIDVGELKSTCDDEVFVGDEEDL 596

Query: 509 EDSNNKSEESSDDD 522
                K  ++  ++
Sbjct: 597 SRYVEKDTDTKVEE 610



 Score = 30.7 bits (69), Expect = 3.9
 Identities = 14/161 (8%), Positives = 43/161 (26%), Gaps = 7/161 (4%)

Query: 509 EDSNNKSEESSDDDDDDWEKYQTGLNKRDK-VLEGRSKQSHSVHCPRFPEDKQEFWWIYI 567
           E   +     +  +  D+E         +   ++  S  S     P F       W + +
Sbjct: 431 EHRTSALNVYNQVEISDFEASV-----IETGAIKNDSSDSPYSEAPDFATRNDSEWAVRV 485

Query: 568 SDRKSRTLLTSPYHITEL-VDQEQVQLKFTAPRWPGVYTFSVCLRCDSYLGFDQMQDIKL 626
           +  +    +     I  +       +++             + +     +G   ++ I +
Sbjct: 486 AKCEQTVYIIPGGSIATVSKVTLDRRIQGDVAPVDKTGGKQLIVHAPFMVGAFSVKWIVM 545

Query: 627 DVKEAPEVPTEHPQWEMSGEEDEGDEEMGGSDVSEFTTDED 667
              +      +     +   E + +           +T +D
Sbjct: 546 LTVDNVIYGEQLKIQVVDDFEKKANVRFSIDVGELKSTCDD 586


>gnl|CDD|214946 smart00973, Sec63, Sec63 Brl domain.  This domain was named after
           the yeast Sec63 (or NPL1) (also known as the Brl domain)
           protein in which it was found. This protein is required
           for assembly of functional endoplasmic reticulum
           translocons. Other yeast proteins containing this domain
           include pre-mRNA splicing helicase BRR2, HFM1 protein
           and putative helicases.
          Length = 314

 Score = 85.5 bits (212), Expect = 6e-18
 Identities = 59/256 (23%), Positives = 104/256 (40%), Gaps = 39/256 (15%)

Query: 105 PTAVGM---WWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFN 161
           PT +G    ++Y  I Y       ETI  +     + P   LK +L IL  + EF     
Sbjct: 1   PTELGRIASYYY--ISY-------ETIETFNQ--SLKPTTTLKDILEILSRASEFK---- 45

Query: 162 SEIIERPSDEIEVPQLIRQIP-NLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLE 220
            EI  R +++ E+ +L +++P  + E   + P       K   L+ AHLS + L    L 
Sbjct: 46  -EIPVRHNEKKELNELNKRVPIPVKEGIIDSP-----HAKVNLLLQAHLSRLPLPDFDLV 99

Query: 221 ADRMYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFK 280
           +D  YI++  P +L+ +V        L  A             N + L  M+VQ +W+  
Sbjct: 100 SDLKYILQNAPRILRALVDIALSKGWLRTAL------------NALDLSQMVVQRLWEDS 147

Query: 281 N-PLLQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGN 339
           + PL QLP+   + +   +  K   +S +    M   E    +  +         ++L  
Sbjct: 148 DSPLKQLPHFLIEDVYDKLELKDGSRSFELLLDMNAAELGEFLNRLPPNGRLIY-ELLRR 206

Query: 340 MPYVDLHVQPEVIDDE 355
            P +++  +   I  +
Sbjct: 207 FPKIEVEAEVLPITRD 222



 Score = 50.4 bits (121), Expect = 2e-06
 Identities = 34/150 (22%), Positives = 56/150 (37%), Gaps = 5/150 (3%)

Query: 481 KDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVL 540
            ++   S     L D+   E+ +   +   +     E          + +     RD  L
Sbjct: 166 LELKDGSRSFELLLDMNAAELGEFLNRLPPNGRLIYELLRRFPKIEVEAEVLPITRDLTL 225

Query: 541 EGRSKQS--HSVHCPRFPEDKQEFWWIYISDRKSRTLLTSPY-HITELVDQEQVQLKFTA 597
               + +   +   PR  + K E WW+ + D  +  LL      + +     +V+L FT 
Sbjct: 226 RVELEITPVFAWDLPR-HKGKSESWWLVVGDSDTNELLAIKRVTLRKKKKSNEVKLDFTV 284

Query: 598 PR-WPGVYTFSVCLRCDSYLGFDQMQDIKL 626
           P   PG   ++V L  DSYLG DQ     L
Sbjct: 285 PLSEPGPENYTVYLISDSYLGCDQEVSFSL 314


>gnl|CDD|215804 pfam00226, DnaJ, DnaJ domain.  DnaJ domains (J-domains) are
          associated with hsp70 heat-shock system and it is
          thought that this domain mediates the interaction.
          DnaJ-domain is therefore part of a chaperone (protein
          folding) system. The T-antigens, although not in
          Prosite are confirmed as DnaJ containing domains from
          literature.
          Length = 63

 Score = 66.8 bits (164), Expect = 7e-14
 Identities = 25/54 (46%), Positives = 34/54 (62%), Gaps = 3/54 (5%)

Query: 11 PSGASEASIKKAYRKQSLILHPDKETGD---EKAFMKLTKAYQALTDEESRRNF 61
          P  AS+  IKKAYRK +L  HPDK  GD   E+ F ++ +AY+ L+D E R  +
Sbjct: 9  PRDASDEEIKKAYRKLALKYHPDKNPGDPAAEEKFKEINEAYEVLSDPEKRAIY 62


>gnl|CDD|223560 COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn
          finger domain [Posttranslational modification, protein
          turnover, chaperones].
          Length = 371

 Score = 67.7 bits (166), Expect = 8e-12
 Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 7/73 (9%)

Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG-- 65
             ASE  IKKAYRK +   HPD+  GD++A   F ++ +AY+ L+D E R  ++++G  
Sbjct: 13 SKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRAAYDQFGHA 72

Query: 66 --NPDGPGAMSFG 76
               G G   FG
Sbjct: 73 GFKAGGFGGFGFG 85


>gnl|CDD|99751 cd06257, DnaJ, DnaJ domain or J-domain.  DnaJ/Hsp40 (heat shock
          protein 40) proteins are highly conserved and play
          crucial roles in protein translation, folding,
          unfolding, translocation, and degradation. They act
          primarily by stimulating the ATPase activity of Hsp70s,
          an important chaperonine family. Hsp40 proteins are
          characterized by the presence of a J domain, which
          mediates the interaction with Hsp70. They may contain
          other domains as well, and the architectures provide a
          means of classification.
          Length = 55

 Score = 59.9 bits (146), Expect = 1e-11
 Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 3/47 (6%)

Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTD 54
          P  AS+  IKKAYRK +L  HPDK   D +A   F ++ +AY+ L+D
Sbjct: 9  PPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55


>gnl|CDD|197617 smart00271, DnaJ, DnaJ molecular chaperone homology domain. 
          Length = 60

 Score = 59.9 bits (146), Expect = 2e-11
 Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 4/51 (7%)

Query: 11 PSGASEASIKKAYRKQSLILHPDKETGD----EKAFMKLTKAYQALTDEES 57
          P  AS   IKKAYRK +L  HPDK  GD    E+ F ++ +AY+ L+D E 
Sbjct: 10 PRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDPEK 60


>gnl|CDD|237658 PRK14284, PRK14284, chaperone protein DnaJ; Provisional.
          Length = 391

 Score = 64.5 bits (157), Expect = 8e-11
 Identities = 32/69 (46%), Positives = 45/69 (65%), Gaps = 6/69 (8%)

Query: 13 GASEASIKKAYRKQSLILHPDKETGD---EKAFMKLTKAYQALTDEESRRNFEKYGNPDG 69
           AS   IKKAYRK ++  HPDK  GD   EK F ++++AY+ L+D + R ++++YG  DG
Sbjct: 12 TASPEEIKKAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKRESYDRYGK-DG 70

Query: 70 P--GAMSFG 76
          P  GA  FG
Sbjct: 71 PFAGAGGFG 79


>gnl|CDD|184611 PRK14297, PRK14297, chaperone protein DnaJ; Provisional.
          Length = 380

 Score = 64.0 bits (156), Expect = 1e-10
 Identities = 27/67 (40%), Positives = 44/67 (65%), Gaps = 3/67 (4%)

Query: 13 GASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYGNPDG 69
          GAS+  IKKA+RK ++  HPDK  G+++A   F ++ +AYQ L+D + +  ++++G  D 
Sbjct: 15 GASDDEIKKAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKAQYDQFGTADF 74

Query: 70 PGAMSFG 76
           GA  FG
Sbjct: 75 NGAGGFG 81


>gnl|CDD|233829 TIGR02349, DnaJ_bact, chaperone protein DnaJ.  This model
          represents bacterial forms of DnaJ, part of the
          DnaK-DnaJ-GrpE chaperone system. The three components
          typically are encoded by consecutive genes. DnaJ
          homologs occur in many genomes, typically not near DnaK
          and GrpE-like genes; most such genes are not included
          by this family. Eukaryotic (mitochondrial and
          chloroplast) forms are not included in the scope of
          this family.
          Length = 354

 Score = 61.8 bits (151), Expect = 5e-10
 Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 11 PSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPD 68
             ASE  IKKAYRK +   HPD  K+   E+ F ++ +AY+ L+D E R  ++++G+  
Sbjct: 9  SKDASEEEIKKAYRKLAKKYHPDRNKDKEAEEKFKEINEAYEVLSDPEKRAQYDQFGH-A 67

Query: 69 GPGAMSFG 76
          G      G
Sbjct: 68 GFNGGGGG 75


>gnl|CDD|237653 PRK14276, PRK14276, chaperone protein DnaJ; Provisional.
          Length = 380

 Score = 62.0 bits (151), Expect = 6e-10
 Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 4/65 (6%)

Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPDGPG 71
          AS+  IKKAYRK S   HPD  KE G E+ + ++ +AY+ L+D + R  +++YG     G
Sbjct: 16 ASQDEIKKAYRKLSKKYHPDINKEPGAEEKYKEVQEAYETLSDPQKRAAYDQYGAAGANG 75

Query: 72 AMSFG 76
             FG
Sbjct: 76 G--FG 78


>gnl|CDD|237662 PRK14292, PRK14292, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 61.8 bits (150), Expect = 6e-10
 Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 8/71 (11%)

Query: 3  GKSRTPLKPSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRN 60
          G SRT      AS   IK AYRK +L  HPD  KE G  + F ++ +AY  L+D E R +
Sbjct: 9  GVSRT------ASADEIKSAYRKLALKYHPDRNKEKGAAEKFAQINEAYAVLSDAEKRAH 62

Query: 61 FEKYGNPDGPG 71
          ++++G   G G
Sbjct: 63 YDRFGTAPGAG 73


>gnl|CDD|240236 PTZ00037, PTZ00037, DnaJ_C chaperone protein; Provisional.
          Length = 421

 Score = 59.4 bits (144), Expect = 4e-09
 Identities = 22/52 (42%), Positives = 37/52 (71%), Gaps = 1/52 (1%)

Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKAFMKLTKAYQALTDEESRRNFEKYG 65
           + + IKKAYRK ++  HPDK  GD + F ++++AY+ L+D E R+ +++YG
Sbjct: 40 CTTSEIKKAYRKLAIKHHPDK-GGDPEKFKEISRAYEVLSDPEKRKIYDEYG 90


>gnl|CDD|236757 PRK10767, PRK10767, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 58.6 bits (143), Expect = 7e-09
 Identities = 28/71 (39%), Positives = 42/71 (59%), Gaps = 8/71 (11%)

Query: 13 GASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG---- 65
           ASE  IKKAYRK ++  HPD+  GD++A   F ++ +AY+ L+D + R  +++YG    
Sbjct: 15 NASEDEIKKAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRAAYDQYGHAAF 74

Query: 66 NPDGPGAMSFG 76
             G G   FG
Sbjct: 75 EQGGGGG-GFG 84


>gnl|CDD|237656 PRK14280, PRK14280, chaperone protein DnaJ; Provisional.
          Length = 376

 Score = 57.0 bits (138), Expect = 2e-08
 Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 10/74 (13%)

Query: 13 GASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPD-- 68
           AS+  IKKAYRK S   HPD  KE G ++ F ++++AY+ L+D++ R  ++++G+    
Sbjct: 15 SASKDEIKKAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDQKRAQYDQFGHAGPN 74

Query: 69 ------GPGAMSFG 76
                G G   FG
Sbjct: 75 QGFGGGGFGGGDFG 88


>gnl|CDD|184599 PRK14277, PRK14277, chaperone protein DnaJ; Provisional.
          Length = 386

 Score = 56.0 bits (135), Expect = 5e-08
 Identities = 26/73 (35%), Positives = 42/73 (57%), Gaps = 7/73 (9%)

Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG-- 65
             A+E  IKKAYR+ +   HPD   GD++A   F ++ +AY+ L+D + R  ++++G  
Sbjct: 14 DRNATEEEIKKAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEILSDPQKRAQYDQFGHA 73

Query: 66 --NPDGPGAMSFG 76
            +P G G   FG
Sbjct: 74 AFDPGGFGQGGFG 86


>gnl|CDD|237664 PRK14294, PRK14294, chaperone protein DnaJ; Provisional.
          Length = 366

 Score = 54.4 bits (131), Expect = 1e-07
 Identities = 23/55 (41%), Positives = 37/55 (67%), Gaps = 3/55 (5%)

Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG 65
          ASE  IKK+YRK ++  HPD+  GD++A   F +  +AY+ L+D + R  +++YG
Sbjct: 16 ASEEEIKKSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRGIYDQYG 70


>gnl|CDD|184612 PRK14298, PRK14298, chaperone protein DnaJ; Provisional.
          Length = 377

 Score = 54.1 bits (130), Expect = 2e-07
 Identities = 23/58 (39%), Positives = 36/58 (62%), Gaps = 2/58 (3%)

Query: 11 PSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGN 66
             AS   IKKAYRK ++  HPD  KE   E+ F ++++AY  L+D E R  ++++G+
Sbjct: 14 SKDASVEDIKKAYRKLAMKYHPDKNKEPDAEEKFKEISEAYAVLSDAEKRAQYDRFGH 71


>gnl|CDD|184604 PRK14283, PRK14283, chaperone protein DnaJ; Provisional.
          Length = 378

 Score = 53.3 bits (128), Expect = 3e-07
 Identities = 22/56 (39%), Positives = 38/56 (67%), Gaps = 2/56 (3%)

Query: 13 GASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGN 66
           A +  IKKAYRK +   HPD  +E G E+ F ++++AY  L+D+E R+ ++++G+
Sbjct: 16 NADKKEIKKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQRYDQFGH 71


>gnl|CDD|237661 PRK14291, PRK14291, chaperone protein DnaJ; Provisional.
          Length = 382

 Score = 53.6 bits (129), Expect = 3e-07
 Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 11 PSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYG 65
             A++  IKKAYR+ +   HPD  K    E+ F ++ +AYQ L+D E R+ ++++G
Sbjct: 12 SRNATQEEIKKAYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRKLYDQFG 68


>gnl|CDD|237657 PRK14281, PRK14281, chaperone protein DnaJ; Provisional.
          Length = 397

 Score = 53.3 bits (128), Expect = 3e-07
 Identities = 21/55 (38%), Positives = 38/55 (69%), Gaps = 3/55 (5%)

Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG 65
          A +  IKKAYRK +L  HPDK   +++A   F ++ +AY+ L++++ RR ++++G
Sbjct: 15 ADKDEIKKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRRRYDQFG 69


>gnl|CDD|172773 PRK14285, PRK14285, chaperone protein DnaJ; Provisional.
          Length = 365

 Score = 53.5 bits (128), Expect = 3e-07
 Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 3/66 (4%)

Query: 13 GASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYGNPDG 69
          GAS+  IKKAYRK ++  HPDK  G+++A   F + T+AY+ L D+  R  ++++G+   
Sbjct: 14 GASKDEIKKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRAQYDRFGHTAF 73

Query: 70 PGAMSF 75
           G   F
Sbjct: 74 EGGGGF 79


>gnl|CDD|237654 PRK14278, PRK14278, chaperone protein DnaJ; Provisional.
          Length = 378

 Score = 52.7 bits (127), Expect = 5e-07
 Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 4/63 (6%)

Query: 13 GASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYGNPDG 69
           AS+A IK+AYRK +  LHPD    DE+A   F +++ AY+ L+D E RR  +  G+P  
Sbjct: 14 NASDAEIKRAYRKLARELHPDV-NPDEEAQEKFKEISVAYEVLSDPEKRRIVDLGGDPLE 72

Query: 70 PGA 72
             
Sbjct: 73 SAG 75


>gnl|CDD|237655 PRK14279, PRK14279, chaperone protein DnaJ; Provisional.
          Length = 392

 Score = 52.8 bits (127), Expect = 5e-07
 Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 3/54 (5%)

Query: 12 SGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFE 62
          S AS   IKKAYRK +  LHPD   GD  A   F  +++A+  L+D   R+ ++
Sbjct: 19 SDASAEEIKKAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRKEYD 72


>gnl|CDD|237665 PRK14295, PRK14295, chaperone protein DnaJ; Provisional.
          Length = 389

 Score = 52.5 bits (126), Expect = 6e-07
 Identities = 31/76 (40%), Positives = 44/76 (57%), Gaps = 10/76 (13%)

Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEK---- 63
          P  A+EA IKKAYRK +   HPD   GD KA   F ++++AY  L+DE+ R+ +++    
Sbjct: 18 PKDATEAEIKKAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRKEYDEARSL 77

Query: 64 YGNP---DGPGAMSFG 76
          +GN     GPG    G
Sbjct: 78 FGNGGFRPGPGGGGGG 93


>gnl|CDD|237667 PRK14299, PRK14299, chaperone protein DnaJ; Provisional.
          Length = 291

 Score = 51.5 bits (123), Expect = 9e-07
 Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 2/61 (3%)

Query: 11 PSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPD 68
          P  AS+  IKKA++K +   HPD  K  G E+ F ++ +AY  L+D E RR ++ YG   
Sbjct: 13 PKNASQDEIKKAFKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRIYDTYGTTA 72

Query: 69 G 69
           
Sbjct: 73 A 73


>gnl|CDD|172774 PRK14286, PRK14286, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 51.1 bits (122), Expect = 2e-06
 Identities = 25/68 (36%), Positives = 40/68 (58%), Gaps = 4/68 (5%)

Query: 13 GASEASIKKAYRKQSLILHPDKETGD---EKAFMKLTKAYQALTDEESRRNFEKYGNPD- 68
           A++  IK AYRK ++  HPDK  G+   E+ F + T+AY+ L D + R+ ++++G    
Sbjct: 15 SANDEEIKSAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDPKKRQAYDQFGKAGV 74

Query: 69 GPGAMSFG 76
            GA  FG
Sbjct: 75 NAGAGGFG 82


>gnl|CDD|237660 PRK14289, PRK14289, chaperone protein DnaJ; Provisional.
          Length = 386

 Score = 51.0 bits (122), Expect = 2e-06
 Identities = 24/62 (38%), Positives = 39/62 (62%), Gaps = 3/62 (4%)

Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYGNPDGP 70
          A+   IKKAYRK+++  HPDK  GD++A   F +  +AY  L+D + R  ++++G+    
Sbjct: 17 ATVDEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRSRYDQFGHAGVG 76

Query: 71 GA 72
          GA
Sbjct: 77 GA 78


>gnl|CDD|237668 PRK14301, PRK14301, chaperone protein DnaJ; Provisional.
          Length = 373

 Score = 50.5 bits (121), Expect = 2e-06
 Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 3/66 (4%)

Query: 14 ASEASIKKAYRKQSLILHPDKETGD---EKAFMKLTKAYQALTDEESRRNFEKYGNPDGP 70
          ASE  IKKAYRK +L  HPD+   +   E+ F +  +AY+ L D E R  ++++G+    
Sbjct: 16 ASEDEIKKAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDAEKRARYDRFGHAGVN 75

Query: 71 GAMSFG 76
          G   FG
Sbjct: 76 GNGGFG 81


>gnl|CDD|237663 PRK14293, PRK14293, chaperone protein DnaJ; Provisional.
          Length = 374

 Score = 50.4 bits (121), Expect = 3e-06
 Identities = 23/65 (35%), Positives = 38/65 (58%), Gaps = 2/65 (3%)

Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPDGPG 71
          A +  +K+AYR+ +   HPD  KE G E  F ++ +AY+ L+D E+R  ++++G     G
Sbjct: 15 ADKDELKRAYRRLARKYHPDVNKEPGAEDRFKEINRAYEVLSDPETRARYDQFGEAGVSG 74

Query: 72 AMSFG 76
          A  F 
Sbjct: 75 AAGFP 79


>gnl|CDD|225124 COG2214, CbpA, DnaJ-class molecular chaperone [Posttranslational
          modification, protein turnover, chaperones].
          Length = 237

 Score = 48.3 bits (114), Expect = 6e-06
 Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 4/59 (6%)

Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA----FMKLTKAYQALTDEESRRNFEKYG 65
          P  AS   IKKAYRK +L  HPD+  GD K     F ++ +AY+ L+D E R  ++K G
Sbjct: 15 PPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDPERRAEYDKIG 73


>gnl|CDD|172776 PRK14288, PRK14288, chaperone protein DnaJ; Provisional.
          Length = 369

 Score = 47.4 bits (112), Expect = 2e-05
 Identities = 22/55 (40%), Positives = 38/55 (69%), Gaps = 3/55 (5%)

Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG 65
          +++ +IKK+YRK +L  HPD+  GD++A   F  + +AY  L+DE+ R  +++YG
Sbjct: 15 SNQETIKKSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEKKRALYDRYG 69


>gnl|CDD|172778 PRK14290, PRK14290, chaperone protein DnaJ; Provisional.
          Length = 365

 Score = 45.3 bits (107), Expect = 9e-05
 Identities = 24/67 (35%), Positives = 41/67 (61%), Gaps = 5/67 (7%)

Query: 14 ASEASIKKAYRKQSLILHPDKETGD----EKAFMKLTKAYQALTDEESRRNFEKYGNPD- 68
          AS+  IKKA+R+ +   HPD   G+    E+ F ++++AY+ L+D + RR +++ G  D 
Sbjct: 15 ASQEDIKKAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKRRQYDQTGTVDF 74

Query: 69 GPGAMSF 75
          G G  +F
Sbjct: 75 GAGGSNF 81


>gnl|CDD|172788 PRK14300, PRK14300, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 45.4 bits (107), Expect = 1e-04
 Identities = 22/55 (40%), Positives = 34/55 (61%), Gaps = 2/55 (3%)

Query: 14 ASEASIKKAYRKQSLILHPDK-ETGD-EKAFMKLTKAYQALTDEESRRNFEKYGN 66
          AS+A +KKAY K +   HPD  +  D EK F ++  AY  L DE+ R  ++++G+
Sbjct: 15 ASQADLKKAYLKLAKQYHPDTTDAKDAEKKFKEINAAYDVLKDEQKRAAYDRFGH 69


>gnl|CDD|237666 PRK14296, PRK14296, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 44.6 bits (105), Expect = 2e-04
 Identities = 19/65 (29%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPDGPG 71
          ASE  I++AYRK +   HPD  K        +++ +A   L D++ R+ ++++G+    G
Sbjct: 16 ASEQEIRQAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQYDQFGHAAFDG 75

Query: 72 AMSFG 76
          +  F 
Sbjct: 76 SSGFS 80


>gnl|CDD|237659 PRK14287, PRK14287, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 44.6 bits (105), Expect = 2e-04
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 8/71 (11%)

Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPD--- 68
          AS   +KKAYRK +   HPD  K    E  F ++ +AY  L+D + + +++++G+ D   
Sbjct: 16 ASVDEVKKAYRKLARKYHPDVNKAPDAEDKFKEVKEAYDTLSDPQKKAHYDQFGHTDPNQ 75

Query: 69 ---GPGAMSFG 76
             G GA  FG
Sbjct: 76 GFGGGGAGDFG 86


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 43.7 bits (103), Expect = 3e-04
 Identities = 21/167 (12%), Positives = 62/167 (37%)

Query: 395 KEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKK 454
           ++ + E ++++EKP         KE+ +  RP   K K+  KK +  +++ ++K   + +
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR 170

Query: 455 ITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNK 514
             +  ++        K ++    E+ +  + E+ +        N+E   +++  +D    
Sbjct: 171 AKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETT 230

Query: 515 SEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPEDKQE 561
           +    +D+     +     +   K  +     +           +  
Sbjct: 231 TSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETR 277



 Score = 31.4 bits (71), Expect = 2.5
 Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%)

Query: 387 KVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHD 446
           K +  E     E E E+E++           +E +          KK  + +  ++ K  
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKV----------EEPRDREEE-----KKRERVRAKSRPKKP 178

Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 506
            K     K     E+ +      +  K    E   +     EE     D   D       
Sbjct: 179 PKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVN-----EEREKEEDDGKDRETTTSP 233

Query: 507 KNEDSNNKSEESSDDDDDDWEKYQ 530
             ED + +S E S       +K  
Sbjct: 234 MEEDESRQSSEISRRSSSSLKKPD 257



 Score = 31.0 bits (70), Expect = 3.5
 Identities = 25/182 (13%), Positives = 59/182 (32%), Gaps = 15/182 (8%)

Query: 392 EPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 451
                 + +  +E +  +G       ++ ++  +    K K+  K +K  +   +K+   
Sbjct: 85  SKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP- 143

Query: 452 QKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVEND------------ 499
           ++K     ++ E P D  + +K++          +   +    + + +            
Sbjct: 144 KEKEKEKEKKVEEPRDREEEKKRERVRAKS--RPKKPPKKKPPNKKKEPPEEEKQRQAAR 201

Query: 500 EVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPEDK 559
           E V  K +  D N + E+  DD  D             +     S++S S      P   
Sbjct: 202 EAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPS 261

Query: 560 QE 561
             
Sbjct: 262 MA 263


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
          TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
          homolog and probable assembly protein of the Mycoplasma
          terminal organelle. The terminal organelle is involved
          in both cytadherence and gliding motility [Cellular
          processes, Chemotaxis and motility].
          Length = 871

 Score = 42.5 bits (99), Expect = 0.001
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 4/66 (6%)

Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNP--DG 69
          A E  IKKA+RK +   HPD  K       F ++ +A   L++ + R N++KYG+   D 
Sbjct: 14 ADEQEIKKAFRKLAKKYHPDRNKAPDAASIFAEINEANDVLSNPKKRANYDKYGHDGVDR 73

Query: 70 PGAMSF 75
               F
Sbjct: 74 EDDFDF 79


>gnl|CDD|222986 PHA03102, PHA03102, Small T antigen; Reviewed.
          Length = 153

 Score = 40.0 bits (94), Expect = 0.001
 Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 1/46 (2%)

Query: 15 SEASIKKAYRKQSLILHPDKETGDEKAFMKLTKAYQALTDEESRRN 60
          +   ++KAY ++ L  HPDK  GDE+   +L   Y+   +      
Sbjct: 20 NLPLMRKAYLRKCLEFHPDK-GGDEEKMKELNTLYKKFRESVKSLR 64


>gnl|CDD|184603 PRK14282, PRK14282, chaperone protein DnaJ; Provisional.
          Length = 369

 Score = 41.7 bits (98), Expect = 0.001
 Identities = 16/59 (27%), Positives = 34/59 (57%), Gaps = 4/59 (6%)

Query: 11 PSGASEASIKKAYRKQSLILHPDKETGD----EKAFMKLTKAYQALTDEESRRNFEKYG 65
             A++  IK+AY++     HPD+   +    E+ F ++ +AY+ L+D + R  ++++G
Sbjct: 13 SRNATQEEIKRAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKRAMYDRFG 71


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 41.0 bits (96), Expect = 0.003
 Identities = 27/105 (25%), Positives = 41/105 (39%), Gaps = 24/105 (22%)

Query: 437 KKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDV 496
           K    K++H +   G+++  + +E+T+          +D  E  +D  SESE ESD  D 
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDD--------DEDECEAIED--SESESESDGEDG 682

Query: 497 ENDEVVDKKEKNEDSNNKSEESS--------------DDDDDDWE 527
           E DE  D  E NE      +                   D +D E
Sbjct: 683 EEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEE 727



 Score = 29.8 bits (67), Expect = 8.6
 Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 467 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
           DV K ++  +  +      E EEE +    E D+  D+ E  EDS ++SE   +D ++D
Sbjct: 630 DVLKADENKSRHQQL---FEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEED 685


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 40.2 bits (94), Expect = 0.004
 Identities = 24/116 (20%), Positives = 41/116 (35%), Gaps = 18/116 (15%)

Query: 431 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKK---------DTPEESK 481
             K  K K   K K  KK   ++      E+     D +K E+          +   +S+
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202

Query: 482 DV---------SSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEK 528
           D          SS+ EE  +   +  +E   + E +E  +  SE  S  D ++   
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSP 258



 Score = 31.3 bits (71), Expect = 2.7
 Identities = 23/110 (20%), Positives = 43/110 (39%), Gaps = 26/110 (23%)

Query: 435 GKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLS 494
           G + K  K K  KK+   KK     ++ +  +D    E+ ++ +ESK    ES E+    
Sbjct: 141 GIETKAKKGKAKKKTKKSKK-----KEAKESSDKDDEEESESEDESKS--EESAEDDSDD 193

Query: 495 DVE-------------------NDEVVDKKEKNEDSNNKSEESSDDDDDD 525
           + E                   ++E  ++      + + SE  SD+ D +
Sbjct: 194 EEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSE 243



 Score = 31.0 bits (70), Expect = 3.1
 Identities = 18/73 (24%), Positives = 27/73 (36%), Gaps = 12/73 (16%)

Query: 478 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNK------------SEESSDDDDDD 525
            ES +  SE  E   +SD E      KK K + +++             SE+  DDD+D 
Sbjct: 235 SESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDI 294

Query: 526 WEKYQTGLNKRDK 538
                     + K
Sbjct: 295 DPDQVVKKPVKRK 307


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 40.4 bits (95), Expect = 0.004
 Identities = 27/133 (20%), Positives = 47/133 (35%)

Query: 393 PSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQ 452
            +KE   E  E ++K    I  V+                   KKK  T  K     +  
Sbjct: 33  ITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPA 92

Query: 453 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 512
           KK       +   A+      KD             +  D  D ++D+ +D  + ++D +
Sbjct: 93  KKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDD 152

Query: 513 NKSEESSDDDDDD 525
           ++ ++  DDDDD 
Sbjct: 153 DEDDDEDDDDDDV 165



 Score = 39.6 bits (93), Expect = 0.007
 Identities = 23/174 (13%), Positives = 45/174 (25%), Gaps = 18/174 (10%)

Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
           + IK       K  E   +                E     +      K    K    K 
Sbjct: 36  EEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKK 95

Query: 444 KHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSE----------------- 486
             D+  S +K         +   +  K        +  D   +                 
Sbjct: 96  LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155

Query: 487 SEEESDLSDVENDEVVDKKEKNEDSN-NKSEESSDDDDDDWEKYQTGLNKRDKV 539
            +E+ D  DV++++   K+ K  +   +  +   D+DD +  +      K    
Sbjct: 156 DDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTAT 209



 Score = 35.4 bits (82), Expect = 0.14
 Identities = 20/125 (16%), Positives = 35/125 (28%), Gaps = 2/125 (1%)

Query: 404 EEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTE 463
            + K  G IT    K +    +    +             K    ++         +   
Sbjct: 25  AKSKSKGFITKEEIK-EALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAA 83

Query: 464 APADVAKVEKKDTPEESKD-VSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 522
             A      KK   +E      +E +   D  D  N           D ++  ++  D D
Sbjct: 84  KAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLD 143

Query: 523 DDDWE 527
           DDD +
Sbjct: 144 DDDID 148


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 39.6 bits (92), Expect = 0.008
 Identities = 31/147 (21%), Positives = 51/147 (34%), Gaps = 15/147 (10%)

Query: 393 PSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQ 452
              +  GE  +EE+          NK+K    +    K K+GG       +  D    G+
Sbjct: 227 DESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDK----KGKRGGDDDADEYDSDDGDDEGR 282

Query: 453 KKI-----------TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 501
           ++                E   +P   AK E +   +  +    ++EEE  LS       
Sbjct: 283 EEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLK 342

Query: 502 VDKKEKNEDSNNKSEESSDDDDDDWEK 528
             K +KN    + S+   D DD D + 
Sbjct: 343 KLKGKKNGLDKDDSDSGDDSDDSDIDG 369



 Score = 35.3 bits (81), Expect = 0.14
 Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 7/119 (5%)

Query: 432 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 491
            +   KKK  K   +KK     K      +     D  + +  D  +E ++    S+  +
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDK----KGKRGGDDDADEYDSDDGDDEGREEDYISDSSA 292

Query: 492 DLSDVENDEVV---DKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQS 547
             +D E  E     +   K E   ++  E S+++ ++ E   +   K+ K L+G+    
Sbjct: 293 SGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351



 Score = 30.7 bits (69), Expect = 4.2
 Identities = 24/129 (18%), Positives = 48/129 (37%), Gaps = 17/129 (13%)

Query: 401 IEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVE 460
            EE EE+          ++K      + M   K  K         D+++ G+K       
Sbjct: 161 AEEAEEEMK-------RRKKTANGFQLMMM--KAAKNGPAAFGDEDEETEGEK------G 205

Query: 461 QTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSD 520
                 D+   + +   E+  D S +  E+ D  + ++ +   K  KN+   +  ++   
Sbjct: 206 GGGRGKDLKIKDLEGDDEDDGDESDKGGEDGD--EEKSKKKKKKLAKNKKKLDDDKKGKR 263

Query: 521 DDDDDWEKY 529
             DDD ++Y
Sbjct: 264 GGDDDADEY 272


>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
          Length = 1713

 Score = 39.5 bits (92), Expect = 0.009
 Identities = 33/163 (20%), Positives = 56/163 (34%), Gaps = 22/163 (13%)

Query: 394 SKEGEGEIE----EEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKS 449
           SK G   I+     ++++P G++  +                 K  K  K  K   +   
Sbjct: 10  SKRGLKVIQIGDDSDDDEPIGSLFKL-----------------KRPKNSKKVKVGLESTG 52

Query: 450 SGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE 509
             ++K++   E +E   D     +K      K V S S   S  + +EN E VD    NE
Sbjct: 53  KREEKLSALDEDSEGMDDTLASFRKRLKGPKKGVGSVSARMSQGALLENQEEVDTV-LNE 111

Query: 510 DSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHC 552
            S    + + +D    + +       R        KQ + V  
Sbjct: 112 GSKRSLDGNLEDSLSAFLREAQAGLSRKSCANSCGKQLNRVQG 154


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 38.2 bits (89), Expect = 0.009
 Identities = 16/98 (16%), Positives = 41/98 (41%), Gaps = 9/98 (9%)

Query: 440 FTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVEND 499
           F  +    +++  ++     +  E        E ++  EE K+ ++  ++E      + D
Sbjct: 35  FFPSSPSDQAAADEQEAKKSDDQET------AEIEEVKEEEKEAANSEDKEDKGDAEKED 88

Query: 500 EVV---DKKEKNEDSNNKSEESSDDDDDDWEKYQTGLN 534
           E     +++E  E S+   +E+ +  + + EK  T  +
Sbjct: 89  EESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126



 Score = 34.3 bits (79), Expect = 0.19
 Identities = 16/89 (17%), Positives = 32/89 (35%), Gaps = 1/89 (1%)

Query: 428 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSES 487
                    +    + +  KKS  Q+       + E        +K+D  +  K+     
Sbjct: 34  LFFPSSPSDQAAADEQE-AKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92

Query: 488 EEESDLSDVENDEVVDKKEKNEDSNNKSE 516
           EE  +  +  +DE   + E+  +SN + E
Sbjct: 93  EENEEEDEESSDENEKETEEKTESNVEKE 121


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 39.2 bits (91), Expect = 0.014
 Identities = 50/260 (19%), Positives = 101/260 (38%), Gaps = 41/260 (15%)

Query: 451  GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 510
            G + ITNT+ + +   ++A  E     +   D S   E ESD++ V  D VV + E N D
Sbjct: 3823 GAEDITNTLNEDDDLEELANEEDTAN-QSDLDESEARELESDMNGVTKDSVVSENE-NSD 3880

Query: 511  SNNKSEESSDDDDDDWEKYQTGLNKR-------DKVLEGRSKQSHSVHC----------- 552
            S  ++++  ++ +D  E     LN++       + +LE   K +                
Sbjct: 3881 SEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKED 3940

Query: 553  -PRFPEDKQEFWWIYISDRKSRTLLTSPYHITELVDQEQVQLKFTAPRWPGVYTFSVC-- 609
              +  EDK         +++    ++    I + + Q  +Q   + P  P      +   
Sbjct: 3941 DNKALEDKDR------QEKEDEEEMSDDVGIDDEI-QPDIQENNSQPP-PENEDLDLPED 3992

Query: 610  LRCDSYLGFDQ----MQDIKLDVKEAPEVPTEHPQWEMSGEEDEG------DEEMGGSDV 659
            L+ D   G       ++D+ ++  +  +   +  + E   +ED        DE++   D 
Sbjct: 3993 LKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDF 4052

Query: 660  SEFTTDEDVEDDLKLEPCAQ 679
            S+   D++  ++   E   Q
Sbjct: 4053 SDLAEDDEKMNEDGFEENVQ 4072



 Score = 36.5 bits (84), Expect = 0.085
 Identities = 24/129 (18%), Positives = 50/129 (38%), Gaps = 4/129 (3%)

Query: 402  EEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQ 461
            EEE +  +  +  +        +  +W +  +    +   K+     ++ +  + +  + 
Sbjct: 3882 EEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDD 3941

Query: 462  TEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEK---NEDSNNKSEES 518
             +A  D  + EK+D  E S DV  + E + D+ +  N +   + E     ED     +E 
Sbjct: 3942 NKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQE-NNSQPPPENEDLDLPEDLKLDEKEG 4000

Query: 519  SDDDDDDWE 527
                D D E
Sbjct: 4001 DVSKDSDLE 4009



 Score = 33.4 bits (76), Expect = 0.78
 Identities = 32/160 (20%), Positives = 61/160 (38%), Gaps = 5/160 (3%)

Query: 391  LEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKH---DK 447
            LE     E E EEE     G    +    ++   +P          +      K     K
Sbjct: 3945 LEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSK 4004

Query: 448  KSSGQKKITNTVEQTEAPADVAKVE--KKDTPEESKDVSSESEEESDLSDVENDEVVDKK 505
             S  +       ++ +  AD  K E  + + P E  +   E  ++ D SD+  D+    +
Sbjct: 4005 DSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNE 4064

Query: 506  EKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSK 545
            +  E++  ++EES++D     E+ + G    D+ ++   K
Sbjct: 4065 DGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPK 4104



 Score = 30.0 bits (67), Expect = 7.8
 Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 1/68 (1%)

Query: 460  EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESS 519
             Q +  +D+A+ ++K   E+  + + +  EES    V++DE +++ E  ED    +    
Sbjct: 4047 IQQDDFSDLAEDDEKMN-EDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKM 4105

Query: 520  DDDDDDWE 527
            D       
Sbjct: 4106 DAKSTFAS 4113


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 37.1 bits (86), Expect = 0.016
 Identities = 20/85 (23%), Positives = 45/85 (52%), Gaps = 4/85 (4%)

Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 506
           KK   +KK  N +E  +  +     +K+D  +E++DV  E +E+ +  + E++E +++ E
Sbjct: 11  KKPEEEKKDENLLEHVKITS----WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE 66

Query: 507 KNEDSNNKSEESSDDDDDDWEKYQT 531
             ED     E+  ++++D+ +    
Sbjct: 67  DIEDEEEIVEDEEEEEEDEEDNVDL 91



 Score = 34.1 bits (78), Expect = 0.15
 Identities = 20/95 (21%), Positives = 46/95 (48%), Gaps = 5/95 (5%)

Query: 434 GGKKKKFTKNKHDKKSSGQKKIT-----NTVEQTEAPADVAKVEKKDTPEESKDVSSESE 488
           G  +KK  + K D+      KIT     + +++ E   D  + + ++  EE ++   E E
Sbjct: 7   GAPEKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE 66

Query: 489 EESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 523
           +  D  ++  DE  +++++ ++ + K  E  + +D
Sbjct: 67  DIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIND 101


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 38.3 bits (89), Expect = 0.016
 Identities = 25/142 (17%), Positives = 51/142 (35%), Gaps = 1/142 (0%)

Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
             +  +    S +   +  +E           V     + + P          KKK TK 
Sbjct: 170 SIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKK-TKE 228

Query: 444 KHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVD 503
           K +KK + +  +    E+     DV   ++   P    +   E E +      +++E  +
Sbjct: 229 KKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETE 288

Query: 504 KKEKNEDSNNKSEESSDDDDDD 525
           +KEK +    K     +D+D++
Sbjct: 289 EKEKEKRKRLKKMMEDEDEDEE 310


>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
          Length = 647

 Score = 38.4 bits (90), Expect = 0.018
 Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 11/60 (18%)

Query: 19 IKKAYRKQSLILHPDKETGDEKAFMKLTKAYQAL----------TDEESRRNFEKYGNPD 68
          ++KAY ++    HPDK  GDE+   +L   Y+ L             +       YG P+
Sbjct: 30 MRKAYLRKCKEYHPDK-GGDEEKMKRLNSLYKKLQEGVKSARQSFGTQDSSEIPTYGTPE 88


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 38.5 bits (90), Expect = 0.018
 Identities = 24/103 (23%), Positives = 44/103 (42%), Gaps = 1/103 (0%)

Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 506
           KK+ G++K      + EA +   K E K+  +E K+   E E E +             +
Sbjct: 417 KKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLK 476

Query: 507 KNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHS 549
           ++E +  + EE   D+++ W K  T    +    +   K+S S
Sbjct: 477 RSEKAQKEEEEEELDEENPWLK-TTSSVGKSAKKQDSKKKSSS 518



 Score = 37.3 bits (87), Expect = 0.035
 Identities = 26/134 (19%), Positives = 47/134 (35%), Gaps = 1/134 (0%)

Query: 392 EPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 451
           E   E E    E E +            K+   R      + G K+ +  K K + K+  
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRK-FGPENGEKEAESKKLKKENKNEF 447

Query: 452 QKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDS 511
           ++K  +  E+     + AKVEK       +   ++ EEE +  D EN  +       + +
Sbjct: 448 KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSA 507

Query: 512 NNKSEESSDDDDDD 525
             +  +       D
Sbjct: 508 KKQDSKKKSSSKLD 521



 Score = 33.5 bits (77), Expect = 0.62
 Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 8/131 (6%)

Query: 395 KEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKK 454
           KE E +  ++E K          KEK+++     +++++  K +K       +    QK+
Sbjct: 432 KEAESKKLKKENKNEF-------KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKE 484

Query: 455 ITNTVEQTEAPADVAKVE-KKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNN 513
                   E P         K   ++     S S+ +   + +    V  KK+K ++ + 
Sbjct: 485 EEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSI 544

Query: 514 KSEESSDDDDD 524
             ++   D++D
Sbjct: 545 DLDDDLIDEED 555



 Score = 32.3 bits (74), Expect = 1.5
 Identities = 30/158 (18%), Positives = 53/158 (33%), Gaps = 26/158 (16%)

Query: 383 DDTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPV--WMKNKKGGKKKK- 439
           +  ++    +  K  E   +EEEE+      P +               K K   K  K 
Sbjct: 464 EAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKA 523

Query: 440 -FTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDV-- 496
               +K   K   +KK   +++  +   D     K D  +E      E E++ +L  +  
Sbjct: 524 ANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDE------EDEDDEELPFLFK 577

Query: 497 ---------ENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
                      D+VV + EK      K E   ++D  +
Sbjct: 578 QKDLIKEAFAGDDVVAEFEK-----EKKEVIEEEDPKE 610



 Score = 32.0 bits (73), Expect = 1.8
 Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 470 KVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
           K+E K   EE +D  S+SEEE +  D ++D+  +          K +E  DD+++ 
Sbjct: 317 KIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPW-MLRKKLGKLKEGEDDEENS 371



 Score = 29.6 bits (67), Expect = 9.3
 Identities = 31/177 (17%), Positives = 66/177 (37%), Gaps = 39/177 (22%)

Query: 392 EPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 451
           E  ++ + + EEE+E  +       N          WM  KK GK K+   ++ +     
Sbjct: 325 EEDEDEDSDSEEEDEDDDEDDDDGENP---------WMLRKKLGKLKEGEDDEENSGLLS 375

Query: 452 QKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE----- 506
            K +     +          +K++   E +++  E E E +  + EN+E   K       
Sbjct: 376 MKFMQRAEAR----------KKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKF 425

Query: 507 ---------------KNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSH 548
                          K   +  K ++ SD++++  ++ +  + K    L  RS+++ 
Sbjct: 426 GPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQ 482


>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
           The proteins in this family are poorly characterized,
           but an investigation has indicated that the immediate
           early protein is required the down-regulation of MHC
           class I expression in dendritic cells. Human herpesvirus
           6 immediate early protein is also referred to as U90.
          Length = 993

 Score = 38.5 bits (89), Expect = 0.020
 Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 10/128 (7%)

Query: 430 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 489
              K  KKK  + N+  KK + ++K   TV      +   K +   T      +SSESE+
Sbjct: 643 TFYKRDKKKDISTNRKVKKRTAKRK---TVGYKTDKSKKIKSDSLPTDTNVIVISSESED 699

Query: 490 ESDLSDVENDEVVDKK--EKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQS 547
           E D        ++ K   +K   S  KSE SS+ DD   E  +  L+  DKV+   + +S
Sbjct: 700 EED-----GFNIIKKSQLKKKIKSELKSESSSESDDCTSEDNELHLSDYDKVINNGNCES 754

Query: 548 HSVHCPRF 555
                P F
Sbjct: 755 KGFPSPVF 762


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
           Members of this family are bacterial proteins with a
           conserved motif [KR]FYDLN, sometimes flanked by a pair
           of CXXC motifs, followed by a long region of low
           complexity sequence in which roughly half the residues
           are Asp and Glu, including multiple runs of five or more
           acidic residues. The function of members of this family
           is unknown.
          Length = 104

 Score = 35.4 bits (82), Expect = 0.024
 Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 2/64 (3%)

Query: 462 TEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDD 521
            E P +VAK        E      E EE+ D   +++D+  D  +   D ++  ++   D
Sbjct: 34  EEVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDD--DDVDLD 91

Query: 522 DDDD 525
           DDDD
Sbjct: 92  DDDD 95



 Score = 27.7 bits (62), Expect = 9.9
 Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 12/64 (18%)

Query: 462 TEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDD 521
             A       +K +  E+  DV  + +++    D ++D++ D  + + D         DD
Sbjct: 45  APAADAEDAAKKDEDEEDEDDVVLDDDDD----DDDDDDLPDLDDDDVD--------LDD 92

Query: 522 DDDD 525
           DDDD
Sbjct: 93  DDDD 96


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 38.0 bits (88), Expect = 0.032
 Identities = 27/123 (21%), Positives = 36/123 (29%), Gaps = 7/123 (5%)

Query: 686  PAAAPHAGRLMKNNSTNSSDRLRDLCSVGR----RRVLSESFSTTQERPIL-PRPTPGAP 740
            P+ A +        +    +R RD  + GR    RR      +     P   PR    A 
Sbjct: 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR-RAAR 2689

Query: 741  IMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTP 800
              V   TS+        T     P   +   P+PP            P  PAP      P
Sbjct: 2690 PTVGSLTSLADPPPPPPTP-EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP 2748

Query: 801  IAP 803
              P
Sbjct: 2749 ATP 2751



 Score = 35.7 bits (82), Expect = 0.14
 Identities = 19/122 (15%), Positives = 28/122 (22%), Gaps = 7/122 (5%)

Query: 686  PAAAPHAGRLMKNNSTNSSDRLRDLCSVGRRRVLSESFSTTQERPILPRPTPGAPIMVAP 745
            P           + S  S     D        +   +       P  P P P +    AP
Sbjct: 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839

Query: 746  TTSVQFLNGSKATQFVFVPANSL-------QPVPIPPAKKRGYVKQKPPPILPAPAPTVP 798
                     S        P   +        P   P A  R  V++   P +     +  
Sbjct: 2840 PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899

Query: 799  TP 800
             P
Sbjct: 2900 LP 2901



 Score = 34.1 bits (78), Expect = 0.38
 Identities = 17/91 (18%), Positives = 24/91 (26%), Gaps = 3/91 (3%)

Query: 713  VGRRRVLSESFSTTQERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVP 772
             G  R      +     P  P      P       +V  L+ S+ +        +  P  
Sbjct: 2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES-LPSPWDPADPPAA 2810

Query: 773  IPPAKKRGYVKQKPPPILPAPAPTVPTPIAP 803
            +            P    P P PT   P AP
Sbjct: 2811 VLAPAAALPPAASPAG--PLPPPTSAQPTAP 2839



 Score = 33.8 bits (77), Expect = 0.65
 Identities = 19/81 (23%), Positives = 29/81 (35%), Gaps = 7/81 (8%)

Query: 730  PILPRPTPGAPIMVA------PTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVK 783
             +  RP   +P          P   +     S++T+   +P +  +  P P A       
Sbjct: 2861 DVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPP-QP 2919

Query: 784  QKPPPILPAPAPTVPTPIAPE 804
            Q  PP  P P P  P P  P+
Sbjct: 2920 QPQPPPPPQPQPPPPPPPRPQ 2940



 Score = 30.7 bits (69), Expect = 4.7
 Identities = 30/124 (24%), Positives = 36/124 (29%), Gaps = 13/124 (10%)

Query: 685  RPAAAPHAGRLMKNNSTNSSDRLRDLCSVG---RRRVLSESFS--TTQERPILPRPTPGA 739
            RP   P  GR+ +        R     S     RRR    +    T+   P  P PTP  
Sbjct: 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710

Query: 740  PIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPT 799
                 P   V               + +L   P PPA   G       P  PA     PT
Sbjct: 2711 A----PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAT----PGGPARPARPPT 2762

Query: 800  PIAP 803
               P
Sbjct: 2763 TAGP 2766



 Score = 29.9 bits (67), Expect = 7.7
 Identities = 26/120 (21%), Positives = 42/120 (35%), Gaps = 24/120 (20%)

Query: 727  QERPILPRPTPGAP-IMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQK 785
              +P  P P P  P   +APTT                PA + +P    P    G +   
Sbjct: 2926 PPQPQPPPPPPPRPQPPLAPTTD---------------PAGAGEPSGAVPQPWLGALVPG 2970

Query: 786  PPPI----LPAPAPTVPTPIAPEFLWSRESKMSIANRIVSEATTMFLETMLQPDILSYIQ 841
               +    +P PAP+   P +     +        +R+ S A+++ L     P  +S  Q
Sbjct: 2971 RVAVPRFRVPQPAPSREAPASS----TPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQ 3026


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 37.3 bits (87), Expect = 0.037
 Identities = 24/82 (29%), Positives = 39/82 (47%), Gaps = 3/82 (3%)

Query: 467 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDW 526
           D   +E++   E   D +    EE D  D E+DE  ++  K+ED     ++  DDDDDD 
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDE---DDDDDDDDDDI 158

Query: 527 EKYQTGLNKRDKVLEGRSKQSH 548
              +  L +R +  E   K++ 
Sbjct: 159 ATRERSLERRRRRREWEEKRAE 180


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 36.9 bits (85), Expect = 0.037
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 5/54 (9%)

Query: 14  ASEASIKKAYRKQSLILHPDKET-----GDEKAFMKLTKAYQALTDEESRRNFE 62
           A    I KA++K+    HPDK       G ++ F  + KA + L D + R  ++
Sbjct: 58  AIPPQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAREVLGDRKLRLQYD 111


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 36.6 bits (85), Expect = 0.044
 Identities = 15/54 (27%), Positives = 32/54 (59%)

Query: 475 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEK 528
           D+ +E + +  ES++E + SD E++E  D+  K    ++  E S +D+++  E+
Sbjct: 111 DSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEE 164



 Score = 29.2 bits (66), Expect = 8.9
 Identities = 21/82 (25%), Positives = 39/82 (47%)

Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 506
           KK  G++++ + +   E      + E+K    E    S + ++E +  +VE DE  D + 
Sbjct: 57  KKGYGEEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116

Query: 507 KNEDSNNKSEESSDDDDDDWEK 528
           +  D  +  E  S D +D+ EK
Sbjct: 117 EWIDVESDKEIESSDSEDEEEK 138


>gnl|CDD|182347 PRK10266, PRK10266, curved DNA-binding protein CbpA; Provisional.
          Length = 306

 Score = 36.3 bits (84), Expect = 0.052
 Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 18 SIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEK 63
          +IK AYR+ +   HPD  KE   E  F ++ +A++ L+DE+ R  +++
Sbjct: 20 TIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQ 67


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 36.9 bits (86), Expect = 0.055
 Identities = 20/113 (17%), Positives = 42/113 (37%), Gaps = 2/113 (1%)

Query: 413  TPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVE 472
               V        +    ++  G K    TK K   K   +  +    ++ ++    A+ +
Sbjct: 1278 VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKK--VKKRLEGSLAALKKKKKSEKKTARKK 1335

Query: 473  KKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
            K  T  +    S  S         ++D   +  + +E  +++ E+  DD+DDD
Sbjct: 1336 KSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 36.7 bits (85), Expect = 0.059
 Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 13/132 (9%)

Query: 428 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKK-DTPEESKDV--- 483
           W+K  K  +KKK  + K       ++K     E T       KV  K +  EE +DV   
Sbjct: 94  WLKKSKKRQKKKEAERKKALLLDEKEK-ERAAEYTSEDLAGLKVGHKVEEFEEGEDVILT 152

Query: 484 -----SSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTG-LNKRD 537
                  E E+E D   +EN E+V+K++  ++   K ++   D DDD +  +   L+K D
Sbjct: 153 LKDTGVLEDEDEGDE--LENVELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYD 210

Query: 538 KVLEGRSKQSHS 549
           + +EG+ K+S +
Sbjct: 211 EEIEGKKKKSDN 222



 Score = 30.1 bits (68), Expect = 7.0
 Identities = 19/87 (21%), Positives = 30/87 (34%), Gaps = 10/87 (11%)

Query: 454 KITNTVE-----QTEAPADVAKVEKKDTPEESKDVSSESEEE--SDLSDVENDEVVDKKE 506
            I  T E     Q E   +  + + +   E S     + +EE      DVE   V + +E
Sbjct: 392 VIDETSEFVRSLQKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEE 451

Query: 507 KNEDSNNKSEESSDDDDDDWEKYQTGL 533
           K E+     E       ++      GL
Sbjct: 452 KEEEDK---EAIPSTILEEEPTVGGGL 475


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 36.5 bits (85), Expect = 0.063
 Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 2/83 (2%)

Query: 472 EKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQT 531
            ++D  EE   V  E EE+ D    E +E VD    +E+ + + E+S D+DD++ E+ + 
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVD--LSDEEEDEEDEDSDDEDDEEEEEEEK 376

Query: 532 GLNKRDKVLEGRSKQSHSVHCPR 554
              K+      RS+   +  CP+
Sbjct: 377 EKKKKKSAESTRSELPFTFPCPK 399



 Score = 31.5 bits (72), Expect = 2.6
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 641 WEMSGEEDEGDEEMGGSDVSEFTTDEDVEDD 671
               GEEDE +EE G  D  E   D+D+E++
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDDDDDLEEE 345


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 36.0 bits (83), Expect = 0.093
 Identities = 13/76 (17%), Positives = 18/76 (23%), Gaps = 4/76 (5%)

Query: 728 ERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPP 787
             P  P   P A    A          ++ +        +L       A+  G      P
Sbjct: 397 PAPAAPPAAPAAAPAAAAAARAVAAAPARRSP----APEALAAARQASARGPGGAPAPAP 452

Query: 788 PILPAPAPTVPTPIAP 803
               APA       A 
Sbjct: 453 APAAAPAAAARPAAAG 468


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
           terminus.  The C terminus of the plasma membrane Nha1
           antiporter plays an important role in the immediate cell
           response to hypo-osmotic shock which prevents an
           execessive loss of ions and water. This domain is found
           with pfam00999.
          Length = 430

 Score = 35.6 bits (82), Expect = 0.12
 Identities = 25/134 (18%), Positives = 41/134 (30%), Gaps = 19/134 (14%)

Query: 400 EIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTV 459
           +I    +       PVV++            +  G K  K        ++   K      
Sbjct: 269 KINPHGKPERDRSNPVVSR----------ALSWVGLKGTKHD-FIKQIRNLDPKARREVD 317

Query: 460 EQTEAPADVAKVEKKDTPE----ESKDVSSESEEESDLSD----VENDEVVDKKEKNEDS 511
            +T AP  V    + +          D  SE + E DL+      E  E    +E +  S
Sbjct: 318 LETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPSSSS 377

Query: 512 NNKSEESSDDDDDD 525
                 S  D +D+
Sbjct: 378 PPSRPRSRRDSEDE 391


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 35.8 bits (82), Expect = 0.12
 Identities = 23/86 (26%), Positives = 34/86 (39%), Gaps = 6/86 (6%)

Query: 442 KNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 501
           K         ++K        E      KV K+   ++  +   +   ES++SDVE  E 
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167

Query: 502 VDKKEKNEDSNNKSEESSDDDDDDWE 527
           V   E      N+SEE  D + DD E
Sbjct: 168 VTSLE------NESEEELDLEKDDGE 187



 Score = 32.3 bits (73), Expect = 1.5
 Identities = 24/117 (20%), Positives = 38/117 (32%), Gaps = 9/117 (7%)

Query: 405 EEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEA 464
           E        P  N            K K     K+ TK    + S   ++ +  V   +A
Sbjct: 53  ESALAVDEEPDENGAV--------SKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDA 104

Query: 465 PADVAKVEKKD-TPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSD 520
             D    +    T  ++   SS+ EEE     V     V K +++ +      E SD
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD 161



 Score = 30.8 bits (69), Expect = 3.5
 Identities = 30/116 (25%), Positives = 48/116 (41%), Gaps = 8/116 (6%)

Query: 431 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESK-------DV 483
           NK  G++ +  +NK    +  +KKI + +   E P +   V KK      K         
Sbjct: 29  NKFRGERWR-LQNKFLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVE 87

Query: 484 SSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKV 539
            SE  EE     V  D  +DK+ K      + + ++   D + EK +  + KR KV
Sbjct: 88  ISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKV 143


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 35.4 bits (82), Expect = 0.14
 Identities = 16/92 (17%), Positives = 32/92 (34%), Gaps = 14/92 (15%)

Query: 73  MSFGIALPSYIVEKENSV----WVLGLYALVFMVALPTAVGMWWYKSIRYTGDKVLLETI 128
                ALP    +   S      ++  ++L + +     +  +   S R+   +VLL  +
Sbjct: 16  GLLSPALPLLAEDLGLSASQAGLIVSAFSLGYALGSL--LAGYL--SDRFGRRRVLLLGL 71

Query: 129 ------NFYYAFFQITPHMALKRVLMILGASL 154
                 +   AF      + + R L+ LG   
Sbjct: 72  LLFALGSLLLAFASSLWLLLVGRFLLGLGGGA 103


>gnl|CDD|178135 PLN02520, PLN02520, bifunctional 3-dehydroquinate
           dehydratase/shikimate dehydrogenase.
          Length = 529

 Score = 34.7 bits (80), Expect = 0.21
 Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 11/87 (12%)

Query: 570 RKSRTLLTSPYHITELVDQEQVQLKFTAPRWPGVYTFSVCLRCDSYLGFDQMQDIKLDVK 629
           R++ TL+  P  + + VD+  +++         +    V +R D    F+  +D+K  +K
Sbjct: 19  RRNPTLICVPI-MADSVDKMLIEMAKAKELGADL----VEIRLDFLKNFNPREDLKTLIK 73

Query: 630 EAPEVP---TEHPQWEMSGEEDEGDEE 653
           ++P +P   T  P+WE  G + EGDE 
Sbjct: 74  QSP-LPTLVTYRPKWE--GGQYEGDEN 97


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 33.2 bits (76), Expect = 0.31
 Identities = 12/71 (16%), Positives = 24/71 (33%)

Query: 453 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 512
           KK   T  + +      K +K+            S+ E +  + +  +    K +N  + 
Sbjct: 52  KKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAE 111

Query: 513 NKSEESSDDDD 523
              E+  D  D
Sbjct: 112 VNHEDVIDLSD 122


>gnl|CDD|234474 TIGR04113, cas_csx17, CRISPR-associated protein Csx17, subtype
           Dpsyc.  Members of this protein family are found
           exclusively in CRISPR-associated (cas) type I system
           gene clusters of the Dpsyc subtype. Markers for that
           type include a variant form of cas3 (model TIGR02621)
           and the GSU0054-like protein family (model TIGR02165).
           This family occurs in less than half of known Dpsyc
           clusters.
          Length = 704

 Score = 34.3 bits (79), Expect = 0.32
 Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 6/52 (11%)

Query: 703 SSDRLRDLCSVGRRRVLSESFSTTQERPILPRPTPGAPIMVAPTTSVQFLNG 754
             D  R+L +V  RR+L E+     +RP       G P   + +   QFL G
Sbjct: 516 GGDLARNLAAVLERRLLEEARKDNDDRPQ------GGPCPASLSDIAQFLRG 561


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 34.1 bits (79), Expect = 0.33
 Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 13/89 (14%)

Query: 445 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 504
             KK++  KKI   VE+ E   +  K EKK      K    E EEE            +K
Sbjct: 404 GSKKAT--KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE-----------KEK 450

Query: 505 KEKNEDSNNKSEESSDDDDDDWEKYQTGL 533
           KE+ ++   +  E   +++++ +K Q  L
Sbjct: 451 KEEEKEEEEEEAEEEKEEEEEKKKKQATL 479



 Score = 32.6 bits (75), Expect = 1.1
 Identities = 19/72 (26%), Positives = 34/72 (47%)

Query: 436 KKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSD 495
             KK TK         +KK     ++ +  A   K ++++  EE +    E EEE + ++
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 496 VENDEVVDKKEK 507
            E +E  +KK+K
Sbjct: 464 EEKEEEEEKKKK 475


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 33.1 bits (76), Expect = 0.34
 Identities = 17/61 (27%), Positives = 32/61 (52%), Gaps = 4/61 (6%)

Query: 469 AKVEKKDTPEESKDVSSESEEE----SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 524
           AK +KK+  EE + V  E EE+          +  +  DK +K++  ++KSE+  + + +
Sbjct: 60  AKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119

Query: 525 D 525
           D
Sbjct: 120 D 120


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 33.5 bits (77), Expect = 0.35
 Identities = 24/142 (16%), Positives = 49/142 (34%), Gaps = 36/142 (25%)

Query: 423 THRPVWM----KNKKGGKKKK-------------FTKNKHDKKSSG------QKKITNTV 459
            HRP +     K ++GG +                TK K+ +   G      ++ +   +
Sbjct: 4   AHRPTFDPARGKEEQGGNRLSGPSSIYSSRDLPAHTKLKYRQPGQGTEDELRKRDLRAEL 63

Query: 460 EQTEAPADVAKVEKK-----DTPEESKDVSSESEEESD--------LSDVENDEVVDKKE 506
           E+ E      K  K      D        +   E++ +        + +   +   D  +
Sbjct: 64  EEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSD 123

Query: 507 KNEDSNNKSEESSDDDDDDWEK 528
            + DS++  ++S DDD +D   
Sbjct: 124 SSSDSDSSDDDSDDDDSEDETA 145


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 34.1 bits (78), Expect = 0.38
 Identities = 24/118 (20%), Positives = 48/118 (40%), Gaps = 2/118 (1%)

Query: 434 GGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDL 493
              +K+  +N  DK+ S     + +V   E  ++  +  + D    S+    E E E ++
Sbjct: 21  ESSQKRIQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEI 80

Query: 494 SDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE-KYQTGL-NKRDKVLEGRSKQSHS 549
              +N   V K        + S +    DDD  + K +    N R + +E  ++++ S
Sbjct: 81  EVEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKREIEELAEENFS 138


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 34.0 bits (78), Expect = 0.40
 Identities = 24/117 (20%), Positives = 32/117 (27%), Gaps = 10/117 (8%)

Query: 687 AAAPHAGRLMKNNSTNSSDRLRDLCSVGRRRVLSESFSTTQERPILPRPTPGAPIMVAPT 746
           A A   GR      T     +     V +  +    ++   E      P P    M A T
Sbjct: 617 AQARTFGRA-----TVGEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHT 671

Query: 747 TSVQFL---NGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTP 800
                +   +  + T             P P     G     PPP LPA A   P  
Sbjct: 672 ARPSRVARGDPVRPTAHHAAL--RAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCG 726


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 33.7 bits (77), Expect = 0.50
 Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 2/99 (2%)

Query: 463 EAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 522
           E+  + +  E+++  EE ++   ESEEE    + E +EV       E+    SE   D +
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497

Query: 523 DDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPEDKQE 561
           + + +  +              +Q         PE  QE
Sbjct: 498 EPEEDAERRNSEMAGISRMSEGQQPRGS--SVQPESPQE 534


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 33.6 bits (77), Expect = 0.51
 Identities = 24/152 (15%), Positives = 56/152 (36%), Gaps = 7/152 (4%)

Query: 390 FLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKS 449
             E  +E E EI+ ++ + N      + ++K +    +  + K   K  +      D K+
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDI--EGKSDTKNLEEGFETKDNKN 412

Query: 450 SGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE 509
                I  T           ++ +K T      ++ E+ +E +   + ND +  +  +  
Sbjct: 413 KNSSFINKTENILTNSPLKDELLEKTTEI----INIENPQEFEFGQIGNDIISTEIAQL- 467

Query: 510 DSNNKSEESSDDDDDDWEKYQTGLNKRDKVLE 541
           D N    ++ + D ++           +K+ E
Sbjct: 468 DENQNLIDTGEFDLENNFSNSFNPENGNKIDE 499


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 32.8 bits (75), Expect = 0.51
 Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 1/53 (1%)

Query: 475 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
            T EE       S  E  L ++E ++V ++ EK+E+   + EE  D+D DD +
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE-EEEEEEEEDEDFDDDD 193



 Score = 30.5 bits (69), Expect = 3.0
 Identities = 27/108 (25%), Positives = 48/108 (44%), Gaps = 11/108 (10%)

Query: 425 RPVWMKNKKGGKK---KKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESK 481
             V   NKK GKK    KF K K    +  ++ I   +   E      K+++ +  +  +
Sbjct: 117 YSVMGINKKAGKKLALSKF-KRKVGLFTEEEEDIDEKLSMLEK-----KLKELEAEDVDE 170

Query: 482 DVSSESEEESDLSDVENDEVVDKKEKNEDSNNKS--EESSDDDDDDWE 527
           +   + EEE +  + + D   D  + ++D N ++  +   DDD DD E
Sbjct: 171 EDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAENYFDNGEDDDYDDDE 218


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 33.5 bits (76), Expect = 0.63
 Identities = 31/128 (24%), Positives = 51/128 (39%), Gaps = 7/128 (5%)

Query: 463 EAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVV----DKKEKNEDSNNKSEES 518
           E  +  A V K  +  ++K    E EE+   +D E+        D KE + +     EES
Sbjct: 373 EQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEES 432

Query: 519 SDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPEDKQEFWWI--YISDRKSRTLL 576
           + DD++ +E+      +R ++ E R  +      P   E +     I  Y   R  R L 
Sbjct: 433 AIDDNEGFEELSPEEEER-QLREFRDMEKEDREFPDEAELQPSESAIERYKEYRGLRNLY 491

Query: 577 TSPYHITE 584
           T  +   E
Sbjct: 492 TCSWRSDE 499


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 33.5 bits (76), Expect = 0.64
 Identities = 26/98 (26%), Positives = 45/98 (45%), Gaps = 2/98 (2%)

Query: 442 KNKHDKKSSGQ--KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVEND 499
           + KH +K   Q   KI ++ E+  A  D  ++E  D    S  V+SE +EE D  +  ++
Sbjct: 5   QGKHQRKGKKQLENKILHSYEEESAGFDSEELEDNDEQGYSFGVNSEDDEEIDSDEAFDE 64

Query: 500 EVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRD 537
           E   +      + +KS +S+ D  +     +  LN  D
Sbjct: 65  EDEKRFADWSFNASKSGKSNKDHKNLNNTKEISLNDSD 102


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 33.6 bits (76), Expect = 0.69
 Identities = 28/140 (20%), Positives = 57/140 (40%), Gaps = 9/140 (6%)

Query: 384  DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
            D  K +  E  K  E + + EE K          K+ ++  +       +        + 
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKAD---AAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361

Query: 444  KHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVD 503
              +K  + +KK      + +A A   K E+K   +E+K  + E ++++D    E  +   
Sbjct: 1362 AEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKADEAKKKAEEDKKKAD----ELKKAAA 1415

Query: 504  KKEKNEDSNNKSEESSDDDD 523
             K+K +++  K+EE    D+
Sbjct: 1416 AKKKADEAKKKAEEKKKADE 1435



 Score = 30.9 bits (69), Expect = 3.7
 Identities = 33/143 (23%), Positives = 55/143 (38%), Gaps = 3/143 (2%)

Query: 384  DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
            D  K +  E  K  E    E E   +         E  +  +    K     KKK   K 
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391

Query: 444  KHD---KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDE 500
            K D   KK+   KK  + +++  A    A   KK   E+ K   ++ + E      E  +
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451

Query: 501  VVDKKEKNEDSNNKSEESSDDDD 523
              ++ +K E++  K+EE+   D+
Sbjct: 1452 KAEEAKKAEEAKKKAEEAKKADE 1474


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 32.9 bits (75), Expect = 0.73
 Identities = 20/60 (33%), Positives = 30/60 (50%)

Query: 466 ADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
           AD    E +D P+E +D     EEES  SD  +++     E+ E    ++ E+S DD  D
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPD 278



 Score = 30.6 bits (69), Expect = 3.1
 Identities = 16/85 (18%), Positives = 37/85 (43%), Gaps = 8/85 (9%)

Query: 446 DKKSSGQKKITNTVEQTEAPADVAK--VEKKDTPEESKDVSSESEEE------SDLSDVE 497
           DK      ++   V+  +A A V +  +   D  EE  D    ++ E          D +
Sbjct: 176 DKAGKDLDRLAACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDED 235

Query: 498 NDEVVDKKEKNEDSNNKSEESSDDD 522
           +D+  +++  + DS ++  ++S ++
Sbjct: 236 DDQGEEEESGSSDSLSEDSDASSEE 260


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 32.4 bits (74), Expect = 0.74
 Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 2/80 (2%)

Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLS-DVENDEVVDKK 505
            K   +K+ T T  +        K +K+   EE K   S+ ++  +   D +N+E     
Sbjct: 48  PKQPKKKRPT-TPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDS 106

Query: 506 EKNEDSNNKSEESSDDDDDD 525
           +++  +N+ S+  ++DD DD
Sbjct: 107 QESASANSLSDIDNEDDMDD 126


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 32.9 bits (75), Expect = 0.81
 Identities = 30/180 (16%), Positives = 67/180 (37%), Gaps = 17/180 (9%)

Query: 386 IKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKH 445
           ++  +L+ + E        +E+ +G +     K     +  +    KK      F     
Sbjct: 29  LRSNYLQSTVEDIEGPSAVDERTSGVL-RDEGKHANILYNSILCNQKK---HASFLN--- 81

Query: 446 DKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKK 505
                 Q+K  +  +  E        + +D    +    S ++++S L   + DE  D +
Sbjct: 82  ------QRKSLDDDDDDEFDFLY--EDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQ 133

Query: 506 EKNEDSNNKSEESSDDDD-DDWEKYQTGLNKRDKVLEGRSKQSHSV-HCPRFPEDKQEFW 563
             N+ +N+  ++ S +   D   K    L+  +K++E +  Q+ +      F  D +E  
Sbjct: 134 ANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAA 193



 Score = 31.0 bits (70), Expect = 4.1
 Identities = 13/71 (18%), Positives = 25/71 (35%), Gaps = 9/71 (12%)

Query: 471 VEKKDTPEESK----DVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDW 526
           +E+K    E+     +  ++ EE +       D    K       N+   +  D D +  
Sbjct: 169 IEEKKGQTENTFKFYNFGNDGEEAA-----AKDGGKSKSSDPGPLNDSDGQGDDGDPESA 223

Query: 527 EKYQTGLNKRD 537
           E+ +   N R 
Sbjct: 224 EEDKAASNTRA 234


>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 129

 Score = 31.0 bits (70), Expect = 0.92
 Identities = 15/42 (35%), Positives = 22/42 (52%)

Query: 630 EAPEVPTEHPQWEMSGEEDEGDEEMGGSDVSEFTTDEDVEDD 671
              EV  E+ + E+   ED  D+  GG D+ +   DEDV+ D
Sbjct: 57  VVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDLD 98



 Score = 30.2 bits (68), Expect = 2.2
 Identities = 18/63 (28%), Positives = 30/63 (47%), Gaps = 6/63 (9%)

Query: 469 AKVEKKDTPEESKDVSSESEEE------SDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 522
             V++ D   E  +V S  + +       DL D+ +DE VD  + ++D+  + EE  DDD
Sbjct: 56  EVVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDLDDDDDDTFLEDEEDDDDD 115

Query: 523 DDD 525
              
Sbjct: 116 VSG 118


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 32.9 bits (75), Expect = 0.92
 Identities = 17/96 (17%), Positives = 38/96 (39%), Gaps = 1/96 (1%)

Query: 430 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 489
             KK   K      K  KK   ++   N   ++       +      PE+  D  S+++E
Sbjct: 34  PEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDP-SKNKE 92

Query: 490 ESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
           E +    E  +   K + ++ +N  +++ +++  D 
Sbjct: 93  EIEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDF 128


>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
           fusion protein; Provisional.
          Length = 131

 Score = 31.1 bits (70), Expect = 0.94
 Identities = 10/70 (14%), Positives = 26/70 (37%)

Query: 458 TVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEE 517
                 A    A+  ++    ++ + + E+ E     + + +E  D+          +E 
Sbjct: 59  EKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEA 118

Query: 518 SSDDDDDDWE 527
              D+++D E
Sbjct: 119 EEADEEEDEE 128



 Score = 29.5 bits (66), Expect = 3.8
 Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 6/72 (8%)

Query: 460 EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSD-VENDEVVDKKEKNEDSNNKSEES 518
               A A+ A+  +    +E  D ++E++   +  +  E DE VD     E ++    E+
Sbjct: 64  AAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVD-----ETADEADAEA 118

Query: 519 SDDDDDDWEKYQ 530
            + D+++ E+ +
Sbjct: 119 EEADEEEDEEAE 130


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 32.6 bits (74), Expect = 0.96
 Identities = 29/138 (21%), Positives = 42/138 (30%), Gaps = 25/138 (18%)

Query: 671 DLKLEPCAQLMLYLRPAAAPHAG----RLMKNNSTNSSDRLRDLCSVGRRRVLSES--FS 724
            L L+   Q +L  R AA    G    R+    +TN+S     L +      L+      
Sbjct: 120 TLTLDLARQPLLRARGAARAVVGLYVLRVWVEGATNASLFPLGLAAFPAEGTLAAPPLGE 179

Query: 725 TTQERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQ 784
            + +    P     AP  + P               VFVPA      P P  +     + 
Sbjct: 180 GSADGSCDPALPLSAP-RLGPAD-------------VFVPA-----TPRPTPRTTASPET 220

Query: 785 KPPPILPAPAPTVPTPIA 802
            P P      P+   P  
Sbjct: 221 TPTPSTTTSPPSTTIPAP 238


>gnl|CDD|225677 COG3135, BenE, Uncharacterized protein involved in benzoate
           metabolism [Secondary metabolites biosynthesis,
           transport, and catabolism].
          Length = 402

 Score = 32.3 bits (74), Expect = 1.0
 Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 9/46 (19%)

Query: 787 PPILPAPAPTVPTPIAPEFLWSRESKMSIANRIVSEATTMFLETML 832
              L A   + PT + PEF ++          ++S A  +FL TM 
Sbjct: 200 HTALVALEISTPTWVTPEFSFA---------AMLSLALPLFLVTMA 236


>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
           ribosomal biogenesis [Translation, ribosomal structure
           and biogenesis].
          Length = 821

 Score = 32.7 bits (74), Expect = 1.0
 Identities = 25/95 (26%), Positives = 36/95 (37%), Gaps = 9/95 (9%)

Query: 431 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEE 490
            KK    K    NK    S          E  E     A V+ +   E+  D S     E
Sbjct: 663 TKKTADGKGKKSNKASFDSDD--------EMDENEIWSALVKSRPDVEDDSDDSELDFAE 714

Query: 491 SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
            D SD  +D+   K +  +D + KSE S + D ++
Sbjct: 715 DDFSDSTSDD-EPKLDAIDDEDAKSEGSQESDQEE 748


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 32.1 bits (72), Expect = 1.1
 Identities = 28/88 (31%), Positives = 34/88 (38%), Gaps = 11/88 (12%)

Query: 733 PRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQ---------PVPIPPAKKRGYVK 783
           P P P  P   APT     +  + A      P ++ Q         P P P AK      
Sbjct: 110 PAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAPAAKPIFLHN 169

Query: 784 QKPPPILPAPA-PTVPT-PIAPEFLWSR 809
           Q PPP  PA + PT+ T P A   L  R
Sbjct: 170 QLPPPDYPAASCPTIETAPAASPVLEPR 197


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 32.7 bits (74), Expect = 1.2
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 4/52 (7%)

Query: 480 SKDVSSESEEESD----LSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
           S D S ESEEE       SD E+DE  + +E +E S + SE+ S++D  D E
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEE 978



 Score = 31.9 bits (72), Expect = 1.7
 Identities = 11/51 (21%), Positives = 25/51 (49%), Gaps = 2/51 (3%)

Query: 477 PEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
            E S+  +S  +E  +  + E  +   +    ++S N S +  ++D +DW+
Sbjct: 936 EEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSD--EEDGEDWD 984


>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
           C-terminal ATPase domain [Signal transduction
           mechanisms].
          Length = 456

 Score = 32.4 bits (74), Expect = 1.2
 Identities = 31/138 (22%), Positives = 53/138 (38%), Gaps = 18/138 (13%)

Query: 130 FYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIERPSDEIEVPQLIRQIPNLGEKNR 189
                   +    L  +L ++         + SE+ E  S+ I   ++  ++  L E+N 
Sbjct: 191 VILLLSSFSSRSILLPILGLILLLSRIALGYLSELKEI-SETIN--EMEERLEYLIEENY 247

Query: 190 ERPLYHKYSIKARAL---IYAH-LSNMQLTSETLEADRMYIVKKCPYLLQEMVTCISQLI 245
                     + RAL   I  H L N      TLE  RM   +  P    ++V  +S+L+
Sbjct: 248 SLEQEQLE-AELRALQSQINPHFLYN------TLETIRMLAEEDDPEEAAKVVKALSKLL 300

Query: 246 LLAYAQRVPRLIHIETLE 263
             + +     L +I TLE
Sbjct: 301 RYSLS----NLDNIVTLE 314


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 32.7 bits (74), Expect = 1.2
 Identities = 29/134 (21%), Positives = 63/134 (47%), Gaps = 4/134 (2%)

Query: 392 EPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 451
           E   EGE E E +E++  G I    + E +        K +  G+ +   K++H+ +S  
Sbjct: 763 ETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE-GKVEHEGETEAGEKDEHEGQSET 821

Query: 452 QKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDS 511
           Q   T   ++T    ++    + +  ++ K V  +    SD  D E +E  +++E+ E+ 
Sbjct: 822 QADDTEVKDETGE-QELNAENQGEAKQDEKGV--DGGGGSDGGDSEEEEEEEEEEEEEEE 878

Query: 512 NNKSEESSDDDDDD 525
             + EE  ++++++
Sbjct: 879 EEEEEEEEEEENEE 892



 Score = 31.9 bits (72), Expect = 1.8
 Identities = 30/131 (22%), Positives = 56/131 (42%), Gaps = 10/131 (7%)

Query: 396 EGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKI 455
           E EGEI+  E+           K + +       K++  G+ +    +   K  +G++++
Sbjct: 778 EDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQEL 837

Query: 456 TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKS 515
            N   Q EA  D    EK        D   +SEEE +    E +E  +++E+ E+   + 
Sbjct: 838 -NAENQGEAKQD----EKGVDGGGGSD-GGDSEEEEE----EEEEEEEEEEEEEEEEEEE 887

Query: 516 EESSDDDDDDW 526
           EE+ +    +W
Sbjct: 888 EENEEPLSLEW 898


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 32.0 bits (73), Expect = 1.2
 Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 14/75 (18%)

Query: 451 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 510
            Q K      +    A +  ++  D  + S D   E   ESDL D ++D           
Sbjct: 227 KQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDV---------- 276

Query: 511 SNNKSEESSDDDDDD 525
               S+E  +D  D 
Sbjct: 277 ----SDEDGEDLFDT 287


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 32.0 bits (73), Expect = 1.2
 Identities = 28/153 (18%), Positives = 45/153 (29%), Gaps = 15/153 (9%)

Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKK---F 440
           D  K    +  KE   E  E E+         +       H                   
Sbjct: 27  DFFKKLLKKKPKE-IDEQTEAEDNLLDKAVDDILSN----HYLKEGNGTTSVAIGDGMEK 81

Query: 441 TKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDE 500
           T +   K+S     ITN       P  +A  E        +D SSES       +     
Sbjct: 82  TIHGDVKESDPMSFITNPSLTVSVP--IASGESDLENLNDEDTSSESSYGF--KEESKKG 137

Query: 501 VVDKKEKNEDSNNKSEESSDD---DDDDWEKYQ 530
             +  +  E  ++ SE S+ D    +++ E+  
Sbjct: 138 SAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIA 170


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 32.5 bits (74), Expect = 1.3
 Identities = 12/46 (26%), Positives = 16/46 (34%)

Query: 764 PANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTPIAPEFLWSR 809
           P  +  P    P  K     +  P + P PA   P   A   +WS 
Sbjct: 396 PQAAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNAAAVRSMWST 441


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 32.4 bits (74), Expect = 1.3
 Identities = 22/116 (18%), Positives = 49/116 (42%), Gaps = 9/116 (7%)

Query: 432 KKGGKKKKFTKNKHDKKSSGQ-KKITNTVEQ-TEAPADVAKVEKKDTPEESKDVSSESEE 489
           ++ G KK    NK D       +++ +++E+     +     E  D  +E  D   +SE+
Sbjct: 393 ERQGDKKDLKSNKEDANEVDDLEEVVSSMEEFLNKVSSFEGAEFADDEDEDDDEPDDSED 452

Query: 490 ESDLSDVEN-DEVVDKK------EKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDK 538
           +    D +   E +         E + D  + S+++ +DDD+D ++ +   +    
Sbjct: 453 KDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTL 508



 Score = 32.0 bits (73), Expect = 1.6
 Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 2/61 (3%)

Query: 467 DVAKVEKKDTPEESKDVSSESEEESD--LSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 524
           D A  +  +  +E +D SS+S  E      D  + E+      N    + S  S  +DDD
Sbjct: 486 DDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDD 545

Query: 525 D 525
           D
Sbjct: 546 D 546


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 32.4 bits (74), Expect = 1.3
 Identities = 12/110 (10%), Positives = 39/110 (35%), Gaps = 8/110 (7%)

Query: 436 KKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSD 495
           K +  +    +     ++ I +T  + E        +K    +++ +   +SE++     
Sbjct: 20  KLQPISYIYSNVLVLSKE-ILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKK 78

Query: 496 VENDEVVDKKEKNE-DSNNKSEESS------DDDDDDWEKYQTGLNKRDK 538
            + +   + + K    +  KS+++         ++D    +       +K
Sbjct: 79  EKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEK 128


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 32.5 bits (74), Expect = 1.3
 Identities = 28/149 (18%), Positives = 52/149 (34%), Gaps = 25/149 (16%)

Query: 402  EEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQ 461
            +++++     ++     +K+   +P     K         K       SGQK +T  ++ 
Sbjct: 1314 DDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKP 1373

Query: 462  TEA-----PADVAKVEKKDTPEESKDV--------SSESEEESDLSDVENDEVVD----- 503
             EA        V K+      ++S  V         +ES E    S     + +D     
Sbjct: 1374 AEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVSAKP 1433

Query: 504  -------KKEKNEDSNNKSEESSDDDDDD 525
                   K+     S+++SE + D D DD
Sbjct: 1434 RPQRANRKQTTYVLSDSESESADDSDFDD 1462


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 32.1 bits (74), Expect = 1.3
 Identities = 15/72 (20%), Positives = 17/72 (23%), Gaps = 11/72 (15%)

Query: 729 RPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPP 788
            P  P P P  P   A   +      +       V       VP PP           P 
Sbjct: 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAA---VAPPQAPAVPPPP--------ASAPQ 408

Query: 789 ILPAPAPTVPTP 800
             PA      T 
Sbjct: 409 QAPAVPLPETTS 420


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 30.6 bits (69), Expect = 1.4
 Identities = 22/69 (31%), Positives = 30/69 (43%), Gaps = 5/69 (7%)

Query: 391 LEPSKEGEGEIEEEEEKPNGTI-TPVVNKE---KQQTHRPVWMKNKKGGKKKKFTKNKHD 446
            E  K+   E EE++EK        V NK+    Q    P WM  ++  KK K  + K  
Sbjct: 56  KEKKKKEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQ-AKKLKKKREKKK 114

Query: 447 KKSSGQKKI 455
           KK  G K +
Sbjct: 115 KKKKGAKNL 123


>gnl|CDD|178151 PLN02536, PLN02536, diaminopimelate epimerase.
          Length = 267

 Score = 31.3 bits (71), Expect = 1.9
 Identities = 20/54 (37%), Positives = 23/54 (42%), Gaps = 1/54 (1%)

Query: 363 GAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVV 416
           G    VTC  M  P  V FG+  +KV  L   K G  + E  E  P  T T  V
Sbjct: 135 GKTWLVTCVSMGNPHCVTFGEKELKVDDLPLEKIGP-KFEHHEMFPARTNTEFV 187


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 30.9 bits (70), Expect = 2.0
 Identities = 15/42 (35%), Positives = 19/42 (45%)

Query: 486 ESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
           E +E   L D  +DE  D  E  E      E   D+DDD+ E
Sbjct: 128 EDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDDDEDE 169


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 31.2 bits (70), Expect = 2.1
 Identities = 25/78 (32%), Positives = 33/78 (42%), Gaps = 5/78 (6%)

Query: 730 PILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPI 789
           P    PTP    ++ P   VQ      A Q + +P    QP P+P +    +V Q+ P  
Sbjct: 20  PPAAAPTPQPNPVIQPQAPVQPGQ-PGAPQQLAIPTQ--QPQPVPTSAMTPHVVQQAPA- 75

Query: 790 LPAP-APTVPTPIAPEFL 806
            PAP AP       PE L
Sbjct: 76  QPAPAAPPAAGAALPEAL 93


>gnl|CDD|233222 TIGR00986, 3a0801s05tom22, mitochondrial import receptor subunit
           Tom22.  The mitochondrial protein translocase (MPT)
           family, which brings nuclearly encoded preproteins into
           mitochondria, is very complex with 19 currently
           identified protein constituents.These proteins include
           several chaperone proteins, four proteins of the outer
           membrane translocase (Tom) import receptor, five
           proteins of the Tom channel complex, five proteins of
           the inner membrane translocase (Tim) and three "motor"
           proteins. This family is specific for the Tom22 proteins
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 145

 Score = 30.3 bits (68), Expect = 2.2
 Identities = 13/52 (25%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 454 KITNTVEQTEA-PADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 504
           ++T   ++    P     VE ++  E+  DV SE   +SD   +E +   D+
Sbjct: 3   QLTEVEDEHFQQPEQQTVVEPEEDDEDFTDVDSEDSVDSDFESLEEETFTDR 54


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
           Provisional.
          Length = 291

 Score = 31.3 bits (70), Expect = 2.2
 Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 9/71 (12%)

Query: 469 AKVEKKDTPEESKDVSSESEEESDL----SDV-ENDEVVDKKEKNEDSNNKSEESSDDD- 522
           A+ +   +P ESK + + S+   DL    +   EN   +  +E +ED ++ SEE  DDD 
Sbjct: 34  AQQDGGKSPPESKGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDDG 93

Query: 523 ---DDDWEKYQ 530
              D++ EK +
Sbjct: 94  GAEDEEEEKVR 104


>gnl|CDD|219928 pfam08602, Mgr1, Mgr1-like, i-AAA protease complex subunit.  The S.
           cerevisiae Mgr1 protein has been shown to be required
           for mitochondrial viability in yeast lacking
           mitochondrial DNA. It is a mitochondrial inner membrane
           protein, which interacts with Yme1 and is a new subunit
           of the i-AAA protease complex.
          Length = 362

 Score = 31.4 bits (71), Expect = 2.3
 Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 4/68 (5%)

Query: 95  LYALVFMVALPTAVGMWWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASL 154
           L  L  +V L T +G++++K  R  G K+    I    A F      +     M +   +
Sbjct: 54  LPGLYTLVGLQTGIGLFFFKRFRTLGKKLRKRDI----ADFPSLNRFSTTHGDMHMAPVI 109

Query: 155 EFDRRFNS 162
            + RRF S
Sbjct: 110 PWQRRFVS 117


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 31.3 bits (71), Expect = 2.4
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 2/60 (3%)

Query: 465 PADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 524
           PA +        P  S    +E EE  D  DV +++  D+    ED N  +EE  D++ +
Sbjct: 15  PAVLLTFGNGKGPFASAQDLTEDEEAED--DVVDEDEEDEAVVEEDENELTEEEEDEEGE 72


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 31.2 bits (70), Expect = 2.5
 Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 4/77 (5%)

Query: 455 ITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEK----NED 510
           I N     E   ++  +      EE +    E EE+ D+ ++E  E V+  E+    +ED
Sbjct: 77  IANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDED 136

Query: 511 SNNKSEESSDDDDDDWE 527
           S    E+ SD + D+ E
Sbjct: 137 SEKDDEKESDAEGDENE 153



 Score = 29.6 bits (66), Expect = 7.3
 Identities = 20/95 (21%), Positives = 50/95 (52%), Gaps = 2/95 (2%)

Query: 439 KFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEE--SKDVSSESEEESDLSDV 496
              KN+ ++ +  +++    VE+ E   +V  VE++   +E   KD   ES+ E D +++
Sbjct: 95  IIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENEL 154

Query: 497 ENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQT 531
             + ++++ + +    N++  S D+ +D  + Y++
Sbjct: 155 AGEYIIEEVDDDVAILNENINSIDELEDPLDSYKS 189


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 31.3 bits (71), Expect = 2.5
 Identities = 6/72 (8%), Positives = 27/72 (37%)

Query: 478 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRD 537
            +    +  +  +S  S+ +        +  + ++N +  + D++D  +    +  +  +
Sbjct: 30  PQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSN 89

Query: 538 KVLEGRSKQSHS 549
            +++   K    
Sbjct: 90  NIIDFIYKNLPQ 101



 Score = 30.9 bits (70), Expect = 3.5
 Identities = 13/57 (22%), Positives = 25/57 (43%), Gaps = 5/57 (8%)

Query: 466 ADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 522
           A     +K  T + +   S +S ++   S   + +  DK     D+NN S + ++D 
Sbjct: 25  AYADDPQKDSTAKTTSHDSKKSNDDET-SKDTSSKDTDK----ADNNNTSNQDNNDK 76


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 31.2 bits (70), Expect = 2.6
 Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 3/89 (3%)

Query: 461 QTEAPADVAKVEKKDTPEESKDVSSESEEE-SDLSDVENDEVVDKKEKNEDSNNKSEESS 519
           Q E    V + E+ DT  E+  V+ +SE+E +   D+E     D+  +  +S  +    S
Sbjct: 197 QDEKERYVEEEEESDTELEA--VTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSES 254

Query: 520 DDDDDDWEKYQTGLNKRDKVLEGRSKQSH 548
           + D+D+ E  +  + KR      +S++ H
Sbjct: 255 ESDEDEDEDNKGKIRKRKTDDAKKSRKPH 283


>gnl|CDD|223035 PHA03294, PHA03294, envelope glycoprotein H; Provisional.
          Length = 835

 Score = 31.2 bits (71), Expect = 2.7
 Identities = 20/103 (19%), Positives = 26/103 (25%), Gaps = 15/103 (14%)

Query: 725 TTQERPILPRPTPGAPIMVAPTTS-------VQFLNGSKATQFVFVPA-NSLQPVPIPPA 776
           T  + P    PT   P  VAP +          +L       F  VP   SL       +
Sbjct: 106 TFAKLPARA-PTGRLPPPVAPLSGLLGNPNLAPYLRTRHLVDFSVVPDPRSLTRWVFDRS 164

Query: 777 KKRGYVKQKPPPILPAPAPT------VPTPIAPEFLWSRESKM 813
                        LP P          P  I P    +  +  
Sbjct: 165 DTAATKAHPSGVALPPPRAPPPRNTTDPATIKPNDHLNPATTF 207


>gnl|CDD|221866 pfam12935, Sec16_N, Vesicle coat trafficking protein Sec16
           N-terminus.  Sec16 is a multi-domain vesicle coat
           protein. The overall function of Sec16 is in mediating
           the movement of protein-cargo between the organelles of
           the secretory pathway. Over-expression of truncated
           mutants of only the N-terminus are lethal, and this
           portion does not appear to be essential for function so
           may act as a stabilising region.
          Length = 246

 Score = 30.6 bits (69), Expect = 2.8
 Identities = 24/161 (14%), Positives = 54/161 (33%), Gaps = 12/161 (7%)

Query: 383 DDTIKVQFLEPSKEGEGEI-----EEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKK 437
           D+  + + +E S      I     E+E+++ +     +   E  +  +P     +K   +
Sbjct: 47  DNGEQNEPVEESAPQTVAIDSVFVEDEDDEGSDFFNSLHEGEAVEEQQPPPHLTRKSTSQ 106

Query: 438 KKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVE 497
              +   +    S            +            + + + +  S+SE  S+     
Sbjct: 107 VLDSLGLNPDSLSSPASAEPLDPTAQNEFSNVL---AASTDGNPESESQSEPSSEEELAA 163

Query: 498 NDEVVDKK-EKNEDSNNKSEESS---DDDDDDWEKYQTGLN 534
             E+ D + E     ++ +E      D+DDD     +T L 
Sbjct: 164 RAELSDDESESTPTEDDLAERWQAFLDNDDDLLLDDETALA 204


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 31.0 bits (70), Expect = 2.9
 Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 5/71 (7%)

Query: 430 KNKKGGKKKKFTKNKHDKKSSG----QKKITNTVEQTEAPADVAKVEKKD-TPEESKDVS 484
           K  K   K    + K   +S      + + ++T  + E+  +     K+  +   S   S
Sbjct: 10  KFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDS 69

Query: 485 SESEEESDLSD 495
           S SEEE D S+
Sbjct: 70  SSSEEEEDGSE 80


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 28.9 bits (65), Expect = 2.9
 Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 4/61 (6%)

Query: 430 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 489
           K KK   KK   K K  KK +  K+   T E+ E     A+ + K+  E+  +   E EE
Sbjct: 13  KGKKIDVKK---KKKKKKKKNKSKEEVVT-EKEEEEKSSAESDLKEGEEDEDNEKIEQEE 68

Query: 490 E 490
           +
Sbjct: 69  D 69


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 31.1 bits (70), Expect = 3.1
 Identities = 25/107 (23%), Positives = 46/107 (42%), Gaps = 6/107 (5%)

Query: 428 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVE-QTEAPADVAKVEKKDTPEESKDVSSE 486
           W+       K+K T N   KK +  +K+    E +     D  + +  D  +E +D   +
Sbjct: 15  WV-RGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDD 73

Query: 487 SEEESDLSDVENDEVVDKKEKNEDSNNKSEES----SDDDDDDWEKY 529
            ++E D  + ++D  +      +D N    E+    SDD+ DD  +Y
Sbjct: 74  DDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSEY 120


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
           Provisional.
          Length = 337

 Score = 30.9 bits (70), Expect = 3.2
 Identities = 10/38 (26%), Positives = 23/38 (60%)

Query: 488 EEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
           +E+SD S  E+D+  D  + ++ +++ S    D +++D
Sbjct: 281 DEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEED 318


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.9 bits (70), Expect = 3.3
 Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 10/92 (10%)

Query: 756 KATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTPIAPEFLWSRESKMSI 815
            A + V +P  +  P P P          +PPP  P P P  P     E   +       
Sbjct: 253 LAVELVLLPRATRLPEPEP----------QPPPPPPPPEPPEPEEEPDEPDQTDPDDGEE 302

Query: 816 ANRIVSEATTMFLETMLQPDILSYIQKGEGKK 847
            ++I  E     +E  L  +IL+ +Q  + ++
Sbjct: 303 TDQIPEELMFDAVEADLPDNILATLQTVQRRR 334


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 30.9 bits (70), Expect = 3.3
 Identities = 19/79 (24%), Positives = 30/79 (37%)

Query: 456 TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKS 515
            +T E        A+++  D  E  +D   E E+ SD  + E  E  +++      +  S
Sbjct: 347 PSTKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSS 406

Query: 516 EESSDDDDDDWEKYQTGLN 534
           E SSD   D   K      
Sbjct: 407 ESSSDVGSDSESKADKESA 425


>gnl|CDD|147374 pfam05160, DSS1_SEM1, DSS1/SEM1 family.  This family contains the
           breast cancer tumour suppressor BRCA2-interacting
           protein DSS1 and its homologue SEM1, both of which are
           short acidic proteins. DSS1 has been shown to be a
           conserved component of the Rae1 mediated mRNA export
           pathway in Schizosaccharomyces pombe.
          Length = 63

 Score = 28.0 bits (63), Expect = 3.3
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 488 EEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
           EE+ +  D   ++  +++ + E+ NN  EE  DDDD +
Sbjct: 12  EEDDEFEDFPVEDWPEEETEKENGNNLWEEDWDDDDVE 49


>gnl|CDD|237304 PRK13206, ureC, urease subunit alpha; Reviewed.
          Length = 573

 Score = 30.8 bits (70), Expect = 3.3
 Identities = 15/30 (50%), Positives = 16/30 (53%), Gaps = 2/30 (6%)

Query: 722 SFSTTQERPILPRPTPGAPIMVAPTTSVQF 751
           S  T Q  P+LPRP  GA    A  TSV F
Sbjct: 470 SIPTPQ--PVLPRPMFGAAPAAAAATSVHF 497


>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
          Length = 267

 Score = 30.6 bits (70), Expect = 3.5
 Identities = 10/23 (43%), Positives = 12/23 (52%)

Query: 12  SGASEASIKKAYRKQSLILHPDK 34
               +  IK+AYRK     HPDK
Sbjct: 210 ESDDDQEIKRAYRKLMSEHHPDK 232


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
           protein subunit; Validated.
          Length = 130

 Score = 29.4 bits (66), Expect = 3.5
 Identities = 12/44 (27%), Positives = 17/44 (38%)

Query: 758 TQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTPI 801
            Q       +  PVP+P         Q P P   A A  +P+P+
Sbjct: 25  AQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSPM 68


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 31.0 bits (70), Expect = 3.5
 Identities = 38/189 (20%), Positives = 61/189 (32%), Gaps = 20/189 (10%)

Query: 345 LHVQPEVIDDEATTEYTAGAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEE 404
           L    EV +D+A++     A          K    + GD  +K   +   K+   ++  E
Sbjct: 363 LESLKEVFEDQASSVKNKQAQSEENLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSE 422

Query: 405 EEKPNGTITPVVNKEKQQTH--------------RPVWMKNKKGGKKKKFTKNKHDKKSS 450
           +      +   V+K K                   PV  + +K   K K TK K+ K   
Sbjct: 423 KINNPVKV-VKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLKNFKPVP 481

Query: 451 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 510
            + K       ++      KV K  T  ++                 N E  DK+     
Sbjct: 482 NKSKKQLANNNSQNI-KSKKVVKAKTNNKANLQDVGECSSPPN----NKEKNDKQTSTSS 536

Query: 511 SNNKSEESS 519
           S  KS+ SS
Sbjct: 537 SVLKSDRSS 545


>gnl|CDD|132386 TIGR03343, biphenyl_bphD, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate
           hydrolase.  Members of this family are
           2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or
           HOPD hydrolase, the BphD protein of biphenyl
           degradation. BphD acts on the product of ring
           meta-cleavage by BphC. Many species carrying bphC and
           bphD are capable of degrading polychlorinated biphenyls
           as well as biphenyl itself.
          Length = 282

 Score = 30.3 bits (68), Expect = 3.8
 Identities = 21/83 (25%), Positives = 43/83 (51%), Gaps = 12/83 (14%)

Query: 272 IVQGMWDFKNPLLQLPYVTDDHLKHFI--CKKRYIKSLQQFAQMKNEERRSVVKFMNDEQ 329
           ++QG W     + + P    +HLK+F+   +K  + +    A++   + +++V +  D++
Sbjct: 183 LLQGRW---ENIQRQP----EHLKNFLISSQKAPLSTWDVTARLGEIKAKTLVTWGRDDR 235

Query: 330 YSDM---LKVLGNMPYVDLHVQP 349
           +  +   LK+L NMP   LHV  
Sbjct: 236 FVPLDHGLKLLWNMPDAQLHVFS 258


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 30.3 bits (68), Expect = 3.9
 Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 6/71 (8%)

Query: 467 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDW 526
           DV   E++   E       E EE    S  E +  ++++E+ E       E SDD +D  
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEA------EGSDDGEDVV 226

Query: 527 EKYQTGLNKRD 537
           +     ++K+ 
Sbjct: 227 DYEGERIDKKQ 237



 Score = 29.2 bits (65), Expect = 9.3
 Identities = 15/69 (21%), Positives = 26/69 (37%)

Query: 457 NTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSE 516
           N V   +   D A   + D   E ++   ES+   +  D E       +   E +  + E
Sbjct: 155 NIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE 214

Query: 517 ESSDDDDDD 525
           E+   DD +
Sbjct: 215 EAEGSDDGE 223


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 30.1 bits (67), Expect = 4.4
 Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 9/112 (8%)

Query: 429 MKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQT--EAPADVAKVEKKDTPEESKDVSSE 486
           M N     K K  K +   + + ++K+   VE+   + P D+ + E +D PEE  +   E
Sbjct: 1   MGNYCRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVE-ELEDQPEEPPEQEEE 59

Query: 487 SEEESDLSDVENDEVVDK--KEKN----EDSNNKSEESSDDDDDDWEKYQTG 532
           +EE+    +++     +K   EKN    E SN+     S   D+  +K +TG
Sbjct: 60  NEEQKPKEEIDYPIQENKSFDEKNLDDLERSNSDIYSESHKYDNASDKLETG 111


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
           consists of several mammalian dentin matrix protein 1
           (DMP1) sequences. The dentin matrix acidic
           phosphoprotein 1 (DMP1) gene has been mapped to human
           chromosome 4q21. DMP1 is a bone and teeth specific
           protein initially identified from mineralised dentin.
           DMP1 is primarily localised in the nuclear compartment
           of undifferentiated osteoblasts. In the nucleus, DMP1
           acts as a transcriptional component for activation of
           osteoblast-specific genes like osteocalcin. During the
           early phase of osteoblast maturation, Ca(2+) surges into
           the nucleus from the cytoplasm, triggering the
           phosphorylation of DMP1 by a nuclear isoform of casein
           kinase II. This phosphorylated DMP1 is then exported out
           into the extracellular matrix, where it regulates
           nucleation of hydroxyapatite. DMP1 is a unique molecule
           that initiates osteoblast differentiation by
           transcription in the nucleus and orchestrates
           mineralised matrix formation extracellularly, at later
           stages of osteoblast maturation. The DMP1 gene has been
           found to be ectopically expressed in lung cancer
           although the reason for this is unknown.
          Length = 514

 Score = 30.4 bits (68), Expect = 4.5
 Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 7/61 (11%)

Query: 472 EKKDTPEESKDV---SSESEEESDLSDVEND----EVVDKKEKNEDSNNKSEESSDDDDD 524
           E+  + E+S++V   SSES +E+DL   EN     E V  + + ++ +N +  S D +D 
Sbjct: 317 EENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDS 376

Query: 525 D 525
           +
Sbjct: 377 E 377



 Score = 29.6 bits (66), Expect = 8.1
 Identities = 32/120 (26%), Positives = 53/120 (44%), Gaps = 9/120 (7%)

Query: 448 KSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVS-----SESEEESDLSDVENDEVV 502
           +SS +  + +    +E+  +V    + D P+ +   S     SES EE  L    + E  
Sbjct: 334 ESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQ 393

Query: 503 DKKEK----NEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPED 558
             +E+    + +S + SEES +  +D+    Q GL       E RS++S S    R  ED
Sbjct: 394 STEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEED 453


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 29.7 bits (67), Expect = 4.7
 Identities = 14/69 (20%), Positives = 26/69 (37%)

Query: 478 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRD 537
           E         EE+    +   +E   K+E  E    + EE  +D ++  E     L   +
Sbjct: 4   ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63

Query: 538 KVLEGRSKQ 546
             L+  +K+
Sbjct: 64  NKLKEENKK 72



 Score = 29.7 bits (67), Expect = 5.6
 Identities = 14/79 (17%), Positives = 28/79 (35%)

Query: 442 KNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 501
           + K  K  + ++      E  E      +  + +  E+ + +    E      +   DE 
Sbjct: 4   ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63

Query: 502 VDKKEKNEDSNNKSEESSD 520
              KE+N+   N+ E   D
Sbjct: 64  NKLKEENKKLENELEALKD 82



 Score = 29.3 bits (66), Expect = 7.0
 Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 1/53 (1%)

Query: 470 KVEKKDTPEESKDVSSESEEES-DLSDVENDEVVDKKEKNEDSNNKSEESSDD 521
           K  K +  EE     +E++EE     +    E ++K+E  EDS   +E   ++
Sbjct: 6   KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 30.1 bits (68), Expect = 4.8
 Identities = 15/93 (16%), Positives = 26/93 (27%), Gaps = 4/93 (4%)

Query: 432 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 491
           K     + F ++K  KK+          E         K ++  T  ES   S+ +    
Sbjct: 106 KSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRP 165

Query: 492 DLSDVENDEVV----DKKEKNEDSNNKSEESSD 520
               +   +       KK     S   S +   
Sbjct: 166 STPHLLKAKEGPSRRAKKAAKLSSTASSGDEKS 198


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 30.5 bits (69), Expect = 4.9
 Identities = 13/46 (28%), Positives = 26/46 (56%)

Query: 483 VSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEK 528
           +  E E E ++ D E +E  ++K++ E+     EE  D++++  EK
Sbjct: 25  LWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEK 70


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
           Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
           CPA1 family is a large family of proteins derived from
           Gram-positive and Gram-negative bacteria, blue green
           bacteria, yeast, plants and animals. Transporters from
           eukaryotes have been functionally characterized, and all
           of these catalyze Na+:H+ exchange. Their primary
           physiological functions may be in (1) cytoplasmic pH
           regulation, extruding the H+ generated during
           metabolism, and (2) salt tolerance (in plants), due to
           Na+ uptake into vacuoles. This model is specific for the
           fungal members of this family [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 810

 Score = 30.7 bits (69), Expect = 4.9
 Identities = 22/98 (22%), Positives = 42/98 (42%), Gaps = 13/98 (13%)

Query: 429 MKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKD---VSS 485
           ++++ G   +++  + H  K   +K+  + V   E  A    +E      E  +   + S
Sbjct: 723 IEDESGEVFRRYRISPHGGKRKIKKRNDSVVSVDEEKA----IEGPSRVPERGNHDLLHS 778

Query: 486 ESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 523
           E E   D ++ EN       +  EDS++ + ES D  D
Sbjct: 779 EDEMADDEAESENM------DDYEDSDDNAYESKDHAD 810


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 30.3 bits (68), Expect = 5.0
 Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 2/56 (3%)

Query: 474 KDTPEESKDVSSESEEES--DLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
                  K  +S  E+       +  +D+  D  E  ED  ++S    DDDD DWE
Sbjct: 258 PSLMSPRKKTASFKEQVVTRTFPERTSDDDEDAIETEEDDVDESAIEDDDDDSDWE 313


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 30.5 bits (69), Expect = 5.4
 Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 8/74 (10%)

Query: 475 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD---DWEKYQT 531
           DT  +SK+++   EE++         V +   KN     K  + S++ +D      +Y  
Sbjct: 208 DTLIKSKEININKEEKN-----NGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDK 262

Query: 532 GLNKRDKVLEGRSK 545
              ++ K L+ ++K
Sbjct: 263 EKLEKIKDLKEKAK 276


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.4 bits (68), Expect = 5.5
 Identities = 19/76 (25%), Positives = 28/76 (36%), Gaps = 5/76 (6%)

Query: 730 PILPRPTPGA---PIMVAPTTSVQFLNGSKATQFVFVPANSLQP--VPIPPAKKRGYVKQ 784
           P  P PTP A   P   +P  +       + +    + A SL P  +P P    +     
Sbjct: 191 PSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTAS 250

Query: 785 KPPPILPAPAPTVPTP 800
           +  P  PAP+   P  
Sbjct: 251 QQSPQPPAPSSRHPQS 266



 Score = 30.0 bits (67), Expect = 6.6
 Identities = 22/106 (20%), Positives = 33/106 (31%), Gaps = 21/106 (19%)

Query: 720 SESFSTTQERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPAN----SLQPVPIPP 775
           +ES S +  +  L +P     I V P  ++             VP      + QP P P 
Sbjct: 160 NESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQ 219

Query: 776 AKKRGYVKQKP---PPILPAP--------------APTVPTPIAPE 804
                 +   P   P  LP+P               P  P+   P+
Sbjct: 220 QPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQ 265



 Score = 30.0 bits (67), Expect = 7.1
 Identities = 16/71 (22%), Positives = 22/71 (30%), Gaps = 3/71 (4%)

Query: 733 PRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPA 792
            +P    P      T  Q L    +T      +      P  P  +  +     P I P 
Sbjct: 412 SQPLQSVPAQPPVLTQSQSLPPKASTH---PHSGLHSGPPQSPFAQHPFTSGGLPAIGPP 468

Query: 793 PAPTVPTPIAP 803
           P+    TP AP
Sbjct: 469 PSLPTSTPAAP 479


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 30.0 bits (67), Expect = 5.6
 Identities = 21/77 (27%), Positives = 30/77 (38%), Gaps = 3/77 (3%)

Query: 730 PILPR--PTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPP 787
           P  PR  P PG P     +   +  +  K TQ   V   + QP    PA++    + +  
Sbjct: 174 PNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQ-QQPQQQ 232

Query: 788 PILPAPAPTVPTPIAPE 804
           P+ PA  PT   P    
Sbjct: 233 PVQPAQQPTPQNPAQQP 249


>gnl|CDD|216161 pfam00865, Osteopontin, Osteopontin. 
          Length = 293

 Score = 30.1 bits (67), Expect = 5.6
 Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 5/95 (5%)

Query: 434 GGKKKKFTKNKHDKKSSGQKKITNTVEQTE-APADVAKVEKKDTPEESK--DVSSESEEE 490
           G  ++K   NKH    +   K   + +Q   AP +    E+ D  ++      S+ES E 
Sbjct: 6   GSSEEKQLYNKHPDAVATWLKPDPSQKQNLLAPQNSVSSEETDDLKQETLPSKSNESHEH 65

Query: 491 SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
            D  D ++D+  D  +  + ++N+S+++   DD D
Sbjct: 66  MD--DDDDDDDDDHVDSQDSNSNESDDADHTDDSD 98


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
           Reviewed.
          Length = 129

 Score = 28.8 bits (65), Expect = 5.7
 Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 4/56 (7%)

Query: 472 EKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
           E+    EE  D   +   + D   +++D + D     ED +    E  D+D++D E
Sbjct: 78  EEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLD----EEDDEDEEDEE 129



 Score = 28.4 bits (64), Expect = 8.9
 Identities = 8/58 (13%), Positives = 31/58 (53%)

Query: 467 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 524
            V +++++  P E K    + +      D+ +D+++   + +E+  ++ ++  ++D++
Sbjct: 72  PVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 30.3 bits (68), Expect = 5.9
 Identities = 18/100 (18%), Positives = 29/100 (29%), Gaps = 3/100 (3%)

Query: 707 LRDLCSVGRRRVLSESF---STTQERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFV 763
           LR++    +      +            LP   PG+  +    +S       + +   F+
Sbjct: 166 LREMLVELQIGGRGGNIPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFL 225

Query: 764 PANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTPIAP 803
           P  S      PPA             +  P    P PI P
Sbjct: 226 PGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQPIIP 265


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 30.1 bits (68), Expect = 6.9
 Identities = 17/53 (32%), Positives = 22/53 (41%), Gaps = 3/53 (5%)

Query: 471 VEKKDTPEESKDVSSESEEESDLSDVEND---EVVDKKEKNEDSNNKSEESSD 520
           +E +D   E  D   E EEE +  D +     E  D +E  ED      E SD
Sbjct: 379 LEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSD 431


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 29.5 bits (66), Expect = 7.0
 Identities = 19/134 (14%), Positives = 44/134 (32%), Gaps = 1/134 (0%)

Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
              K +  E  KE    + E+    +        KE Q       +      +++     
Sbjct: 95  RKTKQKVEEEVKEQLQSLLEKIV-VSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAG 153

Query: 444 KHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVD 503
           K  +++     +  T  +        K  +++  +  + V      +++  DV  ++  +
Sbjct: 154 KVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDE 213

Query: 504 KKEKNEDSNNKSEE 517
           +    EDS    EE
Sbjct: 214 EDNPVEDSKAIKEE 227


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 30.0 bits (67), Expect = 7.5
 Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 3/60 (5%)

Query: 468 VAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE---KNEDSNNKSEESSDDDDD 524
           +   +  +   E KD   +S         ++    +K E   +N + +   EE  D +D+
Sbjct: 336 LNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395


>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein.  Members of this
           family are bacterial proteins with a conserved motif
           [KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
           followed by a long region of low complexity sequence in
           which roughly half the residues are Asp and Glu,
           including multiple runs of five or more acidic residues.
           The function of members of this family is unknown.
          Length = 129

 Score = 28.4 bits (63), Expect = 8.1
 Identities = 14/76 (18%), Positives = 28/76 (36%), Gaps = 4/76 (5%)

Query: 456 TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEE----SDLSDVENDEVVDKKEKNEDS 511
               +  +    V +V+ +           E++EE    +      +D   D+   ++D 
Sbjct: 46  RVRADDEKEAVAVKEVDPEIDEAAELVSLEEADEEVDPGAPTPAGGDDLAEDEDIADDDD 105

Query: 512 NNKSEESSDDDDDDWE 527
           +    E  +DDD D E
Sbjct: 106 DVTFLEDEEDDDIDDE 121


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 29.6 bits (66), Expect = 8.3
 Identities = 23/114 (20%), Positives = 45/114 (39%), Gaps = 6/114 (5%)

Query: 430 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 489
             +   + + F     D K   +KK      + E      +  KK+   + ++   E +E
Sbjct: 247 DEEGNIEYEDF----FDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDE 302

Query: 490 ESDLSDVE--NDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLE 541
           + D  D E   +  +DK + +E      +  S  +   +EK Q  L ++ + LE
Sbjct: 303 QEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLE 356


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 29.7 bits (67), Expect = 8.3
 Identities = 18/76 (23%), Positives = 21/76 (27%), Gaps = 9/76 (11%)

Query: 730 PILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPI 789
           P  P     A   VA   +        A       A +  P   PPA         P   
Sbjct: 381 PARPEAAAPAAAPVAQAAAAP---APAAAPAAAASAPAAPPAAAPPAPV-----AAPAAA 432

Query: 790 LPAPAPT-VPTPIAPE 804
            PA AP   P  +A  
Sbjct: 433 APAAAPAAAPAAVALA 448


>gnl|CDD|221403 pfam12064, DUF3544, Domain of unknown function (DUF3544).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 198 to 216 amino acids in length. This domain is
           found associated with pfam00628, pfam01753, pfam00439,
           pfam00855.
          Length = 207

 Score = 29.2 bits (65), Expect = 8.5
 Identities = 24/79 (30%), Positives = 37/79 (46%), Gaps = 3/79 (3%)

Query: 441 TKNKHDKKSSGQKKITNTVEQTEAPADVAKV-EKKDTPEESKDVSSESEEESDLSDVEND 499
           T  K   +S  Q  +  T+E  +A   + ++ E   T  E  D  SE  E+SD SD E  
Sbjct: 131 TSPKRQIRSRFQLNLDKTIESCKAQLGINEISEDAYTGVEHSD--SEDSEKSDSSDSEYS 188

Query: 500 EVVDKKEKNEDSNNKSEES 518
              ++K KNE  +   +E+
Sbjct: 189 SDDEQKAKNEQEDASDKEA 207


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 27.4 bits (61), Expect = 9.0
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 6/40 (15%)

Query: 486 ESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
           E EE  D  D E+D+  D K+  +D      +  DDD+DD
Sbjct: 46  EGEEPEDDDDDEDDDDDDDKDDKDD------DDDDDDEDD 79


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
           stage VI sporulation protein D, is restricted to
           endospore-forming members of the bacteria, all of which
           are found among the Firmicutes. It is widely distributed
           but not quite universal in this group. Between
           well-conserved N-terminal and C-terminal domains is a
           poorly conserved, low-complexity region of variable
           length, rich enough in glutamic acid to cause spurious
           BLAST search results unless a filter is used. The seed
           alignment for this model was trimmed, in effect, by
           choosing member sequences in which these regions are
           relatively short. SpoVID is involved in spore coat
           assembly by the mother cell compartment late in the
           process of sporulation [Cellular processes, Sporulation
           and germination].
          Length = 338

 Score = 29.5 bits (66), Expect = 9.0
 Identities = 14/75 (18%), Positives = 38/75 (50%), Gaps = 4/75 (5%)

Query: 455 ITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNK 514
           I+   ++    A+ A+ ++++    S +    ++EE+       +E  D+ +   +++ +
Sbjct: 133 ISGIQQENNLDAEPAREDEEEEESFSAEFEHPAQEETA----GEEERTDEPKVEHEAHEQ 188

Query: 515 SEESSDDDDDDWEKY 529
            E+ +DDD D+W+  
Sbjct: 189 HEQPADDDPDEWKIS 203


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 28.5 bits (64), Expect = 9.1
 Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 3/58 (5%)

Query: 437 KKKFTKNKHDKKSSGQKKITNTV--EQTEAPADVAKVEKKDTPEESKDVSSESEEESD 492
           +KK T  KH +K   Q+++ N +  E       V   +K+    E +  SS SE  S+
Sbjct: 59  QKKITFQKHKEK-PEQEELKNQLFSENATENNTVKATKKQLFSSEYEQTSSSSESTSE 115


>gnl|CDD|113815 pfam05058, ActA, ActA Protein.  The ActA family is found in
           Listeria and is associated with motility. ActA protein
           acts as a scaffold to assemble and activate host cell
           actin cytoskeletal factors at the bacterial surface,
           resulting in directional actin polymerisation and
           propulsion of the bacterium through the cytoplasm of the
           host cell.
          Length = 601

 Score = 29.4 bits (65), Expect = 9.5
 Identities = 28/124 (22%), Positives = 56/124 (45%), Gaps = 13/124 (10%)

Query: 367 TVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRP 426
           T T T+++K   V       K+  L  +K  E  I E +        P + K+ +   +P
Sbjct: 454 TTTKTVLKKITPV---KIAPKLAELPATKPQETAIGENK-------APFIEKQAETNKQP 503

Query: 427 VWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSE 486
           + M +    +K+   ++K + K   ++K+   VE++E   D    ++    EE K ++  
Sbjct: 504 IDMPSLPVIQKEATERDKEEMKPQTEEKM---VEESEPANDANGKKRSAGIEEGKLIAKS 560

Query: 487 SEEE 490
           +E+E
Sbjct: 561 AEDE 564


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 29.4 bits (66), Expect = 9.9
 Identities = 21/78 (26%), Positives = 27/78 (34%), Gaps = 19/78 (24%)

Query: 733 PRPTPGAPIMVAPTTSVQFLN-------GSKATQFVFVPANSLQPVPIPPAKKRGYVKQK 785
           PR   G  +  APT + + L+       G  AT      A++  P   PPA         
Sbjct: 873 PRTEGGVRVGYAPTLA-EALDQVFGPGTGRVATAPGGDAASAPPPGAGPPA--------- 922

Query: 786 PPPILPAPAPTVPTPIAP 803
           PP     P P    P A 
Sbjct: 923 PPQ--AVPPPRTTQPPAA 938


>gnl|CDD|131515 TIGR02462, pyranose_ox, pyranose oxidase.  Pyranose oxidase (also
           called glucose 2-oxidase) converts D-glucose and
           molecular oxygen to 2-dehydro-D-glucose and hydrogen
           peroxide. Peroxide production is believed to be
           important to the wood rot fungi in which this enzyme is
           found for lignin degradation.
          Length = 544

 Score = 29.4 bits (66), Expect = 9.9
 Identities = 24/102 (23%), Positives = 37/102 (36%), Gaps = 6/102 (5%)

Query: 712 SVGRRRVLSESFSTTQERPILPRPTPGAPIMVAPTT--SVQFLNGSKATQFVFVPANSLQ 769
           +V   ++L  S      RP    P P  P +    T  S+ F     +T+ V    +  +
Sbjct: 277 AVHNPQILVNSGFGQLGRPDPTNPPPLLPSLGRYITEQSMTFCQIVLSTELVDSVRSDPR 336

Query: 770 PVPIPPAKKRGYVKQKPPPILPAP----APTVPTPIAPEFLW 807
            +     K   ++ + P   LP P     P V TP   E  W
Sbjct: 337 GLDWWKEKVANHMMKHPEDPLPIPFRDPEPQVTTPFTEEHPW 378


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.132    0.384 

Gapped
Lambda     K      H
   0.267   0.0705    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 45,948,009
Number of extensions: 4577789
Number of successful extensions: 7245
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6415
Number of HSP's successfully gapped: 351
Length of query: 904
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 798
Effective length of database: 6,236,078
Effective search space: 4976390244
Effective search space used: 4976390244
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.5 bits)
S2: 64 (28.5 bits)