RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15183
         (1973 letters)



>gnl|CDD|99850 cd06147, Rrp6p_like_exo, DEDDy 3'-5' exonuclease domain of yeast
           Rrp6p, human polymyositis/scleroderma autoantigen
           100kDa, and similar proteins.  Yeast Rrp6p and its human
           homolog, the polymyositis/scleroderma autoantigen 100kDa
           (PM/Scl-100), are exosome-associated proteins involved
           in the degradation and processing of precursors to
           stable RNAs. Both proteins contain a DEDDy-type
           DnaQ-like 3'-5' exonuclease domain possessing three
           conserved sequence motifs termed ExoI, ExoII and ExoIII,
           with a specific YX(3)D pattern at ExoIII. The motifs are
           clustered around the active site and contain four
           conserved acidic residues that serve as ligands for the
           two metal ions required for catalysis. PM/Scl-100, an
           autoantigen present in the nucleolar compartment of the
           cell, reacts with autoantibodies produced by about 50%
           of patients with polymyositis-scleroderma overlap
           syndrome.
          Length = 192

 Score =  328 bits (844), Expect = e-103
 Identities = 118/192 (61%), Positives = 149/192 (77%), Gaps = 1/192 (0%)

Query: 317 SDTPLMMITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTL 376
            +TPL  +   E++ +LV +LK  +EIA+DLE+H+YRSY G+TCLMQISTR++DYIVDTL
Sbjct: 1   DETPLTFVDTEEKLEELVEKLKNCKEIAVDLEHHSYRSYLGFTCLMQISTREEDYIVDTL 60

Query: 377 KLREDLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQSL 436
           KLR+D+ +LNEV TD NI+KVFHGADSDI WLQ+DFGLYVV +FDT QA + L +PR SL
Sbjct: 61  KLRDDMHILNEVFTDPNILKVFHGADSDIIWLQRDFGLYVVNLFDTGQAARVLNLPRHSL 120

Query: 437 AYLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL-SAAAHGK 495
           AYLL+ YC+VD+DK +QL DWR RPLPE  I+YAR DTHYLLY+YD ++ +L   A    
Sbjct: 121 AYLLQKYCNVDADKKYQLADWRIRPLPEEMIKYAREDTHYLLYIYDRLRNELLERANALA 180

Query: 496 QNLVLSTFTNSR 507
            NL+ S    SR
Sbjct: 181 PNLLESVLNCSR 192



 Score =  328 bits (844), Expect = e-103
 Identities = 118/192 (61%), Positives = 149/192 (77%), Gaps = 1/192 (0%)

Query: 1762 SDTPLMMITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTL 1821
             +TPL  +   E++ +LV +LK  +EIA+DLE+H+YRSY G+TCLMQISTR++DYIVDTL
Sbjct: 1    DETPLTFVDTEEKLEELVEKLKNCKEIAVDLEHHSYRSYLGFTCLMQISTREEDYIVDTL 60

Query: 1822 KLREDLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQSL 1881
            KLR+D+ +LNEV TD NI+KVFHGADSDI WLQ+DFGLYVV +FDT QA + L +PR SL
Sbjct: 61   KLRDDMHILNEVFTDPNILKVFHGADSDIIWLQRDFGLYVVNLFDTGQAARVLNLPRHSL 120

Query: 1882 AYLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL-SAAAHGK 1940
            AYLL+ YC+VD+DK +QL DWR RPLPE  I+YAR DTHYLLY+YD ++ +L   A    
Sbjct: 121  AYLLQKYCNVDADKKYQLADWRIRPLPEEMIKYAREDTHYLLYIYDRLRNELLERANALA 180

Query: 1941 QNLVLSTFTNSR 1952
             NL+ S    SR
Sbjct: 181  PNLLESVLNCSR 192


>gnl|CDD|223426 COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and
           biogenesis].
          Length = 361

 Score =  194 bits (496), Expect = 2e-54
 Identities = 86/287 (29%), Positives = 141/287 (49%), Gaps = 9/287 (3%)

Query: 324 ITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLE 383
           IT  + +    + L+  + IAID E+   R+Y    CL+QIS  +   ++D L    DL 
Sbjct: 1   ITTGDLLAAACALLRGSKAIAIDTEFMRLRTYYPRLCLIQISDGEGASLIDPLAGILDLP 60

Query: 384 VLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMP-RQSLAYLLKH 442
            L  +L D N+VK+FH A  D++ L   FGL    +FDT  A K         LA L++ 
Sbjct: 61  PLVALLADPNVVKIFHAARFDLEVLLNLFGLLPTPLFDTQIAAKLAGFGTSHGLADLVEE 120

Query: 443 YCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAAAHGKQNLVLST 502
              V+ DK+ Q  DW  RPL E  ++YA  D  YLL +YD +  +L+    G+       
Sbjct: 121 LLGVELDKSEQRSDWLARPLSEAQLEYAAADVEYLLPLYDKLTEELARE--GRLEWAEDE 178

Query: 503 FTNSRNICKLKYEKPVFNEEGYMNIFRSHALLNNQQKYALRELYKWRDRIARDKDESTGY 562
           F     +   +    V  E+ +  I  +H  L+ ++   LREL  WR+R AR++D +  +
Sbjct: 179 FR----LLPTRRTYKVLPEDAWREIKIAH-SLDPRELAVLRELAAWREREARERDLARNF 233

Query: 563 VLPNHMLLQMAQSIPRDIQGIFACCNPVPQTVKEHVLDIHAIILKAR 609
           VL +  L ++A+  P++++        +P+  + H   + A++  A 
Sbjct: 234 VLKDEALWELARYTPKNLKE-LDALGLIPKERRRHGKLLLALLANAL 279



 Score =  155 bits (395), Expect = 5e-41
 Identities = 62/182 (34%), Positives = 90/182 (49%), Gaps = 3/182 (1%)

Query: 1769 ITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLE 1828
            IT  + +    + L+  + IAID E+   R+Y    CL+QIS  +   ++D L    DL 
Sbjct: 1    ITTGDLLAAACALLRGSKAIAIDTEFMRLRTYYPRLCLIQISDGEGASLIDPLAGILDLP 60

Query: 1829 VLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMP-RQSLAYLLKH 1887
             L  +L D N+VK+FH A  D++ L   FGL    +FDT  A K         LA L++ 
Sbjct: 61   PLVALLADPNVVKIFHAARFDLEVLLNLFGLLPTPLFDTQIAAKLAGFGTSHGLADLVEE 120

Query: 1888 YCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAAAHGKQNLVLST 1947
               V+ DK+ Q  DW  RPL E  ++YA  D  YLL +YD +  +L+    G+       
Sbjct: 121  LLGVELDKSEQRSDWLARPLSEAQLEYAAADVEYLLPLYDKLTEELARE--GRLEWAEDE 178

Query: 1948 FT 1949
            F 
Sbjct: 179  FR 180


>gnl|CDD|176654 cd06142, RNaseD_exo, DEDDy 3'-5' exonuclease domain of Ribonuclease
           D and similar proteins.  Ribonuclease (RNase) D is a
           bacterial enzyme involved in the maturation of small
           stable RNAs and the 3' maturation of tRNA. It contains a
           DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing
           three conserved sequence motifs termed ExoI, ExoII and
           ExoIII, with a specific YX(3)D pattern at ExoIII. These
           motifs are clustered around the active site and contain
           four conserved acidic residues that serve as ligands for
           the two metal ions required for catalysis. In vivo,
           RNase D only becomes essential upon removal of other
           ribonucleases. Eukaryotic RNase D homologs include yeast
           Rrp6p, human PM/Scl-100, and the Drosophila melanogaster
           egalitarian protein.
          Length = 178

 Score =  180 bits (458), Expect = 1e-51
 Identities = 61/164 (37%), Positives = 90/164 (54%), Gaps = 2/164 (1%)

Query: 329 QVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLEVLNEV 388
           ++  L   L     IA+D E+    +Y    CL+QIST  + Y++D L +  DL  L E+
Sbjct: 1   ELEDLCERLASAGVIAVDTEFMRLNTYYPRLCLIQISTGGEVYLIDPLAI-GDLSPLKEL 59

Query: 389 LTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYLLKHYCDVD 447
           L D NIVKVFH A  D++ L++DFG+    +FDT  A + L +     LA L++    V+
Sbjct: 60  LADPNIVKVFHAAREDLELLKRDFGILPQNLFDTQIAARLLGLGDSVGLAALVEELLGVE 119

Query: 448 SDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAA 491
            DK  Q  DW  RPL +  ++YA  D  YLL +Y+ +K +L   
Sbjct: 120 LDKGEQRSDWSKRPLTDEQLEYAALDVRYLLPLYEKLKEELEEE 163



 Score =  180 bits (458), Expect = 1e-51
 Identities = 61/164 (37%), Positives = 90/164 (54%), Gaps = 2/164 (1%)

Query: 1774 QVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLEVLNEV 1833
            ++  L   L     IA+D E+    +Y    CL+QIST  + Y++D L +  DL  L E+
Sbjct: 1    ELEDLCERLASAGVIAVDTEFMRLNTYYPRLCLIQISTGGEVYLIDPLAI-GDLSPLKEL 59

Query: 1834 LTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYLLKHYCDVD 1892
            L D NIVKVFH A  D++ L++DFG+    +FDT  A + L +     LA L++    V+
Sbjct: 60   LADPNIVKVFHAAREDLELLKRDFGILPQNLFDTQIAARLLGLGDSVGLAALVEELLGVE 119

Query: 1893 SDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAA 1936
             DK  Q  DW  RPL +  ++YA  D  YLL +Y+ +K +L   
Sbjct: 120  LDKGEQRSDWSKRPLTDEQLEYAALDVRYLLPLYEKLKEELEEE 163


>gnl|CDD|216604 pfam01612, DNA_pol_A_exo1, 3'-5' exonuclease.  This domain is
           responsible for the 3'-5' exonuclease proofreading
           activity of E. coli DNA polymerase I (polI) and other
           enzymes, it catalyzes the hydrolysis of unpaired or
           mismatched nucleotides. This domain consists of the
           amino-terminal half of the Klenow fragment in E. coli
           polI it is also found in the Werner syndrome helicase
           (WRN), focus forming activity 1 protein (FFA-1) and
           ribonuclease D (RNase D). Werner syndrome is a human
           genetic disorder causing premature aging; the WRN
           protein has helicase activity in the 3'-5' direction.
           The FFA-1 protein is required for formation of a
           replication foci and also has helicase activity; it is a
           homologue of the WRN protein. RNase D is a 3'-5'
           exonuclease involved in tRNA processing. Also found in
           this family is the autoantigen PM/Scl thought to be
           involved in polymyositis-scleroderma overlap syndrome.
          Length = 172

 Score =  174 bits (442), Expect = 1e-49
 Identities = 67/169 (39%), Positives = 98/169 (57%), Gaps = 3/169 (1%)

Query: 323 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGY--TCLMQISTRDKDYIVDTLKLRE 380
           ++T  +++  L+ EL   + +A+D E  +  +Y      CL+QIST +  YI+D L L +
Sbjct: 3   IVTTEDELEDLIEELLNAKVVAVDTETTSLDTYSYTLVGCLIQISTGEGAYIIDPLALGD 62

Query: 381 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYL 439
           DLE L  +L D NI KV H A  D++ L +DFG+ +  +FDT  A   L  PR  SL  L
Sbjct: 63  DLEGLKRLLEDPNITKVGHNAKFDLEVLARDFGIKLENLFDTMLAAYLLGYPRSHSLDDL 122

Query: 440 LKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL 488
            + Y  V+ DK+ Q  DWR RPL E  ++YA  D  YLL +YD ++ +L
Sbjct: 123 AEKYLGVELDKSEQCADWRARPLSEEQLRYAAEDADYLLRLYDKLRKEL 171



 Score =  174 bits (442), Expect = 1e-49
 Identities = 67/169 (39%), Positives = 98/169 (57%), Gaps = 3/169 (1%)

Query: 1768 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGY--TCLMQISTRDKDYIVDTLKLRE 1825
            ++T  +++  L+ EL   + +A+D E  +  +Y      CL+QIST +  YI+D L L +
Sbjct: 3    IVTTEDELEDLIEELLNAKVVAVDTETTSLDTYSYTLVGCLIQISTGEGAYIIDPLALGD 62

Query: 1826 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYL 1884
            DLE L  +L D NI KV H A  D++ L +DFG+ +  +FDT  A   L  PR  SL  L
Sbjct: 63   DLEGLKRLLEDPNITKVGHNAKFDLEVLARDFGIKLENLFDTMLAAYLLGYPRSHSLDDL 122

Query: 1885 LKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL 1933
             + Y  V+ DK+ Q  DWR RPL E  ++YA  D  YLL +YD ++ +L
Sbjct: 123  AEKYLGVELDKSEQCADWRARPLSEEQLRYAAEDADYLLRLYDKLRKEL 171


>gnl|CDD|176650 cd06129, RNaseD_like, DEDDy 3'-5' exonuclease domain of RNase D,
           WRN, and similar proteins.  The RNase D-like group is
           composed of RNase D, WRN, and similar proteins. They
           contain a DEDDy-type, DnaQ-like, 3'-5' exonuclease
           domain that contains three conserved sequence motifs
           termed ExoI, ExoII and ExoIII, with a specific YX(3)D
           pattern at ExoIII. These motifs are clustered around the
           active site and contain four conserved acidic residues
           that serve as ligands for the two metal ions required
           for catalysis. RNase D is involved in the 3'-end
           processing of tRNA precursors. RNase D-like proteins in
           eukaryotes include yeast Rrp6p, human PM/Scl-100 and
           Drosophila melanogaster egalitarian (Egl) protein. WRN
           is a unique DNA helicase possessing exonuclease
           activity. Mutation in the WRN gene is implicated in
           Werner syndrome, a disease associated with premature
           aging and increased predisposition to cancer. Yeast
           Rrp6p and the human Polymyositis/scleroderma autoantigen
           100kDa (PM/Scl-100) are exosome-associated proteins
           involved in the degradation and processing of precursors
           to stable RNAs. Egl is a component of an mRNA-binding
           complex which is required for oocyte specification. The
           Egl subfamily does not possess a completely conserved
           YX(3)D pattern at the ExoIII motif.
          Length = 161

 Score =  155 bits (394), Expect = 2e-43
 Identities = 52/146 (35%), Positives = 77/146 (52%), Gaps = 2/146 (1%)

Query: 342 EIAIDLEYHNYRSYQGYTCLMQISTRD-KDYIVDTLKLREDLEVLNEVLTDKNIVKVFHG 400
            IA D+E+   R Y G   L+Q+   + K Y+ D L L  D + L  +L + +IVK  HG
Sbjct: 15  VIAFDMEWPPGRRYYGEVALIQLCVSEEKCYLFDPLSLSVDWQGLKMLLENPSIVKALHG 74

Query: 401 ADSDIKWLQKDFGLYVVGMFDTHQACKFLPMP-RQSLAYLLKHYCDVDSDKTFQLFDWRH 459
            + D+  L +DFG  +  +FDT  A     +P R SLA L++H+     DK+    DW +
Sbjct: 75  IEGDLWKLLRDFGEKLQRLFDTTIAANLKGLPERWSLASLVEHFLGKTLDKSISCADWSY 134

Query: 460 RPLPEPAIQYARTDTHYLLYVYDCMK 485
           RPL E    YA  D + LL +Y  ++
Sbjct: 135 RPLTEDQKLYAAADVYALLIIYTKLR 160



 Score =  155 bits (394), Expect = 2e-43
 Identities = 52/146 (35%), Positives = 77/146 (52%), Gaps = 2/146 (1%)

Query: 1787 EIAIDLEYHNYRSYQGYTCLMQISTRD-KDYIVDTLKLREDLEVLNEVLTDKNIVKVFHG 1845
             IA D+E+   R Y G   L+Q+   + K Y+ D L L  D + L  +L + +IVK  HG
Sbjct: 15   VIAFDMEWPPGRRYYGEVALIQLCVSEEKCYLFDPLSLSVDWQGLKMLLENPSIVKALHG 74

Query: 1846 ADSDIKWLQKDFGLYVVGMFDTHQACKFLPMP-RQSLAYLLKHYCDVDSDKTFQLFDWRH 1904
             + D+  L +DFG  +  +FDT  A     +P R SLA L++H+     DK+    DW +
Sbjct: 75   IEGDLWKLLRDFGEKLQRLFDTTIAANLKGLPERWSLASLVEHFLGKTLDKSISCADWSY 134

Query: 1905 RPLPEPAIQYARTDTHYLLYVYDCMK 1930
            RPL E    YA  D + LL +Y  ++
Sbjct: 135  RPLTEDQKLYAAADVYALLIIYTKLR 160


>gnl|CDD|214681 smart00474, 35EXOc, 3'-5' exonuclease.  3\' -5' exonuclease
           proofreading domain present in DNA polymerase I, Werner
           syndrome helicase, RNase D and other enzymes.
          Length = 172

 Score =  138 bits (349), Expect = 3e-37
 Identities = 63/171 (36%), Positives = 97/171 (56%), Gaps = 7/171 (4%)

Query: 323 MITEPEQVTQLVSELKQQQ-EIAIDLEYHNYRSYQGYTCLMQIS-TRDKDYIVDTLKLRE 380
           ++T+ E + +L+ +L+    E+A+D E     SY G   L+QIS T +  +I+D L L +
Sbjct: 3   VVTDSETLEELLEKLRAAGGEVALDTETTGLDSYSGKLVLIQISVTGEGAFIIDPLALGD 62

Query: 381 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQS---LA 437
           DLE+L ++L D+ I KV H A  D+  L + FG+ +  +FDT  A  +L +   S   LA
Sbjct: 63  DLEILKDLLEDETITKVGHNAKFDLHVLAR-FGIELENIFDTMLA-AYLLLGGPSKHGLA 120

Query: 438 YLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL 488
            LL  Y  V+ DK  Q  DW  RPL E  ++YA  D   LL +Y+ ++ +L
Sbjct: 121 TLLLGYLGVELDKEEQKSDWGARPLSEEQLEYAAEDADALLRLYEKLEKEL 171



 Score =  138 bits (349), Expect = 3e-37
 Identities = 63/171 (36%), Positives = 97/171 (56%), Gaps = 7/171 (4%)

Query: 1768 MITEPEQVTQLVSELKQQQ-EIAIDLEYHNYRSYQGYTCLMQIS-TRDKDYIVDTLKLRE 1825
            ++T+ E + +L+ +L+    E+A+D E     SY G   L+QIS T +  +I+D L L +
Sbjct: 3    VVTDSETLEELLEKLRAAGGEVALDTETTGLDSYSGKLVLIQISVTGEGAFIIDPLALGD 62

Query: 1826 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQS---LA 1882
            DLE+L ++L D+ I KV H A  D+  L + FG+ +  +FDT  A  +L +   S   LA
Sbjct: 63   DLEILKDLLEDETITKVGHNAKFDLHVLAR-FGIELENIFDTMLA-AYLLLGGPSKHGLA 120

Query: 1883 YLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDL 1933
             LL  Y  V+ DK  Q  DW  RPL E  ++YA  D   LL +Y+ ++ +L
Sbjct: 121  TLLLGYLGVELDKEEQKSDWGARPLSEEQLEYAAEDADALLRLYEKLEKEL 171


>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score =  141 bits (357), Expect = 2e-35
 Identities = 103/413 (24%), Positives = 171/413 (41%), Gaps = 51/413 (12%)

Query: 870  PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKKDGETQ----VGNDISNIEAV- 924
             IV DNG+   + G+A  + P  +F +++ + R E    +   +    VGN+  N     
Sbjct: 8    TIVIDNGSGTTKAGFAGNDTPTTVFPSIVGRRRDEDSVMEDTEEKDTYVGNEAQNDRDNS 67

Query: 925  RFQLKTQFDKNVVTHYDIQEQIFDYAFS---HLSINTEGNVNHPIVLTEPFLNPNYSRSL 981
              +L+   +  ++ ++D  EQI+DY F     L  + E    HP++LTEP LNP  +R  
Sbjct: 68   LLELRYPIENGIILNWDAMEQIWDYTFFNKGDLLPSPEE---HPLLLTEPPLNPPSNREK 124

Query: 982  MSELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAK 1041
            ++ELLFE   VP++   I ++ S   +G   ++G++I  G   THVIPV++G +     K
Sbjct: 125  ITELLFETLNVPALYLAIQAVLSLYASGSSDETGLVIDSGDSVTHVIPVVDGIVLPKAVK 184

Query: 1042 RIDLGGFSVIHHLHKLLQLKYPSHINS------------ITPSRSEELL---WDYGFVAT 1086
            RID+GG  +  +L KLL+ KYP                 I     EE+     +  +V+ 
Sbjct: 185  RIDIGGRDITDYLKKLLREKYPPSRGYNLKSELVEYSSEIVNEIKEEVCETDDESAYVSL 244

Query: 1087 DYREHLRKWLDAEFYDSNVVKVQL-PYAVPVPNLTTEQQKDRRKELAKKLVEMNAKKREE 1145
            D  E   +  +     S     QL             ++    +E+  +       K  E
Sbjct: 245  DAEEEFEEEEEKPAEKSTESTFQLSKETSIAKE---SKELPDGEEI--EFGNEERFKAPE 299

Query: 1146 RLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQ-LQQKIEKTK 1204
             L   E  ++ L E  +I E      +   E   +   N+I  L + + Q +Q   E  +
Sbjct: 300  ILFKPELPISGLEEAGKIDESKQELVAENYEISPTNLGNDIAGLPELVYQSIQICDEDVR 359

Query: 1205 AKIIAYNN-----GEDLTEEPKAKLSKEI-----------AVPESEAEFKAWL 1241
                 Y+N     G         +L KE+            +P  +    AWL
Sbjct: 360  KS--LYSNIVLTGGTSKIPGFAERLQKELTSLAPSIWKVSVIPPPDPSLDAWL 410



 Score = 76.7 bits (189), Expect = 4e-14
 Identities = 46/190 (24%), Positives = 82/190 (43%), Gaps = 2/190 (1%)

Query: 1282 ELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQ 1341
            E   +E+        +  + + K  +  A ++    + E +    E   +  PE     +
Sbjct: 248  EEFEEEEEKPAEKSTESTFQLSKETS-IAKESKELPDGEEIEFGNEERFKA-PEILFKPE 305

Query: 1342 EQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANN 1401
                  +EA ++    + +        P+ LG+  AG+ E +   +    + + +SL +N
Sbjct: 306  LPISGLEEAGKIDESKQELVAENYEISPTNLGNDIAGLPELVYQSIQICDEDVRKSLYSN 365

Query: 1402 IFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFA 1461
            I +TG   ++PGF ERL K+L    P     SV    +P LDAW GA   +  E F    
Sbjct: 366  IVLTGGTSKIPGFAERLQKELTSLAPSIWKVSVIPPPDPSLDAWLGASILASLETFQQLW 425

Query: 1462 VTQSDYQEKG 1471
            +T+ +Y+E G
Sbjct: 426  ITKEEYEEHG 435


>gnl|CDD|214592 smart00268, ACTIN, Actin.  ACTIN subfamily of
            ACTIN/mreB/sugarkinase/Hsp70 superfamily.
          Length = 373

 Score =  131 bits (332), Expect = 1e-32
 Identities = 70/271 (25%), Positives = 125/271 (46%), Gaps = 40/271 (14%)

Query: 870  PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKK--DGETQVGNDISNIEAVRFQ 927
             IV DNG+   + G+A  + P ++F +++ +P+  +G      +  VG++         +
Sbjct: 3    AIVIDNGSGTIKAGFAGEDFPQVVFPSIVGRPKDGKGMVGDAKDIFVGDEAQEKRGG-LE 61

Query: 928  LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 986
            LK   +  +V ++D  E+I+DY F + L +  E    HP++LTEP +NP  +R  + E++
Sbjct: 62   LKYPIENGIVENWDDMEKIWDYTFFNELRVEPE---EHPVLLTEPPMNPKSNREKILEIM 118

Query: 987  FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLG 1046
            FE +  P++   I ++ S   +G    +G++I  G   THV+PV++G +     KRID+ 
Sbjct: 119  FETFNFPALYIAIQAVLSLYASG--RTTGLVIDSGDGVTHVVPVVDGYVLPHAIKRIDIA 176

Query: 1047 GFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFVATDYREHLRKWLDAEFYDSNVV 1106
            G  +  +L +LL  +     NS                             AEF     +
Sbjct: 177  GRDITDYLKELLS-ERGYQFNSS----------------------------AEFEIVREI 207

Query: 1107 KVQLPYAVPVPNLTTEQQKDRRKELAKKLVE 1137
            K +L Y V   +   E +  R    + KL +
Sbjct: 208  KEKLCY-VA-EDFEKEMKLARESSESSKLEK 236



 Score = 86.9 bits (216), Expect = 1e-17
 Identities = 39/146 (26%), Positives = 64/146 (43%)

Query: 1326 EEILRQHDPEFTSLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNF 1385
            E+ ++       S   E+     + N + +G ER   PE LF P ++G  Q GI E +  
Sbjct: 219  EKEMKLARESSESSKLEKTYELPDGNTIKVGNERFRIPEILFSPELIGLEQKGIHELVYE 278

Query: 1386 VLNSYPQHIAQSLANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAW 1445
             +      + + L  NI ++G    +PGF ERL K+L +  P +    V      +   W
Sbjct: 279  SIQKCDIDVRKDLYENIVLSGGSTLIPGFGERLEKELKQLAPKKLKVKVIAPPERKYSVW 338

Query: 1446 SGARQFSLSENFHDFAVTQSDYQEKG 1471
             G    +    F D  +T+ +Y+E G
Sbjct: 339  LGGSILASLSTFEDMWITKKEYEESG 364


>gnl|CDD|130455 TIGR01388, rnd, ribonuclease D.  This model describes ribonuclease
           D, a 3'-exonuclease shown to act on tRNA both in vitro
           and when overexpressed in vivo. Trusted members of this
           family are restricted to the Proteobacteria; Aquifex,
           Mycobacterial, and eukaryotic homologs are not
           full-length homologs. Ribonuclease D is not essential in
           E. coli and is deleterious when overexpressed. Its
           precise biological role is still unknown [Transcription,
           RNA processing].
          Length = 367

 Score =  123 bits (311), Expect = 4e-30
 Identities = 79/307 (25%), Positives = 140/307 (45%), Gaps = 15/307 (4%)

Query: 323 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDL 382
            IT  +++  +   ++    +A+D E+   R++     L+Q++  ++  ++D L +  D 
Sbjct: 1   WITTDDELATVCEAVRTFPFVALDTEFVRERTFWPQLGLIQVADGEQLALIDPLVI-IDW 59

Query: 383 EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYLLK 441
             L E+L D+++VKV H A  D++     FG     +FDT  A  F         A L++
Sbjct: 60  SPLKELLRDESVVKVLHAASEDLEVFLNLFGELPQPLFDTQIAAAFCGFGMSMGYAKLVQ 119

Query: 442 HYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAAAHGKQNLVLS 501
               V+ DK+    DW  RPL +  ++YA  D  YLL +Y   KL       G+   +  
Sbjct: 120 EVLGVELDKSESRTDWLARPLTDAQLEYAAADVTYLLPLYA--KLMERLEESGRLAWLEE 177

Query: 502 TFTNSRNICKLKYEKPVFNEEGYMNIFRSHALLNNQQKYALRELYKWRDRIARDKDESTG 561
             T   +    +    V  E+ + +I ++   L  QQ   L+ L  WR+R AR++D    
Sbjct: 178 ECTLLTD----RRTYVVNPEDAWRDI-KNAWQLRPQQLAVLQALAAWREREARERDLPRN 232

Query: 562 YVLPNHMLLQMAQSIPRDIQGIFACCNPVPQTVKEHVLDIHAIILKARLQSLTKPVEKL- 620
           +VL    L ++A+  P ++    A   P    +++H   + A++  A    L  P + L 
Sbjct: 233 FVLKEEALWELARQAPGNL-TELASLGPKGSEIRKHGDTLLALVKTA----LALPEDALP 287

Query: 621 QPSLDGM 627
           Q  L+ M
Sbjct: 288 QAPLNLM 294



 Score = 94.1 bits (234), Expect = 4e-20
 Identities = 46/170 (27%), Positives = 82/170 (48%), Gaps = 2/170 (1%)

Query: 1768 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDL 1827
             IT  +++  +   ++    +A+D E+   R++     L+Q++  ++  ++D L +  D 
Sbjct: 1    WITTDDELATVCEAVRTFPFVALDTEFVRERTFWPQLGLIQVADGEQLALIDPLVI-IDW 59

Query: 1828 EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQ-SLAYLLK 1886
              L E+L D+++VKV H A  D++     FG     +FDT  A  F         A L++
Sbjct: 60   SPLKELLRDESVVKVLHAASEDLEVFLNLFGELPQPLFDTQIAAAFCGFGMSMGYAKLVQ 119

Query: 1887 HYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAA 1936
                V+ DK+    DW  RPL +  ++YA  D  YLL +Y  +   L  +
Sbjct: 120  EVLGVELDKSESRTDWLARPLTDAQLEYAAADVTYLLPLYAKLMERLEES 169


>gnl|CDD|200935 pfam00022, Actin, Actin. 
          Length = 367

 Score =  106 bits (267), Expect = 3e-24
 Identities = 51/203 (25%), Positives = 99/203 (48%), Gaps = 23/203 (11%)

Query: 870  PIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKERGKKD---GETQVGNDISNIEAVRF 926
             +V DNG+   + G+A  + P  +  +++ +PR          G+  + +    +E    
Sbjct: 5    ALVIDNGSGTTKAGFAGEDAPRAVIPSVVGRPRGRGVMVKYYVGDEAL-SKRPGLE---- 59

Query: 927  QLKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSEL 985
             ++   +  +V ++D  E+I+++ F   L ++ E    HP++LTEP LNP  +R   +E+
Sbjct: 60   -VRYPIEDGIVENWDAMEKIWEHTFFEELRVDPE---EHPLLLTEPPLNPPANREKATEI 115

Query: 986  LFECYQVPSVCYGID---SLFSYQYNGWEGQS-GVIISCGYQCTHVIPVINGCIDASKAK 1041
            +FE + VP++        S ++       G++ G+++  G   T V+PV  G +     +
Sbjct: 116  MFETFGVPALYLAKQAVLSAYAS------GRTTGLVVDSGAGVTSVVPVYEGYVLQKAIR 169

Query: 1042 RIDLGGFSVIHHLHKLLQLKYPS 1064
            R DL G  +  +L KLL  +  S
Sbjct: 170  RSDLAGDDLTDYLRKLLSSRTYS 192



 Score = 85.0 bits (211), Expect = 5e-17
 Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 1/118 (0%)

Query: 1355 IGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANNIFVTGSLCQLPGF 1414
            +G ER   PE LF PS++GS  AGI E +   +N+    +  SL  NI VTG     PGF
Sbjct: 241  LGNERFRVPEILFNPSLIGSESAGIPELIYDSINACDVDLRPSLLANIVVTGGTTLFPGF 300

Query: 1415 VERLNKDLLENRPFQSHFSVSLAEN-PELDAWSGARQFSLSENFHDFAVTQSDYQEKG 1471
             ERL K+L +  P      +    N  +  AW G    +    F    V++ +Y+E G
Sbjct: 301  TERLEKELAQLAPSGVKVKIIAPPNERKYSAWIGGSILASLGTFQQMWVSKQEYEEHG 358


>gnl|CDD|240225 PTZ00004, PTZ00004, actin-2; Provisional.
          Length = 378

 Score =  100 bits (251), Expect = 4e-22
 Identities = 59/249 (23%), Positives = 121/249 (48%), Gaps = 20/249 (8%)

Query: 871  IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKER---GKKDGETQVGNDISNIEAVRFQ 927
             V DNG+   + G+A  + P  +F +++ +P+      G ++ +  VG++  +   +   
Sbjct: 9    AVVDNGSGMVKAGFAGDDAPRCVFPSIVGRPKNPGIMVGMEEKDCYVGDEAQDKRGI-LT 67

Query: 928  LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 986
            LK   +  +VT++D  E+I+ + F + L +  E    HP++LTE  LNP  +R  M++++
Sbjct: 68   LKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPE---EHPVLLTEAPLNPKANREKMTQIM 124

Query: 987  FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLG 1046
            FE + VP++   I ++ S   +G    +G+++  G   +H +P+  G        R+D+ 
Sbjct: 125  FETHNVPAMYVAIQAVLSLYASGR--TTGIVLDSGDGVSHTVPIYEGYSLPHAIHRLDVA 182

Query: 1047 GFSVIHHLHKLLQLKYPSHINSITPSRSEELLWD----YGFVATDYREHLRKWLDAEFYD 1102
            G  +  ++ K+L         + T +  +E++ D      ++A D+ E +     A   D
Sbjct: 183  GRDLTEYMMKIL----HERGTTFTTTAEKEIVRDIKEKLCYIALDFDEEMGN--SAGSSD 236

Query: 1103 SNVVKVQLP 1111
                  +LP
Sbjct: 237  KYEESYELP 245



 Score = 60.9 bits (148), Expect = 3e-09
 Identities = 37/122 (30%), Positives = 50/122 (40%), Gaps = 1/122 (0%)

Query: 1351 NQLHIGVERMCGPECLFQPSMLGSIQA-GISETLNFVLNSYPQHIAQSLANNIFVTGSLC 1409
              + +G ER   PE LFQPS++G  +  GI E     +N     I + L  NI ++G   
Sbjct: 248  TIITVGSERFRCPEALFQPSLIGKEEPPGIHELTFQSINKCDIDIRKDLYGNIVLSGGTT 307

Query: 1410 QLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQE 1469
               G  ERL K+L    P      V      +   W G    S    F    VT+ +Y E
Sbjct: 308  MYRGLPERLTKELTTLAPSTMKIKVVAPPERKYSVWIGGSILSSLPTFQQMWVTKEEYDE 367

Query: 1470 KG 1471
             G
Sbjct: 368  SG 369


>gnl|CDD|145428 pfam02268, TFIIA_gamma_N, Transcription initiation factor IIA,
          gamma subunit, helical domain.  Accurate transcription
          in vivo requires at least six general transcription
          initiation factors, in addition to RNA polymerase II.
          Transcription initiation factor IIA (TFIIA) is a
          multimeric protein which facilitates the binding of
          TFIID to the TATA box. The N-terminal domain of the
          gamma subunit is a 4 helix bundle.
          Length = 49

 Score = 90.4 bits (225), Expect = 5e-22
 Identities = 34/49 (69%), Positives = 38/49 (77%)

Query: 1  MSYQLYRNTTLGNTLQESLDELIQYGTITPTLAMKVLLQFDKSINGALP 49
          M Y+LYR +TLG  L ++LDELI  G ITP LAMKVLLQFDKSIN AL 
Sbjct: 1  MYYELYRRSTLGMALTDALDELISDGRITPQLAMKVLLQFDKSINEALA 49


>gnl|CDD|199901 cd10145, TFIIA_gamma_N, Gamma subunit of transcription initiation
          factor IIA, N-terminal helical domain.  Transcription
          factor II A (TFIIA) is one of the general transcription
          factors for RNA polymerase II. TFIIA increases the
          affinity of the TATA-binding protein (TBP) for DNA, in
          order to assemble the initiation complex. TFIIA also
          functions as an activator during development and
          differentiation, and is involved in transcription from
          TATA-less promoters. TFIIA is composed of more than one
          subunit in various organisms. Mammalian TFIIA large
          subunits (TFIIA alpha and beta), and the smaller
          subunit (TFIIA gamma) form a heterotrimer. TFIIA alpha
          and beta are encoded by a single TFIIA_alpha_beta gene
          and post-translationally processed and cleaved. TOA1
          and TOA2 are the two subunits of Yeast TFIIA which
          correspond to Mammalian TFIIA_alpha_beta and TFIIA
          gamma, respectively. TOA1 and TOA2 form a heterodimeric
          protein complex. The TFIIA gamma subunit is highly
          conserved between humans, Drosophila and yeast and it
          is required for TFIIA function. The N-terminal domain
          of the gamma subunit forms a 4-helix bundle together
          with the alpha subunit.
          Length = 49

 Score = 87.6 bits (218), Expect = 5e-21
 Identities = 37/48 (77%), Positives = 40/48 (83%)

Query: 2  SYQLYRNTTLGNTLQESLDELIQYGTITPTLAMKVLLQFDKSINGALP 49
           Y+LYRNTTLG  LQ++LDELIQ G ITP LAMKVLLQFDKSIN AL 
Sbjct: 1  YYELYRNTTLGIALQDALDELIQDGQITPQLAMKVLLQFDKSINEALA 48


>gnl|CDD|190407 pfam02751, TFIIA_gamma_C, Transcription initiation factor IIA, gamma
            subunit.  Accurate transcription in vivo requires at
            least six general transcription initiation factors, in
            addition to RNA polymerase II. Transcription initiation
            factor IIA (TFIIA) is a multimeric protein which
            facilitates the binding of TFIID to the TATA box. The
            C-terminal domain of the gamma subunit is a 12 stranded
            beta-barrel.
          Length = 51

 Score = 86.2 bits (214), Expect = 1e-20
 Identities = 31/51 (60%), Positives = 40/51 (78%), Gaps = 2/51 (3%)

Query: 1484 KVKSRLTFKSGKLNTYRFCDNVWTFVLTDVEFRE-VAEIARVNKLKIVACD 1533
            KVKS+LTFK G L+TYRFCDNVWTF++ + +F+    E   V+K+KIVACD
Sbjct: 1    KVKSKLTFK-GHLDTYRFCDNVWTFIVKNAQFKLDDQETVTVDKVKIVACD 50


>gnl|CDD|240426 PTZ00466, PTZ00466, actin-like protein; Provisional.
          Length = 380

 Score = 91.5 bits (227), Expect = 4e-19
 Identities = 59/206 (28%), Positives = 106/206 (51%), Gaps = 14/206 (6%)

Query: 859  EYLDELRDSSLPIVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKER---GKKDGETQVG 915
            EY  +L  S+ PI+ DNG    + G+A  + PNL+F + + +P+ +R   G  +G   VG
Sbjct: 4    EYAKQLY-SNQPIIIDNGTGYIKAGFAGEDVPNLVFPSYVGRPKYKRVMAGAVEGNIFVG 62

Query: 916  NDISNIEAVRFQLKTQFDKN--VVTHYDIQEQIFDYAFSHLSINTEGNVNHPIVLTEPFL 973
            N     E  R  LK  +  N  ++ +++  E I+ + ++ + IN+E    HP++LTE  L
Sbjct: 63   N---KAEEYRGLLKVTYPINHGIIENWNDMENIWIHVYNSMKINSE---EHPVLLTEAPL 116

Query: 974  NPNYSRSLMSELLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVING 1033
            NP  ++  ++E+ FE + VP++   I ++ S    G    +G ++ CG    H + +  G
Sbjct: 117  NPQKNKEKIAEVFFETFNVPALFISIQAILSLYSCG--KTNGTVLDCGDGVCHCVSIYEG 174

Query: 1034 CIDASKAKRIDLGGFSVIHHLHKLLQ 1059
                +   R D+ G  +  +L  LL+
Sbjct: 175  YSITNTITRTDVAGRDITTYLGYLLR 200



 Score = 46.9 bits (111), Expect = 7e-05
 Identities = 29/123 (23%), Positives = 56/123 (45%)

Query: 1349 EANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQSLANNIFVTGSL 1408
            + +Q+ IG ER   PE LF PS+LG    G+SE +   +      + ++L ++I ++G  
Sbjct: 249  DGSQILIGSERYRAPEVLFNPSILGLEYLGLSELIVTSITRADMDLRRTLYSHIVLSGGT 308

Query: 1409 CQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQ 1468
                GF +RL  ++ +  P      +S     +   + G    +    F    +++ ++ 
Sbjct: 309  TMFHGFGDRLLNEIRKFAPKDITIRISAPPERKFSTFIGGSILASLATFKKIWISKQEFD 368

Query: 1469 EKG 1471
            E G
Sbjct: 369  EYG 371


>gnl|CDD|173506 PTZ00281, PTZ00281, actin; Provisional.
          Length = 376

 Score = 91.3 bits (226), Expect = 4e-19
 Identities = 57/229 (24%), Positives = 115/229 (50%), Gaps = 14/229 (6%)

Query: 871  IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRKE-----RGKKDGETQVGNDISNIEAVR 925
            +V DNG+  C+ G+A  + P  +F +++ +PR        G+KD  + VG++  +   + 
Sbjct: 9    LVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHTGVMVGMGQKD--SYVGDEAQSKRGI- 65

Query: 926  FQLKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSE 984
              LK   +  +VT++D  E+I+ + F + L +  E    HP++LTE  LNP  +R  M++
Sbjct: 66   LTLKYPIEHGIVTNWDDMEKIWHHTFYNELRVAPE---EHPVLLTEAPLNPKANREKMTQ 122

Query: 985  LLFECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRID 1044
            ++FE +  P++   I ++ S   +G    +G+++  G   +H +P+  G        R+D
Sbjct: 123  IMFETFNTPAMYVAIQAVLSLYASG--RTTGIVMDSGDGVSHTVPIYEGYALPHAILRLD 180

Query: 1045 LGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFVATDYREHLR 1093
            L G  +  ++ K+L  +  S   +       ++     +VA D+   ++
Sbjct: 181  LAGRDLTDYMMKILTERGYSFTTTAEREIVRDIKEKLAYVALDFEAEMQ 229



 Score = 60.9 bits (147), Expect = 3e-09
 Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 6/137 (4%)

Query: 1337 TSLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQ 1396
            ++L +  EL   +   + IG ER   PE LFQPS LG   AGI ET    +      I +
Sbjct: 235  SALEKSYELP--DGQVITIGNERFRCPEALFQPSFLGMESAGIHETTYNSIMKCDVDIRK 292

Query: 1397 SLANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPE--LDAWSGARQFSLS 1454
             L  N+ ++G     PG  +R+NK+L    P  S   + +   PE     W G    +  
Sbjct: 293  DLYGNVVLSGGTTMFPGIADRMNKELTALAP--STMKIKIIAPPERKYSVWIGGSILASL 350

Query: 1455 ENFHDFAVTQSDYQEKG 1471
              F    +++ +Y E G
Sbjct: 351  STFQQMWISKEEYDESG 367


>gnl|CDD|199900 cd10014, TFIIA_gamma_C, Gamma subunit of transcription initiation
            factor IIA, C-terminal domain.  Transcription factor II A
            (TFIIA) is one of the general transcription factors for
            RNA polymerase II. TFIIA increases the affinity of the
            TATA-binding protein (TBP) for DNA, in order to assemble
            the initiation complex. TFIIA also functions as an
            activator during development and differentiation, and is
            involved in transcription from TATA-less promoters. TFIIA
            is composed of more than one subunit in various
            organisms. Mammalian TFIIA large subunits (TFIIA alpha
            and beta), and the smaller subunit (TFIIA gamma) form a
            heterotrimer. TFIIA alpha and beta are encoded by a
            single TFIIA_alpha_beta gene and post-translationally
            processed and cleaved. TOA1 and TOA2 are the two subunits
            of Yeast TFIIA which correspond to Mammalian
            TFIIA_alpha_beta and TFIIA gamma, respectively. TOA1 and
            TOA2 form a heterodimeric protein complex. The TFIIA
            gamma subunit is highly conserved between humans,
            Drosophila and yeast and it is required for TFIIA
            function. The C-terminal domain of the gamma
            (TFIIA_gamma_C) subunit forms a beta-barrel structure
            together with TFIIA beta.
          Length = 47

 Score = 80.0 bits (198), Expect = 2e-18
 Identities = 32/46 (69%), Positives = 37/46 (80%), Gaps = 2/46 (4%)

Query: 1489 LTFKSGKLNTYRFCDNVWTFVLTDVEFREV-AEIARVNKLKIVACD 1533
            LTFK G L TYRFCDNVWTF+L DVEF+E   E+  V+K+KIVACD
Sbjct: 1    LTFK-GHLKTYRFCDNVWTFILNDVEFKESNTEVVTVDKVKIVACD 45


>gnl|CDD|236771 PRK10829, PRK10829, ribonuclease D; Provisional.
          Length = 373

 Score = 84.7 bits (210), Expect = 5e-17
 Identities = 71/259 (27%), Positives = 112/259 (43%), Gaps = 15/259 (5%)

Query: 323 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDL 382
           MIT  + +  +    +    IA+D E+   R+Y     L+Q+   ++  ++D L +  D 
Sbjct: 5   MITTDDALASVCEAARAFPAIALDTEFVRTRTYYPQLGLIQLYDGEQLSLIDPLGIT-DW 63

Query: 383 EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQS-LAYLLK 441
                +L D  + K  H    D++     FG     + DT     F   P     A +++
Sbjct: 64  SPFKALLRDPQVTKFLHAGSEDLEVFLNAFGELPQPLIDTQILAAFCGRPLSCGFASMVE 123

Query: 442 HYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMKLDLSAAAHGKQNLVLS 501
            Y  V  DK+    DW  RPL E   +YA  D  YLL +    KL     A G     L 
Sbjct: 124 EYTGVTLDKSESRTDWLARPLSERQCEYAAADVFYLLPIAA--KLMAETEAAGWLPAALD 181

Query: 502 TFTNSRNICKLKYE--KPVFNEEGYMNIFRSHAL-LNNQQKYALRELYKWRDRIARDKDE 558
                R +C+ + E   P   EE Y +I  ++A  L  +Q   L+ L  WR R AR++D 
Sbjct: 182 ---ECRLLCQRRQEVLAP---EEAYRDI--TNAWQLRTRQLACLQLLADWRLRKARERDL 233

Query: 559 STGYVLPNHMLLQMAQSIP 577
           +  +V+    L Q+A+ +P
Sbjct: 234 AVNFVVREEHLWQVARYMP 252



 Score = 69.3 bits (170), Expect = 6e-12
 Identities = 42/157 (26%), Positives = 66/157 (42%), Gaps = 2/157 (1%)

Query: 1768 MITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDL 1827
            MIT  + +  +    +    IA+D E+   R+Y     L+Q+   ++  ++D L +  D 
Sbjct: 5    MITTDDALASVCEAARAFPAIALDTEFVRTRTYYPQLGLIQLYDGEQLSLIDPLGIT-DW 63

Query: 1828 EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACKFLPMPRQS-LAYLLK 1886
                 +L D  + K  H    D++     FG     + DT     F   P     A +++
Sbjct: 64   SPFKALLRDPQVTKFLHAGSEDLEVFLNAFGELPQPLIDTQILAAFCGRPLSCGFASMVE 123

Query: 1887 HYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLL 1923
             Y  V  DK+    DW  RPL E   +YA  D  YLL
Sbjct: 124  EYTGVTLDKSESRTDWLARPLSERQCEYAAADVFYLL 160


>gnl|CDD|99851 cd06148, Egl_like_exo, DEDDy 3'-5' exonuclease domain of Drosophila
           Egalitarian (Egl) and similar proteins.  The Egalitarian
           (Egl) protein subfamily is composed of Drosophila Egl
           and similar proteins. Egl is a component of an
           mRNA-binding complex which is required for oocyte
           specification. Egl contains a DEDDy-type DnaQ-like 3'-5'
           exonuclease domain possessing three conserved sequence
           motifs termed ExoI, ExoII and ExoIII, with a specific
           YX(3)D pattern at ExoIII. The motifs are clustered
           around the active site and contain four conserved acidic
           residues that serve as ligands for the two metal ions
           required for catalysis. The conservation of this
           subfamily throughout eukaryotes suggests that its
           members may be part of ancient RNA processing complexes
           that are likely to participate in the regulated
           processing of specific mRNAs. Some members of this
           subfamily do not have a completely conserved YX(3)D
           pattern at the ExoIII motif.
          Length = 197

 Score = 76.2 bits (188), Expect = 3e-15
 Identities = 56/181 (30%), Positives = 76/181 (41%), Gaps = 29/181 (16%)

Query: 332 QLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKD-YIVDTLKLREDLEVLN---- 386
           + +  LK+Q+ I +D E  N    +G  CL+QI+TR    Y+ D LKL     V      
Sbjct: 2   EAIIHLKKQKVIGLDCEGVNL-GRKGKLCLVQIATRTGQIYLFDILKL--GSIVFINGLK 58

Query: 387 EVLTDKNIVKVFHGA--DSDIKWLQKDFGLYVVGMFDTHQA---------CKFLPMPRQS 435
           ++L  K I+KV H    DSD   L   +G+ +  +FDT  A           F P    S
Sbjct: 59  DILESKKILKVIHDCRRDSDA--LYHQYGIKLNNVFDTQVADALLQEQETGGFNPDRVIS 116

Query: 436 LAYLLKHYCDVDSDKTFQLFD--------WRHRPLPEPAIQYARTDTHYLLYVYDCMKLD 487
           L  LL  Y  +       +          W  RPL E  I+YA  D   LL +Y  M   
Sbjct: 117 LVQLLDKYLYISISLKEDVKKLMREDPKFWALRPLTEDMIRYAALDVLCLLPLYYAMLDA 176

Query: 488 L 488
           L
Sbjct: 177 L 177



 Score = 76.2 bits (188), Expect = 3e-15
 Identities = 56/181 (30%), Positives = 76/181 (41%), Gaps = 29/181 (16%)

Query: 1777 QLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKD-YIVDTLKLREDLEVLN---- 1831
            + +  LK+Q+ I +D E  N    +G  CL+QI+TR    Y+ D LKL     V      
Sbjct: 2    EAIIHLKKQKVIGLDCEGVNL-GRKGKLCLVQIATRTGQIYLFDILKL--GSIVFINGLK 58

Query: 1832 EVLTDKNIVKVFHGA--DSDIKWLQKDFGLYVVGMFDTHQA---------CKFLPMPRQS 1880
            ++L  K I+KV H    DSD   L   +G+ +  +FDT  A           F P    S
Sbjct: 59   DILESKKILKVIHDCRRDSDA--LYHQYGIKLNNVFDTQVADALLQEQETGGFNPDRVIS 116

Query: 1881 LAYLLKHYCDVDSDKTFQLFD--------WRHRPLPEPAIQYARTDTHYLLYVYDCMKLD 1932
            L  LL  Y  +       +          W  RPL E  I+YA  D   LL +Y  M   
Sbjct: 117  LVQLLDKYLYISISLKEDVKKLMREDPKFWALRPLTEDMIRYAALDVLCLLPLYYAMLDA 176

Query: 1933 L 1933
            L
Sbjct: 177  L 177


>gnl|CDD|227452 COG5123, TOA2, Transcription initiation factor IIA, gamma subunit
          [Transcription].
          Length = 113

 Score = 71.1 bits (174), Expect = 2e-14
 Identities = 35/67 (52%), Positives = 45/67 (67%), Gaps = 1/67 (1%)

Query: 3  YQLYRNTTLGNTLQESLDELIQYGTITPTLAMKVLLQFDKSINGALPSKVKSRLTFKSGK 62
          Y+LYR + +G  L+++LDELI  G I+P LAM VL  FDKS+   L   V S LTFK GK
Sbjct: 5  YELYRRSMIGKVLEDALDELISAGVISPNLAMHVLETFDKSVPEVLNKSVTSSLTFK-GK 63

Query: 63 LNTFLFI 69
          L+T+ F 
Sbjct: 64 LDTYNFC 70



 Score = 53.8 bits (129), Expect = 2e-08
 Identities = 26/58 (44%), Positives = 32/58 (55%), Gaps = 8/58 (13%)

Query: 1482 NNKVKSRLTFKSGKLNTYRFCDNVWTFVLT------DVEFREVAEIA-RVNKLKIVAC 1532
            N  V S LTFK GKL+TY FCD VWTF+        + E+    E+     K +IVAC
Sbjct: 51   NKSVTSSLTFK-GKLDTYNFCDGVWTFITKNFVMSLNNEYFNAGEVNIVACKARIVAC 107


>gnl|CDD|176653 cd06141, WRN_exo, DEDDy 3'-5' exonuclease domain of WRN and similar
           proteins.  WRN is a unique RecQ DNA helicase exhibiting
           an exonuclease activity. It contains a DEDDy-type
           DnaQ-like 3'-5' exonuclease domain possessing three
           conserved sequence motifs termed ExoI, ExoII and ExoIII,
           with a specific YX(3)D pattern at ExoIII. These motifs
           are clustered around the active site and contain four
           conserved acidic residues that serve as ligands for the
           two metal ions required for catalysis. Mutations in the
           WRN gene cause Werner syndrome, an autosomal recessive
           disorder associated with premature aging and increased
           susceptibility to cancer and type II diabetes. WRN
           interacts with key proteins involved in DNA replication,
           recombination, and repair. It is believed to maintain
           genomic stability and life span by participating in DNA
           processes. WRN is stimulated by Ku70/80, an important
           regulator of genomic stability.
          Length = 170

 Score = 73.0 bits (180), Expect = 2e-14
 Identities = 41/169 (24%), Positives = 75/169 (44%), Gaps = 7/169 (4%)

Query: 324 ITEPEQVTQLVSELKQQQE-IAIDLEY-HNYRSYQGY-TCLMQISTRDKDYIVDTLKLRE 380
               +   + V EL  +++ +  D E+  ++R  +     L+Q++T  +  +     + +
Sbjct: 1   TDSAQDAEEAVKELLGKEKVVGFDTEWRPSFRKGKRNKVALLQLATESRCLLFQLAHMDK 60

Query: 381 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACK--FLPMPRQSLAY 438
               L ++L D +I+KV  G   D + L +DFG+ V G+ D     K         SLA 
Sbjct: 61  LPPSLKQLLEDPSILKVGVGIKGDARKLARDFGIEVRGVVDLSHLAKRVGPRRKLVSLAR 120

Query: 439 LLKHY--CDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMK 485
           L++      +   K  +  +W  RPL +  I YA TD +  L +Y  + 
Sbjct: 121 LVEEVLGLPLSKPKKVRCSNWEARPLSKEQILYAATDAYASLELYRKLL 169



 Score = 73.0 bits (180), Expect = 2e-14
 Identities = 41/169 (24%), Positives = 75/169 (44%), Gaps = 7/169 (4%)

Query: 1769 ITEPEQVTQLVSELKQQQE-IAIDLEY-HNYRSYQGY-TCLMQISTRDKDYIVDTLKLRE 1825
                +   + V EL  +++ +  D E+  ++R  +     L+Q++T  +  +     + +
Sbjct: 1    TDSAQDAEEAVKELLGKEKVVGFDTEWRPSFRKGKRNKVALLQLATESRCLLFQLAHMDK 60

Query: 1826 DLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDTHQACK--FLPMPRQSLAY 1883
                L ++L D +I+KV  G   D + L +DFG+ V G+ D     K         SLA 
Sbjct: 61   LPPSLKQLLEDPSILKVGVGIKGDARKLARDFGIEVRGVVDLSHLAKRVGPRRKLVSLAR 120

Query: 1884 LLKHY--CDVDSDKTFQLFDWRHRPLPEPAIQYARTDTHYLLYVYDCMK 1930
            L++      +   K  +  +W  RPL +  I YA TD +  L +Y  + 
Sbjct: 121  LVEEVLGLPLSKPKKVRCSNWEARPLSKEQILYAATDAYASLELYRKLL 169


>gnl|CDD|201312 pfam00570, HRDC, HRDC domain.  The HRDC (Helicase and RNase D
           C-terminal) domain has a putative role in nucleic acid
           binding. Mutations in the HRDC domain cause human
           disease. It is interesting to note that the RecQ
           helicase in Deinococcus radiodurans has three tandem
           HRDC domains.
          Length = 68

 Score = 69.1 bits (170), Expect = 3e-14
 Identities = 18/68 (26%), Positives = 37/68 (54%)

Query: 538 QKYALRELYKWRDRIARDKDESTGYVLPNHMLLQMAQSIPRDIQGIFACCNPVPQTVKEH 597
           Q   L+ L +WRD +AR++D     +  +  LL++A+ +PR ++ +       P+ ++ +
Sbjct: 1   QLALLKALREWRDELAREEDVPPYVIFSDATLLEIAEKLPRTLEELLRIPGVGPRKLERY 60

Query: 598 VLDIHAII 605
             +I  II
Sbjct: 61  GEEILEII 68


>gnl|CDD|176655 cd06146, mut-7_like_exo, DEDDy 3'-5' exonuclease domain of
           Caenorhabditis elegans mut-7 and similar proteins.  The
           mut-7 subfamily is composed of Caenorhabditis elegans
           mut-7 and similar proteins found in plants and
           metazoans. Mut-7 is implicated in posttranscriptional
           gene silencing. It contains a DEDDy-type DnaQ-like 3'-5'
           exonuclease domain possessing three conserved sequence
           motifs, termed ExoI, ExoII and ExoIII, with a specific
           YX(3)D pattern at ExoIII. These motifs are clustered
           around the active site and contain four conserved acidic
           residues that serve as ligands for the two metal ions
           required for catalysis.
          Length = 193

 Score = 73.1 bits (180), Expect = 3e-14
 Identities = 43/187 (22%), Positives = 76/187 (40%), Gaps = 27/187 (14%)

Query: 323 MITEPEQVTQLVSE--LKQQQEIAIDLEYHNYRSYQGYT--CLMQISTRDKDYIVDTLKL 378
           ++   E++  L+    L+  + + ID E+             ++Q++T D+ +++D L L
Sbjct: 3   IVDSEEELEALLLALSLEAGRVVGIDSEWKPSFLGDSDPRVAILQLATEDEVFLLDLLAL 62

Query: 379 REDL-----EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLY------VVGMFDTHQACK 427
                     +L  +  D +++K+  G   D+K L   +         V  + D     K
Sbjct: 63  ENLESEDWDRLLKRLFEDPDVLKLGFGFKQDLKALSASYPALKCMFERVQNVLDLQNLAK 122

Query: 428 ------------FLPMPRQSLAYLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTH 475
                        LP   + LA L++       DK+ Q  +W  RPL E  I YA  D +
Sbjct: 123 ELQKSDMGRLKGNLPSKTKGLADLVQEVLGKPLDKSEQCSNWERRPLREEQILYAALDAY 182

Query: 476 YLLYVYD 482
            LL V+D
Sbjct: 183 CLLEVFD 189



 Score = 73.1 bits (180), Expect = 3e-14
 Identities = 43/187 (22%), Positives = 76/187 (40%), Gaps = 27/187 (14%)

Query: 1768 MITEPEQVTQLVSE--LKQQQEIAIDLEYHNYRSYQGYT--CLMQISTRDKDYIVDTLKL 1823
            ++   E++  L+    L+  + + ID E+             ++Q++T D+ +++D L L
Sbjct: 3    IVDSEEELEALLLALSLEAGRVVGIDSEWKPSFLGDSDPRVAILQLATEDEVFLLDLLAL 62

Query: 1824 REDL-----EVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLY------VVGMFDTHQACK 1872
                      +L  +  D +++K+  G   D+K L   +         V  + D     K
Sbjct: 63   ENLESEDWDRLLKRLFEDPDVLKLGFGFKQDLKALSASYPALKCMFERVQNVLDLQNLAK 122

Query: 1873 ------------FLPMPRQSLAYLLKHYCDVDSDKTFQLFDWRHRPLPEPAIQYARTDTH 1920
                         LP   + LA L++       DK+ Q  +W  RPL E  I YA  D +
Sbjct: 123  ELQKSDMGRLKGNLPSKTKGLADLVQEVLGKPLDKSEQCSNWERRPLREEQILYAALDAY 182

Query: 1921 YLLYVYD 1927
             LL V+D
Sbjct: 183  CLLEVFD 189


>gnl|CDD|128635 smart00341, HRDC, Helicase and RNase D C-terminal.  Hypothetical
           role in nucleic acid binding. Mutations in the HRDC
           domain cause human disease.
          Length = 81

 Score = 68.5 bits (168), Expect = 6e-14
 Identities = 23/81 (28%), Positives = 41/81 (50%)

Query: 535 NNQQKYALRELYKWRDRIARDKDESTGYVLPNHMLLQMAQSIPRDIQGIFACCNPVPQTV 594
             +Q   LR L +WRD IAR +D    +VLP+  L++MA ++P ++  + A      +  
Sbjct: 1   RERQLRLLRRLRQWRDEIARREDVPPYFVLPDETLIKMAAALPTNVSELLAIDGVGEEKA 60

Query: 595 KEHVLDIHAIILKARLQSLTK 615
           + +  D+ A+I +A       
Sbjct: 61  RRYGKDLLAVIQEASDSPSEA 81


>gnl|CDD|185631 PTZ00452, PTZ00452, actin; Provisional.
          Length = 375

 Score = 70.6 bits (172), Expect = 2e-12
 Identities = 65/270 (24%), Positives = 122/270 (45%), Gaps = 26/270 (9%)

Query: 871  IVFDNGAWCCRVGWASCEKPNLIFKNLIAKPRK---ERGKKDGETQVGNDISNIEAVRFQ 927
            +V DNG+  C++G A  + P   F  ++ + ++        + E  VG +      V   
Sbjct: 8    VVIDNGSGYCKIGIAGDDAPTSCFPAIVGRSKQNDGIFSTFNKEYYVGEEAQAKRGV-LA 66

Query: 928  LKTQFDKNVVTHYDIQEQIFDYAF-SHLSINTEGNVNHPIVLTEPFLNPNYSRSLMSELL 986
            +K      ++  +D  E I+ +AF + L ++ E   + P+ +T+  +N  ++R  M++++
Sbjct: 67   IKEPIQNGIINSWDDIEIIWHHAFYNELCMSPE---DQPVFMTDAPMNSKFNRERMTQIM 123

Query: 987  FECYQVPSVCYGIDSLFSYQYNGWEGQSGVIISCGYQCTHVIPVINGCIDASKAKRIDLG 1046
            FE +  P +    +++ S   +G     G+++  G   TH +PV  G        +I+L 
Sbjct: 124  FETFNTPCLYISNEAVLSLYTSG--KTIGLVVDSGEGVTHCVPVFEGHQIPQAITKINLA 181

Query: 1047 GFSVIHHLHKLLQ-LKYP---SHINSITPSRSEELLWDYGFVATDYREHLRKWLDAEFYD 1102
            G     +L ++LQ L Y     H   I  +  E L     + A D ++  R + ++   D
Sbjct: 182  GRLCTDYLTQILQELGYSLTEPHQRIIVKNIKERLC----YTALDPQDEKRIYKESNSQD 237

Query: 1103 SNVVKVQLPYAVPVPN-LTTEQQKDRRKEL 1131
            S       PY +P  N LT + QK R  E+
Sbjct: 238  S-------PYKLPDGNILTIKSQKFRCSEI 260



 Score = 46.3 bits (109), Expect = 1e-04
 Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 2/135 (1%)

Query: 1338 SLNQEQELSPKEANQLHIGVERMCGPECLFQPSMLGSIQAGISETLNFVLNSYPQHIAQS 1397
            S +Q+      + N L I  ++    E LFQP ++G   AGI       +      + Q 
Sbjct: 233  SNSQDSPYKLPDGNILTIKSQKFRCSEILFQPKLIGLEVAGIHHLAYSSIKKCDLDLRQE 292

Query: 1398 LANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDAW-SGARQFSLSEN 1456
            L  NI ++G     PG   RL+ +L    P Q    V+   +    AW  G+ Q +LS  
Sbjct: 293  LCRNIVLSGGTTLFPGIANRLSNELTNLVPSQLKIQVAAPPDRRFSAWIGGSIQCTLSTQ 352

Query: 1457 FHDFAVTQSDYQEKG 1471
               + + + +Y E+G
Sbjct: 353  QPQW-IKRQEYDEQG 366


>gnl|CDD|240343 PTZ00280, PTZ00280, Actin-related protein 3; Provisional.
          Length = 414

 Score = 69.8 bits (171), Expect = 5e-12
 Identities = 64/280 (22%), Positives = 120/280 (42%), Gaps = 40/280 (14%)

Query: 867  SSLPI-VFDNGAWCCRVGWASCEKPNLIFKNLIA------KPRKERGKKDGETQVGNDIS 919
            S+LP+ V DNG    ++G+A   +P  I   LIA      + R ++G +D +  +G++  
Sbjct: 2    STLPVVVIDNGTGYTKMGYAGNTEPTYIIPTLIADNSKQSRRRSKKGFEDLDFYIGDEAL 61

Query: 920  NIEAVRFQLKTQFDKNVVTHYDIQEQIFDYA-FSHLSINTEGNVNHPIVLTEPFLNPNYS 978
               +  + L       +V  +D+ E+ ++   F +L    E    H  +LTEP +NP  +
Sbjct: 62   A-ASKSYTLTYPMKHGIVEDWDLMEKFWEQCIFKYLRCEPE---EHYFILTEPPMNPPEN 117

Query: 979  RSLMSELLFECYQVPSVCYGID---SLFSYQYNGWEGQ-----SGVIISCGYQCTHVIPV 1030
            R   +E++FE + V  +   +    +L +   +    +     +G +I  G   THVIPV
Sbjct: 118  REYTAEIMFETFNVKGLYIAVQAVLALRASWTSKKAKELGGTLTGTVIDSGDGVTHVIPV 177

Query: 1031 INGCIDASKAKRIDLGGFSVIHHLHKLLQLKYPSHINSITPSRSEELLWDYGFVATDYRE 1090
            ++G +  S  K I L G  + + + ++L+ +            ++ +   Y +VA D   
Sbjct: 178  VDGYVIGSSIKHIPLAGRDITNFIQQMLRERGEPIPAEDILLLAQRIKEKYCYVAPD--- 234

Query: 1091 HLRKWLDAEF--YDSNVVKVQLPYAVPVPNLTTEQQKDRR 1128
                 +  EF  YDS+          P  +       +  
Sbjct: 235  -----IAKEFEKYDSD----------PKNHFKKYTAVNSV 259



 Score = 42.0 bits (99), Expect = 0.002
 Identities = 33/153 (21%), Positives = 56/153 (36%), Gaps = 18/153 (11%)

Query: 1336 FTSLNQEQELSPKEANQLHIGVERMCGPECLFQPSML-GSIQAGISETLNFVLNSYPQHI 1394
            F        ++ K+   + +G ER  GPE  F P +        + E ++  + S P   
Sbjct: 250  FKKYTAVNSVT-KKPYTVDVGYERFLGPEMFFHPEIFSSEWTTPLPEVVDDAIQSCPIDC 308

Query: 1395 AQSLANNIFVTGSLCQLPGFVERLNKD--------------LLENRPFQSHFSVSLAENP 1440
             + L  NI ++G      GF +RL +D              L   +       V++  +P
Sbjct: 309  RRPLYKNIVLSGGSTMFKGFDKRLQRDVRKRVDRRLKKAEELSGGKLKPIPIDVNVVSHP 368

Query: 1441 --ELDAWSGARQFSLSENFHDFAVTQSDYQEKG 1471
                  W G    + S  F     T+++Y E G
Sbjct: 369  RQRYAVWYGGSMLASSPEFEKVCHTKAEYDEYG 401


>gnl|CDD|176656 cd09018, DEDDy_polA_RNaseD_like_exo, DEDDy 3'-5' exonuclease domain
           of family-A DNA polymerases, RNase D, WRN, and similar
           proteins.  DEDDy exonucleases, part of the DnaQ-like (or
           DEDD) exonuclease superfamily, catalyze the excision of
           nucleoside monophosphates at the DNA or RNA termini in
           the 3'-5' direction. They contain four invariant acidic
           residues in three conserved sequence motifs termed ExoI,
           ExoII and ExoIII. DEDDy exonucleases are classified as
           such because of the presence of a specific YX(3)D
           pattern at ExoIII. The four conserved acidic residues
           serve as ligands for the two metal ions required for
           catalysis. This family of DEDDy exonucleases includes
           the proofreading domains of family A DNA polymerases, as
           well as RNases such as RNase D and yeast Rrp6p. The
           Egalitarian (Egl) and Bacillus-like DNA Polymerase I
           subfamilies do not possess a completely conserved YX(3)D
           pattern at the ExoIII motif. In addition, the
           Bacillus-like DNA polymerase I subfamily has inactive
           3'-5' exonuclease domains which do not possess the
           metal-binding residues necessary for activity.
          Length = 150

 Score = 58.0 bits (140), Expect = 2e-09
 Identities = 29/144 (20%), Positives = 51/144 (35%), Gaps = 5/144 (3%)

Query: 343 IAIDLEYHNYRSYQGYTCLMQIS-TRDKDYIVDTLKLREDLEVLNEVLTDKNIVKVFHGA 401
            A D E  +  +      L+Q++       ++        LE+L  +L D+  +KV    
Sbjct: 2   FAFDTETDSLDNISANLVLIQLAIEPGVAALIPVAHDYLALELLKPLLEDEKALKVGQNL 61

Query: 402 DSDIKWLQKDFGLYVVGMFDTHQACKFL--PMPRQSLAYLLKHYCDV--DSDKTFQLFDW 457
             D   L   F       FDT      L     R  +  L++ +        ++     W
Sbjct: 62  KYDRGILLNYFIELRGIAFDTMLEAYILNSVAGRWDMDSLVERWLGHKLIKFESIAGKLW 121

Query: 458 RHRPLPEPAIQYARTDTHYLLYVY 481
            ++PL E   +YA  D    L ++
Sbjct: 122 FNQPLTEEQGRYAAEDADVTLQIH 145



 Score = 58.0 bits (140), Expect = 2e-09
 Identities = 29/144 (20%), Positives = 51/144 (35%), Gaps = 5/144 (3%)

Query: 1788 IAIDLEYHNYRSYQGYTCLMQIS-TRDKDYIVDTLKLREDLEVLNEVLTDKNIVKVFHGA 1846
             A D E  +  +      L+Q++       ++        LE+L  +L D+  +KV    
Sbjct: 2    FAFDTETDSLDNISANLVLIQLAIEPGVAALIPVAHDYLALELLKPLLEDEKALKVGQNL 61

Query: 1847 DSDIKWLQKDFGLYVVGMFDTHQACKFL--PMPRQSLAYLLKHYCDV--DSDKTFQLFDW 1902
              D   L   F       FDT      L     R  +  L++ +        ++     W
Sbjct: 62   KYDRGILLNYFIELRGIAFDTMLEAYILNSVAGRWDMDSLVERWLGHKLIKFESIAGKLW 121

Query: 1903 RHRPLPEPAIQYARTDTHYLLYVY 1926
             ++PL E   +YA  D    L ++
Sbjct: 122  FNQPLTEEQGRYAAEDADVTLQIH 145


>gnl|CDD|212657 cd00012, NBD_sugar-kinase_HSP70_actin, Nucleotide-Binding Domain of
            the sugar kinase/HSP70/actin superfamily.  This
            superfamily includes the actin family, the HSP70 family
            of molecular chaperones and nucleotide exchange factors,
            the ROK (repressor, ORF, kinase) family, the hexokinase
            family, the FGGY family (which includes glycerol kinase
            and similar carbohydrate kinases such as rhamnulokinase
            and xylulokinase), the exopolyphosphatase/guanosine
            pentaphosphate phosphohydrolase/nucleoside triphosphate
            diphosphohydrolase family, propionate kinase/acetate
            kinase family, glycerol dehydratase reactivase,
            2-hydroxyglutaryl-CoA dehydratase component A,
            N-acetylglucosamine kinase, butyrate kinase 2,
            Escherichia coli YeaZ and similar glycoproteases, the
            cell shape-determining protein MreB, the plasmid DNA
            segregation factor ParM, cell cycle proteins FtsA, Pili
            assembly protein PilM, ethanolamine utilization protein
            EutJ, and similar proteins. The nucleotide-binding site
            residues are conserved; the nucleotide sits in a deep
            cleft formed between the two lobes of the
            nucleotide-binding domain (NBD). Substrate binding to
            superfamily members is associated with closure of this
            catalytic site cleft. The functional activities of
            several members of the superfamily, including
            hexokinases, actin, and HSP70s, are modulated by
            allosteric effectors, which may act on the cleft closure.
          Length = 185

 Score = 48.7 bits (116), Expect = 5e-06
 Identities = 27/168 (16%), Positives = 47/168 (27%), Gaps = 20/168 (11%)

Query: 913  QVGNDISNIEAVRFQLKTQFDKNVVTHYDIQEQIFDYAFSHLSINTEGNVNHPIVLTEPF 972
             V +    I               VT  D  E+             +  +   + +TEP 
Sbjct: 13   GVADLDGEILPEEIVPTPVGRPGAVTDLDELEEALRELLKEALRQLKSEI-DAVGITEPG 71

Query: 973  LNPNYSRSLMS---------ELLFECYQVPSVCYGID---SLFSYQYNGWEGQSGVIISC 1020
              P  +R ++           L  E      V    D   +  +    G E  + +++  
Sbjct: 72   GVPKENREVIILPNLLLIPLALALEDLGGVPVAVVNDAVAAALAEGLFGKEEDTVLVVDL 131

Query: 1021 GYQCTHVIPVINGCIDASKAKRIDLGGFSVIHHLHKLLQL--KYPSHI 1066
            G   T +  V +G     K      G   +   L  +L L    P  +
Sbjct: 132  GTGTTGIAIVEDG-----KGGVGAAGELGIAEALAAVLNLLDGLPDAV 174


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
            archaeal type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. It is found
            in a single copy and is homodimeric in prokaryotes, but
            six paralogs (excluded from this family) are found in
            eukarotes, where SMC proteins are heterodimeric. This
            family represents the SMC protein of archaea and a few
            bacteria (Aquifex, Synechocystis, etc); the SMC of other
            bacteria is described by TIGR02168. The N- and C-terminal
            domains of this protein are well conserved, but the
            central hinge region is skewed in composition and highly
            divergent [Cellular processes, Cell division, DNA
            metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 47.0 bits (112), Expect = 1e-04
 Identities = 43/263 (16%), Positives = 95/263 (36%), Gaps = 34/263 (12%)

Query: 1126 DRRKELAK-------------KLVEMNAKKREERLVDDERHLNELLELREIV-ELTPSDH 1171
            DR+KE A               L+    +++ ERL  +         L +   E    + 
Sbjct: 169  DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYEL 228

Query: 1172 SHAREAF---KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNG-EDLTEEPKAKLSKE 1227
               +EA    K      +  L + + +L ++I + + ++       E+L ++ K    +E
Sbjct: 229  LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEE 288

Query: 1228 IAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISE----- 1282
                      K  + E + + A +      +++  +D  +R      E  +L++E     
Sbjct: 289  QL------RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342

Query: 1283 --LARKEKRDDDFGMR-DEDWDVYKVINKDAG--DTDSEEEQERLIELEEILRQHDPEFT 1337
              +  + KR D       E  +  + +  +    D +  E ++ L +  E L +   E  
Sbjct: 343  REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402

Query: 1338 SLNQEQELSPKEANQLHIGVERM 1360
             L +E +   +E  +L   +  +
Sbjct: 403  ELKRELDRLQEELQRLSEELADL 425



 Score = 42.7 bits (101), Expect = 0.002
 Identities = 49/253 (19%), Positives = 99/253 (39%), Gaps = 29/253 (11%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 1181
            E+  +   EL K+L E+     E      +    E L ++E +    ++ +    +    
Sbjct: 254  EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313

Query: 1182 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEP--KAKLSKEIA---------- 1229
                ++D  + + +L+ +I+K  A+I       ++ EE   + KL++E A          
Sbjct: 314  E-RELEDAEERLAKLEAEIDKLLAEIEELE--REIEEERKRRDKLTEEYAELKEELEDLR 370

Query: 1230 --VPESEAEFKAWLIETKKKRAYI---IDKKNARKQRRQDLAKRRTAAAQERMRLISELA 1284
              + E + EF     E K  R  +     + N  K+    L +     ++E   L + +A
Sbjct: 371  AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430

Query: 1285 RKEKRDDDFGMRDEDWDVYKVINKDAGD-----TDSEEEQERLIELEEILRQHDPEFTSL 1339
              E + ++     E  D    I K          D  + ++ L +L+E   + + E + L
Sbjct: 431  GIEAKINELEEEKE--DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKL 488

Query: 1340 NQEQELSPKEANQ 1352
              ++EL+  EA  
Sbjct: 489  --QRELAEAEAQA 499



 Score = 36.6 bits (85), Expect = 0.15
 Identities = 45/248 (18%), Positives = 92/248 (37%), Gaps = 24/248 (9%)

Query: 1123 QQKDRRKELAKKLVEMNAKKREERLVDD----ERHLNELLEL-----REIVELTPSDHSH 1173
             +    + L ++L  +  K+    L  +    E  L+EL +      R+I E+       
Sbjct: 671  SEPAELQRLRERLEGL--KRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQL 728

Query: 1174 AREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPES 1233
             +E  K      +++L + ++ L+Q+IE  K+++       +  EE   KL + +   E+
Sbjct: 729  EQEEEKLKER--LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEA 786

Query: 1234 EAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDF 1293
                        +     +++    +  R +   R       R+ L  E   KE ++   
Sbjct: 787  RLSHSRIPEIQAELSK--LEE----EVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQE 840

Query: 1294 GMRDEDWDVYKVINK---DAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEA 1350
                 D        +   +  +   EE +E L ELE  LR  +     L +E++    + 
Sbjct: 841  QRI--DLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898

Query: 1351 NQLHIGVE 1358
             +L   +E
Sbjct: 899  RELERKIE 906



 Score = 34.3 bits (79), Expect = 0.83
 Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 6/105 (5%)

Query: 1125 KDRRKELAKKLVEMNAKKRE--ERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 1182
            K++ K + K++  +N KK E  E L   E     L +L   +     +        + + 
Sbjct: 846  KEQIKSIEKEIENLNGKKEELEEEL---EELEAALRDLESRLGDLKKERDELEAQLRELE 902

Query: 1183 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKE 1227
               I++L   I + ++++ + KAK+ A        E+PK +  + 
Sbjct: 903  -RKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEI 946



 Score = 31.6 bits (72), Expect = 6.0
 Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 5/123 (4%)

Query: 1120 TTEQQKDRRKELAKKLVEMNAKKREE-RLVDDERHLN-ELLELREIVELTPSDHSHAREA 1177
            T ++ KD R++L K   E+N  KRE  RL ++ + L+ EL +L   +    +  +   E 
Sbjct: 383  TRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEE 442

Query: 1178 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 1237
             +   +  I+     + QL   + K + ++       D  E+  +KL +E+A  E+EA+ 
Sbjct: 443  KEDKAL-EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA--EAEAQA 499

Query: 1238 KAW 1240
            +A 
Sbjct: 500  RAS 502


>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
          Length = 880

 Score = 45.9 bits (110), Expect = 3e-04
 Identities = 30/122 (24%), Positives = 49/122 (40%), Gaps = 4/122 (3%)

Query: 304 FLKCEEPKQALPLSDTPLMMITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQ 363
            L+     +A PL +     I + E++   +++LK     A D E  +    Q     + 
Sbjct: 279 LLRRAAAAEAAPLDEEDYETILDEEELEAWLAKLKAAGLFAFDTETTSLDPMQAELVGLS 338

Query: 364 ISTRDKD--YIVDTLKLREDLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGM-F 420
            +    +  YI      RE L  L  +L D  I KV      D+  L   +G+ + G+ F
Sbjct: 339 FAVEPGEAAYIPLDQLDREVLAALKPLLEDPAIKKVGQNLKYDLHVL-ARYGIELRGIAF 397

Query: 421 DT 422
           DT
Sbjct: 398 DT 399



 Score = 45.9 bits (110), Expect = 3e-04
 Identities = 30/122 (24%), Positives = 49/122 (40%), Gaps = 4/122 (3%)

Query: 1749 FLKCEEPKQALPLSDTPLMMITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQ 1808
             L+     +A PL +     I + E++   +++LK     A D E  +    Q     + 
Sbjct: 279  LLRRAAAAEAAPLDEEDYETILDEEELEAWLAKLKAAGLFAFDTETTSLDPMQAELVGLS 338

Query: 1809 ISTRDKD--YIVDTLKLREDLEVLNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGM-F 1865
             +    +  YI      RE L  L  +L D  I KV      D+  L   +G+ + G+ F
Sbjct: 339  FAVEPGEAAYIPLDQLDREVLAALKPLLEDPAIKKVGQNLKYDLHVL-ARYGIELRGIAF 397

Query: 1866 DT 1867
            DT
Sbjct: 398  DT 399


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
            chromosome partitioning].
          Length = 1163

 Score = 43.5 bits (103), Expect = 0.001
 Identities = 39/237 (16%), Positives = 87/237 (36%), Gaps = 26/237 (10%)

Query: 1118 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHL-NELLELREIVELTPSDHSHARE 1176
             L  + ++  R+    K      ++  E+L      L  EL EL E +E          E
Sbjct: 706  ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765

Query: 1177 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 1236
                            +  L++ + K K +I          +E   +L +E+   E   +
Sbjct: 766  E---------------LESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLD 810

Query: 1237 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 1296
                 +E+ ++R   ++++    +   +  + +    +E +  + +   + K + +  + 
Sbjct: 811  ALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE-ELE 869

Query: 1297 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQL 1353
             E  ++   +         +E +E   ELEE LR+ + E   L +E E   +   +L
Sbjct: 870  AEKEELEDEL---------KELEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917



 Score = 39.3 bits (92), Expect = 0.027
 Identities = 39/212 (18%), Positives = 87/212 (41%), Gaps = 12/212 (5%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSD---HSHAREAF 1178
            +++ +  +E  + L E    K +E + + E     L E  E +E    +      A E  
Sbjct: 757  QERLEELEEELESLEEA-LAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815

Query: 1179 KSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFK 1238
                    + L + I +L+++IE+ + K+       +  E+   +L +E+   E EAE +
Sbjct: 816  LESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE--ELEAEKE 873

Query: 1239 AWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDE 1298
               +E + K     ++K   ++  ++L        +E  +L   L   E + +   +   
Sbjct: 874  E--LEDELKELE--EEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELP 929

Query: 1299 DWDVYKVINKDAGDTDSEEEQERLIELEEILR 1330
              ++ + + ++  DT   E +  +  LEE + 
Sbjct: 930  --ELEEELEEEYEDTLETELEREIERLEEEIE 959



 Score = 37.0 bits (86), Expect = 0.12
 Identities = 22/136 (16%), Positives = 51/136 (37%), Gaps = 9/136 (6%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVELTPSDHSHARE 1176
            E+  +   ELA+   E+   KRE     ERL      L +L E  + +E    +     E
Sbjct: 383  EELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELE 442

Query: 1177 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 1236
                     +++L + + +L+ ++++ + ++          E+  + L   +   E+E  
Sbjct: 443  ELNE----ELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQR 498

Query: 1237 FKAWLIETKKKRAYII 1252
                +    +     +
Sbjct: 499  ASQGVRAVLEALESGL 514



 Score = 33.1 bits (76), Expect = 1.7
 Identities = 42/229 (18%), Positives = 84/229 (36%), Gaps = 48/229 (20%)

Query: 1125 KDRRKELAKKL--VEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMG 1182
            K+R++E  +KL   E N ++ E+ L + E+ L +L    E  E      +  RE   ++ 
Sbjct: 171  KERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALL 230

Query: 1183 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 1242
            +  +++L K + +L++++ + + ++          +E   +  KEI              
Sbjct: 231  LAKLKELRKELEELEEELSRLEEELEEL-------QEELEEAEKEI-------------- 269

Query: 1243 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDV 1302
                            K   ++L +      +E + L  E+   E        R      
Sbjct: 270  -------------EELKSELEELREELEELQEELLELKEEIEELEGEISLLRER------ 310

Query: 1303 YKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSL-NQEQELSPKEA 1350
                  +  + + EE +ERL EL+E +     E        +EL    A
Sbjct: 311  -----LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 42.6 bits (101), Expect = 0.002
 Identities = 17/64 (26%), Positives = 34/64 (53%)

Query: 764 KSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSS 823
           K+ +K++   EK +K   E ++EK+++    K K E+ EE  EK +  K + E +  +  
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466

Query: 824 KGKS 827
           + + 
Sbjct: 467 EEEE 470



 Score = 38.4 bits (90), Expect = 0.035
 Identities = 18/74 (24%), Positives = 36/74 (48%), Gaps = 1/74 (1%)

Query: 752 KITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRH 811
           K   + +     K+E+K E EK+++KK      ++KEE+   +K K E+ +E  E+    
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKK-KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464

Query: 812 KIKSEPKENDSSKG 825
           + + E ++      
Sbjct: 465 EKEEEEEKKKKQAT 478



 Score = 35.7 bits (83), Expect = 0.29
 Identities = 14/66 (21%), Positives = 36/66 (54%)

Query: 761 KQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEN 820
            +  +++  ++ ++  KK   E++E+K++  A KK + E+ EE  +K +  + + E  E 
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464

Query: 821 DSSKGK 826
           +  + +
Sbjct: 465 EKEEEE 470



 Score = 31.4 bits (72), Expect = 6.1
 Identities = 23/115 (20%), Positives = 52/115 (45%), Gaps = 6/115 (5%)

Query: 701 KTSRFET-PYQRFLKSK--EYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEP 757
           +  R E  P+   +     E A  +   ++     +++I+ LT   K   +    + +  
Sbjct: 364 RKVRREVLPFLSIIFKHNPELAARLAAFLE---LTEEEIEFLTGSKKATKKIKKIVEKAE 420

Query: 758 VVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHK 812
              ++ K E+K +    K+K+   E E+EK+E+   ++ +  + E+  E+ K+ K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 42.8 bits (100), Expect = 0.002
 Identities = 32/117 (27%), Positives = 58/117 (49%), Gaps = 9/117 (7%)

Query: 713  LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
             K  E AK   E++ K   E+KK++ L  +   E ++  ++ +     K   +EE  + E
Sbjct: 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671

Query: 773  KEKRK--KILREREEEKEEQPAMKKIKVE-------KPEESNEKTKRHKIKSEPKEN 820
            ++K+K  +  +  E+EK+   A+KK   E       K +E+ EK K  ++K   +EN
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN 1728



 Score = 42.4 bits (99), Expect = 0.003
 Identities = 31/111 (27%), Positives = 54/111 (48%)

Query: 714  KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
            K++E  K   E + KE  E KK + L  +   E ++  ++ +     K    E K E E+
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741

Query: 774  EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
            +K+K    +++EE++++ A  K + EK  E   K K   I+ E  E D  +
Sbjct: 1742 DKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792



 Score = 36.7 bits (84), Expect = 0.19
 Identities = 36/145 (24%), Positives = 67/145 (46%), Gaps = 8/145 (5%)

Query: 686  DKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKT 745
            D++K    K +  + K +  +       K  E AK   E   K    +KK DA     K 
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA----KK 1336

Query: 746  EPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKI--KVEKPEE 803
            + EE  K   E    +   + ++ E  +EK +   +++EE K++  A KK   + +K +E
Sbjct: 1337 KAEEAKK-AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395

Query: 804  SNEKTKRHKIKSEP-KENDSSKGKS 827
            + +K +  K K++  K+  ++K K+
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKA 1420



 Score = 35.5 bits (81), Expect = 0.40
 Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 14/138 (10%)

Query: 694  KYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKI 753
            K ++ K          ++  ++K+  +A  E+V K   E+KK+ A   + K   E  +K 
Sbjct: 1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA--EEAKKAEEAKIKA 1622

Query: 754  TQ---------EPVVLKQIKSEEKV---EMEKEKRKKILREREEEKEEQPAMKKIKVEKP 801
             +         +   LK+ ++EEK    E++K + +  ++  EE K+ +   KK +  K 
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682

Query: 802  EESNEKTKRHKIKSEPKE 819
             E +EK     +K E +E
Sbjct: 1683 AEEDEKKAAEALKKEAEE 1700



 Score = 34.0 bits (77), Expect = 1.2
 Identities = 30/139 (21%), Positives = 60/139 (43%), Gaps = 7/139 (5%)

Query: 694  KYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKE---NAEQKKIDAL-TPQVKTEPEE 749
            K ++ K K    +   +   K+ E AKA  E    E     E+ +       + K + + 
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382

Query: 750  NVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKI-KVEKPEESNEKT 808
              K  +E    K  ++++K E +K+K  ++ +    +K+   A KK  + +K +E+ +K 
Sbjct: 1383 AKKKAEE--KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKA 1440

Query: 809  KRHKIKSEPKENDSSKGKS 827
            +  K   E K+      K+
Sbjct: 1441 EEAKKADEAKKKAEEAKKA 1459



 Score = 33.6 bits (76), Expect = 1.5
 Identities = 30/113 (26%), Positives = 57/113 (50%), Gaps = 4/113 (3%)

Query: 714  KSKEYAKAIQEKV--DKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEM 771
            K++E  KA ++K   + + AE+ K  AL    + +  E  +I +   + ++ K  +  E 
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612

Query: 772  EKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
            +K +  KI  + EE K+ +   KK++  K +E+ EK K  ++K   +EN    
Sbjct: 1613 KKAEEAKI--KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663



 Score = 33.2 bits (75), Expect = 1.9
 Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 10/109 (9%)

Query: 713  LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
            LK    AK   ++  K+  E+KK D    + K + EE  K  +     ++ K  E+ + +
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKAD----EAKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465

Query: 773  KEKRKKI--LREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
             E+ KK    +++ EE ++    KK    K EE+ +K    K  +E K+
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKK----KAEEAKKKADEAKKAAEAKK 1510



 Score = 32.8 bits (74), Expect = 2.4
 Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 8/113 (7%)

Query: 713  LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLK----QIKSEEK 768
            LK +       E++ K+ AE+KK      ++K   EEN KI  E    +    + K+EE 
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKK---KAEELKKAEEEN-KIKAEEAKKEAEEDKKKAEEA 1749

Query: 769  VEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEND 821
             + E+EK+K    ++EEEK+ +   K+ +    EE +E+ ++ +++ + K  D
Sbjct: 1750 KKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKD 1802



 Score = 32.8 bits (74), Expect = 2.6
 Identities = 31/123 (25%), Positives = 50/123 (40%), Gaps = 13/123 (10%)

Query: 714  KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
            K  E AK   +   K+  E KK      +      E      E    K   +E+K E  K
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAA----EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377

Query: 774  EKRKKILREREEEKEEQPAMKKI--------KVEKPEESNEKTKRHKIKSEPKEN-DSSK 824
            +K     ++ EE+K+   A KK         +++K   + +K    K K+E K+  D +K
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437

Query: 825  GKS 827
             K+
Sbjct: 1438 KKA 1440



 Score = 32.4 bits (73), Expect = 3.2
 Identities = 31/136 (22%), Positives = 57/136 (41%), Gaps = 9/136 (6%)

Query: 686  DKMKIGSNKYQKIKLKTSRFETPYQRFLKSK-EYAKAIQEKVDKENAEQKKIDALTPQV- 743
             + K    K +    K    E       K K E  K   +++ K  A +KK D    +  
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428

Query: 744  -KTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVE--K 800
             K + +E  K  +E     +   E K + E+ K+ +  +++ EE ++    KK   E  K
Sbjct: 1429 EKKKADEAKKKAEE----AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484

Query: 801  PEESNEKTKRHKIKSE 816
             +E+ +K +  K K++
Sbjct: 1485 ADEAKKKAEEAKKKAD 1500



 Score = 32.4 bits (73), Expect = 3.8
 Identities = 24/106 (22%), Positives = 49/106 (46%), Gaps = 2/106 (1%)

Query: 724  EKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILRER 783
            E+  K+  E KK +      +    E  ++         IK+EE  + ++ K+ +  ++ 
Sbjct: 1233 EEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA 1292

Query: 784  EEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE--PKENDSSKGKS 827
            +E K+ +   K  + +K  E  +K    K K+E   K+ D++K K+
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338



 Score = 32.0 bits (72), Expect = 4.1
 Identities = 45/222 (20%), Positives = 85/222 (38%), Gaps = 13/222 (5%)

Query: 1119 LTTEQQKDRRKELAKKLVEM-NAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 1177
               E+ K +     KK  E   A + +++  +D++  +EL +     +        A E 
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430

Query: 1178 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 1237
             K+       +  K  ++ ++K E+ K    A    E+  +  +AK   E A    EA+ 
Sbjct: 1431 KKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKK 1490

Query: 1238 KAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARK--------EKR 1289
            KA   E  KK+A    K    K++  +  K   A   +  +   E A+K        EK+
Sbjct: 1491 KA---EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE-AKKADEAKKAEEKK 1546

Query: 1290 DDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQ 1331
              D   + E+    +   K      +EE++   +   E  ++
Sbjct: 1547 KADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588



 Score = 31.6 bits (71), Expect = 5.2
 Identities = 27/143 (18%), Positives = 59/143 (41%), Gaps = 3/143 (2%)

Query: 686  DKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKT 745
             +      +    + + +  +       K +   KA  +   K+  E+KK D    + + 
Sbjct: 1345 AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA--DAAKKKAEEKKKADEAKKKAEE 1402

Query: 746  EPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKI-KVEKPEES 804
            + ++  ++ +     K+    +K   EK+K  +  ++ EE K+   A KK  + +K EE+
Sbjct: 1403 DKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462

Query: 805  NEKTKRHKIKSEPKENDSSKGKS 827
             +K +  K   E K+      K+
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKA 1485



 Score = 31.6 bits (71), Expect = 5.5
 Identities = 25/109 (22%), Positives = 47/109 (43%), Gaps = 4/109 (3%)

Query: 714  KSKEYAKAIQEKVDKENA----EQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKV 769
            K+ E  KA + K   + A    E KK D      + +  +  K  +E     ++K  E++
Sbjct: 1498 KADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557

Query: 770  EMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPK 818
            +  +EK+K    ++ EE +     K  + +K EE+  +      + E K
Sbjct: 1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKK 1606


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 41.4 bits (97), Expect = 0.005
 Identities = 25/108 (23%), Positives = 51/108 (47%), Gaps = 1/108 (0%)

Query: 724 EKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILRER 783
           E   +E  E++++     + K +P+E  K  +     K+ +  ++ E EKEK+ +  R+R
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDR 160

Query: 784 EEEKEEQPAMKKIKVEKPEES-NEKTKRHKIKSEPKENDSSKGKSGGT 830
           EEEK+ +    K + +KP +      K+   + E +   + +   G  
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKP 208



 Score = 37.9 bits (88), Expect = 0.058
 Identities = 38/161 (23%), Positives = 75/161 (46%), Gaps = 7/161 (4%)

Query: 693 NKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVK 752
            K +K K K    E P  R  K KE AK  +   +KE  ++KK++    + + +  E V+
Sbjct: 113 VKEEKKKKKEKPKEEPKDR--KPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR 170

Query: 753 ITQEPVVLKQIKSEEKVE--MEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKR 810
               P    + K   K +   E+EK+++  RE  + K E+P + + + ++ ++  ++   
Sbjct: 171 AKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETT 230

Query: 811 HKIKSEPKENDSSKGKSGGTISTVDFSKVNYNKYMAKPDTK 851
               S  +E++S +       S+    K + +  MA P+T+
Sbjct: 231 ---TSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETR 268



 Score = 37.2 bits (86), Expect = 0.10
 Identities = 29/114 (25%), Positives = 54/114 (47%), Gaps = 4/114 (3%)

Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
           +SK  +    ++V+K  ++     A T   K    E+ K  +E    +Q+K E+K + EK
Sbjct: 68  ESKLSSDEAVKRVEKGGSKGPA--AKTKPAKEPKNESGK--EEEKEKEQVKEEKKKKKEK 123

Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKS 827
            K +   R+ +EE +E+   K+ + EK ++  E   R + K   +    S+ K 
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKK 177


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 41.1 bits (96), Expect = 0.006
 Identities = 31/140 (22%), Positives = 57/140 (40%), Gaps = 5/140 (3%)

Query: 696 QKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQ 755
           + ++ +    E+  Q   K +E    + ++  +E  E+K  +     +  E EE      
Sbjct: 245 ELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELL 304

Query: 756 EPVVLKQIKSEEKVEMEKEKRKKI-----LREREEEKEEQPAMKKIKVEKPEESNEKTKR 810
           +    K    E+  E EKE +K        +E  EE E++    +IK E  EE  E+ ++
Sbjct: 305 KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEK 364

Query: 811 HKIKSEPKENDSSKGKSGGT 830
            + K E  E +    K   +
Sbjct: 365 LQEKLEQLEEELLAKKKLES 384



 Score = 41.1 bits (96), Expect = 0.007
 Identities = 25/160 (15%), Positives = 57/160 (35%), Gaps = 3/160 (1%)

Query: 664 NTHLSSYDPQDKKIAQIGLFFEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQ 723
              L+  + + +K+       E             +      E   +      E  + I+
Sbjct: 774 EKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAEL-LEEEQLLIEQEEKIK 832

Query: 724 EKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME--KEKRKKILR 781
           E+  +E A + K +    ++  E  E ++       L Q    ++ E+E  K K +   +
Sbjct: 833 EEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESK 892

Query: 782 EREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEND 821
           E +E++E++   ++ + +   E  E     +I  E     
Sbjct: 893 EEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILL 932



 Score = 35.7 bits (82), Expect = 0.34
 Identities = 30/249 (12%), Positives = 85/249 (34%), Gaps = 21/249 (8%)

Query: 1121 TEQQKDRRKELAKK------LVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHA 1174
            + +++ +++ L K       L E+     E +L + +        L         +    
Sbjct: 166  SREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEE 225

Query: 1175 REAFKSMG---INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVP 1231
               +          I  L + +   Q++IE +K ++           +   +  KE  + 
Sbjct: 226  NLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQ 285

Query: 1232 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQ---ERMRLISELARKEK 1288
            E E +  A   E  K     ++++    + +   +++     +   ++ +   E   KE 
Sbjct: 286  EEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKEL 345

Query: 1289 RDDDFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPK 1348
                     ++ ++ +   ++  +   + +++     EE+L +   E   L+   +L  +
Sbjct: 346  ---------KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEE 396

Query: 1349 EANQLHIGV 1357
            E    +   
Sbjct: 397  ELELKNEEE 405


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
            recombination, and repair].
          Length = 908

 Score = 41.3 bits (97), Expect = 0.007
 Identities = 37/188 (19%), Positives = 74/188 (39%), Gaps = 23/188 (12%)

Query: 1118 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDD---------------ERHLNELLE-LR 1161
             L  +Q K+  ++L  +L E+     E RL+                 ++ L EL E L 
Sbjct: 551  KLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLS 610

Query: 1162 EIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPK 1221
            ++ EL  S      E          ++L   + +L  + E  +    A    E+  EE +
Sbjct: 611  QLEELLQSLELSEAENEL---EEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667

Query: 1222 AKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLIS 1281
            A++ +E+   E+E + +    E  ++   + ++    ++  ++L K+     Q    L S
Sbjct: 668  AEIRRELQRIENEEQLE----EKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELES 723

Query: 1282 ELARKEKR 1289
              A  E+ 
Sbjct: 724  RKAELEEL 731



 Score = 31.7 bits (72), Expect = 5.2
 Identities = 22/146 (15%), Positives = 63/146 (43%), Gaps = 11/146 (7%)

Query: 684 FEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQV 743
            E  ++    K ++++      E   +     +   + ++E ++K  + +++++ L  ++
Sbjct: 279 LERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKL 338

Query: 744 KTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREE--EKEEQPAMKKIKVEKP 801
           +    E  ++ +E         + ++    E+R K L ER E  EKE + A++++K  + 
Sbjct: 339 EKLESELEELAEE---------KNELAKLLEERLKELEERLEELEKELEKALERLKQLEE 389

Query: 802 EESNEKTKRHKIKSEPKENDSSKGKS 827
                K +  ++ +  +E      + 
Sbjct: 390 AIQELKEELAELSAALEEIQEELEEL 415


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score = 40.4 bits (95), Expect = 0.011
 Identities = 44/234 (18%), Positives = 93/234 (39%), Gaps = 29/234 (12%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERH--LNELLELREIVELTPSDHSHAREAFK 1179
            E+ ++ +KEL     E++  +++++++ +       +L EL   +E   S      E   
Sbjct: 281  EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELA 340

Query: 1180 SMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKA 1239
             +    +++L + +  L+ ++E+ +A++    +  +  EE    L  ++A  E +     
Sbjct: 341  ELE-EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIA--- 396

Query: 1240 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 1299
                   +   +       + R + L  RR    QE   L+ +L   E    +     E+
Sbjct: 397  ---SLNNEIERL-------EARLERLEDRRERLQQEIEELLKKLEEAEL--KELQAELEE 444

Query: 1300 WDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQL 1353
                     +    + +EE ERL E  E LR+   E     Q  + + +E  QL
Sbjct: 445  --------LEEELEELQEELERLEEALEELRE---ELEEAEQALDAAERELAQL 487



 Score = 40.0 bits (94), Expect = 0.015
 Identities = 48/243 (19%), Positives = 107/243 (44%), Gaps = 15/243 (6%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELR-EIVELTPSDHSHAREAFKS 1180
            E+ ++  +EL ++L E   +  E      E    +L ELR E+ EL   +    ++   +
Sbjct: 235  EELREELEELQEELKEAEEELEELTAELQELEE-KLEELRLEVSELE-EEIEELQKELYA 292

Query: 1181 MGI------NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE 1234
            +           Q L + +  L++++E+ +A++    +  D   E  A+L +++   E +
Sbjct: 293  LANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE--ELK 350

Query: 1235 AEFKAWLIETKKKRAYIIDKKNARKQRRQDL--AKRRTAAAQERMRLI-SELARKEKRDD 1291
             E ++   E ++  A + + ++  ++  + L   + + A  + ++  + +E+ R E R +
Sbjct: 351  EELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410

Query: 1292 DFGMRDEDWD-VYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEA 1350
                R E      + + K   + + +E Q  L ELEE L +   E   L +  E   +E 
Sbjct: 411  RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL 470

Query: 1351 NQL 1353
             + 
Sbjct: 471  EEA 473



 Score = 38.9 bits (91), Expect = 0.035
 Identities = 44/236 (18%), Positives = 84/236 (35%), Gaps = 36/236 (15%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 1181
             Q      EL  ++ E+  +  E      E    E+ EL   +E    +    REA   +
Sbjct: 750  AQLSKELTELEAEIEELEERLEEAEEELAE-AEAEIEELEAQIEQLKEELKALREALDEL 808

Query: 1182 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIA------------ 1229
                +  LN+    L++++E  + +I A     +  EE   +LS++I             
Sbjct: 809  R-AELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEEL 867

Query: 1230 VPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 1289
            + E E+E +A L E       +       +   ++L++       +R  L  EL    ++
Sbjct: 868  IEELESELEALLNERASLEEAL----ALLRSELEELSEELRELESKRSELRRELEELREK 923

Query: 1290 DDDFGMR------------DEDWDVYKVI------NKDAGDTDSEEEQERLIELEE 1327
                 +R            +   + Y +        ++  + D EE + RL  LE 
Sbjct: 924  LAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLEN 979



 Score = 35.0 bits (81), Expect = 0.58
 Identities = 43/242 (17%), Positives = 92/242 (38%), Gaps = 60/242 (24%)

Query: 1125 KDRRKELAKKLV--EMNAKKREERLVDDERHLNEL----------LELREIVELTPSDHS 1172
            K+RRKE  +KL     N  + E+ L + ER L  L           EL+   EL      
Sbjct: 171  KERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKA--EL------ 222

Query: 1173 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPE 1232
              RE   ++ +  +++L + + +LQ+++++               EE   +L+ E+   E
Sbjct: 223  --RELELALLVLRLEELREELEELQEELKE--------------AEEELEELTAELQELE 266

Query: 1233 SEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDD 1292
             +       +E  +     ++++    Q+         +  +++ +++ E   +    + 
Sbjct: 267  EK-------LEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRE---RLANLER 316

Query: 1293 FGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELSPKEANQ 1352
                            +  +   EE + +L EL E L + + +   L +E E    E  +
Sbjct: 317  --------------QLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEE 362

Query: 1353 LH 1354
            L 
Sbjct: 363  LE 364



 Score = 34.3 bits (79), Expect = 0.86
 Identities = 49/247 (19%), Positives = 89/247 (36%), Gaps = 18/247 (7%)

Query: 1117 PNLTTEQQKDRRKELAKKLVEMNAKKRE-----ERLVDDERHLNELLELREIVELTPSDH 1171
             N +  +++   +EL +K+ E+  K  E       L  +   L E LE         S  
Sbjct: 668  TNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQ 727

Query: 1172 SHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVP 1231
              A     +     ++ L + I QL +++ + +A+I       +  EE  A+   EI   
Sbjct: 728  ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE-- 785

Query: 1232 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 1291
            E EA+ +    E K  R   +D+  A       L +      +    L   +A  E+R +
Sbjct: 786  ELEAQIEQLKEELKALRE-ALDELRAELTL---LNEEAANLRERLESLERRIAATERRLE 841

Query: 1292 DF-----GMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELS 1346
            D       + ++   +   I +   +   EE +  L  L       +     L  E E  
Sbjct: 842  DLEEQIEELSEDIESLAAEIEE--LEELIEELESELEALLNERASLEEALALLRSELEEL 899

Query: 1347 PKEANQL 1353
             +E  +L
Sbjct: 900  SEELREL 906



 Score = 33.9 bits (78), Expect = 1.2
 Identities = 32/213 (15%), Positives = 77/213 (36%), Gaps = 22/213 (10%)

Query: 1155 NELLELREIVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGE 1214
             E+ EL E +E      +   +A        + +L K + +L++++E+ + ++   +   
Sbjct: 677  REIEELEEKIEELEEKIAELEKA--------LAELRKELEELEEELEQLRKELEELSRQI 728

Query: 1215 DLTEEPKAKLSKEIAVPESE--------AEFKAWLIETKKKRAYIIDKKNARKQRRQDLA 1266
                +  A+L  E+   E           E +A + E +++     ++    +   ++L 
Sbjct: 729  SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788

Query: 1267 KRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDT-DSEEEQERLIEL 1325
             +     +E   L   L           +   + +   +  +           + RL +L
Sbjct: 789  AQIEQLKEELKALREALDELRAE-----LTLLNEEAANLRERLESLERRIAATERRLEDL 843

Query: 1326 EEILRQHDPEFTSLNQEQELSPKEANQLHIGVE 1358
            EE + +   +  SL  E E   +   +L   +E
Sbjct: 844  EEQIEELSEDIESLAAEIEELEELIEELESELE 876


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 39.8 bits (94), Expect = 0.017
 Identities = 40/143 (27%), Positives = 62/143 (43%), Gaps = 25/143 (17%)

Query: 1122 EQQKDRRKELAKKLVEMNAK--KREERLVDDERHLN---ELLELRE-IVELTPSDHSHAR 1175
            E+    R E  K+L E   +  K E+RL+  E +L+   ELLE RE  +E    +    +
Sbjct: 64   EEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123

Query: 1176 EAFKSMGINNIQDLNKSINQLQQKIEK----TK--AKIIAYNNGEDLTEEPKAKLSKEIA 1229
            +  +       ++L + I +  Q++E+    T   AK I     E+      A L KEI 
Sbjct: 124  QELEKK----EEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEI- 178

Query: 1230 VPESEAEFKAWLIETKKKRAYII 1252
              E EA+      E   K+A  I
Sbjct: 179  --EEEAK------EEADKKAKEI 193



 Score = 32.1 bits (74), Expect = 3.6
 Identities = 17/56 (30%), Positives = 27/56 (48%)

Query: 761 KQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
           K+I  E K E E  K++ +L  +EE  + +   +K   E+  E  +  KR   K E
Sbjct: 41  KRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEE 96



 Score = 31.7 bits (73), Expect = 4.2
 Identities = 27/143 (18%), Positives = 58/143 (40%), Gaps = 22/143 (15%)

Query: 1189 LNKSINQLQQKIEKTKAKII---AYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETK 1245
            + K I + + K  + +AK I   A    E + +E   +  +EI    +E E      E +
Sbjct: 24   VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK-----ELR 78

Query: 1246 KKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKV 1305
            ++R  +   +    Q+ ++L ++     +    L  +    E++  +   ++E+ +    
Sbjct: 79   ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELE---- 134

Query: 1306 INKDAGDTDSEEEQERLIELEEI 1328
                      E  +E+L ELE I
Sbjct: 135  ----------ELIEEQLQELERI 147



 Score = 31.3 bits (72), Expect = 6.7
 Identities = 28/122 (22%), Positives = 59/122 (48%), Gaps = 15/122 (12%)

Query: 696 QKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAE-QKKIDALTPQVKTEPEENVKIT 754
           ++++ + +  +   +R L+ +E      E ++K   E +KK   L  +     ++ ++  
Sbjct: 75  KELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQK-----QQELEKK 129

Query: 755 QEPVVLKQIKSEEKVEME------KEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKT 808
           +E   L+++  E+  E+E       E+ K+IL E+ EE+    A   IK E  EE+ E+ 
Sbjct: 130 EE--ELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIK-EIEEEAKEEA 186

Query: 809 KR 810
            +
Sbjct: 187 DK 188


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 38.2 bits (89), Expect = 0.018
 Identities = 20/63 (31%), Positives = 28/63 (44%)

Query: 765 SEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
             E    EKE   K+ +E E E+EE+   KK K  K E+  +K K+ K+          K
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188

Query: 825 GKS 827
            K 
Sbjct: 189 KKK 191



 Score = 30.8 bits (70), Expect = 5.3
 Identities = 20/67 (29%), Positives = 42/67 (62%), Gaps = 1/67 (1%)

Query: 743 VKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPE 802
           + +E E + K T   V  K+ + EE+ + EK+K+K++ +E++E+K+++  M + K  K +
Sbjct: 128 LGSESETSEKETTAKVE-KEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKK 186

Query: 803 ESNEKTK 809
           +  +K K
Sbjct: 187 KKKKKKK 193


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 37.4 bits (87), Expect = 0.025
 Identities = 22/108 (20%), Positives = 41/108 (37%), Gaps = 12/108 (11%)

Query: 677 IAQIGLFFEDKMKIGSNKYQK--IKLKTS--RFETPYQRFLKSKEYAKAIQEKVDKENAE 732
            A      + K+++  N+Y+K  I++ T   + ET   + L + E  K I  +  KE  E
Sbjct: 13  SADSAAENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPE 72

Query: 733 QKKIDALTPQVKTEPEENVKITQ--------EPVVLKQIKSEEKVEME 772
           Q+++              VK T+        E        + E+   +
Sbjct: 73  QEELKNQLFSENATENNTVKATKKQLFSSEYEQTSSSSESTSEEETKK 120


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 38.2 bits (89), Expect = 0.026
 Identities = 21/141 (14%), Positives = 40/141 (28%), Gaps = 6/141 (4%)

Query: 721 AIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME-KEKRKKI 779
              E V              P  + +PE      Q     +     E    + KEK K  
Sbjct: 32  VGIELVPLAVFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPE 91

Query: 780 LREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKSGGTISTV--DFS 837
             ++ ++ + +P  K     K +   +  K    K+  K   +    +    +      +
Sbjct: 92  --KKPKKPKPKPKPKPKPKPKVKPQPKPKKPPS-KTAAKAPAAPNQPARPPSAASASGAA 148

Query: 838 KVNYNKYMAKPDTKILPDPYY 858
                 Y++     I   P Y
Sbjct: 149 TGPSASYLSGLRRAIRRAPRY 169


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 37.2 bits (87), Expect = 0.080
 Identities = 23/100 (23%), Positives = 52/100 (52%), Gaps = 11/100 (11%)

Query: 710 QRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKV 769
           +R  + +E    +QE++ +E  ++++ +    +   E E+  +I       ++I+ E++ 
Sbjct: 64  KRKEERREGRAVLQEQI-EEREKRRQEEYE--ERLQEREQMDEII------ERIQEEDEA 114

Query: 770 EME--KEKRKKILREREEEKEEQPAMKKIKVEKPEESNEK 807
           E +  +EK+KK+  E +E  EE+   K+ + E+  E   K
Sbjct: 115 EAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELK 154



 Score = 31.8 bits (73), Expect = 3.7
 Identities = 28/174 (16%), Positives = 70/174 (40%), Gaps = 13/174 (7%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 1181
             ++K+ R+E    L E   ++ + R  + E  L E  ++ EI+E    +     +  +  
Sbjct: 63   RKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREK 122

Query: 1182 GINNIQDLNKSINQLQQKIEKTKA-------KIIAYNNGEDLTEEPKAKLSKEIAVPESE 1234
                 +++++   +  ++ E+ K        KI+ Y   +   EE +    +E      +
Sbjct: 123  QKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAERRER-----K 177

Query: 1235 AEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 1288
             E +  +   + ++    D++    + R DL +      +ER +   E  ++ +
Sbjct: 178  EEKEREVARLRAQQEEAEDEREELDELRADLYQEE-YERKERQKEKEEAEKRRR 230


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 36.9 bits (85), Expect = 0.094
 Identities = 29/131 (22%), Positives = 57/131 (43%), Gaps = 16/131 (12%)

Query: 723 QEKVDKENAEQKKIDALTPQVK---------TEPEENVKITQEPVVLKQIKSEEKVEMEK 773
           +EK  + + E+   + +T ++K               V+  +E   LKQ + E  +E+E+
Sbjct: 159 EEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEE 218

Query: 774 EK-----RKKILREREEEKEEQPAMKKIK--VEKPEESNEKTKRHKIKSEPKENDSSKGK 826
            K     R+K+L E E+ ++++ A +K +   EK     E  +R    +E ++     G 
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGL 278

Query: 827 SGGTISTVDFS 837
           S        F+
Sbjct: 279 SEDKKPFKCFT 289



 Score = 33.1 bits (75), Expect = 1.8
 Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 3/104 (2%)

Query: 728 KENAEQKKI-DALTPQVKTEPEENVKITQEPVVLKQIKS-EEKVEMEKEKRKKILREREE 785
           +E    ++  +AL  Q + +P    +   EP    Q  S  E   +E+E++++   EREE
Sbjct: 66  REERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREE 125

Query: 786 EKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKSGG 829
            +E +   K  +     ++ E  K  K   EP+E +  K  S  
Sbjct: 126 VEETEGVTKSEQKNDWRDAEECQKEEKEP-EPEEEEKPKRGSLE 168



 Score = 31.9 bits (72), Expect = 3.4
 Identities = 28/135 (20%), Positives = 57/135 (42%), Gaps = 16/135 (11%)

Query: 673 QDKKIAQIGLFFEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAE 732
           +       G F   K+K   N + +   + ++ E   +     ++  +A  E +++   +
Sbjct: 164 RGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALE-LEELKKK 222

Query: 733 QKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPA 792
           ++       + K   EE     Q     KQ +++ K   E+EKR+  L+E  E +  + A
Sbjct: 223 RE------ERRKVLEEE----EQRR---KQEEADRKSREEEEKRR--LKEEIERRRAEAA 267

Query: 793 MKKIKVEKPEESNEK 807
            K+ KV +   S +K
Sbjct: 268 EKRQKVPEDGLSEDK 282


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
           MRVI1 proteins which are related to the
           lymphoid-restricted membrane protein (JAW1) and the IP3
           receptor associated cGMP kinase substrates A and B
           (IRAGA and IRAGB). The function of MRVI1 is unknown
           although mutations in the Mrvi1 gene induces myeloid
           leukaemia by altering the expression of a gene important
           for myeloid cell growth and/or differentiation so it has
           been speculated that Mrvi1 is a tumour suppressor gene.
           IRAG is very similar in sequence to MRVI1 and is an
           essential NO/cGKI-dependent regulator of IP3-induced
           calcium release. Activation of cGKI decreases
           IP3-stimulated elevations in intracellular calcium,
           induces smooth muscle relaxation and contributes to the
           antiproliferative and pro-apoptotic effects of NO/cGMP.
           Jaw1 is a member of a class of proteins with
           COOH-terminal hydrophobic membrane anchors and is
           structurally similar to proteins involved in vesicle
           targeting and fusion. This suggests that the function
           and/or the structure of the ER in lymphocytes may be
           modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 36.5 bits (84), Expect = 0.16
 Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 6/84 (7%)

Query: 757 PVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
           P +L+ I +    + E+EK+ K L++  E + E+   +  K    EE  E  +       
Sbjct: 400 PSLLRFISTYSWADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEE------W 453

Query: 817 PKENDSSKGKSGGTISTVDFSKVN 840
            KE  SSK  S  T   V + K N
Sbjct: 454 DKEQRSSKLASWVTFLQVSYRKAN 477


>gnl|CDD|219714 pfam08066, PMC2NT, PMC2NT (NUC016) domain.  This domain is found at
           the N-terminus of 3'-5' exonucleases with HRDC domains,
           and also in putative exosome components.
          Length = 91

 Score = 33.4 bits (77), Expect = 0.20
 Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 4/90 (4%)

Query: 105 IVKCTKTAQSFPSTHENSLLIASPNYISGVAGTSDKVMTLVDSLLKTQ----NISKSMSK 160
           +V+ T+ A S  +   +      P +   +   S ++++L++ LL++      I     +
Sbjct: 1   LVQTTRAASSLAAQDIDFYRSFDPEFSQSLDEQSQRLLSLINDLLQSADSRGRIPDRSEE 60

Query: 161 LYLEGQKDILTEANDKLLESINTRIDVMAG 190
             +E Q   +++ ND LLE  +T +D + G
Sbjct: 61  DDVEDQWKAVSDVNDSLLEKADTLLDELKG 90



 Score = 33.4 bits (77), Expect = 0.20
 Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 4/90 (4%)

Query: 1550 IVKCTKTAQSFPSTHENSLLIASPNYISGVAGTSDKVMTLVDSLLKTQ----NISKSMSK 1605
            +V+ T+ A S  +   +      P +   +   S ++++L++ LL++      I     +
Sbjct: 1    LVQTTRAASSLAAQDIDFYRSFDPEFSQSLDEQSQRLLSLINDLLQSADSRGRIPDRSEE 60

Query: 1606 LYLEGQKDILTEANDKLLESINTRIDVMAG 1635
              +E Q   +++ ND LLE  +T +D + G
Sbjct: 61   DDVEDQWKAVSDVNDSLLEKADTLLDELKG 90


>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
          Length = 156

 Score = 34.4 bits (80), Expect = 0.23
 Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 12/93 (12%)

Query: 1195 QLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDK 1254
             + + +E+ + KI      + L    +AK   E+A  + EA+    L E + + A II+ 
Sbjct: 28   PIMKALEERQKKIA-----DGLAAAERAKKELELAQAKYEAQ----LAEARAEAAEIIE- 77

Query: 1255 KNARKQRRQDLAKRRTAAAQERMRLISELARKE 1287
              A+K+  Q + + +  A  E  R+    A+ E
Sbjct: 78   -QAKKRAAQIIEEAKAEAEAEAARIK-AQAQAE 108


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
            protein is found in bacteria and eukaryotes. Proteins in
            this family are typically between 943 to 1234 amino acids
            in length. This family contains a P-loop motif suggesting
            it is a nucleotide binding protein. It may be involved in
            replication.
          Length = 1198

 Score = 36.2 bits (84), Expect = 0.23
 Identities = 53/269 (19%), Positives = 97/269 (36%), Gaps = 49/269 (18%)

Query: 1119 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLE-LREIVELTPSDHSHAREA 1177
            L  EQQ  + K   +  +    ++ E +L   +  L +LLE  +  +E    D    R  
Sbjct: 661  LQNEQQSLKDK--LELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTE 718

Query: 1178 FKSMGINNIQDLNKSINQLQQKIEK--------TKAKIIAYNN---GEDLTEEPKAKLSK 1226
              +       +L+  + QL   IE          K     Y+      D+      +L +
Sbjct: 719  RLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKR 778

Query: 1227 EI----------AVPESEA-EFKAWLIETKKKRAYIIDKKNARKQR-----------RQD 1264
            +I          AV   E  E++A++ ET   R  + +++     +           +Q+
Sbjct: 779  QIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQE 838

Query: 1265 LAKRRTAAAQERMRLISELARKEKRDDDF-----GMRDEDWDVYKVINKDAGDTDSEEEQ 1319
            L +        R +L  E    EK+ D       G+RDE     + + +     ++ + +
Sbjct: 839  LTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDE----MRQLAELKEPANANQAE 894

Query: 1320 ----ERLIELEEILRQHDPEFTSLNQEQE 1344
                ERL +LEE  R+       L +  E
Sbjct: 895  GSISERLDQLEEFKRKRKRLSGDLKKFLE 923


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 35.8 bits (81), Expect = 0.23
 Identities = 28/115 (24%), Positives = 59/115 (51%), Gaps = 4/115 (3%)

Query: 706 ETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVK----TEPEENVKITQEPVVLK 761
           E P Q     K   K  Q+++ K++  ++K++A     K     E E   +  QE    +
Sbjct: 62  EQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQ 121

Query: 762 QIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
               +E+++ E+EK++  L++ + EK+++ A+K  K EK E++ + +    ++ E
Sbjct: 122 WKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEE 176


>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
            transduction.  This family is a four helix bundle that
            operates as a ubiquitous sensory module in prokaryotic
            signal-transduction. The 4HB_MCP is always found between
            two predicted transmembrane helices indicating that it
            detects only extracellular signals. In many cases the
            domain is associated with a cytoplasmic HAMP domain
            suggesting that most proteins carrying the bundle might
            share the mechanism of transmembrane signalling which is
            well-characterized in E coli chemoreceptors.
          Length = 181

 Score = 34.5 bits (80), Expect = 0.26
 Identities = 27/104 (25%), Positives = 39/104 (37%), Gaps = 27/104 (25%)

Query: 1183 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLI 1242
            + +I++L   I++L +K EKT            LTEE K              EFK  L 
Sbjct: 81   LKDIEELRAEIDKLLKKYEKTI-----------LTEEEKKLF----------NEFKEQLK 119

Query: 1243 ETKKKRAYIID--KKN----ARKQRRQDLAKRRTAAAQERMRLI 1280
              +K R  ++D  K      A      +L   R A  +    LI
Sbjct: 120  AYRKVRNKVLDLAKAGKNDEAYALYLTELEPARDAVIEALDELI 163


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 35.3 bits (82), Expect = 0.27
 Identities = 32/128 (25%), Positives = 56/128 (43%), Gaps = 24/128 (18%)

Query: 682 LFFEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKE---------------YAKAIQEKV 726
           LF E++ K+   KY ++  K  + E   Q FL SKE                 KAI+ + 
Sbjct: 152 LFLEERDKL-EAKYNQVPRKGVKAEEVLQEFLNSKEAVEEAILQTDQALTAKEKAIEAER 210

Query: 727 DKENAEQKKIDALTPQVKTEP---EENVKITQEPVVLKQIKS---EEKVEMEKEKRKKIL 780
            K  A + + + L  + K E    E   +  QE V  KQ+      E+ ++  E+ + + 
Sbjct: 211 AKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHV--KQLIEKMEAEREKLLAEQERMLE 268

Query: 781 REREEEKE 788
            + +E++E
Sbjct: 269 HKLQEQEE 276


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 36.1 bits (83), Expect = 0.27
 Identities = 29/132 (21%), Positives = 43/132 (32%), Gaps = 24/132 (18%)

Query: 777 KKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKSGGTISTVDF 836
           K   R +++ KE+ P  K+IK+E P    E  +    + E K  D+              
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAP------------- 838

Query: 837 SKVNYNKYMAKPD-----TKILPD--PYYEYLDELRDSSLPIVFDNGAWCCRVGWASCEK 889
                N    KP      T+ILP    Y  ++    D  +P+    G           E 
Sbjct: 839 --AILNVKKKKPYKVDNMTRILPQQSRYISFI--KDDRFVPVRKFKGGVVVLRDREPKEP 894

Query: 890 PNLIFKNLIAKP 901
             LI      K 
Sbjct: 895 VALIETVRQMKD 906


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
            found at the C-terminus of a family of FtsJ-like
            methyltransferases. Members of this family are involved
            in 60S ribosomal biogenesis.
          Length = 212

 Score = 34.6 bits (80), Expect = 0.32
 Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 15/94 (15%)

Query: 1197 QQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK- 1255
             +++EK K K  A    ED++E  KAK  K++         KA   + K+K+ Y++ KK 
Sbjct: 125  LKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYK-------KAKKKKKKRKKTYVVAKKG 177

Query: 1256 NARKQRRQDLAKRRTAAAQERMRLISELARKEKR 1289
            N  K  R    K +      RM       +K+ R
Sbjct: 178  NKGKAGRPKGVKGKYKVVDPRM-------KKDLR 204


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 35.2 bits (81), Expect = 0.35
 Identities = 24/120 (20%), Positives = 43/120 (35%), Gaps = 5/120 (4%)

Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIK----SEEKV 769
           K ++   +     ++   E  K D +      EP    +   E       +     EE  
Sbjct: 229 KKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETE 288

Query: 770 EMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSKGKSGG 829
           E EKEKRK+ L++  E+++E   M+ +     EE   +        + +E       S  
Sbjct: 289 EKEKEKRKR-LKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPD 347



 Score = 32.9 bits (75), Expect = 1.7
 Identities = 19/94 (20%), Positives = 41/94 (43%), Gaps = 12/94 (12%)

Query: 731 AEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQ 790
            ++KK      +   + E   +  +  V+L+  +S E   ++++         E+E E +
Sbjct: 226 TKEKKEKKEASESTVKEESEEESGKRDVILED-ESAEPTGLDED---------EDEDEPK 275

Query: 791 PAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
           P+ ++   E+  E  EK KR ++K      D  +
Sbjct: 276 PSGERSDSEEETEEKEKEKRKRLKK--MMEDEDE 307


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 35.3 bits (82), Expect = 0.40
 Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 13/105 (12%)

Query: 710 QRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKV 769
           ++   ++E AK I E+  KE AE  K +AL  + K   EE  K+  E  + +++K     
Sbjct: 25  KKLGSAEELAKRIIEEAKKE-AETLKKEALL-EAK---EEVHKLRAE--LERELKERRNE 77

Query: 770 EMEKEKRKKILREREE---EKEEQPAMKKIKVEKPEESNEKTKRH 811
               E+R   L +REE    K E    K+  +EK E+     +++
Sbjct: 78  LQRLERR---LLQREETLDRKMESLDKKEENLEKKEKELSNKEKN 119



 Score = 32.2 bits (74), Expect = 2.9
 Identities = 34/130 (26%), Positives = 61/130 (46%), Gaps = 14/130 (10%)

Query: 1232 ESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 1291
             +E   K  +IE  KK A  + KK A  + ++++ K R    +E     +EL R E+R  
Sbjct: 29   SAEELAKR-IIEEAKKEAETL-KKEALLEAKEEVHKLRAELERELKERRNELQRLERRLL 86

Query: 1292 DFGMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQE------- 1344
                R+E  D  K+ + D  + + E++++ L   E+ L + + E   L  EQ        
Sbjct: 87   Q---REETLD-RKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERIS 142

Query: 1345 -LSPKEANQL 1353
             L+ +EA ++
Sbjct: 143  GLTQEEAKEI 152



 Score = 31.4 bits (72), Expect = 5.4
 Identities = 24/89 (26%), Positives = 51/89 (57%), Gaps = 3/89 (3%)

Query: 713 LKSKEYAKAIQEKVDKENAEQ-KKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEM 771
           L++KE    ++ ++++E  E+  ++  L  ++  + EE +    E +  K+ ++ EK E 
Sbjct: 54  LEAKEEVHKLRAELERELKERRNELQRLERRLL-QREETLDRKMESLDKKE-ENLEKKEK 111

Query: 772 EKEKRKKILREREEEKEEQPAMKKIKVEK 800
           E   ++K L E+EEE EE  A ++ ++E+
Sbjct: 112 ELSNKEKNLDEKEEELEELIAEQREELER 140


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 34.8 bits (80), Expect = 0.45
 Identities = 32/119 (26%), Positives = 45/119 (37%), Gaps = 10/119 (8%)

Query: 719 AKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKK 778
            KA  E   K  AE  K  A   + K E E   K   E    K+ ++E K +   E +KK
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAE--AKKKAEAEAKKKAAAEAKKK 218

Query: 779 ILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEND------SSKG--KSGG 829
              E +    +  A  K   EK   +    K    K+  + +D      S K   K+GG
Sbjct: 219 AAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLFGGLDSGKNAPKTGG 277



 Score = 33.2 bits (76), Expect = 1.6
 Identities = 26/113 (23%), Positives = 46/113 (40%), Gaps = 10/113 (8%)

Query: 710 QRFLKSKEYAKAIQEKVDKENAEQKKIDA---LTPQVKTEPEENVKITQEPVVLKQIKSE 766
           QR  K ++ A+ +Q+K   E    K+++       + K + EE  K        KQ ++E
Sbjct: 80  QRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL----KQKQAE 135

Query: 767 EKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
           E         K   +   E K    A KK   E  +++  +  + K  +E K+
Sbjct: 136 EAAAKAAAAAK--AKAEAEAKRAAAAAKKAAAEAKKKAEAEAAK-KAAAEAKK 185


>gnl|CDD|223820 COG0749, PolA, DNA polymerase I - 3'-5' exonuclease and polymerase
           domains [DNA replication, recombination, and repair].
          Length = 593

 Score = 34.9 bits (81), Expect = 0.51
 Identities = 24/102 (23%), Positives = 40/102 (39%), Gaps = 4/102 (3%)

Query: 324 ITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLE 383
           IT+   +   +++L     IA D E      +      + +++ ++   +  L   E L 
Sbjct: 6   ITDLAVLNAWLTKLNAAANIAFDTETDGLDPHGADLVGLSVASEEEAAYIPLLHGPEQLN 65

Query: 384 V---LNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDT 422
           V   L  +L D+ I KV      D K L    G+     FDT
Sbjct: 66  VLAALKPLLEDEGIKKVGQNLKYDYKVLAN-LGIEPGVAFDT 106



 Score = 34.9 bits (81), Expect = 0.51
 Identities = 24/102 (23%), Positives = 40/102 (39%), Gaps = 4/102 (3%)

Query: 1769 ITEPEQVTQLVSELKQQQEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLE 1828
            IT+   +   +++L     IA D E      +      + +++ ++   +  L   E L 
Sbjct: 6    ITDLAVLNAWLTKLNAAANIAFDTETDGLDPHGADLVGLSVASEEEAAYIPLLHGPEQLN 65

Query: 1829 V---LNEVLTDKNIVKVFHGADSDIKWLQKDFGLYVVGMFDT 1867
            V   L  +L D+ I KV      D K L    G+     FDT
Sbjct: 66   VLAALKPLLEDEGIKKVGQNLKYDYKVLAN-LGIEPGVAFDT 106


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 33.4 bits (77), Expect = 0.53
 Identities = 17/93 (18%), Positives = 49/93 (52%), Gaps = 7/93 (7%)

Query: 709 YQRFLKSKEYAKAIQEKVDKENAE-QKKIDALTPQVKTEPE----ENVKITQEPVVLKQI 763
            Q+ L      KA Q++++KE  + Q ++     +++ E +    +   +++E    KQ 
Sbjct: 23  VQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQ 82

Query: 764 KSEEKVEMEKEKRKKILREREEEKEEQPAMKKI 796
           + ++K +  ++K++    ++E ++++Q  ++ I
Sbjct: 83  ELQQKQQELQQKQQAA--QQELQQKQQELLQPI 113


>gnl|CDD|151681 pfam11239, DUF3040, Protein of unknown function (DUF3040).  Some
            members in this family of proteins with unknown function
            are annotated as membrane proteins however this cannot be
            confirmed.
          Length = 82

 Score = 31.8 bits (73), Expect = 0.54
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 1315 SEEEQERLIELEEILRQHDPEFTS 1338
            SE EQ RL E+E  L   DP+F +
Sbjct: 4    SEHEQRRLEEIERALYAEDPKFAA 27


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 35.0 bits (81), Expect = 0.55
 Identities = 55/216 (25%), Positives = 99/216 (45%), Gaps = 33/216 (15%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSM 1181
            E++++R +EL KKL E+  +KR E L +      E    +E                K +
Sbjct: 334  EEKEERLEELKKKLKEL--EKRLEELEERHELYEEAKAKKE----------ELERLKKRL 381

Query: 1182 GINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE-AEFKAW 1240
                 + L K +    +++EK K +I          EE  +K++  I   + E  E K  
Sbjct: 382  TGLTPEKLEKEL----EELEKAKEEI----------EEEISKITARIGELKKEIKELKKA 427

Query: 1241 LIETKK-KRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDED 1299
            + E KK K    +  +   ++ R++L +  TA   E  R+  EL   E+++    +R E 
Sbjct: 428  IEELKKAKGKCPVCGRELTEEHRKELLEEYTA---ELKRIEKELKEIEEKERK--LRKEL 482

Query: 1300 WDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPE 1335
             ++ KV+ K++     +E  E+L ELEE L++++ E
Sbjct: 483  RELEKVLKKESELIKLKELAEQLKELEEKLKKYNLE 518



 Score = 33.5 bits (77), Expect = 1.4
 Identities = 26/134 (19%), Positives = 63/134 (47%), Gaps = 6/134 (4%)

Query: 689 KIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPE 748
           ++G    ++++ +    E  Y  +L+ K+  K ++ +  +    ++++D    ++    +
Sbjct: 581 ELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEK 640

Query: 749 ENVKITQEPVVLKQIKSEEKVE------MEKEKRKKILREREEEKEEQPAMKKIKVEKPE 802
              ++ +E   L++  SEE+ E      +E  +    LR   EE E++    K  +EK +
Sbjct: 641 RLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLK 700

Query: 803 ESNEKTKRHKIKSE 816
           E  E+ ++ K + E
Sbjct: 701 EELEEREKAKKELE 714



 Score = 33.5 bits (77), Expect = 1.6
 Identities = 32/145 (22%), Positives = 59/145 (40%), Gaps = 10/145 (6%)

Query: 684 FEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAE-QKKIDALTPQ 742
           F ++      + +K   +        +  +K  E  +   E++ K+  E +K+++ L  +
Sbjct: 301 FYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEER 360

Query: 743 VKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEE-QPAMKKIKVEKP 801
            +   E   K  +    L+++K         EK +K L E E+ KEE +  + KI     
Sbjct: 361 HELYEEAKAKKEE----LERLKKRLTGL-TPEKLEKELEELEKAKEEIEEEISKITARIG 415

Query: 802 EESNEKTKRHKIKSEPKENDSSKGK 826
           E   E  +  K   E K    +KGK
Sbjct: 416 ELKKEIKELKKAIEELK---KAKGK 437



 Score = 33.1 bits (76), Expect = 1.8
 Identities = 38/169 (22%), Positives = 79/169 (46%), Gaps = 20/169 (11%)

Query: 1129 KELAKKLVEMNAKKREERLVDDERHLNELL----ELREIVELTPSDHSHARE-----AFK 1179
             E   + +E + +K EE++ + E  + EL     EL E V+        A E      F 
Sbjct: 243  LEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFY 302

Query: 1180 SMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKA 1239
               ++ ++++ K +++L+++I   + +I      ++L EE + +L +E+     E E + 
Sbjct: 303  EEYLDELREIEKRLSRLEEEINGIEERI------KEL-EEKEERL-EELKKKLKELEKR- 353

Query: 1240 WLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 1288
              +E  ++R  + ++  A+K+  + L KR T    E++    E   K K
Sbjct: 354  --LEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAK 400



 Score = 32.3 bits (74), Expect = 3.7
 Identities = 52/227 (22%), Positives = 95/227 (41%), Gaps = 36/227 (15%)

Query: 1122 EQQKDRRKELAKKLVEMNAKKREERLVDDE-----RHLNELLELREIVELTPSDHSHARE 1176
            E  K++ KEL + L E+N    E   + +E     + + EL EL+E +E         +E
Sbjct: 193  ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIE------ELEKE 246

Query: 1177 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 1236
                 G  + + L + I +L+++IE+ K +I      E+L  E K K  KE+   + +AE
Sbjct: 247  LESLEG--SKRKLEEKIRELEERIEELKKEI------EEL--EEKVKELKEL---KEKAE 293

Query: 1237 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMR 1296
                L E  ++    +D+    ++R   L +            I EL  KE+R ++   +
Sbjct: 294  EYIKLSEFYEE---YLDELREIEKRLSRLEEE----INGIEERIKELEEKEERLEELKKK 346

Query: 1297 DEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQ 1343
             +     ++  +     +  E  E     +E L +     T L  E+
Sbjct: 347  LK-----ELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388


>gnl|CDD|234792 PRK00558, uvrC, excinuclease ABC subunit C; Validated.
          Length = 598

 Score = 34.7 bits (81), Expect = 0.58
 Identities = 22/96 (22%), Positives = 39/96 (40%), Gaps = 20/96 (20%)

Query: 1098 AEFYDSNVV--KVQLPYAVPVPNLTTEQQKDRRKELA-------KKLVEM---NAK-KRE 1144
             +FY    +  ++ +P+ +    L       R+ ++        K+L+E+   NA+   E
Sbjct: 298  GQFYLQKSIPKEILVPHELDDEELAEALLAGRKVKIRVPQRGDKKELLELAEKNAREALE 357

Query: 1145 ERLVDDERHLNELLELREIVEL--TPS-----DHSH 1173
             +L D       L EL E++ L   P      D SH
Sbjct: 358  RKLSDQSTQQQALEELAELLGLPEPPYRIECFDISH 393


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 34.6 bits (80), Expect = 0.65
 Identities = 45/213 (21%), Positives = 91/213 (42%), Gaps = 25/213 (11%)

Query: 1139 NAKKRE-ERLVDDERHLNELLELRE-------IVELTPSDHSHAREAFKSMGINNI--QD 1188
            NA   + + ++DD   L +L E RE        VE   SD   + +  K+  I     +D
Sbjct: 145  NATPSDRQDMIDDLLQLGKLEEYRERASDARLGVERVLSDQRGSLDQLKAQ-IEEKEEKD 203

Query: 1189 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLS----------KEIAVPESE-AEF 1237
            L++ +N L+ ++ +   +I  Y    +   E + +            +E+   E+E  + 
Sbjct: 204  LHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDL 263

Query: 1238 KAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRD 1297
            +  + ET+++R  + ++    ++R ++L + R     E     ++    E R ++   RD
Sbjct: 264  RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRD 323

Query: 1298 EDW-DVYKVINKDAGDTDSEEE--QERLIELEE 1327
            E+  D  +     A   + E E  +E   +LEE
Sbjct: 324  EELRDRLEECRVAAQAHNEEAESLREDADDLEE 356



 Score = 30.8 bits (70), Expect = 9.7
 Identities = 47/264 (17%), Positives = 87/264 (32%), Gaps = 53/264 (20%)

Query: 1120 TTEQQKDRRKELAKKLVEMNAKKREER------LVDDERHL-----------NELLELRE 1162
               ++ +  +E A  L E   + REE       L +    +            E+ ELRE
Sbjct: 339  AHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRE 398

Query: 1163 IVELTPSDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNN---------- 1212
                 P D  +A E F         +L +   +L+  + +T  + +              
Sbjct: 399  RFGDAPVDLGNA-EDFLEELREERDELREREAELEATL-RTARERVEEAEALLEAGKCPE 456

Query: 1213 -GEDLTEEP-----------KAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQ 1260
             G+ +   P             +L  E+   E E E     +E  +      D+    ++
Sbjct: 457  CGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEE 516

Query: 1261 RRQDLAKR----RTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAGDTDSE 1316
            RR+DL +     R    ++R R      R  + + +   + E             + ++E
Sbjct: 517  RREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREA--------AAEAEEEAE 568

Query: 1317 EEQERLIELEEILRQHDPEFTSLN 1340
            E +E + EL   L +      SL 
Sbjct: 569  EAREEVAELNSKLAELKERIESLE 592


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
            chromosome partitioning].
          Length = 420

 Score = 34.3 bits (79), Expect = 0.68
 Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 21/117 (17%)

Query: 1187 QDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESE-AEFKAWLIET- 1244
               +K + Q+Q++I   + KI          ++ +AKL K++   E+E A  +A LIET 
Sbjct: 34   AADDKQLKQIQKEIAALEKKI-------REQQDQRAKLEKQLKSLETEIASLEAQLIETA 86

Query: 1245 ------KKKRAYIIDKKNA----RKQRRQDLAKRRTAAAQERMRLISELARKEKRDD 1291
                  +K+ A +  + NA     +++R+ LA+    AA +R       A     +D
Sbjct: 87   DDLKKLRKQIADLNARLNALEVQEREQRRRLAE--QLAALQRSGRNPPPALLVSPED 141



 Score = 32.4 bits (74), Expect = 2.3
 Identities = 29/175 (16%), Positives = 62/175 (35%), Gaps = 10/175 (5%)

Query: 1123 QQKDRRKELAKKLVEMNA--KKREERLVDDERHLNELLELREIVELTPS----DHSHARE 1176
            +  D  K+L K++ ++NA     E +  +  R L E L   +     P           +
Sbjct: 84   ETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQ 143

Query: 1177 AFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE 1236
                + I     LN +  +    ++ T  ++ A     ++  E     +        +A+
Sbjct: 144  RSVRLAIY-YGALNPARAERIDALKATLKQLAAV--RAEIAAEQAELTTLLSEQRAQQAK 200

Query: 1237 FKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARK-EKRD 1290
                L E KK  A +  + +A +++ ++L    +    E     +  A+  E   
Sbjct: 201  LAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAA 255



 Score = 30.5 bits (69), Expect = 9.5
 Identities = 33/189 (17%), Positives = 66/189 (34%), Gaps = 21/189 (11%)

Query: 1113 AVPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHS 1172
            AV     +      + K++ K++    A +++ R   D+R   +L +  + +E   +   
Sbjct: 25   AVLAAAFSAAADDKQLKQIQKEI---AALEKKIREQQDQR--AKLEKQLKSLETEIASLE 79

Query: 1173 HAREAFKSMGIN----NIQDLNKSINQLQQKIEKTK---AKIIAYNNGEDLTEEPKAKLS 1225
                      +      I DLN  +N L+ +  + +   A+ +A          P   +S
Sbjct: 80   AQLIETA-DDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVS 138

Query: 1226 KEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRL---ISE 1282
                 PE         I          ++ +A K   + LA  R   A E+  L   +SE
Sbjct: 139  -----PEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSE 193

Query: 1283 LARKEKRDD 1291
               ++ +  
Sbjct: 194  QRAQQAKLA 202


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
            mutation causes misplacement of the septum during cell
            division, resulting in the formation of small, circular,
            anucleate mini-cells. Inactivation of divIVA produces a
            mini-cell phenotype, whereas overproduction of DivIVA
            results in a filamentation phenotype. These proteins
            appear to contain coiled-coils.
          Length = 131

 Score = 32.6 bits (75), Expect = 0.68
 Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 11/107 (10%)

Query: 1183 INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAE-FKAWL 1241
            I + + L K   +L+++IE+ + K+  Y   E+        L K + V +  AE  KA  
Sbjct: 31   IKDYEALYKENEELKEEIERLEEKLEEYKELEE-------TLQKTLVVAQETAEEVKA-- 81

Query: 1242 IETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEK 1288
               +K+   II +  A+ +R  + A             +   +R  +
Sbjct: 82   -NAQKEAELIIKEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFR 127


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found in
            cell division proteins which are required for
            kinetochore-spindle association.
          Length = 312

 Score = 33.8 bits (78), Expect = 0.72
 Identities = 21/113 (18%), Positives = 42/113 (37%), Gaps = 23/113 (20%)

Query: 1123 QQKDRRKELAKKLVEMNA-----KKREERLVDD-----------ERHLNELLEL-REIVE 1165
              +  ++ L + L  +        K  E L              E  L +L +L  E+ +
Sbjct: 141  LLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELED 200

Query: 1166 LTPSDHSHAREAFKSM------GINNIQDLNKSINQLQQKIEKTKAKIIAYNN 1212
              P++   A+E  K +       +  +++L + + +L+ KIE    K    N 
Sbjct: 201  CDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT 253


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
            potential nuclease [General function prediction only].
          Length = 290

 Score = 33.7 bits (77), Expect = 0.78
 Identities = 56/221 (25%), Positives = 94/221 (42%), Gaps = 43/221 (19%)

Query: 1042 RIDLGGFSVIHHLHKLLQ-----LKYPSHINSITPSRSEELLW------DYGFVATD--- 1087
            RI + G S +  + K LQ      +Y   I+SI P+ + E+        D   +ATD   
Sbjct: 10   RIIVEGASDVEVISKALQRLALGSEYNITISSIIPTTNVEIAKRAAEGADLVLIATDADR 69

Query: 1088 --------YREHLRKWL------------DAEFYDSNVVKVQLPYAVPVPNLTTEQQ--- 1124
                    + E LR  +            D E  D  +V+ +L  A+    L T Q+   
Sbjct: 70   VGRELAEKFFEELRGMVGHIERMKIPIGHDVEHIDVELVRKELKNALVRAGLKTLQRVPE 129

Query: 1125 ----KDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKS 1180
                K+  +EL +KL E+  +K EE L + E    E  E++E ++    ++S   E  K 
Sbjct: 130  YMDLKEDYEELKEKLEELQKEK-EELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKK 188

Query: 1181 MGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPK 1221
            +    + DL K  ++L+  +E  + ++I+    E L   PK
Sbjct: 189  L-PGEVYDLKKRWDELEPGVELPEEELISDLVKETLNLAPK 228


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 34.2 bits (79), Expect = 0.87
 Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 3/133 (2%)

Query: 1199 KIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNAR 1258
            K    +   +       + +   A  S         A   A      KK+       + +
Sbjct: 46   KKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105

Query: 1259 KQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDWDVYK---VINKDAGDTDS 1315
             +++  L K       + + ++++    +  DDD  + D+D D        ++D  D D 
Sbjct: 106  AEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDV 165

Query: 1316 EEEQERLIELEEI 1328
            ++E E   E +E+
Sbjct: 166  DDEDEEKKEAKEL 178


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
            the myosin-like domain [Function unknown].
          Length = 499

 Score = 33.8 bits (77), Expect = 0.89
 Identities = 46/228 (20%), Positives = 80/228 (35%), Gaps = 32/228 (14%)

Query: 1123 QQKDRRKELAKKLVEMNAKKREERLVDDERHL--NELLELREIVELTPSDHSHAREAFKS 1180
            Q  D R +L     E+   + E+R  + ER    +EL + R+  E    + + AR+    
Sbjct: 75   QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAK 134

Query: 1181 MGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAW 1240
                  Q+L +   Q Q    + K           L  + ++  + +  +  S  + K+ 
Sbjct: 135  A----QQELARLTKQAQDLQTRLKTLAEQR---RQLEAQAQSLQASQKQLQASATQLKSQ 187

Query: 1241 LIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGMRDEDW 1300
            +++ K + A I       +Q  Q+LA R  AA      L    A  ++       RD   
Sbjct: 188  VLDLKLRSAQI-------EQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDA-- 238

Query: 1301 DVYKVINKD----AGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQE 1344
               ++  K     A      E + +L  LE            L QE  
Sbjct: 239  ---QISQKAQQIAARAEQIRERERQLQRLET-------AQARLEQEVA 276


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 34.0 bits (79), Expect = 0.91
 Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 26/132 (19%)

Query: 1126 DRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINN 1185
            + R E+ K+  E   +K EERL   E  L  L ++  ++E+                I  
Sbjct: 315  EHRLEVYKRRKEYLLEKLEERLHILEGLLKALNKIDFVIEV----------------IRG 358

Query: 1186 IQDLNKSINQLQQKIEKTKAKIIA----YNNGEDLTEEPKAKLSKEIAVPESEAEFKAWL 1241
              DL K+  +L +++ + +A  +           LT+E   KL KEI   E E E     
Sbjct: 359  SIDLKKAKKELIEELSEIQADYLLDMRLRR----LTKEEIEKLEKEIE--ELEKEIAELE 412

Query: 1242 IETKKKRAYIID 1253
                 ++   I 
Sbjct: 413  KILASEKKLWIK 424


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 32.3 bits (74), Expect = 0.96
 Identities = 21/99 (21%), Positives = 45/99 (45%), Gaps = 6/99 (6%)

Query: 720 KAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKI 779
           + I  +  +E  E++K      + K   EE  K   + +V K+ + +   E+EK K K +
Sbjct: 10  RIIDIEEKREEKEREK------EEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL 63

Query: 780 LREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPK 818
            R  ++E  ++   ++ + + P     + K+ K   + K
Sbjct: 64  ARYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGK 102


>gnl|CDD|237478 PRK13709, PRK13709, conjugal transfer nickase/helicase TraI;
            Provisional.
          Length = 1747

 Score = 34.0 bits (78), Expect = 1.00
 Identities = 18/78 (23%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 740  TPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKR-KKILREREEEKEEQPAMKKIKV 798
             P    +P           VL Q ++EE +  E E+R  +I+R+  E K + P  K  + 
Sbjct: 1624 IPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQA 1683

Query: 799  EKPEESNEKTKRHKIKSE 816
             +     E+ +    + E
Sbjct: 1684 VRDIAGQERDRAAISERE 1701


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 33.8 bits (77), Expect = 1.1
 Identities = 20/112 (17%), Positives = 51/112 (45%), Gaps = 9/112 (8%)

Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
           LK    +  + + VD ++   KK+      ++ + E+ V   ++   LK+ +S+E     
Sbjct: 160 LKKNILSGNVSD-VDTDSISDKKVVE---ALREDNEKGVNFRRDMTDLKERESQEDA--- 212

Query: 773 KEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
             KR + L+E  ++K+      + K +  +++ +K +    + + +  +  K
Sbjct: 213 --KRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262



 Score = 31.5 bits (71), Expect = 4.8
 Identities = 27/120 (22%), Positives = 46/120 (38%), Gaps = 4/120 (3%)

Query: 714 KSKEYAKAIQEKVDKE-NAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
           K  +  KA Q+    + NA++++ +    Q + +       T  P   KQ+   +K E+E
Sbjct: 225 KQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIE 284

Query: 773 KEKRKKILREREEEKEEQPAMKKIKVEKPEESNE---KTKRHKIKSEPKENDSSKGKSGG 829
           K + +    + E  K +      +K E      E   K    + K EP   D  K K   
Sbjct: 285 KAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQV 344


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
            chaperones [Posttranslational modification, protein
            turnover, chaperones].
          Length = 786

 Score = 33.8 bits (78), Expect = 1.2
 Identities = 23/123 (18%), Positives = 51/123 (41%), Gaps = 19/123 (15%)

Query: 1215 DLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKK--NARKQRRQDLAKRRTAA 1272
            DL +E  A++  EI  PE   E +  L + + ++  +  ++    +K   + +  +    
Sbjct: 384  DLLDEAGARVRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRI 443

Query: 1273 AQERMRLISELARKEKRDDDFGMRDEDWDVYKVINKDAG---DTDSEEEQERLIELEEIL 1329
             +    L +E+                 D+ +V+ +  G       E+E+E+L+ LE  L
Sbjct: 444  PELEKELEAEVDED--------------DIAEVVARWTGIPVAKLLEDEKEKLLNLERRL 489

Query: 1330 RQH 1332
            ++ 
Sbjct: 490  KKR 492


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 33.2 bits (76), Expect = 1.2
 Identities = 33/150 (22%), Positives = 64/150 (42%), Gaps = 11/150 (7%)

Query: 694 KYQKIKLKTSRFETPYQRFLKS--KEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENV 751
           K QK+  +  + E   +R      +  AK   E+  ++ A+Q          K   E   
Sbjct: 103 KLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESAS 162

Query: 752 KITQEPVVLKQIKSEEKV-----EMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNE 806
                    ++  S+E+      E E +K K+  ++REEE+ +Q   ++ + E+ +++ E
Sbjct: 163 SSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222

Query: 807 KTKR--HKIKSEPKENDSSKGKSG--GTIS 832
             ++    +K  PK    ++G     GTIS
Sbjct: 223 AWQKWMKNVKQRPKPVPLNQGMDSLRGTIS 252


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 32.7 bits (74), Expect = 1.4
 Identities = 16/66 (24%), Positives = 38/66 (57%)

Query: 761 KQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEN 820
           +Q + E +  ME E+R++ + ERE+++E++   ++  +E+  + +E     + +SE + +
Sbjct: 132 RQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESD 191

Query: 821 DSSKGK 826
           D    K
Sbjct: 192 DDKTEK 197


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
            Mitofilin controls mitochondrial cristae morphology.
            Mitofilin is enriched in the narrow space between the
            inner boundary and the outer membranes, where it forms a
            homotypic interaction and assembles into a large
            multimeric protein complex. The first 78 amino acids
            contain a typical amino-terminal-cleavable mitochondrial
            presequence rich in positive-charged and hydroxylated
            residues and a membrane anchor domain. In addition, it
            has three centrally located coiled coil domains.
          Length = 493

 Score = 33.5 bits (77), Expect = 1.5
 Identities = 30/177 (16%), Positives = 69/177 (38%), Gaps = 20/177 (11%)

Query: 1114 VPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHL-NELLEL-------REIVE 1165
            +P  +  +E+  +   +      E    K  + LV+    L  ELL+        +E+V 
Sbjct: 84   IPRQSGLSEEVAEAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQELVS 143

Query: 1166 LTPS--DHSHAREAFKSMG------INNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLT 1217
            +     D          +          +  L+K + +L+ + E+   + +     E L 
Sbjct: 144  IFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELL- 202

Query: 1218 EEPKAKLSKEIAVPESEAEFKAWLIETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQ 1274
             + + +L   +   E+  E K   +E ++++  +  +K   ++ RQ+L ++  A  Q
Sbjct: 203  SKLEEELLARLESKEAALE-KQLRLEFEREKEEL--RKKYEEKLRQELERQAEAHEQ 256


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 33.0 bits (75), Expect = 1.6
 Identities = 32/123 (26%), Positives = 62/123 (50%), Gaps = 10/123 (8%)

Query: 714 KSKEYAKAIQ-EKVDKENAEQKKIDA---LTPQVKTEPEENVKITQEPVVLKQIKSE--- 766
           + +E AK    E+  K  AE++K +    +  +VK E E   + T     +  + ++   
Sbjct: 53  REREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDEN 112

Query: 767 EKVEMEKEKRKKILR-EREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPK--ENDSS 823
           E+ E E  K +++ R +R+ E+ E+   +K ++EK     E+ +R +++  PK   N + 
Sbjct: 113 EEEEYEAWKLRELKRIKRDREEREEMEREKAEIEKMRNMTEEERRAELRKNPKVSTNKAK 172

Query: 824 KGK 826
           KGK
Sbjct: 173 KGK 175


>gnl|CDD|176652 cd06140, DNA_polA_I_Bacillus_like_exo, inactive DEDDy 3'-5'
           exonuclease domain of Bacillus stearothermophilus DNA
           polymerase I and similar family-A DNA polymerases.
           Bacillus stearothermophilus-like Polymerase I (Pol I), a
           subgroup of the family-A DNA polymerases, contains an
           inactive DnaQ-like 3'-5' exonuclease domain in the same
           polypeptide chain as the polymerase region. The
           exonuclease-like domain of these proteins possess the
           same fold as the Klenow fragment (KF) of Escherichia
           coli Pol I, but does not contain the four critical
           metal-binding residues necessary for activity. The
           function of this domain is unknown. It might act as a
           spacer between the polymerase and the 5'-3' exonuclease
           domains. Some members of this subgroup, such as those
           from Bacillus sphaericus and Thermus aquaticus, are
           thermostable DNA polymerases.
          Length = 178

 Score = 32.2 bits (74), Expect = 1.6
 Identities = 27/101 (26%), Positives = 39/101 (38%), Gaps = 13/101 (12%)

Query: 341 QEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLEVLNEVLTDKNIVKVFHG 400
            E+A+ +E      +      + ++     Y +       DL  L E L D+ I KV H 
Sbjct: 4   DEVALYVELLGENYHTADIIGLALANGGGAYYIPLELALLDLAALKEWLEDEKIPKVGHD 63

Query: 401 ADSDIKWLQKDFGLYVVGM-FDTHQACKFLPMPRQSLAYLL 440
           A      L K  G+ + G+ FDT  A           AYLL
Sbjct: 64  AKRAYVAL-KRHGIELAGVAFDTMLA-----------AYLL 92



 Score = 32.2 bits (74), Expect = 1.6
 Identities = 27/101 (26%), Positives = 39/101 (38%), Gaps = 13/101 (12%)

Query: 1786 QEIAIDLEYHNYRSYQGYTCLMQISTRDKDYIVDTLKLREDLEVLNEVLTDKNIVKVFHG 1845
             E+A+ +E      +      + ++     Y +       DL  L E L D+ I KV H 
Sbjct: 4    DEVALYVELLGENYHTADIIGLALANGGGAYYIPLELALLDLAALKEWLEDEKIPKVGHD 63

Query: 1846 ADSDIKWLQKDFGLYVVGM-FDTHQACKFLPMPRQSLAYLL 1885
            A      L K  G+ + G+ FDT  A           AYLL
Sbjct: 64   AKRAYVAL-KRHGIELAGVAFDTMLA-----------AYLL 92


>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator.  DivIC from B. subtilis
            is necessary for both vegetative and sporulation septum
            formation. These proteins are mainly composed of an amino
            terminal coiled-coil.
          Length = 80

 Score = 30.2 bits (69), Expect = 1.7
 Identities = 8/36 (22%), Positives = 17/36 (47%)

Query: 1184 NNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEE 1219
              I  L   + +L+ + E+ +A++    +  D  EE
Sbjct: 24   QEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEE 59


>gnl|CDD|218637 pfam05558, DREPP, DREPP plasma membrane polypeptide.  This family
           contains several plant plasma membrane proteins termed
           DREPPs as they are developmentally regulated plasma
           membrane polypeptides.
          Length = 201

 Score = 32.3 bits (73), Expect = 1.8
 Identities = 21/72 (29%), Positives = 32/72 (44%), Gaps = 1/72 (1%)

Query: 721 AIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKIL 780
            + EKV     E+KK +   P  +TE +  VK  +E VV ++ K EE V +      K  
Sbjct: 121 FVFEKVSTFIVEEKKPEEEAPAAETEKKPAVK-EKEIVVEEEKKEEEAVPVAAAPSTKAP 179

Query: 781 REREEEKEEQPA 792
             + EE   +  
Sbjct: 180 ETKVEETPAKAP 191


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 32.9 bits (76), Expect = 1.8
 Identities = 16/89 (17%), Positives = 38/89 (42%)

Query: 716 KEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEK 775
            E+   +   ++  + +++    L  +  +  EE   +  E ++  +   EE VE  +E+
Sbjct: 349 AEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRER 408

Query: 776 RKKILREREEEKEEQPAMKKIKVEKPEES 804
            K+ L      +EE+ A   + +E  +  
Sbjct: 409 AKEALETEALAQEEKLADDLLSLEGLDRE 437


>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I.  coiled-coil structure. It
            has been suggested that the domains I and II from laminin
            A, B1 and B2 may come together to form a triple helical
            coiled-coil structure.
          Length = 263

 Score = 32.4 bits (74), Expect = 1.9
 Identities = 29/134 (21%), Positives = 60/134 (44%), Gaps = 13/134 (9%)

Query: 1113 AVPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELL----ELREIVELTP 1168
            A+P P       ++  ++L+ +L  ++A KR  +L D E+ L++L      L++      
Sbjct: 14   ALPAPYKILYNIENLTQQLSGRLSPISATKR--QLEDLEKALSKLAQDAENLQKKANQAL 71

Query: 1169 SDHSHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEI 1228
            +       A +   + + + L ++I  L+  I++   K+     GE+    P + LS+ +
Sbjct: 72   AKAQKVNAAIERT-LGHAKTLAEAIKNLRDNIKEILEKLAGL--GENAFALPGSALSRML 128

Query: 1229 AVPESEAEFKAWLI 1242
            A    EA+     I
Sbjct: 129  A----EAQRMLGEI 138


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
            and conversion].
          Length = 161

 Score = 31.5 bits (72), Expect = 2.1
 Identities = 24/99 (24%), Positives = 44/99 (44%), Gaps = 19/99 (19%)

Query: 1189 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKR 1248
            + K++++ Q KI             +DL E  + K   +  + E E E    L E +++ 
Sbjct: 31   ILKALDERQAKIA------------DDLAEAERLKEEAQALLAEYEQE----LEEAREQA 74

Query: 1249 AYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKE 1287
            + II+   A+K+  Q   + +  A +E  R+  E A  E
Sbjct: 75   SEIIE--QAKKEAEQIAEEIKAEAEEELERIK-EAAEAE 110



 Score = 29.6 bits (67), Expect = 9.0
 Identities = 21/99 (21%), Positives = 44/99 (44%), Gaps = 5/99 (5%)

Query: 714 KSKEYAKAIQEKVDKENAEQKK-----IDALTPQVKTEPEENVKITQEPVVLKQIKSEEK 768
           + KE A+A+  + ++E  E ++     I+    + +   EE     +E +   +  +E +
Sbjct: 51  RLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEAE 110

Query: 769 VEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEK 807
           +E EKE+  + LR    E     A K +  +  E + + 
Sbjct: 111 IEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKD 149


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 32.6 bits (75), Expect = 2.1
 Identities = 22/94 (23%), Positives = 44/94 (46%), Gaps = 8/94 (8%)

Query: 711 RFLKSKEYAKAIQEKVDKENAEQKKIDA----LTPQVKTEPEENVKITQEPVVLKQIKSE 766
            F + KE  K ++E++ +   E +KI      L  + + +     +   E ++  + K  
Sbjct: 328 DFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLS 387

Query: 767 EKVEMEKEKRKKILREREEEKEEQPAMKKIKVEK 800
           E++E  +E+ K++    +EE E   +  KI V K
Sbjct: 388 EELEELEEELKEL----KEELESLYSEGKISVNK 417


>gnl|CDD|237358 PRK13351, PRK13351, elongation factor G; Reviewed.
          Length = 687

 Score = 33.0 bits (76), Expect = 2.2
 Identities = 23/103 (22%), Positives = 33/103 (32%), Gaps = 9/103 (8%)

Query: 1291 DDFGMRDEDWDV----YKVINKDAGDTDSEEEQERLIELEEILRQHDPEFTSLNQEQELS 1346
            D     +  +        V      +   EE +E   +L E L + D E   L  E E  
Sbjct: 175  DLITEPELHFSEGDGGSTVEEGPIPEELLEEVEEAREKLIEALAEFDDELLELYLEGEEL 234

Query: 1347 PKEANQLHIGVERMCGPECLFQPSMLGSIQ--AGISETLNFVL 1387
              E  QL   +           P + GS     GI   L+ V+
Sbjct: 235  SAE--QLRAPLREGTR-SGHLVPVLFGSALKNIGIEPLLDAVV 274


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 32.5 bits (74), Expect = 2.2
 Identities = 24/106 (22%), Positives = 43/106 (40%), Gaps = 5/106 (4%)

Query: 709 YQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEK 768
            QR    K   +A  E+  K+  E++K      + K +     K   E    K+ K E K
Sbjct: 95  EQRAAAEKAAKQA--EQAAKQAEEKQKQAE---EAKAKQAAEAKAKAEAEAEKKAKEEAK 149

Query: 769 VEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIK 814
            + E+E + K   E +++  E     + + +   E+  K K  + K
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAK 195



 Score = 31.3 bits (71), Expect = 5.5
 Identities = 29/111 (26%), Positives = 39/111 (35%), Gaps = 9/111 (8%)

Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
           K  E A    E+  K+  E K   A   + K E E   K  +E    K+   EE      
Sbjct: 105 KQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEE---AKKQAEEEAKAKAA 161

Query: 774 EKRKKILRE-----REEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
            + KK   E       E K +  A  K K E+ +   E  K  K  +E   
Sbjct: 162 AEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKA-KAAAEAAA 211


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 32.5 bits (74), Expect = 2.3
 Identities = 26/127 (20%), Positives = 48/127 (37%), Gaps = 13/127 (10%)

Query: 717 EYAKAIQEKVDKENAEQKKIDA--LTP-QVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
           +Y +  +EK       ++      +TP +      EN +I +        K+E K   ++
Sbjct: 293 QYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKT---RTAEKNEAKARKKE 349

Query: 774 EKRKKILREREEEKEE-QPAMKKIKVEKPEESNEKTKRHKIK------SEPKENDSSKGK 826
             +K+   ERE  +E  Q     +   +   +  K K+  +         P EN+ SKG 
Sbjct: 350 IAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGS 409

Query: 827 SGGTIST 833
                +T
Sbjct: 410 PPQVEAT 416


>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH.  This
            bacillus clade of FliH proteins is not found by the Pfam
            FliH model pfam02108, but is closely related to the
            sequences identified by that model. Sequences identified
            by this model are observed in flagellar operons in an
            analogous position relative to other flagellar operon
            genes.
          Length = 255

 Score = 32.1 bits (73), Expect = 2.3
 Identities = 33/149 (22%), Positives = 58/149 (38%), Gaps = 20/149 (13%)

Query: 1189 LNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEFKAWLIETKKKR 1248
            ++  I   +  I   +  II      D+ E  + +L+ E    E   E      + + + 
Sbjct: 1    MSNIIKSTKSVIPAQERGIIPLRQVTDVPET-EQELADEEQEFEQILE------KAEAEA 53

Query: 1249 AYIIDKKNAR-KQRRQDLAKRRTAAAQERMRLISELARKEKRDDDFGM-RDEDWDVYKVI 1306
            A II++  A+    R+ + + R    +ER RLI E A++E  +  F     E   +Y+  
Sbjct: 54   AQIIEQAEAQAAAIREQIEQERAQWEEERERLIQE-AKQEGYEAGFQAGESEALSIYQST 112

Query: 1307 NKDAGDTDS----------EEEQERLIEL 1325
              +A               E  Q  +IEL
Sbjct: 113  IDEANAIVEEAKDDYEEKIESAQPLIIEL 141


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 32.6 bits (74), Expect = 2.4
 Identities = 15/105 (14%), Positives = 47/105 (44%), Gaps = 4/105 (3%)

Query: 716 KEYAKAIQEKVDKENAEQKKIDALTPQV----KTEPEENVKITQEPVVLKQIKSEEKVEM 771
           ++Y +   ++   +  EQ++              +  E  ++ Q      + + ++K   
Sbjct: 62  QQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAE 121

Query: 772 EKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
           E EK+ ++ ++++EE+  + A ++ K  +  ++    +  K+K+ 
Sbjct: 122 EAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAA 166



 Score = 30.7 bits (69), Expect = 9.2
 Identities = 26/105 (24%), Positives = 39/105 (37%), Gaps = 12/105 (11%)

Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
           K  E AKA         A + K  A   + K + EE  K   E    K   +  K + E 
Sbjct: 147 KKAEAAKAKA----AAEAAKLKAAA---EAKKKAEEAAK-AAEEAKAKAEAAAAKKKAEA 198

Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPK 818
           E +    + + E + +  A KK +      + EK    K K+  K
Sbjct: 199 EAKAAAEKAKAEAEAKAKAEKKAE----AAAEEKAAAEKKKAAAK 239


>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
            component [Carbohydrate transport and metabolism].
          Length = 472

 Score = 32.7 bits (75), Expect = 2.5
 Identities = 24/114 (21%), Positives = 49/114 (42%), Gaps = 14/114 (12%)

Query: 1119 LTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIV-----ELTPSDH-- 1171
             +  + +D  +    K+ EM  + + +         N+L  +++I+      +  S    
Sbjct: 337  TSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKKIIFACDAGMGSSAMGA 396

Query: 1172 SHAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLS 1225
            S  R+  K+ G+N+I   N +IN L        A ++  +  +DLT+  K K+ 
Sbjct: 397  SILRKKLKNAGLNDISVTNYAINNL-----PADADLVITH--KDLTDRAKRKVP 443


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
            factor dissociates the posttermination complex, composed
            of the ribosome, deacylated tRNA, and mRNA, after
            termination of translation.  Thus ribosomes are
            "recycled" and ready for another round of protein
            synthesis.  RRF is believed to bind the ribosome at the
            A-site in a manner that mimics tRNA, but the specific
            mechanisms remain unclear.  RRF is essential for
            bacterial growth.  It is not necessary for cell growth in
            archaea or eukaryotes, but is found in mitochondria or
            chloroplasts of some eukaryotic species.
          Length = 179

 Score = 31.5 bits (72), Expect = 2.5
 Identities = 18/96 (18%), Positives = 41/96 (42%), Gaps = 13/96 (13%)

Query: 1114 VPVPNLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSH 1173
            V +P LT E    RRKEL K   ++  + +           +++ +L +  E++  +   
Sbjct: 96   VNLPPLTEE----RRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKK 151

Query: 1174 AREAFKSMGINNIQDLNK-SINQLQQKIEKTKAKII 1208
            A E         +Q L    I ++ + ++  + +++
Sbjct: 152  AEED--------LQKLTDEYIKKIDELLKSKEKELL 179


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
            structure and biogenesis].
          Length = 877

 Score = 32.6 bits (75), Expect = 2.5
 Identities = 31/147 (21%), Positives = 63/147 (42%), Gaps = 24/147 (16%)

Query: 1118 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 1177
            NL  E        L  K+V + +++ E+RL  +E  +  L  L E+  L+P       E 
Sbjct: 736  NLRAEMNLSPSAPL--KVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPEEPPVEV 793

Query: 1178 FKSMGI--------------NNIQDLNKSINQLQQKIEKTKAKIIAYNNGED-LTEEPKA 1222
               +G                 +  L K + +L+++I++ + K+    + E  + + P+ 
Sbjct: 794  TAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKL----SNEGFVAKAPEE 849

Query: 1223 KLSKEIAVPESEAEFKAWLIETKKKRA 1249
             + KE    E  AE++  L + +++ A
Sbjct: 850  VVEKEK---EKLAEYQVKLAKLEERLA 873


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 31.6 bits (72), Expect = 2.5
 Identities = 19/62 (30%), Positives = 37/62 (59%), Gaps = 2/62 (3%)

Query: 760 LKQIKSE--EKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEP 817
            KQ + +  E  E E+E+RKK+  +RE E++E+  +++ + +K EE   K +  + + E 
Sbjct: 13  EKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQ 72

Query: 818 KE 819
           +E
Sbjct: 73  EE 74



 Score = 30.4 bits (69), Expect = 6.2
 Identities = 19/96 (19%), Positives = 42/96 (43%), Gaps = 12/96 (12%)

Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
           L+ K+  +  +E  ++E  E+KK++      + E EE              +  EK + E
Sbjct: 11  LEEKQARRQQREAEEEEREERKKLEEKREGERKEEEE------------LEEEREKKKEE 58

Query: 773 KEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKT 808
           +E++++  + R+E++E +       VE+        
Sbjct: 59  EERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSA 94


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 32.4 bits (74), Expect = 2.5
 Identities = 26/172 (15%), Positives = 61/172 (35%), Gaps = 24/172 (13%)

Query: 696 QKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQ 755
           +K   K +   +      +          + + ++    +      +   +P    + T 
Sbjct: 221 EKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKT- 279

Query: 756 EPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKS 815
                K  +++EK   E E+  K  ++ +++  +   +K+I  E  ++   + ++ + + 
Sbjct: 280 -----KAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRK 334

Query: 816 EPKENDSSKGKSGGTISTVDFSKVNYNKYMAKPDTKILPDPYYEYLDELRDS 867
           E  E    K +           K+  +KY   P   +LP       DEL  S
Sbjct: 335 ERGEKKKLKRR-----------KLGKHKYPEPPLEVVLP-------DELSGS 368


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 31.9 bits (73), Expect = 2.9
 Identities = 20/60 (33%), Positives = 34/60 (56%), Gaps = 2/60 (3%)

Query: 746 EPEENVKITQEPVVLKQI--KSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEE 803
           + EE  K+  +    K +  K EEK + ++EK +K LRE EE+  E    +K+++ K +E
Sbjct: 45  KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQE 104


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 31.6 bits (72), Expect = 3.0
 Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 3/92 (3%)

Query: 742 QVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKP 801
               E E      QE   ++++K EEK     E ++      +E++E +   ++ + E  
Sbjct: 44  AAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE---EENEEEDE 100

Query: 802 EESNEKTKRHKIKSEPKENDSSKGKSGGTIST 833
           E S+E  K  + K+E          S   + T
Sbjct: 101 ESSDENEKETEEKTESNVEKEITNPSWKPVGT 132


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 32.5 bits (74), Expect = 3.0
 Identities = 31/130 (23%), Positives = 53/130 (40%), Gaps = 7/130 (5%)

Query: 715 SKEYAKAIQEKVDKE-NAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
            +E    I++    E   EQK         + E  +  ++ QE   LK+ +  +K+E+EK
Sbjct: 510 KEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKK-ELEQEMEELKERERNKKLELEK 568

Query: 774 EKRK--KILREREEEKEEQPAMKKIKVEKPEESNEKTKRH---KIKSEPKENDSSKGKSG 828
           E ++  K L++  E    +   KKI   K  +S E   +    K K   K  +    K G
Sbjct: 569 EAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIG 628

Query: 829 GTISTVDFSK 838
             +    F +
Sbjct: 629 DKVRIRYFGQ 638


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 32.7 bits (74), Expect = 3.0
 Identities = 31/132 (23%), Positives = 47/132 (35%), Gaps = 18/132 (13%)

Query: 684  FEDKMKIGSNKYQKIK----LKT--SRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKID 737
            FE  M  GS    KIK    LK   +R     +    +      + E V K N   + + 
Sbjct: 2056 FEISMIEGSLGESKIKFIGGLKVYDARCVIYIEELDCTNVN---LVEGVRKYNEYGRGMG 2112

Query: 738  ALTPQVK------TEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEE--KEE 789
             L  Q+          E+N          ++++ E K     E  + +L   EEE  K +
Sbjct: 2113 ELKEQLSNTVVILGVKEKNADDALSGTPGERLEREVKSVF-VEAPRDMLFLLEEEVRKRK 2171

Query: 790  QPAMKKIKVEKP 801
               MK    +KP
Sbjct: 2172 GSVMKFKSSKKP 2183


>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
           repair].
          Length = 480

 Score = 32.4 bits (74), Expect = 3.1
 Identities = 18/108 (16%), Positives = 39/108 (36%), Gaps = 14/108 (12%)

Query: 714 KSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEK 773
           KS E       KV      Q  +  L+ +++ + E+ +                ++E E 
Sbjct: 249 KSDERGSPYGTKVVYGGVGQLWVFLLSYKLQKKEEQTL--------------RTRIEKEL 294

Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEND 821
           EK +K L + +  +       +I  EK  +     +  ++  + K+  
Sbjct: 295 EKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLEVDFKSKKKR 342


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 32.4 bits (74), Expect = 3.2
 Identities = 20/111 (18%), Positives = 45/111 (40%), Gaps = 10/111 (9%)

Query: 711 RFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKS-EEKV 769
           RF ++++  +A     +   AE    +  T + + E E          + +QI   E  +
Sbjct: 236 RFTRTEKVEQAATAAAEVPPAEMDTEEDRTKEREAELEA---------LQEQIDELESSI 286

Query: 770 EMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKEN 820
           E    + K +  + ++  EE   +++   E  EE   K +  ++  + + N
Sbjct: 287 EEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENN 337


>gnl|CDD|239776 cd04243, AAK_AK-HSDH-like, AAK_AK-HSDH-like: Amino Acid Kinase
            Superfamily (AAK), AK-HSDH-like; this family includes the
            N-terminal catalytic domain of aspartokinase (AK) of the
            bifunctional enzyme AK- homoserine dehydrogenase (HSDH).
            These aspartokinases are found in such bacteria as E.
            coli (AKI-HSDHI, ThrA  and  AKII-HSDHII, MetL) and in
            higher plants (Z. mays AK-HSDH). AK and HSDH are the
            first and third enzymes in the biosynthetic pathway of
            the aspartate family of amino acids. AK catalyzes the
            phosphorylation of Asp to P-aspartyl phosphate. HSDH
            catalyzes the NADPH-dependent conversion of Asp
            3-semialdehyde to homoserine. ThrA and MetL are involved
            in threonine and methionine biosynthesis, respectively.
            In E. coli, ThrA is subject to allosteric regulation by
            the end product L-threonine and the native enzyme is
            reported to be tetrameric. As with bacteria, plant AK and
            HSDH are feedback inhibited by pathway end products.
            Maize AK-HSDH is a Thr-sensitive 180-kD enzyme.
            Arabidopsis AK-HSDH is an alanine-activated,
            threonine-sensitive enzyme whose ACT domains, located
            C-terminal to the AK catalytic domain, were shown to be
            involved in allosteric activation. Also included in this
            CD is the catalytic domain of the aspartokinase (AK) of
            the lysine-sensitive aspartokinase isoenzyme AKIII, a
            monofunctional class enzyme (LysC) found in some bacteria
            such as E. coli. In E. coli, LysC is reported to be a
            homodimer of 50 kD subunits. Also included in this CD is 
            the catalytic domain of aspartokinase (AK) of the
            bifunctional enzyme AK - DAP decarboxylase (DapDC) found
            in some bacteria. DapDC, which is the lysA gene product,
            catalyzes the decarboxylation of DAP to lysine.
          Length = 293

 Score = 31.8 bits (73), Expect = 3.2
 Identities = 21/87 (24%), Positives = 41/87 (47%), Gaps = 8/87 (9%)

Query: 1130 ELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREAFKSMGINNIQDL 1189
             LA+     +  +        ERHL+ + EL     L+    +    A  S+ +  ++DL
Sbjct: 48   ALAELAASGDDAQAIVLQEIRERHLDLIKEL-----LSGESAAELLAALDSL-LERLKDL 101

Query: 1190 NKSINQLQQKIEKTKAKIIAYNNGEDL 1216
             + I  L +  +KT+A+++++  GE L
Sbjct: 102  LEGIRLLGELSDKTRAEVLSF--GELL 126


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 32.1 bits (73), Expect = 3.4
 Identities = 28/113 (24%), Positives = 47/113 (41%), Gaps = 12/113 (10%)

Query: 711 RFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIK-SEEKV 769
           R  KS   +K  +E    E  E++       +    P       + P+  + ++ +EE+ 
Sbjct: 19  REKKSIRQSKLEEELNKLEKKEEEAECKKKFKANPVP----ASVKLPLYEEIMEQNEERR 74

Query: 770 EMEKEKRKKILRERE-----EEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEP 817
           E  +EK K IL   +      E+EEQ   K I   K   S  + +  K K++P
Sbjct: 75  EEVREKSKAILLSSQKPFSFYEREEQK--KAILPRKLRSSTSEREPKKFKAKP 125


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
           production and conversion].
          Length = 108

 Score = 30.1 bits (68), Expect = 3.4
 Identities = 15/59 (25%), Positives = 32/59 (54%)

Query: 761 KQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
            +I ++E++E  KE+ ++I++E  EE  E     + + EK  +   +  R + + E +E
Sbjct: 15  AEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEE 73


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 31.0 bits (70), Expect = 3.5
 Identities = 26/97 (26%), Positives = 46/97 (47%)

Query: 728 KENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEK 787
           K+  E+KK + L   VK    +   I +E   +K  K E+  E E+E  ++I    + E 
Sbjct: 11  KKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIED 70

Query: 788 EEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSSK 824
           EE+    + + E+ EE N   K  + K+     +S++
Sbjct: 71  EEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQ 107


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 31.7 bits (72), Expect = 3.8
 Identities = 24/78 (30%), Positives = 33/78 (42%), Gaps = 6/78 (7%)

Query: 748 EENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVE-----KPE 802
           EE +K  QE  VL+Q       E E  + ++   E E E E +    KI+ E     K E
Sbjct: 132 EELLK-MQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEARGRAKEE 190

Query: 803 ESNEKTKRHKIKSEPKEN 820
             NE   R  +K +  E 
Sbjct: 191 RENEDINREMLKLKANEE 208


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 32.0 bits (73), Expect = 3.9
 Identities = 28/154 (18%), Positives = 52/154 (33%), Gaps = 32/154 (20%)

Query: 698  IKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQ----KKIDA--------------- 738
            +  K  R     +RF   K+    +Q     E AE+     KID                
Sbjct: 847  LIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELES 906

Query: 739  ----LTPQVKTEPEENVK-----ITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEE 789
                L   + ++  EN++     I +   +L  I  EE   +E  K  ++ +  E E + 
Sbjct: 907  EIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKL 966

Query: 790  QPAMKK----IKVEKPEESNEKTKRHKIKSEPKE 819
            +   ++    +K              ++K+  KE
Sbjct: 967  KETSEEYEDLLKKSTILVREGNKANSELKNFKKE 1000


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 31.7 bits (72), Expect = 4.0
 Identities = 22/86 (25%), Positives = 41/86 (47%), Gaps = 7/86 (8%)

Query: 693 NKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVK 752
             Y+++K  T       Q  LK  EY K +Q+   K    + + DA + + + + +E   
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLK--EYNKKLQDYTKKLGEVKDETDA-SEEAEAKAKEEKL 407

Query: 753 ITQEPVVLKQIKSEEKVEMEKEKRKK 778
             +E     + K +E+ + +KEKR+K
Sbjct: 408 KQEE----NEKKQKEQADEDKEKRQK 429


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
            sigma54-dependent transcription [Transcription / Signal
            transduction mechanisms].
          Length = 225

 Score = 31.1 bits (71), Expect = 4.0
 Identities = 25/98 (25%), Positives = 42/98 (42%), Gaps = 11/98 (11%)

Query: 1120 TTEQQKDRRKELAKKLVEM--NAKKREER-----LVDDERHLNELLELREIVELTPSDHS 1172
               Q   R+K+L +KL E    A+K EE+        +E    E LE ++ +E       
Sbjct: 46   ALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLAKALE 105

Query: 1173 HAREAFKSMGINNIQDLNKSINQLQQKIEKTKAKIIAY 1210
               +  +      ++ L K +  L+QKI + +AK  A 
Sbjct: 106  AELQQAEEQ----VEKLKKQLAALEQKIAELRAKKEAL 139


>gnl|CDD|217630 pfam03592, Terminase_2, Terminase small subunit.  Packaging of
            double-stranded viral DNA concatemers requires
            interaction of the prohead with virus DNA. This process
            is mediated by a phage-encoded DNA recognition and
            terminase protein. The terminase enzymes described so
            far, which are hetero-oligomers composed of a small and a
            large subunit, do not have a significant level of
            sequence homology. The small terminase subunit is thought
            to form a nucleoprotein structure that helps to position
            the terminase large subunit at the packaging initiation
            site.
          Length = 141

 Score = 30.3 bits (69), Expect = 4.2
 Identities = 26/115 (22%), Positives = 40/115 (34%), Gaps = 24/115 (20%)

Query: 1245 KKKRAYIIDKKNARK--------QRRQDLAKRRTAAAQERMRLISELAR---KEKRDDDF 1293
             KK A  I  +   K        +  ++  KR    A E +  ++E+AR    E      
Sbjct: 28   SKKTARSIGSELLTKPKIKAYIDELMKEREKRTIITADEVLEELTEIARGDETELVLVGT 87

Query: 1294 GMRDEDWDVYKVINKDAGDTDSEEEQERLIELEEILRQHDPEFT-SLNQEQELSP 1347
               + +    KV  KD           RL  LE + + H   FT  +  E E+  
Sbjct: 88   KEGEVEKIEKKVKLKD-----------RLKALELLGK-HYGLFTDKVEVEGEVKK 130


>gnl|CDD|237301 PRK13200, psaA, photosystem I P700 chlorophyll a apoprotein A1;
            Provisional.
          Length = 766

 Score = 32.1 bits (73), Expect = 4.3
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 4/48 (8%)

Query: 1431 HFSVSLAENPELDAWSGARQFSLSENFHDFAVTQSDYQEKGGEFFRVH 1478
            HF  +LA+ P+   W     ++L  + HDF    SD +E   + F  H
Sbjct: 33   HFDRTLAKGPKTTTWI----WNLHADAHDFDTHTSDLEEVSRKIFSAH 76


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 31.5 bits (71), Expect = 4.6
 Identities = 25/108 (23%), Positives = 51/108 (47%), Gaps = 1/108 (0%)

Query: 713 LKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEME 772
           L+    A A  E + KE +    + ++  +++       K  +E   L+ +    K E+E
Sbjct: 231 LEKLAEASAELESLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELE 290

Query: 773 KEKRK-KILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
           +EK++ + LRE+E E EE  +  + ++ + +   E   R K+   P +
Sbjct: 291 EEKKELEELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLK 338


>gnl|CDD|234275 TIGR03599, YloV, DAK2 domain fusion protein YloV.  This model
            describes a protein family that contains an N-terminal
            DAK2 domain (pfam02734), so named because of similarity
            to the dihydroxyacetone kinase family family. The
            GTP-binding protein CgtA (a member of the obg family) is
            a bacterial GTPase associated with ribosome biogenesis,
            and it has a characteristic extension (TIGR03595) in
            certain lineages. This protein family described here was
            found, by the method of partial phylognetic profiling, to
            have a phylogenetic distribution strongly correlated to
            that of TIGR03595. This correlation implies some form of
            functional coupling.
          Length = 530

 Score = 31.8 bits (73), Expect = 4.6
 Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 6/47 (12%)

Query: 1174 AREAFKSMGINNI----QDLNKSINQLQQKIEKTKAK--IIAYNNGE 1214
              E FKS+G + +    Q +N S   + + IEK  AK  I+  NN  
Sbjct: 326  IAELFKSLGADVVIEGGQTMNPSTEDILKAIEKVNAKNVIVLPNNKN 372


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 30.0 bits (68), Expect = 4.9
 Identities = 22/80 (27%), Positives = 39/80 (48%), Gaps = 4/80 (5%)

Query: 761 KQIKSEEKVEMEK--EKRKKILREREEEK-EEQPAMKKIKVEKPEESNEKTKRHK-IKSE 816
           K+I+ E K E+EK  ++RK+ L+  EEE  +E+ A  +   +  ++  E          E
Sbjct: 55  KRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYE 114

Query: 817 PKENDSSKGKSGGTISTVDF 836
            +  D  K K+  T+  +D 
Sbjct: 115 EEYIDEDKYKTTVTVEELDP 134


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 31.7 bits (73), Expect = 5.0
 Identities = 17/88 (19%), Positives = 37/88 (42%), Gaps = 2/88 (2%)

Query: 733 QKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPA 792
           + ++  L  ++    +E ++  +E  +  ++KS      E+E RK+  +   EE+ +  A
Sbjct: 3   KVRVYELAKELGVSSKELLEKLKELGI--EVKSHSSTVEEEEARKEEAKREAEEEAKAEA 60

Query: 793 MKKIKVEKPEESNEKTKRHKIKSEPKEN 820
            +    E  EE+  +        E  E 
Sbjct: 61  EEAAAAEAEEEAKAEAAAAAPAEEAAEA 88


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 31.6 bits (72), Expect = 5.0
 Identities = 22/147 (14%), Positives = 47/147 (31%), Gaps = 15/147 (10%)

Query: 1118 NLTTEQQKDRRKELAKKLVEMNAKKREERLVDDERHLNELLELREIVELTPSDHSHAREA 1177
             +      DR +EL  +++E   ++ +   +  E  L    EL  I E          + 
Sbjct: 794  GINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQ--------KN 845

Query: 1178 FKSMGINNIQDLNKSINQLQQKIEKTKAKIIAYNNGEDLTEEPKAKLSKEIAVPESEAEF 1237
             K      + ++ ++  ++Q   E         N   +   +PK      IA        
Sbjct: 846  LKQEIKLELSEIQEAEEEIQNINEN-------KNEFVEFKNDPKKLNKLIIAKDVLIKLV 898

Query: 1238 KAWLIETKKKRAYIIDKKNARKQRRQD 1264
             +     + ++     KK+  K   + 
Sbjct: 899  ISSDEIKQDEKTTKKKKKDLEKTDEEA 925


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 31.4 bits (70), Expect = 5.1
 Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 10/101 (9%)

Query: 723 QEKVDKENAEQKKIDA---LTPQVKTEPEENVKITQEPVVL--KQIKSE-EKVEMEKEKR 776
           Q K + ENA  +   +   L  + +   +E  K  +E + L   QIK+E EK + E+EK+
Sbjct: 132 QAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQ 191

Query: 777 KKILREREEEKEEQPAMK-KIKVEKPEESNEKTKRHKIKSE 816
           K    E+E++K    A K  I++E+ ++  E  K+  IK +
Sbjct: 192 KT---EQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQ 229


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 31.4 bits (71), Expect = 5.3
 Identities = 29/129 (22%), Positives = 62/129 (48%), Gaps = 14/129 (10%)

Query: 700 LKTSRFETPYQR---FLKSKEYAKAIQEKVDKE----NAEQKKIDA-LTPQVKTEPEENV 751
            KT++ E+   +    L++KE A+  +E++++E      E K+I++ LT +  +   ++ 
Sbjct: 59  KKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDE 118

Query: 752 KITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRH 811
            ++ +   L      E  E     + K + EREE+ E+    KK ++E+    ++   R 
Sbjct: 119 NLSSKEKTL------ESKEQSLTDKSKHIDEREEQVEKLEEQKKAELERVAALSQAEARE 172

Query: 812 KIKSEPKEN 820
            I +E +  
Sbjct: 173 IILAETENK 181


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 31.6 bits (71), Expect = 5.4
 Identities = 31/125 (24%), Positives = 61/125 (48%), Gaps = 19/125 (15%)

Query: 725 KVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILRER- 783
           +VDK++AE+ +I+      K      +KI  E   +++++ EE+  +E+E+ ++I RER 
Sbjct: 429 RVDKDHAERARIEKENAHRKAL---EMKIL-EKKRIERLEREERERLERERMERIERERL 484

Query: 784 -----EEEKEEQPAMKKIKVEKPEESN---------EKTKRHKIKSEPKENDSSKGKSGG 829
                E E+ E+  +++ ++++ E            EK +R+    +  EN  S G   G
Sbjct: 485 ERERLERERLERDRLERDRLDRLERERVDRLERDRLEKARRNSYFLKGMENGLSAGGGPG 544

Query: 830 TISTV 834
               V
Sbjct: 545 DGPGV 549


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 30.9 bits (70), Expect = 5.4
 Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 4/71 (5%)

Query: 709 YQRFLKSKEYAKAIQEKVDKENAE-QKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEE 767
            +R +++ E  K    K DK N E   KI++    ++          Q    +K++ S+E
Sbjct: 9   AKRVVRALESEKYKANK-DKGNPEIYNKINSQDKAIEKFKLL--IKAQMAERVKKLHSQE 65

Query: 768 KVEMEKEKRKK 778
           K E +K+ +KK
Sbjct: 66  KKEEKKKPKKK 76


>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 264

 Score = 31.0 bits (70), Expect = 5.5
 Identities = 14/110 (12%), Positives = 35/110 (31%), Gaps = 6/110 (5%)

Query: 699 KLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPV 758
             + +  +   +   + ++  + ++  VD++         L  Q +    E    TQ   
Sbjct: 80  PTEPAAVKDAERLTPEQRQLLEQME--VDQKAQPT----QLGEQPEQARIEEQPRTQSQK 133

Query: 759 VLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKT 808
              Q  + +   ++ + R +  +        +P  K  K E       K 
Sbjct: 134 AQSQATTVQTQPVKPKPRPEKPQPVAPAPAPEPVEKAPKAEAAPPPKPKA 183


>gnl|CDD|212668 cd10225, MreB_like, MreB and similar proteins.  MreB is a bacterial
            protein which assembles into filaments resembling those
            of eukaryotic F-actin. It is involved in determining the
            shape of rod-like bacterial cells, by assembling into
            large fibrous spirals beneath the cell membrane. MreB has
            also been implicated in chromosome segregation;
            specifically MreB is thought to bind to and segregate the
            replication origin of bacterial chromosomes.
          Length = 320

 Score = 31.2 bits (72), Expect = 5.6
 Identities = 11/47 (23%), Positives = 18/47 (38%), Gaps = 8/47 (17%)

Query: 1398 LANNIFVTGSLCQLPGFVERLNKDLLENRPFQSHFSVSLAENPELDA 1444
            L   I +TG    L G  E ++++            V +AE+P    
Sbjct: 270  LDRGIVLTGGGALLRGLDELISEET--------GLPVRVAEDPLTCV 308


>gnl|CDD|226169 COG3643, COG3643, Glutamate formiminotransferase [Amino acid
           transport and metabolism].
          Length = 302

 Score = 31.3 bits (71), Expect = 5.8
 Identities = 24/105 (22%), Positives = 39/105 (37%), Gaps = 12/105 (11%)

Query: 758 VVLKQIKSEEKVEMEKEKRKKI---------LREREEEKEEQPAMKKIKVEKPEESNEKT 808
           V LK   +EE VE+ KE  K++         L E    + E+  +  I+  + E   EK 
Sbjct: 94  VPLKDTTTEECVEISKELGKRVGEELGIPVYLYEDAATRPERQNLADIRKGQYEGLKEKI 153

Query: 809 KRHKIKSEPKENDSSKG-KSGGTISTVDFSKVNYNKYMAKPDTKI 852
           K  K    P          +G T        + +N  +   + +I
Sbjct: 154 KEPKWA--PDFGPREVHPTAGVTAIGARMPLIAFNVNLDTSNMEI 196


>gnl|CDD|223994 COG1067, LonB, Predicted ATP-dependent protease [Posttranslational
           modification, protein turnover, chaperones].
          Length = 647

 Score = 31.6 bits (72), Expect = 5.8
 Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 6/90 (6%)

Query: 737 DALTPQVKTEPEE--NVKITQEPVVLKQ-IKSEEKVEMEKEKRKKILREREEEKEEQPAM 793
           D   P VK+   +  ++ +  EP V K  +       +  E+ + I  +   E  E+P +
Sbjct: 33  DEAVPIVKSAAFQRRHILLIGEPGVGKSMLAQAMSFLLPLEELEDIRVKPNPEDPERPKV 92

Query: 794 KKIKVEKPEESNEKTKRHKIKSEPKENDSS 823
           + +   +PE  +E       K+E K    S
Sbjct: 93  ETVPAGRPESFSE---LLGQKAEEKAEYLS 119


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 31.3 bits (71), Expect = 5.8
 Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 6/97 (6%)

Query: 732 EQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKI------LREREE 785
            ++++     + K    E +K  +  +  +  K E    +  EK K++      L E  +
Sbjct: 69  LEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSK 128

Query: 786 EKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDS 822
           E ++Q       +EK  E+N+  +R K ++E K  +S
Sbjct: 129 ELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEES 165


>gnl|CDD|224141 COG1220, HslU, ATP-dependent protease HslVU (ClpYQ), ATPase subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 444

 Score = 31.5 bits (72), Expect = 5.8
 Identities = 9/53 (16%), Positives = 23/53 (43%), Gaps = 7/53 (13%)

Query: 769 VEMEKEKRKKILREREEEKEEQ-------PAMKKIKVEKPEESNEKTKRHKIK 814
           V++ +E++ + ++++ EE  E+       P  K    +   +      R K +
Sbjct: 108 VKLVREEKIEKVKDKAEELAEERILDALVPPAKNFWGQSENKQESSATREKFR 160


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 30.4 bits (68), Expect = 5.9
 Identities = 18/77 (23%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 742 QVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKP 801
           Q + E EE  ++ +E   LK+  +EE++  E+E R++      E++E+     + + ++ 
Sbjct: 44  QERREQEEQDRLEREE--LKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQE 101

Query: 802 EESNEKTKRHKIKSEPK 818
           +E  E+ ++ K ++E +
Sbjct: 102 QEEQERIQKQKEEAEAR 118



 Score = 29.7 bits (66), Expect = 9.7
 Identities = 25/98 (25%), Positives = 51/98 (52%), Gaps = 1/98 (1%)

Query: 719 AKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKK 778
           A+  +E+ ++E  EQ++ D L  +         ++ +E    +Q +   + + EK KRK 
Sbjct: 35  AREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKA 94

Query: 779 ILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
              E++E +EEQ  ++K K E    + E+ +R +++ E
Sbjct: 95  EEEEKQE-QEEQERIQKQKEEAEARAREEAERMRLERE 131


>gnl|CDD|237561 PRK13923, PRK13923, putative spore coat protein regulator protein
           YlbO; Provisional.
          Length = 170

 Score = 30.2 bits (68), Expect = 6.0
 Identities = 14/62 (22%), Positives = 28/62 (45%), Gaps = 2/62 (3%)

Query: 734 KKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAM 793
           ++I  L  + +    EN  + QE  + ++      V M + +R  IL   E+E++   A 
Sbjct: 111 EQIGKLQEEEEKLSWENQTLKQELAITEEDYRALIVIMNRARRMAIL--VEDERKSLAAF 168

Query: 794 KK 795
           + 
Sbjct: 169 EM 170


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
            autophagy-related subunit 14.  The Atg14 or Apg14
            proteins are hydrophilic proteins with a predicted
            molecular mass of 40.5 kDa, and have a coiled-coil motif
            at the N terminus region. Yeast cells with mutant Atg14
            are defective not only in autophagy but also in sorting
            of carboxypeptidase Y (CPY), a vacuolar-soluble
            hydrolase, to the vacuole. Subcellular fractionation
            indicate that Apg14p and Apg6p are peripherally
            associated with a membrane structure(s). Apg14p was
            co-immunoprecipitated with Apg6p, suggesting that they
            form a stable protein complex. These results imply that
            Apg6/Vps30p has two distinct functions: in the autophagic
            process and in the vacuolar protein sorting pathway.
            Apg14p may be a component specifically required for the
            function of Apg6/Vps30p through the autophagic pathway.
            There are 17 auto-phagosomal component proteins which are
            categorized into six functional units, one of which is
            the AS-PI3K complex (Vps30/Atg6 and Atg14). The AS-PI3K
            complex and the Atg2-Atg18 complex are essential for
            nucleation, and the specific function of the AS-PI3K
            apparently is to produce phosphatidylinositol 3-phosphate
            (PtdIns(3)P) at the pre-autophagosomal structure (PAS).
            The localisation of this complex at the PAS is controlled
            by Atg14. Autophagy mediates the cellular response to
            nutrient deprivation, protein aggregation, and pathogen
            invasion in humans, and malfunction of autophagy has been
            implicated in multiple human diseases including cancer.
            This effect seems to be mediated through direct
            interaction of the human Atg14 with Beclin 1 in the human
            phosphatidylinositol 3-kinase class III complex.
          Length = 307

 Score = 31.2 bits (71), Expect = 6.0
 Identities = 15/47 (31%), Positives = 23/47 (48%)

Query: 1243 ETKKKRAYIIDKKNARKQRRQDLAKRRTAAAQERMRLISELARKEKR 1289
            E ++KR  I + K A  QRR DL+       + R   + +L  + KR
Sbjct: 86   EIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKR 132


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 30.0 bits (68), Expect = 6.4
 Identities = 18/71 (25%), Positives = 39/71 (54%)

Query: 760 LKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
           L+ +  + K E E E+ ++   E++ + EE+ A K+ K +K ++  +K K+ K  ++ +E
Sbjct: 52  LELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEE 111

Query: 820 NDSSKGKSGGT 830
            + SK     +
Sbjct: 112 KEGSKSSEESS 122


>gnl|CDD|185654 PTZ00475, PTZ00475, RESA-like protein; Provisional.
          Length = 282

 Score = 30.8 bits (69), Expect = 6.4
 Identities = 29/109 (26%), Positives = 48/109 (44%), Gaps = 9/109 (8%)

Query: 77  NQEIPKSD----EANKELSKTLDVLVQNAYKSIVKCTKTAQSFPSTHENSLLIASPNYIS 132
           N  I K D    EAN+ + +   VL Q   + I   T     +P   +N+  I    Y+S
Sbjct: 126 NSGINKKDIYLKEANERMIRNSIVLRQCKSRFISIIT---NYYPFKEQNNPFIKQAQYVS 182

Query: 133 GVAGTSDKVMTLVDSLLKTQNISKSMSKLYLEGQKDILTEANDKLLESI 181
                 D ++  +D      NI +++  LY E   ++L E  +++LE I
Sbjct: 183 SSNYVLDDIINNID--YSIDNIHRAIDNLYYEHILNLLEEEKNEILEEI 229


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 31.2 bits (70), Expect = 6.4
 Identities = 34/140 (24%), Positives = 61/140 (43%), Gaps = 13/140 (9%)

Query: 691 GSNKYQKIKLK-TSRFETPYQRFLKSKEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEE 749
           G NK++  + +  ++F        K  E A A+ E+ D+  A  KK    + + +   + 
Sbjct: 27  GFNKFRGERWRLQNKFLLAAGSR-KKIESALAVDEEPDENGAVSKKKPTRSVK-RATKKT 84

Query: 750 NVKITQEPVVLKQIKSEEKVEMEKEKRKKILRER----------EEEKEEQPAMKKIKVE 799
            V+I++      ++   E   ++KE +K   R R          EEEK E+   K+ KV+
Sbjct: 85  VVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVK 144

Query: 800 KPEESNEKTKRHKIKSEPKE 819
           K +E  E        S+ +E
Sbjct: 145 KMDEDVEDQGSESEVSDVEE 164


>gnl|CDD|237377 PRK13405, bchH, magnesium chelatase subunit H; Provisional.
          Length = 1209

 Score = 31.2 bits (71), Expect = 7.3
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 406 KWLQKDFGLYVVGMFDTHQACKFLP 430
           +WL++DFG   V  F TH A +F+P
Sbjct: 589 RWLREDFGADAVLHFGTHGALEFMP 613



 Score = 31.2 bits (71), Expect = 7.3
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 1851 KWLQKDFGLYVVGMFDTHQACKFLP 1875
            +WL++DFG   V  F TH A +F+P
Sbjct: 589  RWLREDFGADAVLHFGTHGALEFMP 613


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 31.2 bits (71), Expect = 7.5
 Identities = 27/106 (25%), Positives = 56/106 (52%), Gaps = 9/106 (8%)

Query: 716 KEYAKAIQEKVDKENAEQKKIDALTPQVKTE--PEENVKITQEPVVLKQIKSEEKVEMEK 773
           ++ AKA + K+   + ++K++  L  +++ E  P   ++I  +P      K ++K E+++
Sbjct: 497 RDAAKA-EMKLISLDKKKKQLLQLCDKLRDEWLPNLGIRIEDKPDGPSVWKLDDKEELQR 555

Query: 774 EKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
           EK      E +E  +EQ  ++K+K ++ ++  E  K  K K  P E
Sbjct: 556 EK------EEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 31.2 bits (71), Expect = 7.7
 Identities = 22/108 (20%), Positives = 40/108 (37%), Gaps = 15/108 (13%)

Query: 715 SKEYAKAIQEK---VDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKV-- 769
                         + K N  + K + L P VK    +  K+  E  ++K+    +K+  
Sbjct: 463 GNRNFMQNINDSQLLPKINITKAKDEILNPVVKRALLQARKVVNE--LVKKYGPPDKIVI 520

Query: 770 EMEKE--------KRKKILREREEEKEEQPAMKKIKVEKPEESNEKTK 809
           EM +E        +  K   ++ E+K ++ A    K    EE  E + 
Sbjct: 521 EMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSS 568


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 30.4 bits (69), Expect = 7.8
 Identities = 33/139 (23%), Positives = 66/139 (47%), Gaps = 17/139 (12%)

Query: 695 YQKIKLKTSRFETPYQRFLKSKEYAKA-IQEKVDKENAEQKKIDALTPQVKTEPEEN--- 750
           + K++           +  K K+  K   ++ + K  A +KK++ L      E EE    
Sbjct: 66  FSKVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKW 125

Query: 751 ---------VKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEK- 800
                    VK+  +  +LK  K+ ++ E +K+K KK  +ER+E+ E++ A ++ K E+ 
Sbjct: 126 TKALAKAEGVKVKDDEKLLK--KALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREEN 183

Query: 801 -PEESNEKTKRHKIKSEPK 818
             +  ++K  + K K++ K
Sbjct: 184 LKKRKDDKKNKKKKKAKKK 202


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 31.3 bits (72), Expect = 7.9
 Identities = 23/105 (21%), Positives = 43/105 (40%), Gaps = 10/105 (9%)

Query: 732 EQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQP 791
            +++++    Q + E E+  K   E +   + + +++ +    K   +   ++ +KEE  
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEIT 227

Query: 792 AMKKIKVEKPEESNEKTKRHKI-KSEPKENDSSKGKSGGTISTVD 835
            MK+I     EE         + K E KE      KSG  I T  
Sbjct: 228 PMKEI----NEEERRVVVEGYVFKVEIKE-----LKSGRHILTFK 263


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 30.4 bits (68), Expect = 8.2
 Identities = 25/117 (21%), Positives = 43/117 (36%), Gaps = 10/117 (8%)

Query: 716 KEYAKAIQEKVDKENAEQKKIDALTPQVKTEPEENV------KITQEPVVLKQIKSE--- 766
           +E A+   E+       Q  + A    + T P E        K T E    K   +E   
Sbjct: 109 EESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQK 168

Query: 767 EKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSEPKENDSS 823
             VE EK   K    +++++   +   +     KP E+  K      K++PK  + +
Sbjct: 169 TPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKA-DKADKTKPKPKEKA 224


>gnl|CDD|143435 cd07117, ALDH_StaphAldA1, Uncharacterized Staphylococcus aureus AldA1
            (SACOL0154) aldehyde dehydrogenase-like.  Uncharacterized
            aldehyde dehydrogenase from Staphylococcus aureus (AldA1,
            locus SACOL0154) and other similar sequences are present
            in this CD.
          Length = 475

 Score = 30.9 bits (70), Expect = 8.3
 Identities = 24/105 (22%), Positives = 50/105 (47%), Gaps = 15/105 (14%)

Query: 1160 LREIVELTPSDHSHA----REAFKSMGINNIQDLNKSINQLQQKIEKTK---AKIIAYNN 1212
            L EI + T +D   A    +EAFK+     + +    +N++   I++ K   A +   +N
Sbjct: 29   LSEITDATDADVDRAVKAAQEAFKTWRKTTVAERANILNKIADIIDENKELLAMVETLDN 88

Query: 1213 GEDLTEEPKAKLSKEIAVPESEAEFK--AWLIETKKKRAYIIDKK 1255
            G+ + E      ++ + +P +   F+  A +I  ++  A +ID+ 
Sbjct: 89   GKPIRE------TRAVDIPLAADHFRYFAGVIRAEEGSANMIDED 127


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 30.8 bits (70), Expect = 8.3
 Identities = 19/99 (19%), Positives = 35/99 (35%), Gaps = 3/99 (3%)

Query: 746 EPEENVKITQEPVVLKQIKSEEKV---EMEKEKRKKILREREEEKEEQPAMKKIKVEKPE 802
               NV +  + ++    + E KV     +K+K++    E +++ E++   KK K E   
Sbjct: 26  YIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKS 85

Query: 803 ESNEKTKRHKIKSEPKENDSSKGKSGGTISTVDFSKVNY 841
           E   K      K   K                 F+K+  
Sbjct: 86  EGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAE 124


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 30.0 bits (68), Expect = 8.3
 Identities = 22/106 (20%), Positives = 51/106 (48%), Gaps = 5/106 (4%)

Query: 740 TPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVE 799
           TP    E  E  K  +E   L +   + K E E++++ K   ++++ K+++   K  K +
Sbjct: 50  TPIYDAEYTEAKKKKKE---LAEEIEKVKKEYEEKQKWK--WKKKKSKKKKDKDKDKKDD 104

Query: 800 KPEESNEKTKRHKIKSEPKENDSSKGKSGGTISTVDFSKVNYNKYM 845
           K ++ +EK    + + + ++   S  ++  T+S +   K   +K +
Sbjct: 105 KKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRKYALHKDI 150


>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 31.0 bits (70), Expect = 8.4
 Identities = 22/93 (23%), Positives = 37/93 (39%), Gaps = 8/93 (8%)

Query: 719 AKAIQEKVDKENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQI--KSEEKVEMEKEKR 776
             AI +  D           LT    T  E   +I Q+    K     + EK E+  E+ 
Sbjct: 334 EAAIAKAPDVPGWSNTASGTLTDSYVTPKEAQAEIEQKLAQEKAQLDAAVEKGELTPEQA 393

Query: 777 KKILREREEEKEEQPAMKKIKVEKPEESNEKTK 809
           K+I R + E +E +      ++EK +++    K
Sbjct: 394 KQIARAKLEPEERE------RLEKIDKAQAALK 420


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 30.0 bits (68), Expect = 8.8
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 762 QIKSEEKVEMEKEKRKKILREREEEKEE 789
           + + EEK  M KE++K I  E+E+ +EE
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEE 122


>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD.  YqfD is part of
           the sigma-E regulon in the sporulation program of
           endospore-forming Gram-positive bacteria. Mutation
           results in a sporulation defect in Bacillus subtilis.
           Members are found in all currently known
           endospore-forming bacteria, including the genera
           Bacillus, Symbiobacterium, Carboxydothermus,
           Clostridium, and Thermoanaerobacter [Cellular processes,
           Sporulation and germination].
          Length = 382

 Score = 30.8 bits (70), Expect = 8.8
 Identities = 25/131 (19%), Positives = 53/131 (40%), Gaps = 15/131 (11%)

Query: 675 KKIAQIGLFFEDKMKIGSNKYQKIKLKTSRFETPYQRFLKSKEYAKAIQEKVDKENAEQK 734
           +K  ++ L  + +++ G  K  +  L     +   ++    KE+AK   E+  K+    +
Sbjct: 248 EKTVEVPLKTQREVRTG-KKIVRYYLNIGGKKIYLKK--PPKEFAKYDVEEKTKKFGLWR 304

Query: 735 KIDALTPQVKTEPEE--NVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKEEQPA 792
                +  ++   E    VK   E V  ++   + K + EKE +K++      +   +  
Sbjct: 305 W----SLPIEIVKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKKEL------DPNAKIV 354

Query: 793 MKKIKVEKPEE 803
             KI  E+ E 
Sbjct: 355 SDKILSERVEG 365


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 31.0 bits (70), Expect = 9.0
 Identities = 13/91 (14%), Positives = 33/91 (36%), Gaps = 7/91 (7%)

Query: 729 ENAEQKKIDALTPQVKTEPEENVKITQEPVVLKQIKSEEKVEMEKEKRKKILREREEEKE 788
            NA   K++    ++  +PE+      + +  K+I          E+  K     +   +
Sbjct: 361 INANPDKLENKKIEISIKPEKKTINNSQIITTKKIN-------IVEESNKNSVHFDTLYK 413

Query: 789 EQPAMKKIKVEKPEESNEKTKRHKIKSEPKE 819
            +    K K+ +  +  +  K   ++ E  +
Sbjct: 414 TKIFYHKKKINQNNKEQDIKKEELLEKEFVK 444


>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal.  The C-terminal domain of
           FCP-1 is required for interaction with the carboxy
           terminal domain of RAP74. Interaction relies extensively
           on van der Waals contacts between hydrophobic residues
           situated within alpha-helices in both domains.
          Length = 263

 Score = 30.4 bits (68), Expect = 9.4
 Identities = 15/54 (27%), Positives = 22/54 (40%), Gaps = 4/54 (7%)

Query: 763 IKSEEKVEMEKEKRKKILREREEEKEEQPAMKKIKVEKPEESNEKTKRHKIKSE 816
           I  E   + + EK+K    E E+E+  QP     K + P    E  +     SE
Sbjct: 155 ILGEGSDDSDSEKKKPPGNEEEQEQAPQPR----KQQPPGPRREPDEGTPSSSE 204


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.132    0.383 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 99,129,109
Number of extensions: 9908410
Number of successful extensions: 14188
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13095
Number of HSP's successfully gapped: 578
Length of query: 1973
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1862
Effective length of database: 6,014,308
Effective search space: 11198641496
Effective search space used: 11198641496
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (29.5 bits)